Gradient Surgery for Multi-Task Learning

Tianhe Yu; Saurabh Kumar; Abhishek Gupta; Sergey Levine; Karol Hausman; Chelsea Finn

マルチタスク学習のための勾配手術

深層学習および深層強化学習（RL）システムは、画像分類、ゲームプレイ、ロボット制御などの分野で印象的な結果を示していますが、データ効率は依然として大きな課題です。マルチタスク学習は、より効率的な学習を可能にするために複数のタスク間で構造を共有するための有望なアプローチとして浮上しています。ただし、マルチタスク設定には多くの最適化の課題があり、タスクを個別に学習する場合と比べて効率を大幅に向上させることは困難です。マルチタスク学習がシングルタスク学習と比較して非常に難しい理由は完全には理解されていません。この作業では、有害な勾配干渉を引き起こすマルチタスク最適化ランドスケープの3つの条件のセットを識別し、タスク勾配間のこのような干渉を回避するためのシンプルでありながら一般的なアプローチを開発します。勾配の競合がある他のタスクの勾配の法線平面にタスクの勾配を投影する勾配手術の形式を提案します。困難な一連のマルチタスク監視およびマルチタスクRL問題では、このアプローチにより効率とパフォーマンスが大幅に向上します。さらに、これはモデルに依存せず、以前に提案されたマルチタスクアーキテクチャと組み合わせてパフォーマンスを向上させることができます。

While deep learning and deep reinforcement learning (RL) systems have demonstrated impressive results in domains such as image classification, game playing, and robotic control, data efficiency remains a major challenge. Multi-task learning has emerged as a promising approach for sharing structure across multiple tasks to enable more efficient learning. However, the multi-task setting presents a number of optimization challenges, making it difficult to realize large efficiency gains compared to learning tasks independently. The reasons why multi-task learning is so challenging compared to single-task learning are not fully understood. In this work, we identify a set of three conditions of the multi-task optimization landscape that cause detrimental gradient interference, and develop a simple yet general approach for avoiding such interference between task gradients. We propose a form of gradient surgery that projects a task's gradient onto the normal plane of the gradient of any other task that has a conflicting gradient. On a series of challenging multi-task supervised and multi-task RL problems, this approach leads to substantial gains in efficiency and performance. Further, it is model-agnostic and can be combined with previously-proposed multi-task architectures for enhanced performance.

updated: Tue Dec 22 2020 00:35:46 GMT+0000 (UTC)

published: Sun Jan 19 2020 06:33:47 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト