Distribution Matching for Heterogeneous Multi-Task Learning: a Large-scale Face Study

Dimitrios Kollias; Viktoriia Sharmanska; Stefanos Zafeiriou

異種マルチタスク学習のための分布マッチング：大規模な顔の研究

マルチタスク学習は、DNNなどの共有学習アルゴリズムによって複数のタスクを共同で学習する方法論として登場しました。 MTLは、検討中のタスクが関連しているという仮定に基づいています。したがって、共有された知識を活用して、個々のタスクのパフォーマンスを向上させます。タスクは一般に同種であると見なされます。つまり、同じタイプの問題を参照します。さらに、MTLは通常、タスク間で完全または部分的に重複するグラウンドトゥルースアノテーションに基づいています。この作業では、異種MTLを扱い、同時に検出、分類、および回帰の問題に対処します。弱く監視された方法で、ほとんど、または重複しない注釈を含むタスクを共同トレーニングする手段として、タスクの関連性を調査します。タスクの関連性は、事前の専門知識を通じて、またはデータ主導の研究を通じて、MTLに導入されます。予測の分布のマッチングを介して、タスク間で知識交換が可能になる、新しい分布マッチングアプローチを提案します。このアプローチに基づいて、すべての顔の行動タスクを共同で学習することにより、大規模な顔分析の最初のフレームワークであるFaceBehaviorNetを構築します。以下のケーススタディを開発します。i）継続的な影響の推定、アクションユニットの検出、基本的な感情の認識。 ii）属性の検出、顔の識別。タスクの関連性を介した共同トレーニングが負の転送を軽減することを示します。 FaceBehaviorNetは、顔の行動のすべての側面をカプセル化する機能を学習するため、複合感情認識など、トレーニングされたタスク以外のタスクを実行するために、ゼロショット/少数ショットの学習を実行します。 10のデータベースを利用して非常に大規模な実験的研究を実施することにより、トレーニングで使用されていないものも含め、すべてのタスクとすべてのデータベースで最先端のアプローチを大幅に上回っていることを示しています。。

Multi-Task Learning has emerged as a methodology in which multiple tasks are jointly learned by a shared learning algorithm, such as a DNN. MTL is based on the assumption that the tasks under consideration are related; therefore it exploits shared knowledge for improving performance on each individual task. Tasks are generally considered to be homogeneous, i.e., to refer to the same type of problem. Moreover, MTL is usually based on ground truth annotations with full, or partial overlap across tasks. In this work, we deal with heterogeneous MTL, simultaneously addressing detection, classification & regression problems. We explore task-relatedness as a means for co-training, in a weakly-supervised way, tasks that contain little, or even non-overlapping annotations. Task-relatedness is introduced in MTL, either explicitly through prior expert knowledge, or through data-driven studies. We propose a novel distribution matching approach, in which knowledge exchange is enabled between tasks, via matching of their predictions' distributions. Based on this approach, we build FaceBehaviorNet, the first framework for large-scale face analysis, by jointly learning all facial behavior tasks. We develop case studies for: i) continuous affect estimation, action unit detection, basic emotion recognition; ii) attribute detection, face identification. We illustrate that co-training via task relatedness alleviates negative transfer. Since FaceBehaviorNet learns features that encapsulate all aspects of facial behavior, we conduct zero-/few-shot learning to perform tasks beyond the ones that it has been trained for, such as compound emotion recognition. By conducting a very large experimental study, utilizing 10 databases, we illustrate that our approach outperforms, by large margins, the state-of-the-art in all tasks and in all databases, even in these which have not been used in its training.

updated: Sat May 08 2021 22:26:52 GMT+0000 (UTC)

published: Sat May 08 2021 22:26:52 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト