BatchFormer: Learning to Explore Sample Relationships for Robust Representation Learning

Zhi Hou; Baosheng Yu; Dacheng Tao

BatchFormer：ロバストな表現学習のためのサンプル関係を探索することを学ぶ

ディープニューラルネットワークの成功にもかかわらず、データの不均衡、目に見えない分布、ドメインシフトなどのデータ不足の問題により、ディープ表現学習には依然として多くの課題があります。上記の問題に対処するために、さまざまな方法が考案され、サンプルの関係をバニラ方式で（つまり、入力関数または損失関数の観点から）探索し、ディープニューラルネットワークの内部構造を探索できませんでした。サンプルの関係で学習するため。これに触発されて、各ミニバッチからサンプルの関係を学習する機能を備えたディープニューラルネットワーク自体を有効にすることを提案します。具体的には、バッチトランスフォーマーモジュールまたはBatchFormerを導入します。これは、各ミニバッチのバッチディメンションに適用され、トレーニング中にサンプルの関係を暗黙的に調査します。これを行うことにより、提案された方法は、異なるサンプルのコラボレーションを可能にします。たとえば、ヘッドクラスのサンプルは、ロングテール認識のためのテールクラスの学習にも貢献できます。さらに、トレーニングとテストの間のギャップを緩和するために、トレーニング中にBatchFormerを使用する場合と使用しない場合で分類子を共有します。これにより、テスト中に削除できます。 10を超えるデータセットで広範な実験を実行し、提案された方法は、ロングテール認識、構成的ゼロショット学習、ドメイン一般化、および対照学習のタスクを含む、ベルやホイッスルなしでさまざまなデータ不足アプリケーションで大幅な改善を達成します。コードはhttps://github.com/zhihou7/BatchFormerで公開されます。

Despite the success of deep neural networks, there are still many challenges in deep representation learning due to the data scarcity issues such as data imbalance, unseen distribution, and domain shift. To address the above-mentioned issues, a variety of methods have been devised to explore the sample relationships in a vanilla way (i.e., from the perspectives of either the input or the loss function), failing to explore the internal structure of deep neural networks for learning with sample relationships. Inspired by this, we propose to enable deep neural networks themselves with the ability to learn the sample relationships from each mini-batch. Specifically, we introduce a batch transformer module or BatchFormer, which is then applied into the batch dimension of each mini-batch to implicitly explore sample relationships during training. By doing this, the proposed method enables the collaboration of different samples, e.g., the head-class samples can also contribute to the learning of the tail classes for long-tailed recognition. Furthermore, to mitigate the gap between training and testing, we share the classifier between with or without the BatchFormer during training, which can thus be removed during testing. We perform extensive experiments on over ten datasets and the proposed method achieves significant improvements on different data scarcity applications without any bells and whistles, including the tasks of long-tailed recognition, compositional zero-shot learning, domain generalization, and contrastive learning. Code will be made publicly available at https://github.com/zhihou7/BatchFormer.

updated: Tue Mar 29 2022 12:48:13 GMT+0000 (UTC)

published: Thu Mar 03 2022 05:31:33 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト