End-to-End Supervised Multilabel Contrastive Learning

Ahmad Sajedi; Samir Khaki; Konstantinos N. Plataniotis; Mahdi S. Hosseini

エンドツーエンドの教師ありマルチラベル対照学習

マルチラベル表現の学習は、オブジェクトカテゴリ間のラベルの依存関係、または正/負のサンプルの固有の不均衡などのデータ関連の問題に関連する可能性がある、困難な問題として認識されています。最近の進歩は、モデルおよびデータ中心の観点からこれらの課題に対処しています。モデル中心では、ラベル相関は外部モデル設計 (グラフ CNN など) によって取得され、トレーニング用の帰納的バイアスが組み込まれます。ただし、エンドツーエンドのトレーニングフレームワークを設計できていないため、計算が非常に複雑になります。逆に、データ中心では、ラベルの依存関係を無視しながら、データセットの現実的な性質が分類を改善するために考慮されます。このペーパーでは、モデル中心設計とデータ中心設計の両方の欠点に対処するために、KMCL (カーネルベースの多重ラベル対照学習) と呼ばれる新しいエンドツーエンドのトレーニングフレームワークを提案します。 KMCL はまず、埋め込まれた特徴をガウス RKHS の指数カーネルの混合物に変換します。次に、(a) カーネル表現を再構築するための再構成損失、(b) 固有の不均衡問題に対処するための非対称分類損失、および (c) ラベル相関を捕捉するためのコントラスト損失で構成される客観的損失の符号化が続きます。 KMCL は、低い計算フットプリントを維持しながら、特徴エンコーダーの不確実性をモデル化します。 SOTA メソッドに対する KMCL の一貫した改善を示すために、画像分類タスクに関して広範な実験が行われています。 PyTorch の実装は https://github.com/mahdihosseini/KMCL で提供されています。

Multilabel representation learning is recognized as a challenging problem that can be associated with either label dependencies between object categories or data-related issues such as the inherent imbalance of positive/negative samples. Recent advances address these challenges from model- and data-centric viewpoints. In model-centric, the label correlation is obtained by an external model designs (e.g., graph CNN) to incorporate an inductive bias for training. However, they fail to design an end-to-end training framework, leading to high computational complexity. On the contrary, in data-centric, the realistic nature of the dataset is considered for improving the classification while ignoring the label dependencies. In this paper, we propose a new end-to-end training framework -- dubbed KMCL (Kernel-based Mutlilabel Contrastive Learning) -- to address the shortcomings of both model- and data-centric designs. The KMCL first transforms the embedded features into a mixture of exponential kernels in Gaussian RKHS. It is then followed by encoding an objective loss that is comprised of (a) reconstruction loss to reconstruct kernel representation, (b) asymmetric classification loss to address the inherent imbalance problem, and (c) contrastive loss to capture label correlation. The KMCL models the uncertainty of the feature encoder while maintaining a low computational footprint. Extensive experiments are conducted on image classification tasks to showcase the consistent improvements of KMCL over the SOTA methods. PyTorch implementation is provided in https://github.com/mahdihosseini/KMCL.

updated: Sat Jul 08 2023 12:46:57 GMT+0000 (UTC)

published: Sat Jul 08 2023 12:46:57 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト