Information Theory-Guided Heuristic Progressive Multi-View Coding

Jiangmeng Li; Hang Gao; Wenwen Qiang; Changwen Zheng

情報理論に基づくヒューリスティックプログレッシブマルチビューコーディング

マルチビュー表現学習は、共有コンテキストの複数のビューから包括的な情報を取得することを目的としています。最近の研究では、対比学習をさまざまなビューにペアワイズ方式で直観的に適用していますが、これは依然としてスケーラブルです。ビュー固有のノイズは、ビュー共有表現の学習においてフィルタリングされません。偽の負のペア。負の項は実際には正の項と同じクラス内にあり、実際の負のペアは同等に扱われます。用語間の類似性を均等に測定すると、最適化が妨げられる可能性があります。重要なのは、一般化された自己教師ありマルチビュー学習、特に 3 つ以上のビューの理論的枠組みを研究した研究がほとんどないことです。この目的のために、我々は情報理論の観点から既存の多視点学習パラダイムを再考し、一般化された多視点学習のための新しい情報理論的枠組みを提案します。これに基づいて、私たちは 3 層のプログレッシブアーキテクチャを備えたマルチビューコーディング方法、つまり情報理論に基づく階層型プログレッシブマルチビューコーディング (IPMC) を構築します。配布層では、IPMC はビュー間の配布を調整して、ビュー固有のノイズを削減します。セット層では、IPMC は自己調整されたコントラストプールを構築し、ビューフィルターによって適応的に変更されます。最後に、インスタンス層では、表現を学習し、勾配干渉を軽減するために設計された統合損失を採用します。理論的および経験的に、私たちは最先端の方法に対する IPMC の優位性を実証しています。

Multi-view representation learning aims to capture comprehensive information from multiple views of a shared context. Recent works intuitively apply contrastive learning to different views in a pairwise manner, which is still scalable: view-specific noise is not filtered in learning view-shared representations; the fake negative pairs, where the negative terms are actually within the same class as the positive, and the real negative pairs are coequally treated; evenly measuring the similarities between terms might interfere with optimization. Importantly, few works study the theoretical framework of generalized self-supervised multi-view learning, especially for more than two views. To this end, we rethink the existing multi-view learning paradigm from the perspective of information theory and then propose a novel information theoretical framework for generalized multi-view learning. Guided by it, we build a multi-view coding method with a three-tier progressive architecture, namely Information theory-guided hierarchical Progressive Multi-view Coding (IPMC). In the distribution-tier, IPMC aligns the distribution between views to reduce view-specific noise. In the set-tier, IPMC constructs self-adjusted contrasting pools, which are adaptively modified by a view filter. Lastly, in the instance-tier, we adopt a designed unified loss to learn representations and reduce the gradient interference. Theoretically and empirically, we demonstrate the superiority of IPMC over state-of-the-art methods.

updated: Wed Aug 23 2023 08:49:54 GMT+0000 (UTC)

published: Mon Aug 21 2023 07:19:47 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト