DICNet: Deep Instance-Level Contrastive Network for Double Incomplete Multi-View Multi-Label Classification

Chengliang Liu; Jie Wen; Xiaoling Luo; Chao Huang; Zhihao Wu; Yong Xu

DICNet: 二重不完全マルチビューマルチラベル分類のためのディープインスタンスレベル対照ネットワーク

近年、マルチビューマルチラベル学習は、広範な研究熱意を呼び起こしています。ただし、現実世界のマルチビューマルチラベルデータは、データ収集と手動注釈の不確実な要因により、一般的に不完全です。これは、マルチビュー機能が欠落していることが多いだけでなく、ラベルの完全性も満足するのが難しいことを意味します。 .二重不完全マルチビューマルチラベル分類問題に対処するために、深いインスタンスレベルの対照的なネットワーク、つまり DICNet を提案します。従来の方法とは異なり、DICNet はディープニューラルネットワークを活用して、浅いレベルの機能ではなく、サンプルの高レベルのセマンティック表現を活用することに重点を置いています。まず、スタックされたオートエンコーダーを利用してエンドツーエンドのマルチビュー機能抽出フレームワークを構築し、サンプルのビュー固有の表現を学習します。さらに、コンセンサス表現能力を向上させるために、不完全なインスタンスレベルの対照的学習スキームを導入して、エンコーダーが複数のビューのコンセンサス情報をより適切に抽出し、マルチビュー加重融合モジュールを使用して意味的特徴の識別を強化するように導きます。 .全体として、当社の DICNet は、マルチビューマルチラベルデータの一貫した識別表現をキャプチャし、欠落したビューや欠落したラベルの悪影響を回避することに長けています。 5 つのデータセットに対して実施された広範な実験により、当社の方法が他の最先端の方法よりも優れていることが検証されました。

In recent years, multi-view multi-label learning has aroused extensive research enthusiasm. However, multi-view multi-label data in the real world is commonly incomplete due to the uncertain factors of data collection and manual annotation, which means that not only multi-view features are often missing, and label completeness is also difficult to be satisfied. To deal with the double incomplete multi-view multi-label classification problem, we propose a deep instance-level contrastive network, namely DICNet. Different from conventional methods, our DICNet focuses on leveraging deep neural network to exploit the high-level semantic representations of samples rather than shallow-level features. First, we utilize the stacked autoencoders to build an end-to-end multi-view feature extraction framework to learn the view-specific representations of samples. Furthermore, in order to improve the consensus representation ability, we introduce an incomplete instance-level contrastive learning scheme to guide the encoders to better extract the consensus information of multiple views and use a multi-view weighted fusion module to enhance the discrimination of semantic features. Overall, our DICNet is adept in capturing consistent discriminative representations of multi-view multi-label data and avoiding the negative effects of missing views and missing labels. Extensive experiments performed on five datasets validate that our method outperforms other state-of-the-art methods.

updated: Thu Mar 23 2023 03:09:11 GMT+0000 (UTC)

published: Wed Mar 15 2023 04:24:01 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト