General Partial Label Learning via Dual Bipartite Graph Autoencoder

Brian Chen; Bo Wu; Alireza Zareian; Hanwang Zhang; Shih-Fu Chang

デュアル2部グラフオートエンコーダによる一般的な部分ラベル学習

実用的でありながら挑戦的な問題を定式化します：一般的な部分ラベル学習（GPLL）。従来の部分ラベル学習（PLL）の問題と比較して、GPLLは、監視の仮定をインスタンスレベル（ラベルセットがインスタンスに部分的にラベルを付ける）からグループレベルに緩和します。1）ラベルセットがインスタンスのグループに部分的にラベルを付ける。グループ内のインスタンスラベルリンク注釈が欠落しており、2）グループ間リンクが許可されています-グループ内のインスタンスは、別のグループのラベルセットに部分的にリンクされている可能性があります。このようなあいまいなグループレベルの監視は、インスタンスレベルでの追加の注釈が不要になるため、実際のシナリオではより実用的です。たとえば、グループがフレーム内の顔で構成され、で設定された名前でラベル付けされたビデオの顔の名前付けなどです。対応するキャプション。この論文では、GPLLのラベルのあいまいさの課題に取り組むために、デュアル2部グラフオートエンコーダ（DB-GAE）と呼ばれる新しいグラフ畳み込みネットワーク（GCN）を提案します。まず、グループ間の相関関係を利用して、インスタンスグループを2つの2部グラフ（グループ内とグループ間）として表します。これらは相互に補完して、リンクのあいまいさを解決します。次に、それらをエンコードおよびデコードするGCNオートエンコーダーを設計します。ここで、デコードは洗練された結果と見なされます。 DB-GAEは、個別のオフライントレーニングステージなしでグループレベルの監視のみを使用するため、自己監視型でトランスダクティブであることに注意してください。 2つの実際のデータセットでの広範な実験は、DB-GAEが絶対0.159 F1スコアと24.8％の精度を超えて最高のベースラインを大幅に上回っていることを示しています。さらに、さまざまなレベルのラベルのあいまいさに関する分析を提供します。

We formulate a practical yet challenging problem: General Partial Label Learning (GPLL). Compared to the traditional Partial Label Learning (PLL) problem, GPLL relaxes the supervision assumption from instance-level -- a label set partially labels an instance -- to group-level: 1) a label set partially labels a group of instances, where the within-group instance-label link annotations are missing, and 2) cross-group links are allowed -- instances in a group may be partially linked to the label set from another group. Such ambiguous group-level supervision is more practical in real-world scenarios as additional annotation on the instance-level is no longer required, e.g., face-naming in videos where the group consists of faces in a frame, labeled by a name set in the corresponding caption. In this paper, we propose a novel graph convolutional network (GCN) called Dual Bipartite Graph Autoencoder (DB-GAE) to tackle the label ambiguity challenge of GPLL. First, we exploit the cross-group correlations to represent the instance groups as dual bipartite graphs: within-group and cross-group, which reciprocally complements each other to resolve the linking ambiguities. Second, we design a GCN autoencoder to encode and decode them, where the decodings are considered as the refined results. It is worth noting that DB-GAE is self-supervised and transductive, as it only uses the group-level supervision without a separate offline training stage. Extensive experiments on two real-world datasets demonstrate that DB-GAE significantly outperforms the best baseline over absolute 0.159 F1-score and 24.8% accuracy. We further offer analysis on various levels of label ambiguities.

updated: Thu Sep 09 2021 14:40:19 GMT+0000 (UTC)

published: Sun Jan 05 2020 19:00:41 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト