Select and Calibrate the Low-confidence: Dual-Channel Consistency based Graph Convolutional Networks

Shuhao Shi; Jian Chen; Kai Qiao; Shuai Yang; Linyuan Wang; Bin Yan

低信頼性の選択と校正：デュアルチャネル整合性ベースのグラフ畳み込みネットワーク

グラフ畳み込みネットワーク（GCN）は、ノード分類タスクで優れた結果を達成しましたが、低いラベル率でのモデルのパフォーマンスは依然として不十分です。グラフの半教師あり学習（SSL）のこれまでの研究では、ネットワーク予測を使用してソフト疑似ラベルを生成したり、メッセージの伝播を指示したりすることに焦点が当てられていました。提案されたデュアルチャネル整合性ベースのグラフ畳み込みネットワーク（DCC-GCN）は、デュアルチャネルを使用してノードの特徴とトポロジ構造から埋め込みを抽出し、デュアルチャネル整合性に基づいて信頼性の高い低信頼性と高信頼性のサンプル選択を実現します。さらに、デュアルチャネルの一貫性に基づいて取得された信頼性の低いサンプルは精度が低く、モデルのパフォーマンスを制約していることを確認しました。信頼度の低いサンプルを無視した以前の研究とは異なり、近隣の信頼度の高いサンプルを使用して、信頼度の低いサンプルの特徴の埋め込みを調整します。私たちの実験では、DCC-GCNが信頼性の低いサンプルと信頼性の高いサンプルをより正確に区別でき、信頼性の低いサンプルの精度を大幅に向上できることが示されています。ベンチマークデータセットで広範な実験を実施し、DCC-GCNがさまざまなラベルレートで最先端のベースラインよりも大幅に優れていることを実証しました。

The Graph Convolutional Networks (GCNs) have achieved excellent results in node classification tasks, but the model's performance at low label rates is still unsatisfactory. Previous studies in Semi-Supervised Learning (SSL) for graph have focused on using network predictions to generate soft pseudo-labels or instructing message propagation, which inevitably contains the incorrect prediction due to the over-confident in the predictions. Our proposed Dual-Channel Consistency based Graph Convolutional Networks (DCC-GCN) uses dual-channel to extract embeddings from node features and topological structures, and then achieves reliable low-confidence and high-confidence samples selection based on dual-channel consistency. We further confirmed that the low-confidence samples obtained based on dual-channel consistency were low in accuracy, constraining the model's performance. Unlike previous studies ignoring low-confidence samples, we calibrate the feature embeddings of the low-confidence samples by using the neighborhood's high-confidence samples. Our experiments have shown that the DCC-GCN can more accurately distinguish between low-confidence and high-confidence samples, and can also significantly improve the accuracy of low-confidence samples. We conducted extensive experiments on the benchmark datasets and demonstrated that DCC-GCN is significantly better than state-of-the-art baselines at different label rates.

updated: Sun May 08 2022 01:35:28 GMT+0000 (UTC)

published: Sun May 08 2022 01:35:28 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト