Boosting-GNN: Boosting Algorithm for Graph Networks on Imbalanced Node Classification

S. Shi; Kai Qiao; Shuai Yang; L. Wang; J. Chen; Bin Yan

Boosting-GNN：不均衡なノード分類でのグラフネットワークのブースティングアルゴリズム

グラフニューラルネットワーク（GNN）は、グラフデータの表現に広く使用されています。ただし、既存の研究では理想的なバランスの取れたデータセットのみが考慮されており、不均衡なデータセットが考慮されることはめったにありません。不均衡なデータセットを処理する再サンプリング、再重み付け、合成サンプルなどの従来の方法は、GNNでは適用できなくなりました。この論文は、ブースティング中の基本分類器としてGNNを使用するBoosting-GNNと呼ばれるアンサンブルモデルを提案します。 Boosting-GNNでは、以前の分類器によって正しく分類されなかったトレーニングサンプルに高い重みが設定されるため、分類の精度と信頼性が向上します。さらに、転移学習は、計算コストを削減し、フィッティング能力を高めるために使用されます。実験結果は、提案されたBoosting-GNNモデルが、GCN、GraphSAGE、GAT、SGC、N-GCN、および合成不均衡データセットでの最先端のリウェイトおよびリサンプリング方法よりも優れたパフォーマンスを達成し、平均パフォーマンスが4.5％向上することを示しています。

The Graph Neural Network (GNN) has been widely used for graph data representation. However, the existing researches only consider the ideal balanced dataset, and the imbalanced dataset is rarely considered. Traditional methods such as resampling, reweighting, and synthetic samples that deal with imbalanced datasets are no longer applicable in GNN. This paper proposes an ensemble model called Boosting-GNN, which uses GNNs as the base classifiers during boosting. In Boosting-GNN, higher weights are set for the training samples that are not correctly classified by the previous classifier, thus achieving higher classification accuracy and better reliability. Besides, transfer learning is used to reduce computational cost and increase fitting ability. Experimental results indicate that the proposed Boosting-GNN model achieves better performance than GCN, GraphSAGE, GAT, SGC, N-GCN, and most advanced reweighting and resampling methods on synthetic imbalanced datasets, with an average performance improvement of 4.5%

updated: Sun May 08 2022 01:16:54 GMT+0000 (UTC)

published: Tue May 25 2021 02:43:31 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト