Modeling Global Distribution for Federated Learning with Label Distribution Skew

Tao Sheng; Chengchao Shen; Yuan Liu; Yeyu Ou; Zhe Qu; Jianxin Wang

ラベル分布スキューを使用したフェデレーテッドラーニングのためのグローバル分布のモデル化

フェデレーテッドラーニングは、分散型データソースを接続することで、ディープモデルの共同トレーニングを実現します。これにより、プライバシー漏洩のリスクを大幅に軽減できます。しかし、より一般的なケースでは、クライアント間のラベルの分布は異なり、「ラベル分布の偏り」と呼ばれます。ラベル配布スキューの問題を考慮せずに従来のフェデレーテッドラーニングを直接適用すると、グローバルモデルのパフォーマンスが大幅に低下します。この目的のために、FedMGD という名前の新しいフェデレーテッドラーニングメソッドを提案し、ラベル配布スキューの問題によって引き起こされるパフォーマンスの低下を軽減します。グローバルな敵対的生成ネットワークを導入して、ローカルデータセットにアクセスせずにグローバルデータ分布をモデル化するため、グローバルモデルはデータ分布のグローバル情報を使用してプライバシー漏洩なしでトレーニングできます。実験結果は、提案された方法がいくつかの公開ベンチマークで最先端技術を大幅に上回ることを示しています。コードは https://github.com/Sheng-T/FedMGD で入手できます。

Federated learning achieves joint training of deep models by connecting decentralized data sources, which can significantly mitigate the risk of privacy leakage. However, in a more general case, the distributions of labels among clients are different, called ``label distribution skew''. Directly applying conventional federated learning without consideration of label distribution skew issue significantly hurts the performance of the global model. To this end, we propose a novel federated learning method, named FedMGD, to alleviate the performance degradation caused by the label distribution skew issue. It introduces a global Generative Adversarial Network to model the global data distribution without access to local datasets, so the global model can be trained using the global information of data distribution without privacy leakage. The experimental results demonstrate that our proposed method significantly outperforms the state-of-the-art on several public benchmarks. Code is available at https://github.com/Sheng-T/FedMGD.

updated: Sat Dec 17 2022 14:46:01 GMT+0000 (UTC)

published: Sat Dec 17 2022 14:46:01 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト