Fed-FSNet: Mitigating Non-I.I.D. Federated Learning via Fuzzy Synthesizing Network

Jingcai Guo; Song Guo; Jie Zhang; Ziming Liu

Fed-FSNet: ファジー合成ネットワークによる非 IID 連合学習の緩和

フェデレーテッドラーニング (FL) は、プライバシーを保護する有望な分散型機械学習フレームワークとして最近登場しました。これは、分散型トレーニングをエッジデバイスでローカルに実行し、ローカルモデルをグローバルモデルに集約することで、共有グローバルモデルを共同で学習することを目的としています。クラウドサーバーで生データを一元的に共有する必要はありません。ただし、エッジデバイス間で大きなローカルデータの不均一性 (非 IID データ) があるため、FL は、ローカルデータセットでより多くのシフトされた勾配を生成できるグローバルモデルを簡単に取得する可能性があり、それによってモデルのパフォーマンスが低下したり、非収束に悩まされることさえあります。トレーニング中。このホワイトペーパーでは、Fed-FSNet と呼ばれる新しい FL トレーニングフレームワークを提案し、適切に設計されたファジー合成ネットワーク (FSNet) を使用して、非 IID FL をソースで軽減します。具体的には、クラウドサーバーでエッジにとらわれない隠れモデルを維持して、グローバルモデルの方向を認識しながら精度の低い反転を推定します。次に、非表示モデルは、グローバルモデルのみに条件付けられた複数の模倣 IID データサンプル (サンプル機能) をファジーに合成できます。これをエッジデバイスで共有して、FL トレーニングをより高速で優れた収束に向けて促進できます。さらに、合成プロセスには、ローカルモデルのパラメーター/更新へのアクセスも、個々のローカルモデル出力の分析も含まれないため、フレームワークは FL のプライバシーを確保できます。いくつかの FL ベンチマークでの実験結果は、私たちの方法が非 IID 問題を大幅に軽減し、他の代表的な方法よりも優れたパフォーマンスを得ることができることを示しています。

Federated learning (FL) has emerged as a promising privacy-preserving distributed machine learning framework recently. It aims at collaboratively learning a shared global model by performing distributed training locally on edge devices and aggregating local models into a global one without centralized raw data sharing in the cloud server. However, due to the large local data heterogeneities (Non-I.I.D. data) across edge devices, the FL may easily obtain a global model that can produce more shifted gradients on local datasets, thereby degrading the model performance or even suffering from the non-convergence during training. In this paper, we propose a novel FL training framework, dubbed Fed-FSNet, using a properly designed Fuzzy Synthesizing Network (FSNet) to mitigate the Non-I.I.D. FL at-the-source. Concretely, we maintain an edge-agnostic hidden model in the cloud server to estimate a less-accurate while direction-aware inversion of the global model. The hidden model can then fuzzily synthesize several mimic I.I.D. data samples (sample features) conditioned on only the global model, which can be shared by edge devices to facilitate the FL training towards faster and better convergence. Moreover, since the synthesizing process involves neither access to the parameters/updates of local models nor analyzing individual local model outputs, our framework can still ensure the privacy of FL. Experimental results on several FL benchmarks demonstrate that our method can significantly mitigate the Non-I.I.D. issue and obtain better performance against other representative methods.

updated: Tue Apr 25 2023 08:45:05 GMT+0000 (UTC)

published: Sun Aug 21 2022 18:40:51 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト