Supernet Training for Federated Image Classification under System Heterogeneity

Taehyeon Kim; Se-Young Yun

システムの不均一性の下でのフェデレーテッドイメージ分類のためのスーパーネットトレーニング

多くのデバイスとリソースの制約、特にエッジデバイスでディープニューラルネットワークを効率的に展開することは、データプライバシー保護の問題が存在する中で最も困難な問題の 1 つです。従来のアプローチは、単一のグローバルモデルを改善しながら各ローカルの異種トレーニングデータを分散化する (つまり、データの異種性、フェデレーテッドラーニング (FL)) か、さまざまな計算能力を備えた異種システムに対処するために多様なアーキテクチャ設定をサポートする包括的なネットワークをトレーニングするために進化しました。 (つまり、システムの不均一性; ニューラルアーキテクチャ検索)。ただし、両方の方向を同時に検討した研究はほとんどありません。このホワイトペーパーでは、両方のシナリオを同時に検討するためのスーパーネットトレーニング (FedSup) フレームワークのフェデレーションを提案します。つまり、クライアントは、それ自体からサンプリングされたすべての可能なアーキテクチャを含むスーパーネットを送受信します。このアプローチは、FL のモデル集約中のパラメーターの平均化が、スーパーネットトレーニングでの重みの共有に似ていることを観察することから着想を得ています。したがって、提案された FedSup フレームワークは、シングルショットモデルのトレーニングに広く使用されている重み共有アプローチと FL 平均化 (FedAvg) を組み合わせたものです。さらに、FL環境でのスーパーネットトレーニングを強化するためのいくつかの戦略を含む、ブロードキャスト段階でサブモデルをクライアントに送信して通信コストとトレーニングオーバーヘッドを削減することにより、効率的なアルゴリズム（E-FedSup）を開発します。広範な経験的評価により、提案されたアプローチを検証します。結果として得られるフレームワークは、いくつかの標準的なベンチマークでデータとモデルの不均一性の堅牢性も保証します。

Efficient deployment of deep neural networks across many devices and resource constraints, particularly on edge devices, is one of the most challenging problems in the presence of data-privacy preservation issues. Conventional approaches have evolved to either improve a single global model while keeping each local heterogeneous training data decentralized (i.e. data heterogeneity; Federated Learning (FL)) or to train an overarching network that supports diverse architectural settings to address heterogeneous systems equipped with different computational capabilities (i.e. system heterogeneity; Neural Architecture Search). However, few studies have considered both directions simultaneously. This paper proposes the federation of supernet training (FedSup) framework to consider both scenarios simultaneously, i.e., where clients send and receive a supernet that contains all possible architectures sampled from itself. The approach is inspired by observing that averaging parameters during model aggregation for FL is similar to weight-sharing in supernet training. Thus, the proposed FedSup framework combines a weight-sharing approach widely used for training single shot models with FL averaging (FedAvg). Furthermore, we develop an efficient algorithm (E-FedSup) by sending the sub-model to clients on the broadcast stage to reduce communication costs and training overhead, including several strategies to enhance supernet training in the FL environment. We verify the proposed approach with extensive empirical evaluations. The resulting framework also ensures data and model heterogeneity robustness on several standard benchmarks.

updated: Thu Oct 06 2022 03:00:50 GMT+0000 (UTC)

published: Fri Jun 03 2022 02:21:01 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト