No Fear of Heterogeneity: Classifier Calibration for Federated Learning with Non-IID Data

Mi Luo; Fei Chen; Dapeng Hu; Yifan Zhang; Jian Liang; Jiashi Feng

異質性を恐れない: 非 IID データを使用したフェデレーテッドラーニングの分類器のキャリブレーション

現実世界の連合システムで分類モデルをトレーニングする際の中心的な課題は、非 IID データで学習することです。これに対処するために、既存の作業のほとんどは、ローカル最適化で正則化を強制するか、サーバーでのモデル集計スキームを改善することを含みます。他の作品では、公開データセットや合成サンプルを共有して、過小評価されているクラスのトレーニングを補完したり、特定のレベルの個人化を導入したりしています。彼らは効果的ではありますが、データの異質性が深層分類モデルの各層にどのように影響するかについて十分に理解していません。この論文では、異なる層によって学習された表現の実験的分析を実行することにより、このギャップを埋めます。 (1) 分類器には他の層よりも大きなバイアスが存在し、(2) フェデレーショントレーニング後に分類器を事後キャリブレーションすることで、分類のパフォーマンスを大幅に改善できます。上記の発見に動機付けられて、私たちは、仮想表現による分類器キャリブレーション (CCVR) と呼ばれる新しくシンプルなアルゴリズムを提案します。これは、近似された混合ガウスモデルからサンプリングされた仮想表現を使用して分類器を調整します。実験結果は、CCVR が CIFAR-10、CIFAR-100、CINIC-10 などの一般的な連合学習ベンチマークで最先端のパフォーマンスを達成することを示しています。私たちのシンプルで効果的な方法が、非 IID データを使用したフェデレーションラーニングの将来の研究に光を当てることを願っています。

A central challenge in training classification models in the real-world federated system is learning with non-IID data. To cope with this, most of the existing works involve enforcing regularization in local optimization or improving the model aggregation scheme at the server. Other works also share public datasets or synthesized samples to supplement the training of under-represented classes or introduce a certain level of personalization. Though effective, they lack a deep understanding of how the data heterogeneity affects each layer of a deep classification model. In this paper, we bridge this gap by performing an experimental analysis of the representations learned by different layers. Our observations are surprising: (1) there exists a greater bias in the classifier than other layers, and (2) the classification performance can be significantly improved by post-calibrating the classifier after federated training. Motivated by the above findings, we propose a novel and simple algorithm called Classifier Calibration with Virtual Representations (CCVR), which adjusts the classifier using virtual representations sampled from an approximated gaussian mixture model. Experimental results demonstrate that CCVR achieves state-of-the-art performance on popular federated learning benchmarks including CIFAR-10, CIFAR-100, and CINIC-10. We hope that our simple yet effective method can shed some light on the future research of federated learning with non-IID data.

updated: Wed Jun 09 2021 12:02:29 GMT+0000 (UTC)

published: Wed Jun 09 2021 12:02:29 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト