Exploring One-shot Semi-supervised Federated Learning with A Pre-trained Diffusion Model

Mingzhao Yang; Shangchao Su; Bin Li; Xiangyang Xue

事前トレーニング済みの拡散モデルを使用したワンショットの半教師付き連合学習の探索

連合学習は、プライバシーを保護する共同学習アプローチです。最近、一部の研究では、サーバー上のラベル付きデータとクライアント上のラベルなしデータを使用して一般的に見られる現実世界のシナリオを処理するために、半教師付き連合学習設定を提案しています。ただし、既存の方法は、高い通信コスト、クライアントデバイスのトレーニングプレッシャー、サーバーとクライアント間の分散の違いなどの課題に直面しています。このホワイトペーパーでは、強力な事前トレーニング済み拡散モデルを連合学習に導入し、これらの課題に対処するために、連合拡散に触発された半教師付き共同トレーニング手法である FedDISC を提案します。具体的には、まずサーバー上のラベル付きデータからプロトタイプを抽出し、クライアントに送信します。次に、クライアントはこれらのプロトタイプを使用してローカルデータの疑似ラベルを予測し、クラスターの重心とドメイン固有の特徴を計算して、パーソナライズされた分布を表します。ノイズを追加した後、クライアントはこれらの機能とそれに対応する疑似ラベルをサーバーに送り返します。サーバーは、事前にトレーニングされた拡散モデルを使用して、クライアントの分布に準拠する疑似サンプルを条件付きで生成し、それらに対して集約モデルをトレーニングします。私たちの方法はローカルトレーニングを必要とせず、クライアントでの前方推論のみを含みます。 DomainNet、Openimage、および NICO++ に関する私たちの広範な実験は、提案された FedDISC メソッドが、非 IID クライアントでのワンショットの半教師あり問題に効果的に対処し、比較した SOTA メソッドよりも優れていることを示しています。また、FedDISC がクライアントのプライバシーに敏感な情報を漏らす可能性が無視できないことを視覚化して示します。

Federated learning is a privacy-preserving collaborative learning approach. Recently, some studies have proposed the semi-supervised federated learning setting to handle the commonly seen real-world scenarios with labeled data on the server and unlabeled data on the clients. However, existing methods still face challenges such as high communication costs, training pressure on the client devices, and distribution differences among the server and the clients. In this paper, we introduce the powerful pre-trained diffusion models into federated learning and propose FedDISC, a Federated Diffusion Inspired Semi-supervised Co-training method, to address these challenges. Specifically, we first extract prototypes from the labeled data on the server and send them to the clients. The clients then use these prototypes to predict pseudo-labels of the local data, and compute the cluster centroids and domain-specific features to represent their personalized distributions. After adding noise, the clients send these features and their corresponding pseudo-labels back to the server, which uses a pre-trained diffusion model to conditionally generate pseudo-samples complying with the client distributions and train an aggregated model on them. Our method does not require local training and only involves forward inference on the clients. Our extensive experiments on DomainNet, Openimage, and NICO++ demonstrate that the proposed FedDISC method effectively addresses the one-shot semi-supervised problem on Non-IID clients and outperforms the compared SOTA methods. We also demonstrate through visualization that it is of neglectable possibility for FedDISC to leak privacy-sensitive information of the clients.

updated: Sat May 06 2023 14:22:33 GMT+0000 (UTC)

published: Sat May 06 2023 14:22:33 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト