Federated Adaptive Prompt Tuning for Multi-domain Collaborative Learning

Shangchao Su; Mingzhao Yang; Bin Li; Xiangyang Xue

マルチドメイン共同学習のためのフェデレーテッド・アダプティブ・プロンプト・チューニング

フェデレーテッドラーニング (FL) を使用すると、複数のクライアントがデータを開示することなく、グローバルモデルを共同でトレーニングできます。以前の研究では、完全なモデルパラメーターのトレーニングが必要になることがよくありました。しかし、強力な事前トレーニング済みモデルの出現により、FL で学習可能なパラメーターを減らしても、より高いパフォーマンスを達成できるようになりました。この論文では、CLIP のような強力な基盤モデルを使用したマルチドメイン協調画像分類のためのフェデレーテッド適応プロンプトチューニングアルゴリズム FedAPT を提案します。直接的なフェデレーテッドプロンプトチューニングと比較して、私たちの中心的なアイデアは、各テストサンプルの特定のドメイン知識を適応的に解き放ち、パーソナライズされたプロンプトを提供することです。このアイデアを実装するために、メタプロンプト、適応ネットワーク、およびいくつかのキーで構成される適応プロンプトチューニングモジュールを設計します。サーバーはキーのセットをランダムに生成し、一意のキーを各クライアントに割り当てます。次に、すべてのクライアントが、ローカルデータセットと凍結されたキーを使用して、グローバルアダプティブネットワークとメタプロンプトを協力してトレーニングします。最終的に、グローバル集約モデルは、各テストサンプルのドメイン機能に基づいて、パーソナライズされたプロンプトを CLIP に割り当てることができます。私たちは、教師ありと教師なしの 2 つの異なる設定にわたって、2 つのマルチドメイン画像分類データセットに対して広範な実験を実行しました。結果は、FedAPT が完全にトレーニングされたモデルのパラメーター数の 10% 未満でより優れたパフォーマンスを達成できること、およびグローバルモデルがさまざまなクライアントドメインで同時に良好なパフォーマンスを発揮できることを示しています。

Federated learning (FL) enables multiple clients to collaboratively train a global model without disclosing their data. Previous researches often require training the complete model parameters. However, the emergence of powerful pre-trained models makes it possible to achieve higher performance with fewer learnable parameters in FL. In this paper, we propose a federated adaptive prompt tuning algorithm, FedAPT, for multi-domain collaborative image classification with powerful foundation models, like CLIP. Compared with direct federated prompt tuning, our core idea is to adaptively unlock specific domain knowledge for each test sample in order to provide them with personalized prompts. To implement this idea, we design an adaptive prompt tuning module, which consists of a meta prompt, an adaptive network, and some keys. The server randomly generates a set of keys and assigns a unique key to each client. Then all clients cooperatively train the global adaptive network and meta prompt with the local datasets and the frozen keys. Ultimately, the global aggregation model can assign a personalized prompt to CLIP based on the domain features of each test sample. We perform extensive experiments on two multi-domain image classification datasets across two different settings - supervised and unsupervised. The results show that FedAPT can achieve better performance with less than 10% of the number of parameters of the fully trained model, and the global model can perform well in diverse client domains simultaneously.

updated: Thu Aug 31 2023 05:11:10 GMT+0000 (UTC)

published: Tue Nov 15 2022 03:10:05 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト