Model Extraction Attacks on Split Federated Learning

Jingtao Li; Adnan Siraj Rakin; Xing Chen; Li Yang; Zhezhi He; Deliang Fan; Chaitali Chakrabarti

分割連合学習に対するモデル抽出攻撃

Federated Learning (FL) は、複数のクライアントとサーバーを含む一般的な共同学習スキームです。 FL はクライアントのデータの保護に重点を置いていますが、知的財産 (IP) の脅威に対して非常に脆弱であることが判明しています。 FL は定期的にモデルパラメータを収集して配布するため、フリーライダーが最新のモデルをダウンロードして、モデルの IP を盗むことができます。リソースに制約のあるクライアントでのトレーニングをサポートする FL の最近のバリアントである Split Federated Learning (SFL) は、モデルを 2 つに分割し、モデルの一部をクライアント (クライアント側モデル) に提供し、残りの部分をサーバーに提供します (サーバー側モデル)。したがって、SFL は設計によりモデルの漏洩を防ぎます。さらに、予測クエリをブロックすることで、従来のモデル抽出 (ME) 攻撃などの高度な IP 脅威に対する耐性を持たせることができます。 SFL は IP 保護を提供するという点では FL よりも優れていますが、依然として脆弱です。このホワイトペーパーでは、SFL の脆弱性を明らかにし、悪意のあるクライアントがサーバー側から勾配情報をクエリすることで ME 攻撃を開始する方法を示します。勾配の使用とデータの仮定が異なる ME 攻撃の 5 つのバリアントを提案します。実際のケースでは、提案された ME 攻撃が SFL に対して非常にうまく機能することを示します。たとえば、サーバー側モデルに 5 つのレイヤーがある場合、提案された ME 攻撃は、CIFAR-10 の VGG-11 で 2% 未満の精度低下で 90% 以上の精度を達成できます。

Federated Learning (FL) is a popular collaborative learning scheme involving multiple clients and a server. FL focuses on protecting clients' data but turns out to be highly vulnerable to Intellectual Property (IP) threats. Since FL periodically collects and distributes the model parameters, a free-rider can download the latest model and thus steal model IP. Split Federated Learning (SFL), a recent variant of FL that supports training with resource-constrained clients, splits the model into two, giving one part of the model to clients (client-side model), and the remaining part to the server (server-side model). Thus SFL prevents model leakage by design. Moreover, by blocking prediction queries, it can be made resistant to advanced IP threats such as traditional Model Extraction (ME) attacks. While SFL is better than FL in terms of providing IP protection, it is still vulnerable. In this paper, we expose the vulnerability of SFL and show how malicious clients can launch ME attacks by querying the gradient information from the server side. We propose five variants of ME attack which differs in the gradient usage as well as in the data assumptions. We show that under practical cases, the proposed ME attacks work exceptionally well for SFL. For instance, when the server-side model has five layers, our proposed ME attack can achieve over 90% accuracy with less than 2% accuracy degradation with VGG-11 on CIFAR-10.

updated: Mon Mar 13 2023 20:21:51 GMT+0000 (UTC)

published: Mon Mar 13 2023 20:21:51 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト