Fishing for User Data in Large-Batch Federated Learning via Gradient Magnification

Yuxin Wen; Jonas Geiping; Liam Fowl; Micah Goldblum; Tom Goldstein

勾配倍率による大規模なバッチ連合学習におけるユーザーデータのフィッシング

連合学習（FL）は、プライバシーと効率性が約束されているため、急速に人気が高まっています。以前の作業では、勾配更新からユーザーデータを回復することにより、FLパイプラインのプライバシーの脆弱性が明らかになりました。ただし、既存の攻撃では、1）バッチサイズが非常に小さいおもちゃの設定が必要であるか、2）非現実的で目立つアーキテクチャの変更が必要であるため、現実的な設定に対処できません。既存の攻撃を劇的に高めて、アーキテクチャを変更せずに、任意の大きなサイズのバッチで動作する新しい戦略を紹介します。モデルにとらわれない戦略では、ユーザーに送信されるモデルパラメータを変更するだけで済みます。これは、多くのシナリオで現実的な脅威モデルです。クロスデバイスとクロスサイロの両方の連合学習で忠実度の高いデータ抽出を取得し、大規模な設定に挑戦する戦略を示します。

Federated learning (FL) has rapidly risen in popularity due to its promise of privacy and efficiency. Previous works have exposed privacy vulnerabilities in the FL pipeline by recovering user data from gradient updates. However, existing attacks fail to address realistic settings because they either 1) require toy settings with very small batch sizes, or 2) require unrealistic and conspicuous architecture modifications. We introduce a new strategy that dramatically elevates existing attacks to operate on batches of arbitrarily large size, and without architectural modifications. Our model-agnostic strategy only requires modifications to the model parameters sent to the user, which is a realistic threat model in many scenarios. We demonstrate the strategy in challenging large-scale settings, obtaining high-fidelity data extraction in both cross-device and cross-silo federated learning.

updated: Sun Jun 19 2022 23:21:42 GMT+0000 (UTC)

published: Tue Feb 01 2022 17:26:11 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト