privGAN: Protecting GANs from membership inference attacks at low cost

Sumit Mukherjee; Yixi Xu; Anusua Trivedi; Juan Lavista Ferres

privGAN：GANをメンバーシップ推論攻撃から低コストで保護

Generative Adversarial Networks（GAN）により、元のデータセットを解放せずにデータを共有するための実行可能なアプローチが合成画像の解放になりました。このような合成データは、分類器のトレーニングなど、元のデータセットの共有を必要とするさまざまなダウンストリームタスクに使用できることが示されています。ただし、最近の研究では、GANモデルとその総合的に生成されたデータを使用して、データセット全体といくつかの補助情報にアクセスできる敵対者がトレーニングセットのメンバーシップを推測できることが示されています。この問題を軽減するための現在のアプローチ（DPGANなど）では、元の非プライベートGANよりも生成されるサンプルの品質が劇的に低下します。ここでは、新しいGANアーキテクチャ（privGAN）を開発します。ここで、ジェネレーターは、判別子をだますだけでなく、メンバーシップ推論攻撃を防御するようにもトレーニングされます。新しいメカニズムは、このモードの攻撃に対する保護を提供すると同時に、ダウンストリームパフォーマンスの損失を無視できる程度にします。さらに、私たちのアルゴリズムは、トレーニングセットへの過剰適合を明示的に防ぐことが示されています。これは、保護が非常に効果的である理由を説明しています。このペーパーの主な貢献は次のとおりです。i）追加のハイパーパラメータ調整やアーキテクチャの選択なしでプライバシーを保護する方法で合成データを生成できる新しいGANアーキテクチャを提案します。ii）privGAN損失関数の最適なソリューションの理論的理解を提供します。、iii）いくつかのベンチマークデータセットに対するホワイトボックスおよびブラックボックス攻撃に対するモデルの有効性を示します。iv）3つの一般的なベンチマークデータセットで、privGANによって生成された合成画像が非パフォーマンスと比較すると、ダウンストリームパフォーマンスの損失を無視できることを示します。 -プライベートGAN。

Generative Adversarial Networks (GANs) have made releasing of synthetic images a viable approach to share data without releasing the original dataset. It has been shown that such synthetic data can be used for a variety of downstream tasks such as training classifiers that would otherwise require the original dataset to be shared. However, recent work has shown that the GAN models and their synthetically generated data can be used to infer the training set membership by an adversary who has access to the entire dataset and some auxiliary information. Current approaches to mitigate this problem (such as DPGAN) lead to dramatically poorer generated sample quality than the original non--private GANs. Here we develop a new GAN architecture (privGAN), where the generator is trained not only to cheat the discriminator but also to defend membership inference attacks. The new mechanism provides protection against this mode of attack while leading to negligible loss in downstream performances. In addition, our algorithm has been shown to explicitly prevent overfitting to the training set, which explains why our protection is so effective. The main contributions of this paper are: i) we propose a novel GAN architecture that can generate synthetic data in a privacy preserving manner without additional hyperparameter tuning and architecture selection, ii) we provide a theoretical understanding of the optimal solution of the privGAN loss function, iii) we demonstrate the effectiveness of our model against several white and black--box attacks on several benchmark datasets, iv) we demonstrate on three common benchmark datasets that synthetic images generated by privGAN lead to negligible loss in downstream performance when compared against non--private GANs.

updated: Sun Dec 13 2020 18:27:26 GMT+0000 (UTC)

published: Tue Dec 31 2019 20:47:21 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト