Self-Supervised Pretraining for Differentially Private Learning

Arash Asadian; Evan Weidner; Lei Jiang

差分プライベート学習のための自己管理型事前トレーニング

自己教師あり事前トレーニング (SSP) が、画像分類で利用可能な公開データセットのサイズに関係なく、差分プライバシー (DP) を使用したディープラーニングに対するスケーラブルなソリューションであることを示します。パブリックデータセットの不足に直面した場合、SSP によって 1 つの画像のみで生成された特徴により、プライベート分類器は、同じプライバシーバジェットの下で、学習されていない手作りの特徴よりもはるかに優れたユーティリティを取得できることを示します。中規模または大規模なサイズのパブリックデータセットが利用可能な場合、SSP によって生成された機能は、同じプライベートバジェットでさまざまな複雑なプライベートデータセットのラベルを使用してトレーニングされた機能よりもはるかに優れています。また、複数の DP 対応トレーニングフレームワークを比較して、SSP によって生成された機能でプライベート分類器をトレーニングしました。最後に、ϵ=3 の場合、非自明な ImageNet-1K データセットの 25.3% という重要なユーティリティを報告します。ソースコードは https://github.com/UnchartedRLab/SSP にあります。

We demonstrate self-supervised pretraining (SSP) is a scalable solution to deep learning with differential privacy (DP) regardless of the size of available public datasets in image classification. When facing the lack of public datasets, we show the features generated by SSP on only one single image enable a private classifier to obtain much better utility than the non-learned handcrafted features under the same privacy budget. When a moderate or large size public dataset is available, the features produced by SSP greatly outperform the features trained with labels on various complex private datasets under the same private budget. We also compared multiple DP-enabled training frameworks to train a private classifier on the features generated by SSP. Finally, we report a non-trivial utility 25.3% of a private ImageNet-1K dataset when ϵ=3. Our source code can be found at https://github.com/UnchartedRLab/SSP.

updated: Fri Aug 05 2022 16:21:04 GMT+0000 (UTC)

published: Tue Jun 14 2022 19:30:23 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト