Privacy-Preserving CNN Training with Transfer Learning

John Chiang

転移学習を使用したプライバシー保護の CNN トレーニング

この論文では、単なる準同型暗号化 (HE) 技術に基づいてプライバシーを保護する CNN トレーニングを実装するための実用的なソリューションを紹介します。私たちの知る限り、これはこの問題を解決することに成功した最初の試みであり、これまでにこの目標を達成した研究はありません。このタスクを達成するには、いくつかの手法を組み合わせます。(1) 転移学習を使用すると、プライバシーを保護した CNN トレーニングを準同型ニューラルネットワークトレーニング、さらにはマルチクラスロジスティック回帰 (MLR) トレーニングに減らすことができます。 (2) Quadratic Gradient と呼ばれるより高速な勾配バリアントを介して、収束速度における最先端のパフォーマンスを備えた MLR の強化された勾配法がこの研究に適用され、高いパフォーマンスが実現されます。 (3) 数学における変換の考え方を利用して、暗号化ドメインの近似ソフトマックス関数をシグモイド関数の近似に変換します。この変更に合わせて、二乗尤度誤差と呼ばれる新しいタイプの損失関数が開発されました。 (4) Volley Revolver という名前のシンプルだが柔軟な行列エンコーディング手法を使用して暗号文のデータフローを管理します。これは準同型 CNN トレーニング全体を完了するための重要な要素です。私たちの作業を実装するための完全な実行可能な C++ コードは、https://github.com/petitioner/HE.CNNtraininghttps://github.com/petitioner/HE.CNNtraining にあります。転移学習用の事前トレーニング済みモデルとして REGNET\_X\_400MF を選択します。最初の 128 個の MNIST トレーニング画像をトレーニングデータとして使用し、MNIST テストデータセット全体をテストデータとして使用します。クライアントは 6 つの暗号文をクラウドにアップロードするだけでよく、64 個の vCPU を備えたクラウド上で 2 回の反復を実行するのに約 21 分かかり、精度は 21.49% になります。

In this paper, we present a practical solution to implement privacy-preserving CNN training based on mere Homomorphic Encryption (HE) technique. To our best knowledge, this is the first attempt successfully to crack this nut and no work ever before has achieved this goal. Several techniques combine to accomplish the task:: (1) with transfer learning, privacy-preserving CNN training can be reduced to homomorphic neural network training, or even multiclass logistic regression (MLR) training; (2) via a faster gradient variant called Quadratic Gradient, an enhanced gradient method for MLR with a state-of-the-art performance in convergence speed is applied in this work to achieve high performance; (3) we employ the thought of transformation in mathematics to transform approximating Softmax function in the encryption domain to the approximation of the Sigmoid function. A new type of loss function termed Squared Likelihood Error has been developed alongside to align with this change.; and (4) we use a simple but flexible matrix-encoding method named Volley Revolver to manage the data flow in the ciphertexts, which is the key factor to complete the whole homomorphic CNN training. The complete, runnable C++ code to implement our work can be found at: https://github.com/petitioner/HE.CNNtraininghttps://github.com/petitioner/HE.CNNtraining. We select REGNET\_X\_400MF as our pre-trained model for transfer learning. We use the first 128 MNIST training images as training data and the whole MNIST testing dataset as the testing data. The client only needs to upload 6 ciphertexts to the cloud and it takes ∼21 mins to perform 2 iterations on a cloud with 64 vCPUs, resulting in a precision of 21.49%.

updated: Tue May 09 2023 20:51:07 GMT+0000 (UTC)

published: Fri Apr 07 2023 18:21:30 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト