Learning to Linearize Deep Neural Networks for Secure and Efficient Private Inference

Souvik Kundu; Shunlin Lu; Yuke Zhang; Jacqueline Liu; Peter A. Beerel

安全で効率的なプライベート推論のためのディープニューラルネットワークの線形化の学習

既存のディープニューラルネットワークの ReLU 非線形演算の数が多いため、レイテンシ効率の高いプライベート推論 (PI) には適していません。 ReLU 操作を削減するための既存の手法では、多くの場合、手作業が必要となり、精度が大幅に犠牲になります。このホワイトペーパーでは、最初に、非線形レイヤーの ReLU 感度の新しい測定法を提示し、同じものを識別するための時間のかかる手動作業を軽減できるようにします。この感度に基づいて、特定の ReLU バジェットに対して、レイヤーごとの ReLU カウントを自動的に割り当て、各レイヤーのアクティベーションマップの ReLU の場所を決定し、大幅に少ない ReLU でモデルをトレーニングする 3 段階のトレーニング方法である SENet を提示します。レイテンシーと通信効率の高い PI が得られる可能性があります。さまざまなデータセットで複数のモデルを使用した実験的評価では、既存の代替手段と比較して、ReLU の削減と分類精度の向上の両方の点で、SENet の優れたパフォーマンスが示されています。特に、SENet は、同様の精度を実現しながら、必要な ReLU を最大で 2 分の 1 まで削減できるモデルを生成できます。同様の ReLU 予算の場合、SENet は、CIFAR-100 で評価された分類精度が最大 2.32% 向上したモデルを生成できます。

The large number of ReLU non-linearity operations in existing deep neural networks makes them ill-suited for latency-efficient private inference (PI). Existing techniques to reduce ReLU operations often involve manual effort and sacrifice significant accuracy. In this paper, we first present a novel measure of non-linearity layers' ReLU sensitivity, enabling mitigation of the time-consuming manual efforts in identifying the same. Based on this sensitivity, we then present SENet, a three-stage training method that for a given ReLU budget, automatically assigns per-layer ReLU counts, decides the ReLU locations for each layer's activation map, and trains a model with significantly fewer ReLUs to potentially yield latency and communication efficient PI. Experimental evaluations with multiple models on various datasets show SENet's superior performance both in terms of reduced ReLUs and improved classification accuracy compared to existing alternatives. In particular, SENet can yield models that require up to ~2x fewer ReLUs while yielding similar accuracy. For a similar ReLU budget SENet can yield models with ~2.32% improved classification accuracy, evaluated on CIFAR-100.

updated: Mon Jan 23 2023 03:33:38 GMT+0000 (UTC)

published: Mon Jan 23 2023 03:33:38 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト