PocketNet: Extreme Lightweight Face Recognition Network using Neural Architecture Search and Multi-Step Knowledge Distillation

Fadi Boutros; Patrick Siebke; Marcel Klemt; Naser Damer; Florian Kirchbuchner; Arjan Kuijper

PocketNet：ニューラルアーキテクチャ検索とマルチステップ知識蒸留を使用した超軽量顔認識ネットワーク

ディープニューラルネットワークは、急速に顔認識（FR）の主流の方法になりました。ただし、これにより、非常に多くのパラメーターを含むこのようなモデルの展開が、組み込みデバイスおよびローエンドデバイスに制限されます。この作業では、非常に軽量で正確なFRソリューション、つまりPocketNetを紹介します。ニューラルアーキテクチャ検索を利用して、軽量の顔固有のアーキテクチャの新しいファミリを開発します。さらに、知識蒸留（KD）に基づく新しいトレーニングパラダイム、マルチステップKDを提案します。このパラダイムでは、トレーニングの成熟度のさまざまな段階で、知識が教師モデルから生徒モデルに蒸留されます。私たちは、一般的なオブジェクト分類ではなくFRの特定のタスクにNASを使用することの健全性と、提案されたマルチステップKDの利点の両方を証明する、詳細なアブレーション研究を実施します。 IJB-B、IJB-C、MegaFaceなどの大規模な評価ベンチマークを含む9つの異なるベンチマークで、広範な実験的評価と最先端の（SOTA）コンパクトFRモデルとの比較を示します。 PocketNetsは、同じレベルのモデルのコンパクトさを考慮した場合、9つの主流ベンチマークでSOTAFRのパフォーマンスを一貫して向上させてきました。最小のネットワークPocketNetS-128は、最大4Mのパラメーターを含む最近のSOTAコンパクトモデルに対して、092Mのパラメーターで非常に競争力のある結果を達成しました。

Deep neural networks have rapidly become the mainstream method for face recognition (FR). However, this limits the deployment of such models that contain an extremely large number of parameters to embedded and low-end devices. In this work, we present an extremely lightweight and accurate FR solution, namely PocketNet. We utilize neural architecture search to develop a new family of lightweight face-specific architectures. We additionally propose a novel training paradigm based on knowledge distillation (KD), the multi-step KD, where the knowledge is distilled from the teacher model to the student model at different stages of the training maturity. We conduct a detailed ablation study proving both, the sanity of using NAS for the specific task of FR rather than general object classification, and the benefits of our proposed multi-step KD. We present an extensive experimental evaluation and comparisons with the state-of-the-art (SOTA) compact FR models on nine different benchmarks including large-scale evaluation benchmarks such as IJB-B, IJB-C, and MegaFace. PocketNets have consistently advanced the SOTA FR performance on nine mainstream benchmarks when considering the same level of model compactness. With 0.92M parameters, our smallest network PocketNetS-128 achieved very competitive results to recent SOTA compacted models that contain up to 4M parameters.

updated: Mon Dec 13 2021 15:16:22 GMT+0000 (UTC)

published: Tue Aug 24 2021 13:19:08 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト