Towards On-Device Face Recognition in Body-worn Cameras

Ali Almadan; Ajita Rattani

身に着けているカメラのデバイス上の顔認識に向けて

IDの認識に関連する顔認識テクノロジーは、情報収集、法執行、監視、および消費者向けアプリケーションで広く採用されています。最近、この技術はスマートフォンや身に着けているカメラ（BWC）に移植されました。身に着けているカメラの顔認識技術は、監視、状況認識、および警官の安全を維持するために使用されます。身に着けているカメラを使用した顔認識には、ほんの一握りの学術研究しかありません。最近の研究では、身に着けているカメラを使用して取得したBWCFace顔画像データセットを収集し、顔識別のためにResNet-50モデルを評価しました。ただし、リソースの制約で身に着けているカメラや顔画像に関連するプライバシーの懸念をリアルタイムで推測するには、デバイス上の顔認識が必要です。この目的のために、この研究では、身に着けているカメラを使用して顔を識別するために、軽量のMobileNet-V2、EfficientNet-B0、LightCNN-9、およびLightCNN-29モデルを評価します。実験は、公開されているBWCfaceデータセットで実行されます。リアルタイム推論は、3つのモバイルデバイスで評価されます。比較分析は、パフォーマンスとモデルサイズの間のトレードオフを評価するために、6つの手作りの機能とともに重量のあるVGG-16モデルとResNet-50モデルを使用して行われます。実験結果は、軽量LightCNN-29の最大ランク1精度と最高性能のResNet-50との差は1.85％であり、モデルパラメータの削減は23.49Mであることを示唆しています。ディープモデルのほとんどは、ランク5とランク10で同様のパフォーマンスを獲得しました。 LightCNNの推論時間は、モバイルデバイス上の他のモデルより2.1倍高速です。 LightCNN-29とランク1のローカル位相量子化（LPQ）記述子の間で、14％という最小のパフォーマンスの違いが見られます。ほとんどの実験設定では、軽量のLightCNNモデルは、ほとんどのモデルと比較して、精度とモデルサイズの間で最良のトレードオフを提供しました。

Face recognition technology related to recognizing identities is widely adopted in intelligence gathering, law enforcement, surveillance, and consumer applications. Recently, this technology has been ported to smartphones and body-worn cameras (BWC). Face recognition technology in body-worn cameras is used for surveillance, situational awareness, and keeping the officer safe. Only a handful of academic studies exist in face recognition using the body-worn camera. A recent study has assembled BWCFace facial image dataset acquired using a body-worn camera and evaluated the ResNet-50 model for face identification. However, for real-time inference in resource constraint body-worn cameras and privacy concerns involving facial images, on-device face recognition is required. To this end, this study evaluates lightweight MobileNet-V2, EfficientNet-B0, LightCNN-9 and LightCNN-29 models for face identification using body-worn camera. Experiments are performed on a publicly available BWCface dataset. The real-time inference is evaluated on three mobile devices. The comparative analysis is done with heavy-weight VGG-16 and ResNet-50 models along with six hand-crafted features to evaluate the trade-off between the performance and model size. Experimental results suggest the difference in maximum rank-1 accuracy of lightweight LightCNN-29 over best-performing ResNet-50 is 1.85% and the reduction in model parameters is 23.49M. Most of the deep models obtained similar performances at rank-5 and rank-10. The inference time of LightCNNs is 2.1x faster than other models on mobile devices. The least performance difference of 14% is noted between LightCNN-29 and Local Phase Quantization (LPQ) descriptor at rank-1. In most of the experimental settings, lightweight LightCNN models offered the best trade-off between accuracy and the model size in comparison to most of the models.

updated: Wed Apr 07 2021 22:24:57 GMT+0000 (UTC)

published: Wed Apr 07 2021 22:24:57 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト