BinaryCoP: Binary Neural Network-based COVID-19 Face-Mask Wear and Positioning Predictor on Edge Devices

Nael Fasfous; Manoj-Rohit Vemparala; Alexander Frickenstein; Lukas Frickenstein; Walter Stechele

BinaryCoP：エッジデバイス上のバイナリニューラルネットワークベースのCOVID-19フェイスマスク摩耗およびポジショニング予測子

フェイスマスクは、危険な煙や粒子の吸入から保護するために、日常生活の多くの分野で長い間使用されてきました。また、空中感染症に対する双方向の保護のためのヘルスケアにおける効果的なソリューションも提供します。マスクを正しく着用して配置することは、その機能にとって不可欠です。畳み込みニューラルネットワーク（CNN）は、顔の認識と正しいマスクの着用と配置の分類のための優れたソリューションを提供します。進行中のCOVID-19パンデミックの状況では、このようなアルゴリズムを企業の建物、空港、ショッピングエリア、およびその他の屋内の場所への入り口で使用して、ウイルスの拡散を軽減することができます。これらのアプリケーションシナリオは、基盤となるコンピューティングプラットフォームに大きな課題を課します。推論ハードウェアは、安価で、小型で、エネルギー効率が高く、十分なメモリと計算能力を備えて、適度に低いレイテンシで正確なCNNを実行する必要があります。公衆のデータプライバシーを維持するために、すべての処理は、クラウドサーバーとの通信なしに、エッジデバイス上にとどまる必要があります。これらの課題に対処するために、顔のマスクの正しい着用と配置のための低電力バイナリニューラルネットワーク分類器を提示します。分類タスクは組み込みFPGAに実装され、高スループットの二項演算を実行します。分類は最大6400フレーム/秒で実行できるため、マルチカメラ、スピードゲート設定、または群集設定での統計収集を簡単に有効にできます。単一の入口またはゲートに配置すると、アイドル時の消費電力が1.6Wに削減され、デバイスのバッテリー寿命が向上します。 MaskedFace-Netデータセットの4つの着用位置で最大98％の精度を達成します。すべての顔の構造、肌の色、髪の毛の種類、マスクの種類について同等の分類精度を維持するために、Grad-CAMアプローチを使用して、すべての被験者に関連する特徴を一般化する能力についてアルゴリズムがテストされます。

Face masks have long been used in many areas of everyday life to protect against the inhalation of hazardous fumes and particles. They also offer an effective solution in healthcare for bi-directional protection against air-borne diseases. Wearing and positioning the mask correctly is essential for its function. Convolutional neural networks (CNNs) offer an excellent solution for face recognition and classification of correct mask wearing and positioning. In the context of the ongoing COVID-19 pandemic, such algorithms can be used at entrances to corporate buildings, airports, shopping areas, and other indoor locations, to mitigate the spread of the virus. These application scenarios impose major challenges to the underlying compute platform. The inference hardware must be cheap, small and energy efficient, while providing sufficient memory and compute power to execute accurate CNNs at a reasonably low latency. To maintain data privacy of the public, all processing must remain on the edge-device, without any communication with cloud servers. To address these challenges, we present a low-power binary neural network classifier for correct facial-mask wear and positioning. The classification task is implemented on an embedded FPGA, performing high-throughput binary operations. Classification can take place at up to ~6400 frames-per-second, easily enabling multi-camera, speed-gate settings or statistics collection in crowd settings. When deployed on a single entrance or gate, the idle power consumption is reduced to 1.6W, improving the battery-life of the device. We achieve an accuracy of up to 98% for four wearing positions of the MaskedFace-Net dataset. To maintain equivalent classification accuracy for all face structures, skin-tones, hair types, and mask types, the algorithms are tested for their ability to generalize the relevant features over all subjects using the Grad-CAM approach.

updated: Sat Feb 06 2021 00:14:06 GMT+0000 (UTC)

published: Sat Feb 06 2021 00:14:06 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト