Mask or Non-Mask? Robust Face Mask Detector via Triplet-Consistency Representation Learning

Chun-Wei Yang; Thanh-Hai Phung; Hong-Han Shuai; Wen-Huang Cheng

マスクまたは非マスク？トリプレット一貫性表現学習による堅牢なフェイスマスク検出器

COVID-19を阻止するワクチンや薬がない場合、コロナウイルスの蔓延を遅らせ、医療の過負荷を減らす効果的な方法の1つは、フェイスマスクを着用することです。それにもかかわらず、公共エリアでのフェイスマスクまたはカバーの使用を義務付けるには、追加の人的資源が必要であり、それは退屈で注意を要する。監視プロセスを自動化するための有望なソリューションの1つは、既存のオブジェクト検出モデルを活用して、マスクの有無にかかわらず顔を検出することです。そのため、警備員は監視デバイスや群衆を凝視する必要はなく、マスクのない顔の検出によってトリガーされるアラートに対処するだけで済みます。既存のオブジェクト検出モデルは通常、識別機能を抽出するためのCNNベースのネットワークアーキテクチャの設計に重点を置いています。ただし、フェイスマスク検出のトレーニングデータセットのサイズは小さく、マスクのある顔とない顔の違いはわずかです。したがって、本論文では、コンテキスト注意モジュールを使用して、注意マップの機能の改良を適応させることにより、フィードフォワード畳み込みニューラルネットワークの効果的な注意を可能にするフェイスマスク検出フレームワークを提案します。さらに、小規模なトレーニングデータとマスクとオクルージョンの類似性を処理するために、整合性損失とトリプレット損失を統合することにより、トリプレット整合性表現学習を備えたアンカーフリー検出器をさらに提案します。広範な実験結果は、私たちの方法が他の最先端の方法よりも優れていることを示しています。ソースコードは、https：//github.com/wei-1006/MaskFaceDetectionで公衆衛生を改善するためのパブリックダウンロードとしてリリースされています。

In the absence of vaccines or medicines to stop COVID-19, one of the effective methods to slow the spread of the coronavirus and reduce the overloading of healthcare is to wear a face mask. Nevertheless, to mandate the use of face masks or coverings in public areas, additional human resources are required, which is tedious and attention-intensive. To automate the monitoring process, one of the promising solutions is to leverage existing object detection models to detect the faces with or without masks. As such, security officers do not have to stare at the monitoring devices or crowds, and only have to deal with the alerts triggered by the detection of faces without masks. Existing object detection models usually focus on designing the CNN-based network architectures for extracting discriminative features. However, the size of training datasets of face mask detection is small, while the difference between faces with and without masks is subtle. Therefore, in this paper, we propose a face mask detection framework that uses the context attention module to enable the effective attention of the feed-forward convolution neural network by adapting their attention maps feature refinement. Moreover, we further propose an anchor-free detector with Triplet-Consistency Representation Learning by integrating the consistency loss and the triplet loss to deal with the small-scale training data and the similarity between masks and occlusions. Extensive experimental results show that our method outperforms the other state-of-the-art methods. The source code is released as a public download to improve public health at https://github.com/wei-1006/MaskFaceDetection.

updated: Fri Oct 01 2021 16:44:06 GMT+0000 (UTC)

published: Fri Oct 01 2021 16:44:06 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト