YOLO5Face: Why Reinventing a Face Detector

Delong Qi; Weijun Tan; Qi Yao; Jingfeng Liu

YOLO5Face：顔検出器を再発明する理由

近年、畳み込みニューラルネットワークを使用した顔検出で大きな進歩が見られました。多くの顔検出器は顔を検出するために指定された設計を使用しますが、顔検出は一般的なオブジェクト検出タスクとして扱います。 YOLOv5オブジェクト検出器に基づいて顔検出器を実装し、それをYOLO5Faceと呼びます。 YOLOv5にいくつかの重要な変更を加え、顔検出用に最適化します。これらの変更には、5ポイントのランドマーク回帰ヘッドの追加、バックボーンの入力でのステムブロックの使用、SPPでのより小さなサイズのカーネルの使用、およびPANブロックでのP6出力の追加が含まれます。最高のパフォーマンスを実現する特大モデルから、組み込みデバイスやモバイルデバイスでのリアルタイム検出用の超小型モデルまで、さまざまなモデルサイズの検出器を設計しています。 WiderFaceデータセットの実験結果は、VGA画像で、当社の顔検出器が、より複雑な指定の顔検出器を超えて、ほぼすべてのEasy、Medium、およびHardサブセットで最先端のパフォーマンスを達成できることを示しています。コードはhttps://github.com/deepcam-cn/yolov5-faceで入手できます。

Tremendous progress has been made on face detection in recent years using convolutional neural networks. While many face detectors use designs designated for detecting faces, we treat face detection as a generic object detection task. We implement a face detector based on the YOLOv5 object detector and call it YOLO5Face. We make a few key modifications to the YOLOv5 and optimize it for face detection. These modifications include adding a five-point landmark regression head, using a stem block at the input of the backbone, using smaller-size kernels in the SPP, and adding a P6 output in the PAN block. We design detectors of different model sizes, from an extra-large model to achieve the best performance to a super small model for real-time detection on an embedded or mobile device. Experiment results on the WiderFace dataset show that on VGA images, our face detectors can achieve state-of-the-art performance in almost all the Easy, Medium, and Hard subsets, exceeding the more complex designated face detectors. The code is available at https://github.com/deepcam-cn/yolov5-face

updated: Thu Dec 02 2021 04:40:13 GMT+0000 (UTC)

published: Thu May 27 2021 03:54:38 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト