A Dataset and Application for Facial Recognition of Individual Gorillas in Zoo Environments

Otto Brookes; Tilo Burghardt

動物園環境における個々のゴリラの顔認識のためのデータセットとアプリケーション

ブリストル動物園の7つのニシローランドゴリラの軍隊全体に5k以上の顔の境界ボックスの注釈が付いたビデオデータセットを提案しました。このデータセットのトレーニングでは、動物園環境で個々のゴリラを顔で認識するタスクに関する標準的な深層学習パイプラインを実装および評価します。基本的なYOLOv3を利用したアプリケーションは、単一フレームのみを使用する場合、92％mAPで識別を実行できることを示します。短いトラックレット間での検出による追跡の関連付けとID投票により、97％mAPの堅牢なパフォーマンスが向上します。動物園環境の研究機能を充実させるために簡単に利用できるように、コード、ビデオデータセット、重み、およびグラウンドトゥルースアノテーションをdata.bris.ac.ukで公開しています。

We put forward a video dataset with 5k+ facial bounding box annotations across a troop of 7 western lowland gorillas at Bristol Zoo Gardens. Training on this dataset, we implement and evaluate a standard deep learning pipeline on the task of facially recognising individual gorillas in a zoo environment. We show that a basic YOLOv3-powered application is able to perform identifications at 92% mAP when utilising single frames only. Tracking-by-detection-association and identity voting across short tracklets yields an improved robust performance of 97% mAP. To facilitate easy utilisation for enriching the research capabilities of zoo environments, we publish the code, video dataset, weights, and ground-truth annotations at data.bris.ac.uk.

updated: Tue Dec 08 2020 19:23:22 GMT+0000 (UTC)

published: Tue Dec 08 2020 19:23:22 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト