One Metric to Measure them All: Localisation Recall Precision (LRP) for Evaluating Visual Detection Tasks

Kemal Oksuz; Baris Can Cam; Sinan Kalkan; Emre Akbas

それらすべてを測定するための1つのメトリック：視覚的検出タスクを評価するためのローカリゼーション再現率（LRP）

視覚的検出タスクのパフォーマンス指標として広く使用されているにもかかわらず、平均精度（AP）は、ローカリゼーションの品質、（ii）解釈可能性、（iii）計算に関する設計選択の堅牢性、および信頼スコアなしの出力への適用性を反映する点で制限されています。。パノプティコンセグメンテーションを評価するために提案された尺度であるパノプティコン品質（PQ）（Kirillov et al。、2019）は、これらの制限を受けませんが、パノプティコンセグメンテーションに限定されます。この論文では、すべての視覚的検出タスクのパフォーマンス指標として、ローカリゼーション再現率（LRP）エラーを提案します。 LRPエラー。当初はOksuzらによってオブジェクト検出のためにのみ提案されました。（2018）、前述の制限に悩まされることはなく、すべての視覚的検出タスクに適用できます。また、視覚検出器を評価し、展開に最適なしきい値を取得するために、信頼スコアに対して取得された最小LRPエラーとしてOptimal LRP（oLRP）エラーを紹介します。 LRPとAPおよびPQの詳細な比較分析を提供し、7つの視覚的検出タスク（オブジェクト検出、キーポイント検出、インスタンスセグメンテーション、パノラマセグメンテーション、視覚的関係検出、ゼロ）からのほぼ100の最先端の視覚的検出器を使用します-10個のデータセット（つまり、異なるCOCOバリアント、LVIS、Open Images、Pascal、ILSVRC）を使用したショット検出および一般化されたゼロショット検出）。LRPが対応するものよりも豊富で識別力のある情報を提供することを経験的に示します。コードはhttps://github.com/kemaloksuz/LRPで入手できます-エラー

Despite being widely used as a performance measure for visual detection tasks, Average Precision (AP) is limited in reflecting localisation quality, (ii) interpretability and (iii) robustness to the design choices regarding its computation, and its applicability to outputs without confidence scores. Panoptic Quality (PQ), a measure proposed for evaluating panoptic segmentation (Kirillov et al., 2019), does not suffer from these limitations but is limited to panoptic segmentation. In this paper, we propose Localisation Recall Precision (LRP) Error as the performance measure for all visual detection tasks. LRP Error, initially proposed only for object detection by Oksuz et al. (2018), does not suffer from the aforementioned limitations and is applicable to all visual detection tasks. We also introduce Optimal LRP (oLRP) Error as the minimum LRP error obtained over confidence scores to evaluate visual detectors and obtain optimal thresholds for deployment. We provide a detailed comparative analysis of LRP with AP and PQ, and use nearly 100 state-of-the-art visual detectors from seven visual detection tasks (i.e. object detection, keypoint detection, instance segmentation, panoptic segmentation, visual relationship detection, zero-shot detection and generalised zero-shot detection) using ten datasets (i.e. different COCO variants, LVIS, Open Images, Pascal, ILSVRC) to empirically show that LRP provides richer and more discriminative information than its counterparts. Code available at: https://github.com/kemaloksuz/LRP-Error

updated: Tue Jul 13 2021 06:24:35 GMT+0000 (UTC)

published: Sat Nov 21 2020 11:20:42 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト