Why Accuracy Is Not Enough: The Need for Consistency in Object Detection

Caleb Tung; Abhinav Goel; Fischer Bordwell; Nick Eliopoulos; Xiao Hu; George K. Thiruvathukal; Yung-Hsiang Lu

精度が十分でない理由：オブジェクト検出における一貫性の必要性

オブジェクト検出器は、多くの最新のコンピュータビジョンアプリケーションに不可欠です。ただし、最先端のオブジェクト検出器でさえ完璧ではありません。人間の目に似ている2つの画像では、カメラセンサーのノイズや照明の変化などの小さな画像の歪みのために、同じ検出器が異なる予測を行う可能性があります。この問題は不整合と呼ばれます。既存の精度メトリックは不整合を適切に説明しておらず、この領域での同様の作業は、人工的な画像の歪みの改善のみを対象としています。したがって、非人工ビデオフレームを使用して、フレーム全体でオブジェクト検出の一貫性を経時的に測定する方法を提案します。この方法を使用して、最新のオブジェクト検出器の一貫性が、Multiple Object Tracking Challengeのさまざまなビデオデータセットで83.2％から97.1％の範囲であることを示します。 .WEBP画像圧縮やアンシャープマスキングなどの画像歪み補正を適用すると、精度を損なうことなく、一貫性を最大5.1％向上させることができることを示して結論を下します。

Object detectors are vital to many modern computer vision applications. However, even state-of-the-art object detectors are not perfect. On two images that look similar to human eyes, the same detector can make different predictions because of small image distortions like camera sensor noise and lighting changes. This problem is called inconsistency. Existing accuracy metrics do not properly account for inconsistency, and similar work in this area only targets improvements on artificial image distortions. Therefore, we propose a method to use non-artificial video frames to measure object detection consistency over time, across frames. Using this method, we show that the consistency of modern object detectors ranges from 83.2% to 97.1% on different video datasets from the Multiple Object Tracking Challenge. We conclude by showing that applying image distortion corrections like .WEBP Image Compression and Unsharp Masking can improve consistency by as much as 5.1%, with no loss in accuracy.

updated: Thu Jul 28 2022 05:51:18 GMT+0000 (UTC)

published: Thu Jul 28 2022 05:51:18 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト