Object Detection in the DCT Domain: is Luminance the Solution?

Benjamin Deguerre; Clement Chatelain; Gilles Gasso

DCTドメインでのオブジェクト検出：輝度はソリューションですか？

画像内のオブジェクト検出は、前例のないパフォーマンスに達しています。最先端の手法は、顕著な特徴を抽出し、対象のオブジェクトを囲む境界ボックスを予測する深いアーキテクチャに依存しています。これらのメソッドは基本的にRGB画像で実行されます。ただし、RGB画像は、保存と転送効率のために、取得デバイスによって圧縮されることがよくあります。したがって、それらの解凍はオブジェクト検出器に必要です。効率を上げるために、このペーパーでは、画像の圧縮表現を利用して、制約されたリソース条件で使用可能なオブジェクト検出を実行することを提案します。具体的には、JPEG画像に焦点を当て、JPEGノルムの特性に関して新しく設計された検出アーキテクチャの徹底的な分析を提案します。これにより、標準のRGBベースのアーキテクチャと比較して、1.7倍高速化されますが、検出パフォーマンスは5.5％しか低下しません。さらに、私たちの経験的調査結果は、圧縮されたJPEG情報の一部、つまり輝度成分のみが、完全な入力メソッドの検出精度と一致するために必要な場合があることを示しています。

Object detection in images has reached unprecedented performances. The state-of-the-art methods rely on deep architectures that extract salient features and predict bounding boxes enclosing the objects of interest. These methods essentially run on RGB images. However, the RGB images are often compressed by the acquisition devices for storage purpose and transfer efficiency. Hence, their decompression is required for object detectors. To gain in efficiency, this paper proposes to take advantage of the compressed representation of images to carry out object detection usable in constrained resources conditions. Specifically, we focus on JPEG images and propose a thorough analysis of detection architectures newly designed in regard of the peculiarities of the JPEG norm. This leads to a ×1.7 speed up in comparison with a standard RGB-based architecture, while only reducing the detection performance by 5.5%. Additionally, our empirical findings demonstrate that only part of the compressed JPEG information, namely the luminance component, may be required to match detection accuracy of the full input methods.

updated: Wed Jul 14 2021 08:09:24 GMT+0000 (UTC)

published: Wed Jun 10 2020 08:43:40 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト