G-CAME: Gaussian-Class Activation Mapping Explainer for Object Detectors

Quoc Khanh Nguyen; Truong Thanh Hung Nguyen; Vo Thanh Khang Nguyen; Van Binh Truong; Quoc Hung Cao

G-CAME: オブジェクト検出器のためのガウスクラスの活性化マッピングの説明

現在、画像内の物体検出のためのディープニューラルネットワークが非常に普及しています。ただし、これらのネットワークは複雑であるため、ユーザーはこれらのオブジェクトがモデルによって検出される理由を理解するのが難しいと感じています。我々は、物体検出モデルの説明として顕著性マップを生成する Gaussian Class Activation Mapping Explainer (G-CAME) を提案しました。 G-CAME は、選択したレイヤーのアクティベーションマップをガウスカーネルと組み合わせて使用し、予測ボックスの画像内の重要な領域を強調表示する CAM ベースの方法と考えることができます。他の領域ベースの方法と比較して、G-CAME はオブジェクトを説明するのに非常に短い時間がかかるため、時間の制約を超えることができます。また、MS-COCO 2017 データセットに対して YOLOX を使用してメソッドを定性的および定量的に評価し、G-CAME を 2 段階の Faster-RCNN モデルに適用するように導きました。

Nowadays, deep neural networks for object detection in images are very prevalent. However, due to the complexity of these networks, users find it hard to understand why these objects are detected by models. We proposed Gaussian Class Activation Mapping Explainer (G-CAME), which generates a saliency map as the explanation for object detection models. G-CAME can be considered a CAM-based method that uses the activation maps of selected layers combined with the Gaussian kernel to highlight the important regions in the image for the predicted box. Compared with other Region-based methods, G-CAME can transcend time constraints as it takes a very short time to explain an object. We also evaluated our method qualitatively and quantitatively with YOLOX on the MS-COCO 2017 dataset and guided to apply G-CAME into the two-stage Faster-RCNN model.

updated: Tue Jun 06 2023 04:30:18 GMT+0000 (UTC)

published: Tue Jun 06 2023 04:30:18 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト