DetectorGuard: Provably Securing Object Detectors against Localized Patch Hiding Attacks

Chong Xiang; Prateek Mittal

DetectorGuard：ローカライズされたパッチ隠蔽攻撃からオブジェクト検出器を確実に保護します

最先端のオブジェクト検出器は、攻撃者が小さな敵対パッチを導入して、検出器が顕著なオブジェクトの検出を見逃すような、局所的なパッチ隠蔽攻撃に対して脆弱です。パッチ攻撃者は、敵対的なパッチを印刷して被害者のオブジェクトに添付することにより、物理的な世界の攻撃を実行できます。この論文では、局所的なパッチ隠蔽攻撃に対して確実に堅牢な検出器を構築するための最初の一般的なフレームワークであるDetectorGuardを提案します。まず、ロバストな画像分類研究の最近の進歩を利用して、ロバストなオブジェクト検出にロバストな画像分類器を適応させることができるかどうかを尋ねることを目指しています。残念ながら、タスクの違いにより、堅牢な画像分類器から単純に適合されたオブジェクト検出器は、1）敵対的な設定で必ずしも堅牢であるとは限らないか、2）クリーンな設定で適切なパフォーマンスを維持することさえあります。高性能のロバストなオブジェクト検出器を構築するために、オブジェクト性を説明する戦略を提案します。ロバストな画像分類器を適応させて、すべての画像位置のオブジェクト性を予測し、従来のオブジェクト検出器によって予測された境界ボックスを使用して各オブジェクト性を説明します。すべてのオブジェクト性が十分に説明されている場合は、従来のオブジェクト検出器によって行われた予測を出力します。それ以外の場合は、攻撃アラートを発行します。特に、1）敵対的な設定では、認証されたオブジェクトに対するDetectorGuardのエンドツーエンドの堅牢性を正式に証明します。つまり、脅威モデル内のパッチを隠す攻撃者に対して、オブジェクトを検出するか、アラートをトリガーします。 2）クリーンな設定では、最先端の物体検出器とほぼ同じ性能を発揮します。 PASCAL VOC、MS COCO、およびKITTIデータセットに対する評価は、DetectorGuardが、クリーンなパフォーマンスのごくわずかなコスト（<1％）で、ローカライズされたパッチ隠蔽攻撃に対して最初の証明可能な堅牢性を実現することをさらに示しています。

State-of-the-art object detectors are vulnerable to localized patch hiding attacks where an adversary introduces a small adversarial patch to make detectors miss the detection of salient objects. The patch attacker can carry out a physical-world attack by printing and attaching an adversarial patch to the victim object. In this paper, we propose DetectorGuard, the first general framework for building provably robust detectors against localized patch hiding attacks. To start with, we aim to take advantage of recent advancements of robust image classification research by asking: can we adapt robust image classifiers for robust object detection? Unfortunately, due to their task difference, an object detector naively adapted from a robust image classifier 1) may not necessarily be robust in the adversarial setting or 2) even maintain decent performance in the clean setting. To build a high-performance robust object detector, we propose an objectness explaining strategy: we adapt a robust image classifier to predict objectness for every image location and then explain each objectness using the bounding boxes predicted by a conventional object detector. If all objectness is well explained, we output the predictions made by the conventional object detector; otherwise, we issue an attack alert. Notably, 1) in the adversarial setting, we formally prove the end-to-end robustness of DetectorGuard on certified objects, i.e., it either detects the object or triggers an alert, against any patch hiding attacker within our threat model; 2) in the clean setting, we have almost the same performance as state-of-the-art object detectors. Our evaluation on the PASCAL VOC, MS COCO, and KITTI datasets further demonstrates that DetectorGuard achieves the first provable robustness against localized patch hiding attacks at a negligible cost (<1%) of clean performance.

updated: Sun May 09 2021 13:14:02 GMT+0000 (UTC)

published: Fri Feb 05 2021 02:02:21 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト