SLAP: Improving Physical Adversarial Examples with Short-Lived Adversarial Perturbations

Giulio Lovisotto; Henry Turner; Ivo Sluganovic; Martin Strohmeier; Ivan Martinovic

SLAP：短期間の敵対的摂動による物理的敵対的例の改善

敵対的な例（AE）の研究は急速に発展しましたが、静的な敵対的なパッチは、展開された後は明白で半永久的で変更不可能であるにもかかわらず、依然として現実の世界で攻撃を行うための主要な手法です。本論文では、光プロジェクターを用いて敵対者が物理的にロバストな実世界のAEを実現できる新しい手法であるShort-Lived Adversarial Perturbations（SLAP）を提案します。攻撃者は、特別に細工された敵対的摂動を現実世界のオブジェクトに投影し、それをAEに変換することができます。これにより、敵のパッチと比較して、敵は攻撃をより細かく制御できます。（i）投影は動的にオンとオフを切り替えたり、自由に変更したりできます。（ii）投影はパッチによって課せられる局所性の制約を受けず、検出が困難になります。。自動運転シナリオでのSLAPの実現可能性を研究し、一時停止標識の検出に焦点を当てて、オブジェクト検出器と交通標識認識タスクの両方を対象としています。屋外を含むさまざまな周囲光条件で実験を行い、明るくない設定で提案された方法が非常に堅牢なAEを生成し、最先端のネットワークで最大99％の成功率で誤分類を引き起こす方法を示します。さまざまな角度と距離。また、SLAPで生成されたAEは、敵対的なパッチで見られる検出可能な動作を示さないため、物理的なAE検出方法であるSentiNetをバイパスします。敵対的学習を使用して、攻撃者が有利な状況でも攻撃の有効性を最大80％阻止できる適応防御器など、他の防御を評価します。

Research into adversarial examples (AE) has developed rapidly, yet static adversarial patches are still the main technique for conducting attacks in the real world, despite being obvious, semi-permanent and unmodifiable once deployed. In this paper, we propose Short-Lived Adversarial Perturbations (SLAP), a novel technique that allows adversaries to realize physically robust real-world AE by using a light projector. Attackers can project a specifically crafted adversarial perturbation onto a real-world object, transforming it into an AE. This allows the adversary greater control over the attack compared to adversarial patches: (i) projections can be dynamically turned on and off or modified at will, (ii) projections do not suffer from the locality constraint imposed by patches, making them harder to detect. We study the feasibility of SLAP in the self-driving scenario, targeting both object detector and traffic sign recognition tasks, focusing on the detection of stop signs. We conduct experiments in a variety of ambient light conditions, including outdoors, showing how in non-bright settings the proposed method generates AE that are extremely robust, causing misclassifications on state-of-the-art networks with up to 99% success rate for a variety of angles and distances. We also demostrate that SLAP-generated AE do not present detectable behaviours seen in adversarial patches and therefore bypass SentiNet, a physical AE detection method. We evaluate other defences including an adaptive defender using adversarial learning which is able to thwart the attack effectiveness up to 80% even in favourable attacker conditions.

updated: Wed Jan 06 2021 16:17:39 GMT+0000 (UTC)

published: Wed Jul 08 2020 14:11:21 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト