Unsupervised Adversarial Attacks on Deep Feature-based Retrieval with GAN

Guoping Zhao; Mingyu Zhang; Jiajun Liu; Ji-Rong Wen

GANを用いた深層特徴量に基づく検索に対する教師なし敵対的攻撃

ディープニューラルネットワーク(DNN)ベースの画像分類モデルは、悪意を持って構築された敵対的な例に対して脆弱であることが研究で明らかになっている。しかし、DNNベースの画像検索モデルがこのような攻撃にどのような影響を受けるかについては、これまでほとんど研究されていなかった。本論文では、深層特徴量に基づく画像検索システムを攻撃するために、Unsupervised Adversarial Attacks with Generative Adversarial Networks (UAA-GAN)を導入する。UAA-GANは教師なし学習モデルであり、学習に必要なデータは少量のラベル付けされていないデータのみである。一度学習されると、クエリ画像に対してクエリ固有の摂動を生成し、敵対的なクエリを形成する。中心となる考え方は、付加された摂動は人間にはほとんど感知できないが、深い特徴空間の中でクエリを元の位置から遠ざけるのに効果的であることを保証することである。UAA-GANは、画像検索、人物再識別、顔検索など、深層特徴量に基づく様々なアプリケーションシナリオで動作する。経験的には、クエリ画像の視覚的な変化を伴わずに検索性能を低下させることが示された。UAA-GANで生成された敵対的な例は、視覚的に重要でない領域(例えば、背景や空)ではなく、人物の主要な体の部分、支配的な構造パターンやテクスチャ、エッジなど、画像のテクスチャや特徴的な領域に微妙な擾乱を取り入れる傾向があるため、あまり区別がつかない。このような傾向は、モデルが画像検索システムと人間の目の両方をもてあそぶ方法を実際に学んだことを示している。

Studies show that Deep Neural Network (DNN)-based image classification models are vulnerable to maliciously constructed adversarial examples. However, little effort has been made to investigate how DNN-based image retrieval models are affected by such attacks. In this paper, we introduce Unsupervised Adversarial Attacks with Generative Adversarial Networks (UAA-GAN) to attack deep feature-based image retrieval systems. UAA-GAN is an unsupervised learning model that requires only a small amount of unlabeled data for training. Once trained, it produces query-specific perturbations for query images to form adversarial queries. The core idea is to ensure that the attached perturbation is barely perceptible to human yet effective in pushing the query away from its original position in the deep feature space. UAA-GAN works with various application scenarios that are based on deep features, including image retrieval, person Re-ID and face search. Empirical results show that UAA-GAN cripples retrieval performance without significant visual changes in the query images. UAA-GAN generated adversarial examples are less distinguishable because they tend to incorporate subtle perturbations in textured or salient areas of the images, such as key body parts of human, dominant structural patterns/textures or edges, rather than in visually insignificant areas (e.g., background and sky). Such tendency indicates that the model indeed learned how to toy with both image retrieval systems and human eyes.

updated: Fri Jul 12 2019 15:23:36 GMT+0000 (UTC)

published: Fri Jul 12 2019 15:23:36 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト