Brain Programming is Immune to Adversarial Attacks: Towards Accurate and Robust Image Classification using Symbolic Learning

Gerardo Ibarra-Vazquez; Gustavo Olague; Mariana Chan-Ley; Cesar Puente; Carlos Soubervielle-Montalvo

脳プログラミングは敵対的攻撃に耐性がある：記号学習を使用した正確でロバストな画像分類に向けて

近年、人間の視覚にはほとんど見えない入力画像への小さな変更という形での敵対的攻撃（AA）に対するディープ畳み込みニューラルネットワーク（DCNN）の脆弱性に関するセキュリティ上の懸念により、予測は信頼できません。したがって、新しい分類器を開発するときは、正確なスコアに加えて、敵対的な例にロバスト性を提供する必要があります。この作品では、アートメディアの分類の複雑な問題に対するAAの影響の比較研究を行います。これには、アートワークの優れたコレクションを分類するための機能の高度な分析が含まれます。コンピュータービジョン、4つの最先端のDCNNモデル（AlexNet、VGG、ResNet、ResNet101）、および脳プログラミング（BP）アルゴリズムからの視覚的な単語アプローチの一般的なバッグをテストしました。この研究では、精度を使用してアルゴリズムのパフォーマンスを分析します。さらに、敵対的な例とクリーンな画像の精度比を使用して、堅牢性を測定します。さらに、結果を裏付けるために、各分類器の予測の信頼度の統計分析を提案します。高速勾配符号法で計算された敵対的な例を使用して、BP予測の変化が2％未満であることを確認します。また、複数ピクセル攻撃を考慮すると、BPは7つのクラスのうち4つを変更なしで取得し、残りは予測で最大4％の誤差で取得しました。最後に、BPは、変更なしで敵対パッチを使用し、残りの3つのクラスについて1％の変動で4つのカテゴリも取得します。さらに、統計分析は、BPの予測の信頼度が、すべての実験でクリーンな画像と摂動された画像の各ペアで有意差がないことを示しました。これらの結果は、アートメディア分類でのパフォーマンスが提案された摂動によって損なわれたDCNNおよび手作りの特徴法と比較した敵対的な例に対するBPの堅牢性を証明しています。

In recent years, the security concerns about the vulnerability of Deep Convolutional Neural Networks (DCNN) to Adversarial Attacks (AA) in the form of small modifications to the input image almost invisible to human vision make their predictions untrustworthy. Therefore, it is necessary to provide robustness to adversarial examples in addition to an accurate score when developing a new classifier. In this work, we perform a comparative study of the effects of AA on the complex problem of art media categorization, which involves a sophisticated analysis of features to classify a fine collection of artworks. We tested a prevailing bag of visual words approach from computer vision, four state-of-the-art DCNN models (AlexNet, VGG, ResNet, ResNet101), and the Brain Programming (BP) algorithm. In this study, we analyze the algorithms' performance using accuracy. Besides, we use the accuracy ratio between adversarial examples and clean images to measure robustness. Moreover, we propose a statistical analysis of each classifier's predictions' confidence to corroborate the results. We confirm that BP predictions' change was below 2% using adversarial examples computed with the fast gradient sign method. Also, considering the multiple pixel attack, BP obtained four out of seven classes without changes and the rest with a maximum error of 4% in the predictions. Finally, BP also gets four categories using adversarial patches without changes and for the remaining three classes with a variation of 1%. Additionally, the statistical analysis showed that the predictions' confidence of BP were not significantly different for each pair of clean and perturbed images in every experiment. These results prove BP's robustness against adversarial examples compared to DCNN and handcrafted features methods, whose performance on the art media classification was compromised with the proposed perturbations.

updated: Mon Mar 01 2021 23:49:26 GMT+0000 (UTC)

published: Mon Mar 01 2021 23:49:26 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト