Enhanced Regularizers for Attributional Robustness

Anindya Sarkar; Anirban Sarkar; Vineeth N Balasubramanian

アトリビューションロバストネスのための強化されたレギュラライザー

ディープニューラルネットワークは、コンピュータービジョンタスクの学習モデルのデフォルトの選択肢です。近年、分類などの視覚課題の深いモデルを説明するための広範な作業が行われています。ただし、最近の研究では、2つの非常に類似した画像がネットワークに与えられた場合でも、これらのモデルが大幅に異なるアトリビューションマップを生成する可能性があり、信頼性について深刻な疑問が生じています。この問題に対処するために、ディープニューラルネットワークの帰属の堅牢性を向上させるための堅牢な帰属トレーニング戦略を提案します。私たちの方法は、帰属の堅牢性の要件を注意深く分析し、攻撃中にモデルの帰属マップを保持する2つの新しいレギュラライザーを導入します。私たちの方法は、MNIST、FMNIST、Flower、GTSRBを含むいくつかのデータセットでの帰属の堅牢性の測定に関して、最先端の帰属の堅牢性の方法を約3％から9％上回っています。

Deep neural networks are the default choice of learning models for computer vision tasks. Extensive work has been carried out in recent years on explaining deep models for vision tasks such as classification. However, recent work has shown that it is possible for these models to produce substantially different attribution maps even when two very similar images are given to the network, raising serious questions about trustworthiness. To address this issue, we propose a robust attribution training strategy to improve attributional robustness of deep neural networks. Our method carefully analyzes the requirements for attributional robustness and introduces two new regularizers that preserve a model's attribution map during attacks. Our method surpasses state-of-the-art attributional robustness methods by a margin of approximately 3% to 9% in terms of attribution robustness measures on several datasets including MNIST, FMNIST, Flower and GTSRB.

updated: Mon Dec 28 2020 18:18:39 GMT+0000 (UTC)

published: Mon Dec 28 2020 18:18:39 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト