The mathematics of adversarial attacks in AI -- Why deep learning is unstable despite the existence of stable neural networks

Alexander Bastounis; Anders C Hansen; Verner Vlačić

AIにおける敵対的攻撃の数学-安定したニューラルネットワークが存在するにもかかわらず深層学習が不安定である理由

ディープラーニング（DL）の前例のない成功により、分類の問題に関しては挑戦されません。ただし、現在のDL方法論が普遍的に不安定なニューラルネットワーク（NN）を生成することは十分に確立されています。不安定性の問題は、いわゆる敵対的攻撃に関する膨大な文献を伴う膨大な研究努力を引き起こしましたが、問題の解決策はありませんでした。私たちの論文は、次の数学的パラドックスを証明するため、問題の解決策がなかった理由を取り上げています。固定アーキテクチャの分類問題のためのニューラルネットワークのトレーニングに基づくトレーニング手順では、不正確または不安定なニューラルネットワークが生成されます（正確な場合） -同じ分類問題に対して正確で安定したニューラルネットワークが存在することが証明されているにもかかわらず。重要なのは、安定した正確なニューラルネットワークは入力に応じて可変次元でなければならないということです。特に、可変次元は安定性に必要な条件です。私たちの結果は、正確で安定したニューラルネットワークが存在するというパラドックスを示していますが、最新のアルゴリズムはそれらを計算しません。これは疑問を投げかけます：望ましい特性を持つニューラルネットワークの存在が証明できれば、それらを計算するアルゴリズムも見つけることができますか？数学では、証明可能な存在が計算可能性を意味する場合がありますが、これはニューラルネットワークの場合でしょうか？ニューラルネットワークが標準コスト関数を使用した標準最適化問題の近似最小値として存在する可能性があることを示すため、逆のことが当てはまります。ただし、ランダム化アルゴリズムでは、1/2よりも良い確率でニューラルネットワークを計算することはできません。

The unprecedented success of deep learning (DL) makes it unchallenged when it comes to classification problems. However, it is well established that the current DL methodology produces universally unstable neural networks (NNs). The instability problem has caused an enormous research effort -- with a vast literature on so-called adversarial attacks -- yet there has been no solution to the problem. Our paper addresses why there has been no solution to the problem, as we prove the following mathematical paradox: any training procedure based on training neural networks for classification problems with a fixed architecture will yield neural networks that are either inaccurate or unstable (if accurate) -- despite the provable existence of both accurate and stable neural networks for the same classification problems. The key is that the stable and accurate neural networks must have variable dimensions depending on the input, in particular, variable dimensions is a necessary condition for stability. Our result points towards the paradox that accurate and stable neural networks exist, however, modern algorithms do not compute them. This yields the question: if the existence of neural networks with desirable properties can be proven, can one also find algorithms that compute them? There are cases in mathematics where provable existence implies computability, but will this be the case for neural networks? The contrary is true, as we demonstrate how neural networks can provably exist as approximate minimisers to standard optimisation problems with standard cost functions, however, no randomised algorithm can compute them with probability better than 1/2.

updated: Mon Sep 13 2021 16:19:25 GMT+0000 (UTC)

published: Mon Sep 13 2021 16:19:25 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト