Can stable and accurate neural networks be computed? -- On the barriers of deep learning and Smale's 18th problem

Matthew J. Colbrook; Vegard Antun; Anders C. Hansen

安定した正確なニューラルネットワークを計算できますか？ -ディープラーニングとSmaleの18番目の問題の障壁について

ディープラーニング（DL）は前例のない成功を収めており、現在、全力で科学計算に参入しています。ただし、現在のDL法は、普遍近似特性によって安定したニューラルネットワーク（NN）の存在が保証されている場合でも、通常は不安定になります。このパラドックスに対処するために、科学計算における基本的な条件数の問題を示します。この問題では、優れた近似品質のNNの存在を証明できますが、そのようなNNをトレーニング（または計算）できるアルゴリズムは、ランダム化されていても存在しません。正の整数K> 2およびLの場合、同時に次の場合があります。（a）ランダム化されたトレーニングアルゴリズムが1/2より大きい確率でK桁に正しいNNを計算できない、（b）計算する決定論的トレーニングアルゴリズムが存在するK-1の正しい桁を持つNNですが、そのような（ランダム化された）アルゴリズムには任意に多くのトレーニングデータが必要です。（c）L個以下のトレーニングサンプルを使用してK-2の正しい桁を持つNNを計算する決定論的トレーニングアルゴリズムが存在します。これらの結果は、特定の精度の（安定した）NNをアルゴリズムで計算できる条件を説明する分類理論を意味します。この理論は、逆問題で安定したNNを計算するアルゴリズムが存在するための十分条件を確立することから始めます。 Fast Iterative REstarted NETworks（FIRENET）を紹介します。これは、安定していることを証明し、数値的に検証します。さらに、逆問題のϵ-正確な解には、O（| log（ϵ）|）層のみが必要であることを証明します。

Deep learning (DL) has had unprecedented success and is now entering scientific computing with full force. However, current DL methods typically suffer from instability, even when universal approximation properties guarantee the existence of stable neural networks (NNs). We address this paradox by demonstrating basic well-conditioned problems in scientific computing where one can prove the existence of NNs with great approximation qualities, however, there does not exist any algorithm, even randomised, that can train (or compute) such a NN. For any positive integers K > 2 and L, there are cases where simultaneously: (a) no randomised training algorithm can compute a NN correct to K digits with probability greater than 1/2, (b) there exists a deterministic training algorithm that computes a NN with K-1 correct digits, but any such (even randomised) algorithm needs arbitrarily many training data, (c) there exists a deterministic training algorithm that computes a NN with K-2 correct digits using no more than L training samples. These results imply a classification theory describing conditions under which (stable) NNs with a given accuracy can be computed by an algorithm. We begin this theory by establishing sufficient conditions for the existence of algorithms that compute stable NNs in inverse problems. We introduce Fast Iterative REstarted NETworks (FIRENETs), which we both prove and numerically verify are stable. Moreover, we prove that only O(|log(ϵ)|) layers are needed for an ϵ-accurate solution to the inverse problem.

updated: Thu Apr 15 2021 17:09:49 GMT+0000 (UTC)

published: Wed Jan 20 2021 19:04:17 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト