Robustness to Transformations Across Categories: Is Robustness To Transformations Driven by Invariant Neural Representations?

Hojin Jang; Syed Suleman Abbas Zaidi; Xavier Boix; Neeraj Prasad; Sharon Gilad-Gutnick; Shlomit Ben-Ami; Pawan Sinha

カテゴリを超えた変換に対する堅牢性: 変換に対する堅牢性は不変の神経表現によって駆動されますか?

深層畳み込みニューラルネットワーク (DCNN) は、変換 (ブラーやノイズなど) がトレーニングセットに含まれている場合に、そのオブジェクトを認識する優れた堅牢性を実証しています。このような堅牢性を説明する仮説は、DCNN が画像が変換されても変更されない不変のニューラル表現を開発するというものです。ただし、変換に対するロバスト性は不変性とは異なるプロパティで達成できる可能性があるため、この仮説がどの程度当てはまるかは未解決の問題です。ネットワークの一部を、変換された画像または変換されていない画像を認識するように特殊化することができます。この論文では、不変ニューラル表現がトレーニング分布を超えた変換に対する堅牢性を促進することを利用して、不変ニューラル表現が出現する条件を調査します。具体的には、トレーニング中に一部のオブジェクトカテゴリのみが変換されるトレーニングパラダイムを分析し、DCNN が変換されていないカテゴリ全体の変換に対して堅牢であるかどうかを評価します。最先端の DCNN を使用した結果は、不変ニューラル表現が常に変換に対するロバスト性を高めるわけではないことを示しています。ネットワークは、不変ニューラル表現がない場合でも、トレーニング中に変換されたカテゴリに対してロバスト性を示します。不変性は、トレーニングセット内の変換されたカテゴリの数が増加した場合にのみ現れます。この現象は、オブジェクトの空間配置の変更を伴う回転や細線化などの幾何学的変換よりも、ぼかしやハイパスフィルターなどの局所的な変換でより顕著になります。私たちの結果は、深層学習における不変の神経表現とそれが自発的に現れる条件のより良い理解に貢献します。

Deep Convolutional Neural Networks (DCNNs) have demonstrated impressive robustness to recognize objects under transformations (eg. blur or noise) when these transformations are included in the training set. A hypothesis to explain such robustness is that DCNNs develop invariant neural representations that remain unaltered when the image is transformed. However, to what extent this hypothesis holds true is an outstanding question, as robustness to transformations could be achieved with properties different from invariance, eg. parts of the network could be specialized to recognize either transformed or non-transformed images. This paper investigates the conditions under which invariant neural representations emerge by leveraging that they facilitate robustness to transformations beyond the training distribution. Concretely, we analyze a training paradigm in which only some object categories are seen transformed during training and evaluate whether the DCNN is robust to transformations across categories not seen transformed. Our results with state-of-the-art DCNNs indicate that invariant neural representations do not always drive robustness to transformations, as networks show robustness for categories seen transformed during training even in the absence of invariant neural representations. Invariance only emerges as the number of transformed categories in the training set is increased. This phenomenon is much more prominent with local transformations such as blurring and high-pass filtering than geometric transformations such as rotation and thinning, which entail changes in the spatial arrangement of the object. Our results contribute to a better understanding of invariant neural representations in deep learning and the conditions under which it spontaneously emerges.

updated: Wed Jun 14 2023 22:34:29 GMT+0000 (UTC)

published: Tue Jun 30 2020 21:18:08 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト