Shortcut Learning Through the Lens of Early Training Dynamics

Nihal Murali; Aahlad Manas Puli; Ke Yu; Rajesh Ranganath; Kayhan Batmanghelich

初期のトレーニングダイナミクスのレンズを通した近道学習

ディープニューラルネットワーク (DNN) は、展開中に DNN の一般化を損なうショートカットパターンを学習する傾向があります。ショートカット学習は、特に DNN が安全性が重要なドメインに適用される場合に懸念されます。この論文は、トレーニングプロセス中の内部ニューロンの学習ダイナミクスのレンズを通してショートカット学習をよりよく理解することを目的としています.より具体的には、次の観察を行います。(1) 以前の研究では、近道を疑似相関と同義として扱っていましたが、すべての疑似相関が近道であるとは限らないことを強調します。ショートカットは、コア機能よりも「簡単」な偽の機能のみであることを示しています。 (2) この前提に基づいて構築し、インスタンス難易度メソッド (予測深度など) を使用して「簡単」を定量化し、トレーニングフェーズ中にこの動作を識別します。 (3) 使用されているネットワークアーキテクチャに関係なく、DNN の初期層の学習ダイナミクスを観察することでショートカット学習を検出できることを経験的に示します。言い換えれば、トレーニングの早い段階で DNN の初期層によって学習された簡単な機能は、潜在的なショートカットです。シミュレートされた医療画像データと実際の医療画像データに関する主張を検証し、予測深度と V 使用可能情報のような情報理論的概念との間の理論的なつながりを示すことにより、仮説の経験的成功を正当化します。最後に、私たちの実験は、トレーニング中に精度プロットのみを監視するのは不十分であることを示しています (機械学習パイプラインでは一般的です)。

Deep Neural Networks (DNNs) are prone to learn shortcut patterns that damage the generalization of the DNN during deployment. Shortcut Learning is concerning, particularly when the DNNs are applied to safety-critical domains. This paper aims to better understand shortcut learning through the lens of the learning dynamics of the internal neurons during the training process. More specifically, we make the following observations: (1) While previous works treat shortcuts as synonymous with spurious correlations, we emphasize that not all spurious correlations are shortcuts. We show that shortcuts are only those spurious features that are "easier" than the core features. (2) We build upon this premise and use instance difficulty methods (like Prediction Depth) to quantify "easy" and to identify this behavior during the training phase. (3) We empirically show that shortcut learning can be detected by observing the learning dynamics of the DNN's early layers, irrespective of the network architecture used. In other words, easy features learned by the initial layers of a DNN early during the training are potential shortcuts. We verify our claims on simulated and real medical imaging data and justify the empirical success of our hypothesis by showing the theoretical connections between Prediction Depth and information-theoretic concepts like V-usable information. Lastly, our experiments show the insufficiency of monitoring only accuracy plots during training (as is common in machine learning pipelines), and we highlight the need for monitoring early training dynamics using example difficulty metrics.

updated: Sat Feb 18 2023 14:37:46 GMT+0000 (UTC)

published: Sat Feb 18 2023 14:37:46 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト