Neural Capacitance: A New Perspective of Neural Network Selection via Edge Dynamics

Chunheng Jiang; Tejaswini Pedapati; Pin-Yu Chen; Yizhou Sun; Jianxi Gao

ニューラル容量：エッジダイナミクスによるニューラルネットワーク選択の新しい視点

ダウンストリームタスクに適した事前トレーニング済みニューラルネットワークを特定するための効率的なモデル選択は、深層学習における基本的でありながら困難なタスクです。現在の慣行では、パフォーマンス予測のためのモデルトレーニングに高額な計算コストが必要です。この論文では、トレーニング中のシナプス接続（エッジ）の支配ダイナミクスを分析することにより、ニューラルネットワーク選択のための新しいフレームワークを提案します。私たちのフレームワークは、ニューラルネットワークトレーニング中のバックプロパゲーションがシナプス接続の動的進化と同等であるという事実に基づいて構築されています。したがって、収束ニューラルネットワークは、これらのエッジで構成されるネットワークシステムの平衡状態に関連付けられます。この目的のために、ニューラルネットワークG_AをG_Aのそれらのエッジで定義された有向線グラフG_Bに変換して、ネットワークマッピングϕを構築します。次に、少数の初期トレーニング結果のみを使用して、ダウンストリームタスクでのG_Aの一般化機能を普遍的にキャプチャする予測尺度として、神経容量メトリックβ_effを導出します。フレームワークの微調整パフォーマンスを評価するために、17の一般的な事前トレーニング済みImageNetモデルとCIFAR10、CIFAR100、SVHN、Fashion MNIST、Birdsを含む5つのベンチマークデータセットを使用して広範な実験を実施しました。私たちの神経容量メトリックは、初期のトレーニング結果のみに基づいたモデル選択の強力な指標であり、最先端の方法よりも効率的であることが示されています。

Efficient model selection for identifying a suitable pre-trained neural network to a downstream task is a fundamental yet challenging task in deep learning. Current practice requires expensive computational costs in model training for performance prediction. In this paper, we propose a novel framework for neural network selection by analyzing the governing dynamics over synaptic connections (edges) during training. Our framework is built on the fact that back-propagation during neural network training is equivalent to the dynamical evolution of synaptic connections. Therefore, a converged neural network is associated with an equilibrium state of a networked system composed of those edges. To this end, we construct a network mapping ϕ, converting a neural network G_A to a directed line graph G_B that is defined on those edges in G_A. Next, we derive a neural capacitance metric β_ eff as a predictive measure universally capturing the generalization capability of G_A on the downstream task using only a handful of early training results. We carried out extensive experiments using 17 popular pre-trained ImageNet models and five benchmark datasets, including CIFAR10, CIFAR100, SVHN, Fashion MNIST and Birds, to evaluate the fine-tuning performance of our framework. Our neural capacitance metric is shown to be a powerful indicator for model selection based only on early training results and is more efficient than state-of-the-art methods.

updated: Fri Jan 14 2022 21:18:24 GMT+0000 (UTC)

published: Tue Jan 11 2022 20:53:15 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト