The mechanism underlying successful deep learning

Yarden Tzach; Yuval Meir; Ofek Tevet; Ronit D. Gross; Shiri Hodassman; Roni Vardi; Ido Kanter

ディープラーニングの成功を支えるメカニズム

ディープアーキテクチャは、数十または数百の畳み込み層 (CL) で構成され、これらの層は、いくつかの完全接続 (FC) 層と、複雑な分類タスクの可能なラベルを表す出力層で終わります。既存のディープラーニング (DL) の理論的根拠によれば、最初の CL は生データから局所的な特徴を明らかにし、その後の層は洗練された分類に必要なより高いレベルの特徴を段階的に抽出します。この記事では、DL の成功の基礎となるメカニズムを定量化するための効率的な 3 段階の手順を紹介します。まず、成功率 (SR) を最大化するようにディープアーキテクチャがトレーニングされます。次に、最初のいくつかの CL の重みが固定され、出力に接続された連結された新しい FC 層のみがトレーニングされ、その結果、層とともに進行する SR が生成されます。最後に、トレーニングされた FC 重みは、単一のフィルターから出てくるものを除いて沈黙され、入力ラベルと平均化された出力フィールドの間の相関行列を使用してこのフィルターの機能を定量化できるようになり、明確に定義された定量化可能な特徴のセットが取得されます。。各フィルターは基本的に、入力ラベルとは独立した単一の出力ラベルを選択します。これにより、高い SR が防止されるようです。ただし、直感に反して、考えられる出力ラベルの小さなサブセットが識別されます。この機能は、基礎となる DL メカニズムの重要な部分であり、レイヤーを使用することで徐々にシャープになり、その結果、信号対雑音比と SR が向上します。定量的には、このメカニズムは VGG-16、VGG-6、および AVGG-16 によって例示されます。 DL の基礎となる提案されたメカニズムは、各フィルターの品質を識別するための正確なツールを提供し、DL の SR、計算の複雑さ、レイテンシーを改善するための追加手順を指示することが期待されます。

Deep architectures consist of tens or hundreds of convolutional layers (CLs) that terminate with a few fully connected (FC) layers and an output layer representing the possible labels of a complex classification task. According to the existing deep learning (DL) rationale, the first CL reveals localized features from the raw data, whereas the subsequent layers progressively extract higher-level features required for refined classification. This article presents an efficient three-phase procedure for quantifying the mechanism underlying successful DL. First, a deep architecture is trained to maximize the success rate (SR). Next, the weights of the first several CLs are fixed and only the concatenated new FC layer connected to the output is trained, resulting in SRs that progress with the layers. Finally, the trained FC weights are silenced, except for those emerging from a single filter, enabling the quantification of the functionality of this filter using a correlation matrix between input labels and averaged output fields, hence a well-defined set of quantifiable features is obtained. Each filter essentially selects a single output label independent of the input label, which seems to prevent high SRs; however, it counterintuitively identifies a small subset of possible output labels. This feature is an essential part of the underlying DL mechanism and is progressively sharpened with layers, resulting in enhanced signal-to-noise ratios and SRs. Quantitatively, this mechanism is exemplified by the VGG-16, VGG-6, and AVGG-16. The proposed mechanism underlying DL provides an accurate tool for identifying each filter's quality and is expected to direct additional procedures to improve the SR, computational complexity, and latency of DL.

updated: Mon May 29 2023 13:28:43 GMT+0000 (UTC)

published: Mon May 29 2023 13:28:43 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト