Auto-tuning of Deep Neural Networks by Conflicting Layer Removal

David Peer; Sebastian Stabinger; Antonio Rodriguez-Sanchez

競合するレイヤーの削除によるディープニューラルネットワークの自動調整

ニューラルネットワークアーキテクチャの設計は困難な作業であり、パフォーマンスを向上させるためにモデルのどの特定の層を適合させる必要があるかを知ることは、ほとんど謎です。この論文では、訓練されたモデルのテスト精度を低下させる層を特定するための新しい方法論を紹介します。競合するレイヤーは、トレーニングの開始と同時に検出されます。最悪のシナリオでは、そのようなレイヤーがまったくトレーニングできないネットワークにつながる可能性があることを証明します。全体的なネットワークパフォーマンスの低下をもたらすこれらのレイヤーの原因について理論的な分析が提供され、これは当社の広範な経験的評価によって補完されます。より正確には、競合するトレーニングバンドルと呼ばれるものを生成するため、パフォーマンスを悪化させるレイヤーを特定しました。トレーニングされた残余ネットワークのレイヤーの約60％が、テストエラーを大幅に増加させることなくアーキテクチャから完全に削除できることを示します。さらに、トレーニングの開始時に競合するレイヤーを識別する新しいニューラルアーキテクチャ検索（NAS）アルゴリズムを紹介します。当社の自動調整アルゴリズムによって検出されたアーキテクチャは、より複雑な最先端のアーキテクチャと比較した場合、競争力のある精度値を達成すると同時に、さまざまなコンピュータビジョンタスクのメモリ消費と推論時間を大幅に削減します。ソースコードはhttps://github.com/peerdavid/conflicting-bundlesで入手できます。

Designing neural network architectures is a challenging task and knowing which specific layers of a model must be adapted to improve the performance is almost a mystery. In this paper, we introduce a novel methodology to identify layers that decrease the test accuracy of trained models. Conflicting layers are detected as early as the beginning of training. In the worst-case scenario, we prove that such a layer could lead to a network that cannot be trained at all. A theoretical analysis is provided on what is the origin of those layers that result in a lower overall network performance, which is complemented by our extensive empirical evaluation. More precisely, we identified those layers that worsen the performance because they would produce what we name conflicting training bundles. We will show that around 60% of the layers of trained residual networks can be completely removed from the architecture with no significant increase in the test-error. We will further present a novel neural-architecture-search (NAS) algorithm that identifies conflicting layers at the beginning of the training. Architectures found by our auto-tuning algorithm achieve competitive accuracy values when compared against more complex state-of-the-art architectures, while drastically reducing memory consumption and inference time for different computer vision tasks. The source code is available on https://github.com/peerdavid/conflicting-bundles

updated: Sun Mar 07 2021 11:51:55 GMT+0000 (UTC)

published: Sun Mar 07 2021 11:51:55 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト