Training Deep Networks from Zero to Hero: avoiding pitfalls and going beyond

Moacir Antonelli Ponti; Fernando Pereira dos Santos; Leo Sampaio Ferraz Ribeiro; Gabriel Biscaro Cavallari

ゼロからヒーローへのディープネットワークのトレーニング：落とし穴を避け、それを超える

ディープニューラルネットワークのトレーニングは、実世界のデータでは難しい場合があります。モデルをブラックボックスとして使用すると、転移学習を使用した場合でも、小さなデータセットや特定のアプリケーションに関しては、一般化が不十分になったり、結果が確定しなくなったりする可能性があります。このチュートリアルでは、モデルを改善するための基本的な手順と最近のオプション、特に教師あり学習について説明しますが、これに限定されません。これは、チャレンジのデータセットほど準備が整っていないデータセットや、注釈が不足しているデータセットや小さなデータの下で特に役立ちます。基本的な手順について説明します。データの準備、最適化、伝達学習だけでなく、変圧器モジュール、代替畳み込み層、活性化関数、広くて深いネットワークの使用などの最近のアーキテクチャの選択、およびカリキュラム、対照的、自己などのトレーニング手順についても説明します。 -教師あり学習。

Training deep neural networks may be challenging in real world data. Using models as black-boxes, even with transfer learning, can result in poor generalization or inconclusive results when it comes to small datasets or specific applications. This tutorial covers the basic steps as well as more recent options to improve models, in particular, but not restricted to, supervised learning. It can be particularly useful in datasets that are not as well-prepared as those in challenges, and also under scarce annotation and/or small data. We describe basic procedures: as data preparation, optimization and transfer learning, but also recent architectural choices such as use of transformer modules, alternative convolutional layers, activation functions, wide and deep networks, as well as training procedures including as curriculum, contrastive and self-supervised learning.

updated: Mon Sep 06 2021 21:31:42 GMT+0000 (UTC)

published: Mon Sep 06 2021 21:31:42 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト