To Ensemble or Not Ensemble: When does End-To-End Training Fail?

Andrew M. Webb; Charles Reynolds; Wenlin Chen; Henry Reeve; Dan-Andrei Iliescu; Mikel Lujan; Gavin Brown

アンサンブルするかしないか：エンドツーエンドのトレーニングが失敗するのはいつですか？

エンドツーエンドトレーニング（E2E）は、複雑なディープネットワークアーキテクチャをトレーニングするためにますます人気が高まっています。興味深い質問は、この傾向が続くかどうかです。E2Eトレーニングに明確な失敗例はありますか？ネットワークのアンサンブルをトレーニングするE2Eの特定のケースについて、この質問を詳細に検討します。私たちの戦略は、ネットワークの独立したトレーニングから完全なE2Eトレーニングまで、2つの極端な間で勾配をスムーズにブレンドすることです。過剰パラメーター化モデルをE2Eでトレーニングできない明確な失敗のケースを見つけました。驚くべき結果として、アンサンブルシステムでもE2Eシステムでも、2つのシステムの間に最適値が存在する場合があります。作品はまた、ドロップアウトへのリンクを明らかにし、アンサンブルの多様性と多分岐ネットワークの性質についての疑問を提起します。

End-to-End training (E2E) is becoming more and more popular to train complex Deep Network architectures. An interesting question is whether this trend will continue-are there any clear failure cases for E2E training? We study this question in depth, for the specific case of E2E training an ensemble of networks. Our strategy is to blend the gradient smoothly in between two extremes: from independent training of the networks, up to to full E2E training. We find clear failure cases, where over-parameterized models cannot be trained E2E. A surprising result is that the optimum can sometimes lie in between the two, neither an ensemble or an E2E system. The work also uncovers links to Dropout, and raises questions around the nature of ensemble diversity and multi-branch networks.

updated: Thu Aug 06 2020 09:48:03 GMT+0000 (UTC)

published: Tue Feb 12 2019 14:56:06 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト