Intra-Ensemble in Neural Networks

Yuan Gao; Zixiang Cai; Lei Yu

ニューラルネットワークのイントラアンサンブル

モデルのパフォーマンスの向上は、ディープラーニングを含む機械学習で常に重要な問題です。ただし、スタンドアロンのニューラルネットワークは、より多くの層を積み重ねるとき、常にわずかな影響を受けます。同時に、アンサンブルはモデルのパフォーマンスをさらに向上させるための便利なテクニックです。それでも、アンサンブル用にいくつかの独立したディープニューラルネットワークをトレーニングするには、複数のリソースが必要です。もしそうなら、アンサンブルを1つのニューラルネットワークのみで利用することは可能ですか？この作業では、1つのニューラルネットワーク内で複数のサブネットワークを同時にトレーニングする確率的チャネル再結合操作を使用したエンドツーエンドのアンサンブル戦略であるイントラアンサンブルを提案します。パラメータの大部分は相互に共有されているため、追加のパラメータサイズはわずかです。一方、確率的チャネルの再結合により、サブネットワークの多様性が大幅に向上し、最終的にアンサンブルのパフォーマンスが向上します。広範な実験とアブレーション研究により、さまざまな種類のデータセットとネットワークアーキテクチャに対するイントラアンサンブルの適用性が証明されています。

Improving model performance is always the key problem in machine learning including deep learning. However, stand-alone neural networks always suffer from marginal effect when stacking more layers. At the same time, ensemble is an useful technique to further enhance model performance. Nevertheless, training several independent deep neural networks for ensemble costs multiple resources. If so, is it possible to utilize ensemble in only one neural network? In this work, we propose Intra-Ensemble, an end-to-end ensemble strategy with stochastic channel recombination operations to train several sub-networks simultaneously within one neural network. Additional parameter size is marginal since the majority of parameters are mutually shared. Meanwhile, stochastic channel recombination significantly increases the diversity of sub-networks, which finally enhances ensemble performance. Extensive experiments and ablation studies prove the applicability of intra-ensemble on various kinds of datasets and network architectures.

updated: Sun May 10 2020 02:09:23 GMT+0000 (UTC)

published: Tue Apr 09 2019 04:53:17 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト