MixMo: Mixing Multiple Inputs for Multiple Outputs via Deep Subnetworks

Alexandre Rame; Remy Sun; Matthieu Cord

MixMo：ディープサブネットワークを介した複数の出力のための複数の入力の混合

最近の戦略では、単一のベースネットワーク内に同時に多様なサブネットワークを適合させることにより、「無料」でアンサンブルを実現しました。トレーニング中の主なアイデアは、各サブネットワークが同時に提供される複数の入力のうちの1つだけを分類することを学習することです。ただし、これらの複数の入力を最適に混合する方法の問題は、これまで研究されていません。この論文では、マルチ入力マルチ出力ディープサブネットワークを学習するための新しい一般化されたフレームワークであるMixMoを紹介します。私たちの主な動機は、以前のアプローチに隠されていた次善の加算操作を、より適切な混合メカニズムに置き換えることです。そのために、成功した混合サンプルデータの拡張からインスピレーションを得ています。機能のバイナリミキシング（特にCutMixの長方形パッチを使用）は、サブネットワークをより強力で多様にすることで結果を向上させることを示しています。 CIFAR-100およびTinyImageNetデータセットでの画像分類の最新技術を改善します。実装が容易なモデルは、推論やメモリのオーバーヘッドなしに、データ拡張ディープアンサンブルを大幅に上回ります。機能を操作し、大規模なネットワークの表現力をより有効に活用することで、以前の作業を補完する新しい研究ラインを開きます。

Recent strategies achieved ensembling "for free" by fitting concurrently diverse subnetworks inside a single base network. The main idea during training is that each subnetwork learns to classify only one of the multiple inputs simultaneously provided. However, the question of how to best mix these multiple inputs has not been studied so far. In this paper, we introduce MixMo, a new generalized framework for learning multi-input multi-output deep subnetworks. Our key motivation is to replace the suboptimal summing operation hidden in previous approaches by a more appropriate mixing mechanism. For that purpose, we draw inspiration from successful mixed sample data augmentations. We show that binary mixing in features - particularly with rectangular patches from CutMix - enhances results by making subnetworks stronger and more diverse. We improve state of the art for image classification on CIFAR-100 and Tiny ImageNet datasets. Our easy to implement models notably outperform data augmented deep ensembles, without the inference and memory overheads. As we operate in features and simply better leverage the expressiveness of large networks, we open a new line of research complementary to previous works.

updated: Tue Aug 24 2021 11:11:06 GMT+0000 (UTC)

published: Wed Mar 10 2021 15:31:02 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト