Anytime Inference with Distilled Hierarchical Neural Ensembles

Adria Ruiz; Jakob Verbeek

蒸留された階層的ニューラルアンサンブルによるいつでも推論

ディープニューラルネットワークでの推論は計算コストが高くなる可能性があり、計算量または入力データの量が時間とともに変化するmscenariosでは、いつでも推論できるネットワークが重要です。このようなネットワークでは、推論プロセスを中断して結果をより速く提供したり、より正確な結果を取得し続けたりすることができます。複数のネットワークのアンサンブルを階層ツリー構造に埋め込み、中間層を共有する新しいフレームワークであるHierarchical Neural Ensembles（HNE）を提案します。 HNEでは、アンサンブル内のモデルを多かれ少なかれ評価することにより、オンザフライで推論の複雑さを制御します。 2番目の貢献は、小さなアンサンブルの予測精度を高めるための新しい階層的蒸留法です。このアプローチは、アンサンブルのネストされた構造を活用して、個々のモデル全体に精度と多様性を最適に割り当てます。私たちの実験は、以前のいつでも推論モデルと比較して、HNEがCIFAR-10 / 100およびImageNetデータセットで最先端の精度と計算上のトレードオフを提供することを示しています。

Inference in deep neural networks can be computationally expensive, and networks capable of anytime inference are important in mscenarios where the amount of compute or quantity of input data varies over time. In such networks the inference process can interrupted to provide a result faster, or continued to obtain a more accurate result. We propose Hierarchical Neural Ensembles (HNE), a novel framework to embed an ensemble of multiple networks in a hierarchical tree structure, sharing intermediate layers. In HNE we control the complexity of inference on-the-fly by evaluating more or less models in the ensemble. Our second contribution is a novel hierarchical distillation method to boost the prediction accuracy of small ensembles. This approach leverages the nested structure of our ensembles, to optimally allocate accuracy and diversity across the individual models. Our experiments show that, compared to previous anytime inference models, HNE provides state-of-the-art accuracy-computate trade-offs on the CIFAR-10/100 and ImageNet datasets.

updated: Mon Dec 14 2020 07:26:50 GMT+0000 (UTC)

published: Tue Mar 03 2020 12:13:38 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト