Any-Precision Deep Neural Networks

Haichao Yu; Haoxiang Li; Honghui Shi; Thomas S. Huang; Gang Hua

任意の精度のディープニューラルネットワーク

学習したDNNが推論中に数値精度で柔軟になることを可能にする新しい方法でトレーニングされた任意精度のディープニューラルネットワーク（DNN）を紹介します。実行時の同じモデルは、動的な速度と精度のトレードオフをサポートするために、最下位ビットを切り捨てることにより、柔軟かつ直接異なるビット幅に設定できます。すべてのレイヤーが低ビットに設定されている場合、モデルが同じ精度でトレーニングされた専用モデルに匹敵する精度を達成したことを示します。この優れたプロパティにより、実際にはモデルの精度と実行時の効率の間のトレードオフが求められることが多い、実際のアプリケーションでの深層学習モデルの柔軟な展開が容易になります。以前の文献は、個々の固定効率/精度のトレードオフポイントでモデルをトレーニングするためのソリューションを示しています。しかし、実行時の精度に柔軟なモデルを作成する方法は、ほとんど未踏です。効率と精度のトレードオフの要求が時々変化する場合、または実行時に動的に変化する場合でも、それに応じてモデルを再トレーニングすることは不可能であり、ストレージの予算によって複数のモデルを維持することが禁止される場合があります。提案されたフレームワークは、パフォーマンスを低下させることなくこの柔軟性を実現します。さらに重要なことに、この成果はモデルアーキテクチャにとらわれず、複数のビジョンタスクに適用できることを示しています。私たちのコードはhttps://github.com/SHI-Labs/Any-Precision-DNNsでリリースされています。

We present any-precision deep neural networks (DNNs), which are trained with a new method that allows the learned DNNs to be flexible in numerical precision during inference. The same model in runtime can be flexibly and directly set to different bit-widths, by truncating the least significant bits, to support dynamic speed and accuracy trade-off. When all layers are set to low-bits, we show that the model achieved accuracy comparable to dedicated models trained at the same precision. This nice property facilitates flexible deployment of deep learning models in real-world applications, where in practice trade-offs between model accuracy and runtime efficiency are often sought. Previous literature presents solutions to train models at each individual fixed efficiency/accuracy trade-off point. But how to produce a model flexible in runtime precision is largely unexplored. When the demand of efficiency/accuracy trade-off varies from time to time or even dynamically changes in runtime, it is infeasible to re-train models accordingly, and the storage budget may forbid keeping multiple models. Our proposed framework achieves this flexibility without performance degradation. More importantly, we demonstrate that this achievement is agnostic to model architectures and applicable to multiple vision tasks. Our code is released at https://github.com/SHI-Labs/Any-Precision-DNNs.

updated: Fri Jan 15 2021 08:13:10 GMT+0000 (UTC)

published: Sun Nov 17 2019 21:35:32 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト