Dynamic Resolution Network

Mingjian Zhu; Kai Han; Enhua Wu; Qiulin Zhang; Ying Nie; Zhenzhong Lan; Yunhe Wang

動的解像度ネットワーク

深い畳み込みニューラルネットワーク（CNN）は、精度の理由から、多くの場合、学習可能なパラメーターが多数ある高度な設計になっています。それらをモバイルデバイスに展開するための高額なコストを軽減するために、最近の作業では、事前定義されたアーキテクチャの冗長性を掘り起こすために多大な努力が払われています。それにもかかわらず、最新のCNNの入力解像度の冗長性は十分に調査されていません。つまり、入力画像の解像度は固定されています。この論文では、与えられた画像を正確に予測するための最小解像度が、同じニューラルネットワークを使用して異なることを観察します。この目的のために、入力解像度が各入力サンプルに基づいて動的に決定される新しい動的解像度ネットワーク（DRNet）を提案します。ここでは、計算コストがごくわずかな解像度予測子が調査され、目的のネットワークと共同で最適化されます。具体的には、予測子は、各画像の元の認識精度を保持し、さらには超えることができる最小の解像度を学習します。推論中に、各入力画像は、全体的な計算負荷を最小限に抑えるために、予測された解像度にサイズ変更されます。次に、いくつかのベンチマークネットワークとデータセットで広範な実験を行います。結果は、DRNetを既製のネットワークアーキテクチャに組み込んで、計算の複雑さを大幅に軽減できることを示しています。たとえば、DR-ResNet-50は、ImageNet上の元のResNet-50と比較して、約34％の計算削減で同様のパフォーマンスを実現し、10％の計算削減で1.4％の精度向上を実現します。

Deep convolutional neural networks (CNNs) are often of sophisticated design with numerous learnable parameters for the accuracy reason. To alleviate the expensive costs of deploying them on mobile devices, recent works have made huge efforts for excavating redundancy in pre-defined architectures. Nevertheless, the redundancy on the input resolution of modern CNNs has not been fully investigated, i.e., the resolution of input image is fixed. In this paper, we observe that the smallest resolution for accurately predicting the given image is different using the same neural network. To this end, we propose a novel dynamic-resolution network (DRNet) in which the input resolution is determined dynamically based on each input sample. Wherein, a resolution predictor with negligible computational costs is explored and optimized jointly with the desired network. Specifically, the predictor learns the smallest resolution that can retain and even exceed the original recognition accuracy for each image. During the inference, each input image will be resized to its predicted resolution for minimizing the overall computation burden. We then conduct extensive experiments on several benchmark networks and datasets. The results show that our DRNet can be embedded in any off-the-shelf network architecture to obtain a considerable reduction in computational complexity. For instance, DR-ResNet-50 achieves similar performance with an about 34% computation reduction, while gains 1.4% accuracy increase with 10% computation reduction compared to the original ResNet-50 on ImageNet.

updated: Sun Oct 17 2021 03:05:32 GMT+0000 (UTC)

published: Sat Jun 05 2021 13:48:33 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト