Spirit Distillation: Precise Real-time Prediction with Insufficient Data

Zhiyuan Wu; Hong Qi; Yu Jiang; Chupeng Cui; Zongmin Yang; Xinhui Xue

精神蒸留：不十分なデータによる正確なリアルタイム予測

最近の傾向は、自動運転システムの環境認識のタスクにディープニューラルネットワーク（DNN）の有効性が適用されることを示しています。大規模で完全なデータは細かいDNNをトレーニングできますが、それを収集することは常に困難で、費用がかかり、時間がかかります。また、リアルタイム認識が必要なため、精度と効率の両方の重要性を強調しすぎることはありません。 DNNの弱いデータと高い計算消費量の間の競合を軽減するために、Spirit Distillation（SD）という名前の新しいトレーニングフレームワークを提案します。これは、微調整ベースの転移学習（FTT）と機能ベースの知識蒸留のアイデアを拡張します。生徒が特徴抽出で教師を模倣できるようにすることで、教師と生徒のネットワーク間の一般的な特徴のギャップが埋められます。さまざまなドメインからの画像をシャッフルし、ミニバッチとしていくつかをランダムに選択するImage Party蒸留強化法（IP）も提案されています。このアプローチにより、教師ネットワークの一般的な機能に対する学生ネットワークの過剰適合を簡単に回避できます。説得力のある実験と議論は、COCO2017とKITTIのプロンプトでCityScapesで行われます。結果は、セグメンテーションのブーストパフォーマンス（mIOUと高精度精度がそれぞれ1.4％と8.2％、出力分散が78.2％）を示し、わずか41.8％のFLOPで正確なコンパクトネットワークを実現できます（図1を参照）。この論文は、数ショット学習に適用される知識蒸留に関する先駆的な研究です。提案された方法は、DNNトレーニングのデータへの依存を大幅に減らし、リアルタイムの要件を満たして、まれな状況に直面したときのDNNの堅牢性を向上させます。自動運転のシーン認識技術の進歩に重要な技術サポートを提供します。

Recent trend demonstrates the effectiveness of deep neural networks (DNNs) apply on the task of environment perception in autonomous driving system. While large-scale and complete data can train out fine DNNs, collecting it is always difficult, expensive, and time-consuming. Also, the significance of both accuracy and efficiency cannot be over-emphasized due to the requirement of real-time recognition. To alleviate the conflicts between weak data and high computational consumption of DNNs, we propose a new training framework named Spirit Distillation(SD). It extends the ideas of fine-tuning-based transfer learning(FTT) and feature-based knowledge distillation. By allowing the student to mimic its teacher in feature extraction, the gap of general features between the teacher-student networks is bridged. The Image Party distillation enhancement method(IP) is also proposed, which shuffling images from various domains, and randomly selecting a few as mini-batch. With this approach, the overfitting that the student network to the general features of the teacher network can be easily avoided. Persuasive experiments and discussions are conducted on CityScapes with the prompt of COCO2017 and KITTI. Results demonstrate the boosting performance in segmentation(mIOU and high-precision accuracy boost by 1.4% and 8.2% respectively, with 78.2% output variance), and can gain a precise compact network with only 41.8% FLOPs(see Fig. 1). This paper is a pioneering work on knowledge distillation applied to few-shot learning. The proposed methods significantly reduce the dependence on data of DNNs training, and improves the robustness of DNNs when facing rare situations, with real-time requirement satisfied. We provide important technical support for the advancement of scene perception technology for autonomous driving.

updated: Thu Mar 25 2021 10:23:30 GMT+0000 (UTC)

published: Thu Mar 25 2021 10:23:30 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト