Paraphrasing Complex Network: Network Compression via Factor Transfer

Jangho Kim; SeongUk Park; Nojun Kwak

複雑なネットワークの言い換え：因子伝達によるネットワーク圧縮

多くの研究者は、組み込みシステムでDNNを使用するために、パフォーマンスの低下を最小限に抑えてディープニューラルネットワーク（DNN）のサイズを削減するモデル圧縮の方法を模索しています。モデルの圧縮方法の中で、知識伝達と呼ばれる方法は、より強力な教師ネットワークで生徒ネットワークを訓練することです。この論文では、畳み込み演算を使用して教師の知識を言い換え、生徒に翻訳する新しい知識伝達方法を提案します。これは、言い換えと翻訳と呼ばれる2つの畳み込みモジュールによって行われます。言い換えは、教師ネットワークの言い換え情報として定義されている教師要因を抽出するために、教師なしの方法で訓練されます。学生ネットワークにある翻訳者は、学生の要素を抽出し、それらを模倣することで教師の要素を翻訳するのに役立ちます。提案された因子伝達方法で訓練された学生ネットワークは、従来の知識伝達方法で訓練されたものよりも優れていることがわかりました。

Many researchers have sought ways of model compression to reduce the size of a deep neural network (DNN) with minimal performance degradation in order to use DNNs in embedded systems. Among the model compression methods, a method called knowledge transfer is to train a student network with a stronger teacher network. In this paper, we propose a novel knowledge transfer method which uses convolutional operations to paraphrase teacher's knowledge and to translate it for the student. This is done by two convolutional modules, which are called a paraphraser and a translator. The paraphraser is trained in an unsupervised manner to extract the teacher factors which are defined as paraphrased information of the teacher network. The translator located at the student network extracts the student factors and helps to translate the teacher factors by mimicking them. We observed that our student network trained with the proposed factor transfer method outperforms the ones trained with conventional knowledge transfer methods.

updated: Wed Jul 22 2020 12:42:18 GMT+0000 (UTC)

published: Wed Feb 14 2018 07:46:15 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト