Distilling Knowledge via Knowledge Review

Pengguang Chen; Shu Liu; Hengshuang Zhao; Jiaya Jia

知識レビューによる知識の抽出

知識の蒸留は、生徒のネットワークのパフォーマンスを大幅に向上させることを目的として、教師のネットワークから生徒のネットワークに知識を転送します。以前の方法は、主に、有効性を向上させるために、同じレベルの特徴間で特徴変換と損失関数を提案することに焦点を合わせています。教師と生徒のネットワーク間の接続パスのクロスレベルの要因をさまざまに調査し、その重要性を明らかにします。知識蒸留で初めて、クロスステージ接続パスが提案されます。私たちの新しいレビューメカニズムは効果的で構造的にシンプルです。最終的に設計されたネストされたコンパクトなフレームワークは、ごくわずかな計算オーバーヘッドを必要とし、さまざまなタスクで他のメソッドよりも優れています。この方法を分類、オブジェクト検出、およびインスタンスセグメンテーションタスクに適用します。それらのすべては、大幅な学生ネットワークのパフォーマンスの向上を目撃しています。コードはhttps://github.com/Jia-Research-Lab/ReviewKDで入手できます。

Knowledge distillation transfers knowledge from the teacher network to the student one, with the goal of greatly improving the performance of the student network. Previous methods mostly focus on proposing feature transformation and loss functions between the same level's features to improve the effectiveness. We differently study the factor of connection path cross levels between teacher and student networks, and reveal its great importance. For the first time in knowledge distillation, cross-stage connection paths are proposed. Our new review mechanism is effective and structurally simple. Our finally designed nested and compact framework requires negligible computation overhead, and outperforms other methods on a variety of tasks. We apply our method to classification, object detection, and instance segmentation tasks. All of them witness significant student network performance improvement. Code is available at https://github.com/Jia-Research-Lab/ReviewKD

updated: Mon Apr 19 2021 04:36:24 GMT+0000 (UTC)

published: Mon Apr 19 2021 04:36:24 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト