Learning by Distillation: A Self-Supervised Learning Framework for Optical Flow Estimation

Pengpeng Liu; Michael R. Lyu; Irwin King; Jia Xu

蒸留による学習: オプティカルフロー推定のための自己教師あり学習フレームワーク

オプティカルフローを学習するための知識蒸留アプローチである DistillFlow を紹介します。 DistillFlow は、複数の教師モデルと 1 つの生徒モデルをトレーニングします。生徒モデルの入力に難しい変換が適用され、幻覚によるオクルージョンと信頼性の低い予測が生成されます。次に、自己監視学習フレームワークが構築されます。教師モデルからの信頼できる予測は、自信のない予測のオプティカルフローを学習するように生徒モデルをガイドする注釈として提供されます。自己監視学習フレームワークにより、非遮蔽ピクセルだけでなく、非遮蔽ピクセルについても、ラベルのないデータからオプティカルフローを効果的に学習できます。 DistillFlow は、KITTI データセットと Sintel データセットの両方で、最先端の教師なし学習パフォーマンスを実現します。私たちの自己監視型事前トレーニングモデルは、教師あり微調整のための優れた初期化も提供し、合成データに基づく事前トレーニングに大きく依存する現在の教師あり学習方法とは対照的に、代替トレーニングパラダイムを提案します。執筆時点で、微調整されたモデルは、KITTI 2015 ベンチマークですべての単眼メソッドの中で 1 位にランクされ、Sintel Final ベンチマークで公開されているすべてのメソッドよりも優れています。さらに重要なことに、フレームワークの一般化、通信の一般化、およびデータセット間の一般化という 3 つの側面で DistillFlow の一般化機能を示します。

We present DistillFlow, a knowledge distillation approach to learning optical flow. DistillFlow trains multiple teacher models and a student model, where challenging transformations are applied to the input of the student model to generate hallucinated occlusions as well as less confident predictions. Then, a self-supervised learning framework is constructed: confident predictions from teacher models are served as annotations to guide the student model to learn optical flow for those less confident predictions. The self-supervised learning framework enables us to effectively learn optical flow from unlabeled data, not only for non-occluded pixels, but also for occluded pixels. DistillFlow achieves state-of-the-art unsupervised learning performance on both KITTI and Sintel datasets. Our self-supervised pre-trained model also provides an excellent initialization for supervised fine-tuning, suggesting an alternate training paradigm in contrast to current supervised learning methods that highly rely on pre-training on synthetic data. At the time of writing, our fine-tuned models ranked 1st among all monocular methods on the KITTI 2015 benchmark, and outperform all published methods on the Sintel Final benchmark. More importantly, we demonstrate the generalization capability of DistillFlow in three aspects: framework generalization, correspondence generalization and cross-dataset generalization.

updated: Tue Jun 08 2021 09:13:34 GMT+0000 (UTC)

published: Tue Jun 08 2021 09:13:34 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト