Weakly-Supervised Domain Adaptation of Deep Regression Trackers via Reinforced Knowledge Distillation

Matteo Dunnhofer; Niki Martinel; Christian Micheloni

強化された知識蒸留による深回帰トラッカーの弱教師ありドメイン適応

深回帰トラッカーは、利用可能な最速の追跡アルゴリズムの1つであるため、リアルタイムのロボットアプリケーションに適しています。ただし、分布のシフトと過剰適合のため、多くのドメインでは精度が不十分です。この論文では、そのようなクラスのトラッカーのドメイン適応のための最初の方法論を提示することにより、そのような制限を克服します。ラベル付けの労力を減らすために、強化学習を使用して弱い教師ありをスカラーアプリケーション依存の時間遅延フィードバックとして表現する、弱教師あり適応戦略を提案します。同時に、知識の蒸留は、学習の安定性を保証し、より強力であるが低速のトラッカーから知識を圧縮および転送するために使用されます。 5つの異なるロボットビジョンドメインでの広範な実験は、私たちの方法論の関連性を示しています。リアルタイムの速度は、組み込みデバイスとGPUのないマシンで達成されますが、精度は重要な結果に達します。

Deep regression trackers are among the fastest tracking algorithms available, and therefore suitable for real-time robotic applications. However, their accuracy is inadequate in many domains due to distribution shift and overfitting. In this paper we overcome such limitations by presenting the first methodology for domain adaption of such a class of trackers. To reduce the labeling effort we propose a weakly-supervised adaptation strategy, in which reinforcement learning is used to express weak supervision as a scalar application-dependent and temporally-delayed feedback. At the same time, knowledge distillation is employed to guarantee learning stability and to compress and transfer knowledge from more powerful but slower trackers. Extensive experiments on five different robotic vision domains demonstrate the relevance of our methodology. Real-time speed is achieved on embedded devices and on machines without GPUs, while accuracy reaches significant results.

updated: Fri Mar 26 2021 14:37:33 GMT+0000 (UTC)

published: Fri Mar 26 2021 14:37:33 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト