Alpha-Refine: Boosting Tracking Performance by Precise Bounding Box Estimation

Bin Yan; Xinyu Zhang; Dong Wang; Huchuan Lu; Xiaoyun Yang

Alpha-Refine：正確なバウンディングボックス推定による追跡パフォーマンスの向上

ビジュアルオブジェクトトラッキングは、特定のターゲットのバウンディングボックスを正確に推定することを目的としています。これは、変形やオクルージョンなどの要因による困難な問題です。最近の多くのトラッカーは、バウンディングボックス推定の品質を向上させるために多段階追跡戦略を採用しています。これらの方法では、最初にターゲットを大まかに特定し、次の段階で初期予測を調整します。ただし、既存のアプローチは依然として精度が制限されており、異なるステージを結合すると、メソッドの転送可能性が大幅に制限されます。この作品は、ベーストラッカーのボックス推定品質を大幅に向上させることができるAlpha-Refine（AR）と呼ばれる斬新で柔軟かつ正確な改良モジュールを提案します。一連の設計オプションを検討することにより、改良を成功させる秘訣は、可能な限り詳細な空間情報を抽出して維持することであると結論付けます。この原理に従って、Alpha-Refineは、ピクセル単位の相関、コーナー予測ヘッド、および補助マスクヘッドをコアコンポーネントとして採用しています。複数のベーストラッカーを使用したTrackingNet、LaSOT、GOT-10K、およびVOT2020ベンチマークの包括的な実験は、私たちのアプローチがわずかな追加の遅延でベーストラッカーのパフォーマンスを大幅に改善することを示しています。提案されたAlpha-Refineメソッドは、一連の強化されたトラッカーにつながります。その中で、ARSiamRPN（AR強化SiamRPNpp）とARDiMP50（ARstrengthened DiMP50）は、効率と精度のトレードオフを実現し、ARDiMPsuper（AR強化DiMP-super）はリアルタイムの速度で非常に競争力のあるパフォーマンス。コードと事前トレーニング済みモデルは、https：//github.com/MasterBin-IIAU/AlphaRefineで入手できます。

Visual object tracking aims to precisely estimate the bounding box for the given target, which is a challenging problem due to factors such as deformation and occlusion. Many recent trackers adopt the multiple-stage tracking strategy to improve the quality of bounding box estimation. These methods first coarsely locate the target and then refine the initial prediction in the following stages. However, existing approaches still suffer from limited precision, and the coupling of different stages severely restricts the method's transferability. This work proposes a novel, flexible, and accurate refinement module called Alpha-Refine (AR), which can significantly improve the base trackers' box estimation quality. By exploring a series of design options, we conclude that the key to successful refinement is extracting and maintaining detailed spatial information as much as possible. Following this principle, Alpha-Refine adopts a pixel-wise correlation, a corner prediction head, and an auxiliary mask head as the core components. Comprehensive experiments on TrackingNet, LaSOT, GOT-10K, and VOT2020 benchmarks with multiple base trackers show that our approach significantly improves the base trackers' performance with little extra latency. The proposed Alpha-Refine method leads to a series of strengthened trackers, among which the ARSiamRPN (AR strengthened SiamRPNpp) and the ARDiMP50 (ARstrengthened DiMP50) achieve good efficiency-precision trade-off, while the ARDiMPsuper (AR strengthened DiMP-super) achieves very competitive performance at a real-time speed. Code and pretrained models are available at https://github.com/MasterBin-IIAU/AlphaRefine.

updated: Mon Mar 29 2021 03:53:00 GMT+0000 (UTC)

published: Sat Dec 12 2020 13:33:25 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト