Dynamic Anchor Learning for Arbitrary-Oriented Object Detection

Qi Ming; Zhiqiang Zhou; Lingjuan Miao; Hongwei Zhang; Linhao Li

任意指向のオブジェクト検出のための動的アンカー学習

自然シーン、航空写真、リモートセンシング画像などに任意の物体が広く出現するため、任意の物体検出が注目されています。多くの現在の回転検出器は、グラウンドトゥルースボックスとの空間的位置合わせを実現するためにさまざまな方向のアンカーを多数使用します。次に、Intersection-over-Union（IoU）を適用して、トレーニングの正と負の候補をサンプリングします。ただし、一部のネガティブサンプルでは正確なローカリゼーションを実現できる一方で、選択したポジティブアンカーは回帰後の正確な検出を常に保証できるとは限りません。これは、IoUを介したアンカーの品質評価が適切ではないことを示しており、これにより、分類の信頼性とローカリゼーションの精度の間に不整合が生じます。本論文では、動的アンカー学習（DAL）法を提案します。これは、新しく定義された一致度を利用して、アンカーのローカリゼーションの可能性を包括的に評価し、より効率的なラベル割り当てプロセスを実行します。このようにして、検出器は高品質のアンカーを動的に選択して正確なオブジェクト検出を実現でき、分類と回帰の間の相違が緩和されます。新たに導入されたDALにより、水平方向のプリセットアンカーが少ないだけで、任意の方向のオブジェクトに対して優れた検出パフォーマンスを実現します。 3つのリモートセンシングデータセットHRSC2016、DOTA、UCAS-AOD、およびシーンテキストデータセットICDAR 2015の実験結果は、私たちの方法がベースラインモデルと比較して大幅な改善を達成していることを示しています。その上、私たちのアプローチは、水平バウンドボックスを使用したオブジェクト検出にも普遍的です。コードとモデルはhttps://github.com/ming71/DALで入手できます。

Arbitrary-oriented objects widely appear in natural scenes, aerial photographs, remote sensing images, etc., thus arbitrary-oriented object detection has received considerable attention. Many current rotation detectors use plenty of anchors with different orientations to achieve spatial alignment with ground truth boxes, then Intersection-over-Union (IoU) is applied to sample the positive and negative candidates for training. However, we observe that the selected positive anchors cannot always ensure accurate detections after regression, while some negative samples can achieve accurate localization. It indicates that the quality assessment of anchors through IoU is not appropriate, and this further lead to inconsistency between classification confidence and localization accuracy. In this paper, we propose a dynamic anchor learning (DAL) method, which utilizes the newly defined matching degree to comprehensively evaluate the localization potential of the anchors and carry out a more efficient label assignment process. In this way, the detector can dynamically select high-quality anchors to achieve accurate object detection, and the divergence between classification and regression will be alleviated. With the newly introduced DAL, we achieve superior detection performance for arbitrary-oriented objects with only a few horizontal preset anchors. Experimental results on three remote sensing datasets HRSC2016, DOTA, UCAS-AOD as well as a scene text dataset ICDAR 2015 show that our method achieves substantial improvement compared with the baseline model. Besides, our approach is also universal for object detection using horizontal bound box. The code and models are available at https://github.com/ming71/DAL.

updated: Tue Dec 15 2020 13:18:28 GMT+0000 (UTC)

published: Tue Dec 08 2020 01:30:06 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト