CFC-Net: A Critical Feature Capturing Network for Arbitrary-Oriented Object Detection in Remote Sensing Images

Qi Ming; Lingjuan Miao; Zhiqiang Zhou; Yunpeng Dong

CFC-Net：リモートセンシング画像における任意指向の物体検出のための重要な機能キャプチャネットワーク

光学リモートセンシング画像での物体検出は、重要で困難な作業です。近年、畳み込みニューラルネットワークに基づく方法が順調に進歩しています。しかし、物体の縮尺、アスペクト比、任意の向きのばらつきが大きいため、検出性能をさらに向上させることは困難です。この論文では、オブジェクト検出における識別機能の役割について説明し、次に、強力な機能表現の構築、プリセットアンカーの改良、ラベル割り当ての最適化の3つの側面から検出精度を向上させるCritical Feature Capturing Network（CFC-Net）を提案します。具体的には、最初に分類機能と回帰機能を分離し、次にPolarization Attention Module（PAM）を介してそれぞれのタスクに適合した堅牢な重要な機能を構築します。抽出された識別回帰機能を使用して、回転アンカーリファインメントモジュール（R-ARM）は、事前設定された水平アンカーに対してローカリゼーションリファインメントを実行し、優れた回転アンカーを取得します。次に、動的アンカー学習（DAL）戦略が与えられ、重要な機能をキャプチャする能力に基づいて高品質のアンカーを適応的に選択します。提案されたフレームワークは、リモートセンシング画像内のオブジェクトに対してより強力なセマンティック表現を作成し、高性能のリアルタイムオブジェクト検出を実現します。 HRSC2016、DOTA、およびUCAS-AODを含む3つのリモートセンシングデータセットの実験結果は、私たちの方法が多くの最先端のアプローチと比較して優れた検出性能を達成することを示しています。コードとモデルはhttps://github.com/ming71/CFC-Netで入手できます。

Object detection in optical remote sensing images is an important and challenging task. In recent years, the methods based on convolutional neural networks have made good progress. However, due to the large variation in object scale, aspect ratio, and arbitrary orientation, the detection performance is difficult to be further improved. In this paper, we discuss the role of discriminative features in object detection, and then propose a Critical Feature Capturing Network (CFC-Net) to improve detection accuracy from three aspects: building powerful feature representation, refining preset anchors, and optimizing label assignment. Specifically, we first decouple the classification and regression features, and then construct robust critical features adapted to the respective tasks through the Polarization Attention Module (PAM). With the extracted discriminative regression features, the Rotation Anchor Refinement Module (R-ARM) performs localization refinement on preset horizontal anchors to obtain superior rotation anchors. Next, the Dynamic Anchor Learning (DAL) strategy is given to adaptively select high-quality anchors based on their ability to capture critical features. The proposed framework creates more powerful semantic representations for objects in remote sensing images and achieves high-performance real-time object detection. Experimental results on three remote sensing datasets including HRSC2016, DOTA, and UCAS-AOD show that our method achieves superior detection performance compared with many state-of-the-art approaches. Code and models are available at https://github.com/ming71/CFC-Net.

updated: Mon Aug 16 2021 07:32:15 GMT+0000 (UTC)

published: Mon Jan 18 2021 02:31:09 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト