Learning High-Precision Bounding Box for Rotated Object Detection via Kullback-Leibler Divergence

Xue Yang; Xiaojiang Yang; Jirui Yang; Qi Ming; Wentao Wang; Qi Tian; Junchi Yan

カルバック・ライブラーダイバージェンスによる回転物体検出のための高精度境界ボックスの学習

既存の回転オブジェクト検出器は、水平検出パラダイムがよく発達した領域に進化したため、ほとんどが水平検出パラダイムから継承されています。ただし、これらの検出器は、現在の回帰損失の設計の制限により、特に大きなアスペクト比を持つオブジェクトの場合、高精度の検出で顕著に機能することは困難です。水平検出は回転オブジェクト検出の特殊なケースであるという観点から、この論文では、回転と水平検出の関係の観点から、回転回帰損失の設計を誘導パラダイムから演手法に変更する動機を与えています。重要な課題の 1 つは、回転回帰損失の結合されたパラメーターを変調する方法であることを示します。これは、推定されたパラメーターが、適応的かつ相乗的な方法で動的関節最適化中に互いに影響を与える可能性があるためです。具体的には、最初に回転したバウンディングボックスを 2 次元ガウス分布に変換し、次にガウス分布間のカルバック・ライブラーダイバージェンス (KLD) を回帰損失として計算します。各パラメーターの勾配を分析することにより、KLD (およびその派生物) がオブジェクトの特性に従ってパラメーター勾配を動的に調整できることを示します。アスペクト比に応じて、角度パラメータの重要度 (グラデーションの重み) を調整します。わずかな角度エラーが大きなアスペクト比のオブジェクトの精度を大幅に低下させるため、このメカニズムは高精度の検出に不可欠です。さらに重要なことは、KLD がスケール不変であることを証明したことです。さらに、KLD 損失が水平検出の一般的な l_n-norm 損失に縮退できることを示します。異なる検出器を使用した 7 つのデータセットの実験結果は、一貫した優位性を示しており、コードは https://github.com/yangxue0827/RotationDetection で入手できます。

Existing rotated object detectors are mostly inherited from the horizontal detection paradigm, as the latter has evolved into a well-developed area. However, these detectors are difficult to perform prominently in high-precision detection due to the limitation of current regression loss design, especially for objects with large aspect ratios. Taking the perspective that horizontal detection is a special case for rotated object detection, in this paper, we are motivated to change the design of rotation regression loss from induction paradigm to deduction methodology, in terms of the relation between rotation and horizontal detection. We show that one essential challenge is how to modulate the coupled parameters in the rotation regression loss, as such the estimated parameters can influence to each other during the dynamic joint optimization, in an adaptive and synergetic way. Specifically, we first convert the rotated bounding box into a 2-D Gaussian distribution, and then calculate the Kullback-Leibler Divergence (KLD) between the Gaussian distributions as the regression loss. By analyzing the gradient of each parameter, we show that KLD (and its derivatives) can dynamically adjust the parameter gradients according to the characteristics of the object. It will adjust the importance (gradient weight) of the angle parameter according to the aspect ratio. This mechanism can be vital for high-precision detection as a slight angle error would cause a serious accuracy drop for large aspect ratios objects. More importantly, we have proved that KLD is scale invariant. We further show that the KLD loss can be degenerated into the popular l_n-norm loss for horizontal detection. Experimental results on seven datasets using different detectors show its consistent superiority, and codes are available at https://github.com/yangxue0827/RotationDetection.

updated: Thu Jun 03 2021 14:29:19 GMT+0000 (UTC)

published: Thu Jun 03 2021 14:29:19 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト