Boosting R-CNN: Reweighting R-CNN Samples by RPN's Error for Underwater Object Detection

Pinhao Song; Hong Liu; Linhui Dai; Tao Wang; Zhan Chen

R-CNNのブースト：水中オブジェクト検出のためのRPNのエラーによるR-CNNサンプルの再重み付け

複雑な水中環境は、不均衡な光条件、低コントラスト、閉塞、水生生物の模倣など、物体検出に新たな課題をもたらします。このような状況では、水中カメラによってキャプチャされたオブジェクトはあいまいになり、一般的な検出器はこれらのあいまいなオブジェクトで失敗することがよくあります。この作業は、不確実性モデリングとハードサンプルマイニングの2つの観点から問題を解決することを目的としています。 3つの主要コンポーネントで構成されるブースティングR-CNNという名前の2ステージ水中検出器を提案します。最初に、RetinaRPNという名前の新しい地域提案ネットワークが提案されます。これは、高品質の提案を提供し、オブジェクトの事前確率をモデル化するために、オブジェクト性と不確実性のIoU予測を考慮します。次に、確率的推論パイプラインを導入して、第1段階の事前不確実性と第2段階の分類スコアを組み合わせて、最終的な検出スコアをモデル化します。最後に、ブースティングリウェイトという名前の新しいハードサンプルマイニング方法を提案します。具体的には、領域提案ネットワークがサンプルのオブジェクト事前確率を誤って計算する場合、再重み付けを増やすと、トレーニング中のR-CNNヘッド内のサンプルの分類損失が増加し、正確に推定された事前確率を持つ簡単なサンプルの損失が減少します。これにより、第2段階の堅牢な検出ヘッドを得ることができます。推論段階では、R-CNNには、パフォーマンスを向上させるために最初の段階のエラーを修正する機能があります。 2つの水中データセットと2つの一般的なオブジェクト検出データセットに関する包括的な実験は、私たちの方法の有効性と堅牢性を示しています。

Complicated underwater environments bring new challenges to object detection, such as unbalanced light conditions, low contrast, occlusion, and mimicry of aquatic organisms. Under these circumstances, the objects captured by the underwater camera will become vague, and the generic detectors often fail on these vague objects. This work aims to solve the problem from two perspectives: uncertainty modeling and hard example mining. We propose a two-stage underwater detector named boosting R-CNN, which comprises three key components. First, a new region proposal network named RetinaRPN is proposed, which provides high-quality proposals and considers objectness and IoU prediction for uncertainty to model the object prior probability. Second, the probabilistic inference pipeline is introduced to combine the first-stage prior uncertainty and the second-stage classification score to model the final detection score. Finally, we propose a new hard example mining method named boosting reweighting. Specifically, when the region proposal network miscalculates the object prior probability for a sample, boosting reweighting will increase the classification loss of the sample in the R-CNN head during training, while reducing the loss of easy samples with accurately estimated priors. Thus, a robust detection head in the second stage can be obtained. During the inference stage, the R-CNN has the capability to rectify the error of the first stage to improve the performance. Comprehensive experiments on two underwater datasets and two generic object detection datasets demonstrate the effectiveness and robustness of our method.

updated: Fri Sep 16 2022 10:55:28 GMT+0000 (UTC)

published: Tue Jun 28 2022 03:29:20 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト