Generating Positive Bounding Boxes for Balanced Training of Object Detectors

Kemal Oksuz; Baris Can Cam; Emre Akbas; Sinan Kalkan

オブジェクト検出器のバランスの取れたトレーニングのための正の境界ボックスの生成

2段階の深層オブジェクト検出器は、最初の段階で一連の関心領域（RoI）を生成し、2番目の段階で、提案されたRoIからグラウンドトゥルース（GT）ボックスと十分に重なるオブジェクトを特定します。 2番目の段階は、関連するGTボックスとの低い交差（IoU）を持つRoIへのバイアスに悩まされることが知られています。この問題に対処するために、まず、特定のIoUしきい値を超えて特定の参照ボックスとオーバーラップする境界ボックス（BB）を生成するサンプリング方法を提案します。次に、このBB生成メソッドを使用して、第2ステージの任意の空間またはIoU分布に従ってRoIを生成する正のRoI（pRoI）ジェネレーターを開発します。 pRoIジェネレーターは、ハードサンプルマイニングやプライムサンプリングなどの肯定的な例に対して、他のサンプリング方法をシミュレートできることを示します。ジェネレーターを分析ツールとして使用して、（i）IoU不均衡がパフォーマンスに悪影響を与えること、（ii）ハードポジティブサンプルマイニングが特定の入力IoU分布に対してのみパフォーマンスを改善すること、（iii）前景クラス間の不均衡を示すパフォーマンスに悪影響を及ぼし、バッチレベルで軽減できること。最後に、pRoIジェネレーターを使用してより高速なR-CNNをトレーニングし、従来のトレーニングと比較して、Pascal VOCおよびMS COCOデータセットのIoUが高くなるようにトレーニングすると、低IoUでより良いまたは同等のパフォーマンスが得られ、大幅な改善が得られます。コードはhttps://github.com/kemaloksuz/BoundingBoxGeneratorで入手できます。

Two-stage deep object detectors generate a set of regions-of-interest (RoI) in the first stage, then, in the second stage, identify objects among the proposed RoIs that sufficiently overlap with a ground truth (GT) box. The second stage is known to suffer from a bias towards RoIs that have low intersection-over-union (IoU) with the associated GT boxes. To address this issue, we first propose a sampling method to generate bounding boxes (BB) that overlap with a given reference box more than a given IoU threshold. Then, we use this BB generation method to develop a positive RoI (pRoI) generator that produces RoIs following any desired spatial or IoU distribution, for the second-stage. We show that our pRoI generator is able to simulate other sampling methods for positive examples such as hard example mining and prime sampling. Using our generator as an analysis tool, we show that (i) IoU imbalance has an adverse effect on performance, (ii) hard positive example mining improves the performance only for certain input IoU distributions, and (iii) the imbalance among the foreground classes has an adverse effect on performance and that it can be alleviated at the batch level. Finally, we train Faster R-CNN using our pRoI generator and, compared to conventional training, obtain better or on-par performance for low IoUs and significant improvements when trained for higher IoUs for Pascal VOC and MS COCO datasets. The code is available at: https://github.com/kemaloksuz/BoundingBoxGenerator.

updated: Fri Jun 19 2020 07:50:03 GMT+0000 (UTC)

published: Sat Sep 21 2019 05:27:15 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト