Towards Few-Shot Open-Set Object Detection

Binyi Su; Hua Zhang; Jingzhi Li; Zhong Zhou

少数ショットオープンセットオブジェクト検出に向けて

オープンセットオブジェクト検出 (OSOD) は、動的な世界で既知のカテゴリを検出し、未知のオブジェクトを識別することを目的としており、大きな注目を集めています。ただし、以前のアプローチでは、この問題はデータが豊富な状況でのみ考慮され、ショット数の少ないシーンは無視されていました。このホワイトペーパーでは、すべての既知のクラスを検出し、未知のクラスを識別しながら、少数のサンプルに基づいて検出器を迅速にトレーニングすることを目的とした少数ショットオープンセットオブジェクト検出 (FSOSOD) のソリューションを探します。このタスクの主な課題は、ほとんどのトレーニングサンプルがモデルを既知のクラスにオーバーフィットさせ、結果としてオープンセットのパフォーマンスが低下することです。この問題に取り組むために、Few-shOt Open-set Detector (FOOD) という名前の新しい FSOSOD アルゴリズムを提案します。これには、新しいクラス重みスパース化分類器 (CWSC) と新しい未知のデカップリング学習器 (UDL) が含まれています。オーバーフィッティングを防ぐために、CWSC はすべてのクラスのロジット予測のために正規化された重みの一部をランダムにスパースしてから、クラスとその隣接クラスの間の相互適応性を減らします。同時に、UDL は未知のクラスのトレーニングを分離し、モデルがコンパクトな未知の決定境界を形成できるようにします。したがって、未知のオブジェクトは、トレーニング用の疑似未知サンプルなしで信頼確率で識別できます。数ショットのシーンでいくつかの最先端の OSOD メソッドと私たちの方法を比較し、私たちの方法が VOC-COCO データセット設定のすべてのショットで未知のクラスの再現率を 5%-9% 改善することを観察します。

Open-set object detection (OSOD) aims to detect the known categories and identify unknown objects in a dynamic world, which has achieved significant attentions. However, previous approaches only consider this problem in data-abundant conditions, while neglecting the few-shot scenes. In this paper, we seek a solution for the few-shot open-set object detection (FSOSOD), which aims to quickly train a detector based on few samples while detecting all known classes and identifying unknown classes. The main challenge for this task is that few training samples induce the model to overfit on the known classes, resulting in a poor open-set performance. We propose a new FSOSOD algorithm to tackle this issue, named Few-shOt Open-set Detector (FOOD), which contains a novel class weight sparsification classifier (CWSC) and a novel unknown decoupling learner (UDL). To prevent over-fitting, CWSC randomly sparses parts of the normalized weights for the logit prediction of all classes, and then decreases the co-adaptability between the class and its neighbors. Alongside, UDL decouples training the unknown class and enables the model to form a compact unknown decision boundary. Thus, the unknown objects can be identified with a confidence probability without any pseudo-unknown samples for training. We compare our method with several state-of-the-art OSOD methods in few-shot scenes and observe that our method improves the recall of unknown classes by 5%-9% across all shots in VOC-COCO dataset setting.

updated: Fri Dec 09 2022 12:32:47 GMT+0000 (UTC)

published: Fri Oct 28 2022 09:02:32 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト