Towards Accurate Localization by Instance Search

Yi-Geng Hong; Hui-Chu Xiao; Wan-Lei Zhao

インスタンス検索による正確なローカリゼーションに向けて

視覚的なオブジェクトのローカリゼーションは、一連のオブジェクト検出タスクの重要なステップです。文献では、主流の強力に監視されたフレームワークを使用して、高いローカリゼーション精度が達成されています。ただし、このようなメソッドにはオブジェクトレベルの注釈が必要であり、不明なカテゴリのオブジェクトを検出することはできません。弱く監視された方法も同様の問題に直面します。この論文では、インスタンス検索によって返されるランクリスト上で正確なオブジェクトのローカリゼーションを実現するために、自己ペースの学習フレームワークが提案されています。提案されたフレームワークは、クエリとそれに対応する上位の検索結果からターゲットインスタンスを徐々にマイニングします。クエリとランクリスト内の画像の間で共通のインスタンスが共有されるため、オブジェクトカテゴリが何であるかを知らなくても、ターゲットのビジュアルインスタンスを正確にローカライズできます。インスタンス検索でローカリゼーションを実行することに加えて、数ショットのオブジェクト検出の問題も同じフレームワークで対処されます。両方のタスクで、最先端の方法よりも優れたパフォーマンスが見られます。

Visual object localization is the key step in a series of object detection tasks. In the literature, high localization accuracy is achieved with the mainstream strongly supervised frameworks. However, such methods require object-level annotations and are unable to detect objects of unknown categories. Weakly supervised methods face similar difficulties. In this paper, a self-paced learning framework is proposed to achieve accurate object localization on the rank list returned by instance search. The proposed framework mines the target instance gradually from the queries and their corresponding top-ranked search results. Since a common instance is shared between the query and the images in the rank list, the target visual instance can be accurately localized even without knowing what the object category is. In addition to performing localization on instance search, the issue of few-shot object detection is also addressed under the same framework. Superior performance over state-of-the-art methods is observed on both tasks.

updated: Sat Aug 07 2021 14:58:34 GMT+0000 (UTC)

published: Sun Jul 11 2021 10:03:31 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト