Detecting out-of-context objects using contextual cues

Manoj Acharya; Anirban Roy; Kaushik Koneripalli; Susmit Jha; Christopher Kanan; Ajay Divakaran

コンテキストキューを使用したコンテキスト外オブジェクトの検出

このホワイトペーパーでは、画像内のコンテキスト外（OOC）オブジェクトを検出するためのアプローチを紹介します。オブジェクトのセットを含む画像が与えられた場合、私たちの目標は、オブジェクトがシーンコンテキストと矛盾しているかどうかを判断し、バウンディングボックスを使用してOOCオブジェクトを検出することです。この作品では、共起関係、他のオブジェクトに対するオブジェクトの相対的なサイズ、シーン内のオブジェクトの位置など、一般的に調査されるコンテキスト関係を検討します。コンテキストキューは、コンテキスト内オブジェクトのオブジェクトラベルを決定するのに役立ち、一貫性のないコンテキストキューは、コンテキスト外オブジェクトのオブジェクトラベルを決定するのに有害であると考えます。この仮説を実現するために、OOCオブジェクトを検出するためのグラフコンテキスト推論ネットワーク（GCRN）を提案します。 GCRNは、画像内のコンテキストキューに基づいてオブジェクトラベルを予測する2つの別個のグラフで構成されます。1）隣接オブジェクトに基づいてオブジェクトの特徴を学習する表現グラフと、2）隣接オブジェクトからコンテキストキューを明示的にキャプチャするコンテキストグラフです。 GCRNは、コンテキストキューを明示的にキャプチャして、コンテキスト内オブジェクトの検出を改善し、コンテキスト関係に違反するオブジェクトを識別します。私たちのアプローチを評価するために、COCO画像にOOCオブジェクトインスタンスを追加して大規模なデータセットを作成します。また、最近のOCDベンチマークについても評価します。私たちの結果は、GCRNがOOCオブジェクトの検出とコンテキスト内オブジェクトの正しい検出において、競合するベースラインを上回っていることを示しています。

This paper presents an approach to detect out-of-context (OOC) objects in an image. Given an image with a set of objects, our goal is to determine if an object is inconsistent with the scene context and detect the OOC object with a bounding box. In this work, we consider commonly explored contextual relations such as co-occurrence relations, the relative size of an object with respect to other objects, and the position of the object in the scene. We posit that contextual cues are useful to determine object labels for in-context objects and inconsistent context cues are detrimental to determining object labels for out-of-context objects. To realize this hypothesis, we propose a graph contextual reasoning network (GCRN) to detect OOC objects. GCRN consists of two separate graphs to predict object labels based on the contextual cues in the image: 1) a representation graph to learn object features based on the neighboring objects and 2) a context graph to explicitly capture contextual cues from the neighboring objects. GCRN explicitly captures the contextual cues to improve the detection of in-context objects and identify objects that violate contextual relations. In order to evaluate our approach, we create a large-scale dataset by adding OOC object instances to the COCO images. We also evaluate on recent OCD benchmark. Our results show that GCRN outperforms competitive baselines in detecting OOC objects and correctly detecting in-context objects.

updated: Fri Feb 11 2022 23:15:01 GMT+0000 (UTC)

published: Fri Feb 11 2022 23:15:01 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト