CueCAn: Cue Driven Contextual Attention For Identifying Missing Traffic Signs on Unconstrained Roads

Varun Gupta; Anbumani Subramanian; C. V. Jawahar; Rohit Saluja

CueCAN: 制約のない道路で交通標識の欠落を識別するためのキュー主導の文脈上の注意

制約のないアジアの道路は、インフラが貧弱であることが多く、全体的な道路の安全性に影響を与えています。交通標識の欠落は、そのような道路の通常の部分です。欠けているまたは存在しないオブジェクトの検出は、欠けている縁石の位置を特定し、道路シーン画像上の歩行者の妥当な領域を推定するために研究されています。このような方法には、タスク固有の単一オブジェクトキューの分析が含まれます。このホワイトペーパーでは、シーンに標識がなくても手がかりが見える複数のタイプの交通標識を使用して、行方不明のオブジェクトの最初で最も困難なビデオデータセットを提示します。これを Missing Traffic Signs Video Dataset (MTSVD) と呼びます。 MTSVD は、2 つの側面で以前の作品と比較して挑戦的です。i) 交通標識は通常、手がかりの近くに存在しません。ii) 交通標識の手がかりは多様でユニークです。また、MTSVD は、公開された最初の不明オブジェクトデータセットです。欠落している標識を識別するためにモデルをトレーニングするために、データセットを 10,000 の交通標識トラックで補完し、交通標識の 40% でシーン内にキューが表示されています。欠落している兆候を特定するために、モデルエンコーダーに組み込む Cue-driven Contextual Attention units (CueCAN) を提案します。最初にエンコーダーをトレーニングして交通標識の手がかりの存在を分類し、次にセグメンテーションモデル全体をエンドツーエンドでトレーニングして、欠落している交通標識をローカライズします。定量分析と定性分析は、CueCAN が基本モデルのパフォーマンスを大幅に改善することを示しています。

Unconstrained Asian roads often involve poor infrastructure, affecting overall road safety. Missing traffic signs are a regular part of such roads. Missing or non-existing object detection has been studied for locating missing curbs and estimating reasonable regions for pedestrians on road scene images. Such methods involve analyzing task-specific single object cues. In this paper, we present the first and most challenging video dataset for missing objects, with multiple types of traffic signs for which the cues are visible without the signs in the scenes. We refer to it as the Missing Traffic Signs Video Dataset (MTSVD). MTSVD is challenging compared to the previous works in two aspects i) The traffic signs are generally not present in the vicinity of their cues, ii) The traffic signs cues are diverse and unique. Also, MTSVD is the first publicly available missing object dataset. To train the models for identifying missing signs, we complement our dataset with 10K traffic sign tracks, with 40 percent of the traffic signs having cues visible in the scenes. For identifying missing signs, we propose the Cue-driven Contextual Attention units (CueCAn), which we incorporate in our model encoder. We first train the encoder to classify the presence of traffic sign cues and then train the entire segmentation model end-to-end to localize missing traffic signs. Quantitative and qualitative analysis shows that CueCAn significantly improves the performance of base models.

updated: Sun Mar 05 2023 11:06:20 GMT+0000 (UTC)

published: Sun Mar 05 2023 11:06:20 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト