A Saliency-Guided Street View Image Inpainting Framework for Efficient Last-Meters Wayfinding

Chuanbo Hu; Shan Jia; Fan Zhang; Xin Li

効率的なラストメーターの経路探索のための顕著性ガイド付きストリートビュー画像修復フレームワーク

全地球測位システム（GPS）は、さまざまなナビゲーションアプリケーションで重要な役割を果たしてきました。それにもかかわらず、最後の数メートル以内に完璧な目的地を特定することは重要ですが、未解決の問題のままです。 GPS測位精度によって制限され、ナビゲーションシステムは常に目的地の近くをユーザーに表示しますが、正確な位置は表示しません。没入型メディアテクノロジーとしての地図のストリートビュー画像（SVI）は、人間のラストメーターの経路探索に物理的な環境を提供するための補助として役立ちました。ただし、地理的コンテキストと取得条件が非常に多様であるため、キャプチャされたSVIには常にさまざまな注意散漫なオブジェクト（歩行者や車両など）が含まれ、最後の数メートルで目的地を効率的に見つけることから人間の視覚的注意をそらします。この問題に対処するために、顕著性に基づく画像修復フレームワークを提案することにより、画像ベースの経路探索における視覚的注意散漫を減らすことの重要性を強調します。これは、人間の視覚的注意を気を散らすオブジェクトから目的地に関連するオブジェクトにリダイレクトして、最後のメーターでより効率的かつ正確な経路探索を行うことを目的としています。具体的には、深い顕著なオブジェクト検出によって駆動されるコンテキスト認識の気を散らすオブジェクト検出方法は、SVIの3つのセマンティックレベルから気を散らすオブジェクトを抽出するように設計されています。次に、高速フーリエ畳み込みを使用したラージマスク修復法を使用して、検出された気を散らすオブジェクトを削除します。定性分析と定量分析の両方を使用した実験結果は、私たちの顕著性に基づく修復方法が、ストリートビュー画像で優れた知覚品質を達成できるだけでなく、人間の視覚的注意を、気を散らすものよりも静的な場所に関連するオブジェクトに集中するようにリダイレクトできることを示しています。人間ベースの評価はまた、ターゲットの目的地を見つける効率を改善する上での私たちの方法の有効性を正当化しました。

Global Positioning Systems (GPS) have played a crucial role in various navigation applications. Nevertheless, localizing the perfect destination within the last few meters remains an important but unresolved problem. Limited by the GPS positioning accuracy, navigation systems always show users a vicinity of a destination, but not its exact location. Street view images (SVI) in maps as an immersive media technology have served as an aid to provide the physical environment for human last-meters wayfinding. However, due to the large diversity of geographic context and acquisition conditions, the captured SVI always contains various distracting objects (e.g., pedestrians and vehicles), which will distract human visual attention from efficiently finding the destination in the last few meters. To address this problem, we highlight the importance of reducing visual distraction in image-based wayfinding by proposing a saliency-guided image inpainting framework. It aims at redirecting human visual attention from distracting objects to destination-related objects for more efficient and accurate wayfinding in the last meters. Specifically, a context-aware distracting object detection method driven by deep salient object detection has been designed to extract distracting objects from three semantic levels in SVI. Then we employ a large-mask inpainting method with fast Fourier convolutions to remove the detected distracting objects. Experimental results with both qualitative and quantitative analysis show that our saliency-guided inpainting method can not only achieve great perceptual quality in street view images but also redirect the human's visual attention to focus more on static location-related objects than distracting ones. The human-based evaluation also justified the effectiveness of our method in improving the efficiency of locating the target destination.

updated: Thu Nov 17 2022 13:40:37 GMT+0000 (UTC)

published: Sat May 14 2022 00:16:38 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト