Improving Feature-based Visual Localization by Geometry-Aided Matching

Hailin Yu; Youji Feng; Weicai Ye; Mingxuan Jiang; Hujun Bao; Guofeng Zhang

ジオメトリ支援マッチングによる機能ベースのビジュアルローカリゼーションの改善

2D と 3D の対応がカメラの姿勢の精度を決定する上で重要な役割を果たしている視覚的な位置特定では、特徴の一致が非常に重要です。十分に分散された 2D-3D 対応の十分な数は、ノイズによる正確な姿勢推定に不可欠です。ただし、既存の 2D-3D フィーチャマッチング方法は、フィーチャ空間で最近傍を見つけ、手作りのヒューリスティックを使用して外れ値を削除することに依存しているため、潜在的な一致が見逃されたり、正しい一致が除外されたりする可能性があります。この作業では、Geometry-Aided Matching (GAM) と呼ばれる新しい方法を提案します。これは、外観情報と幾何学的コンテキストの両方を組み込んで、この問題に対処し、2D-3D フィーチャマッチングを改善します。 GAM は、高い精度を維持しながら、2D-3D マッチの再現率を大幅に向上させることができます。 GAM を新しい階層的なビジュアルローカリゼーションパイプラインに適用し、GAM がローカリゼーションの堅牢性と精度を効果的に改善できることを示します。広範な実験により、GAM は手作りのヒューリスティックや学習ベースラインよりも実際の一致を見つけることができることが示されています。私たちが提案するローカリゼーション手法は、複数のビジュアルローカリゼーションデータセットで最先端の結果を達成します。ケンブリッジランドマークデータセットの実験では、私たちの方法が既存の最先端の方法よりも優れており、最高の方法よりも 6 倍高速であることが示されています。ソースコードは https://github.com/openxrlab/xrlocalization で入手できます。

Feature matching is crucial in visual localization, where 2D-3D correspondence plays a major role in determining the accuracy of camera pose. A sufficient number of well-distributed 2D-3D correspondences is essential for accurate pose estimation due to noise. However, existing 2D-3D feature matching methods rely on finding nearest neighbors in the feature space and removing outliers using hand-crafted heuristics, which may lead to potential matches being missed or the correct matches being filtered out. In this work, we propose a novel method called Geometry-Aided Matching (GAM), which incorporates both appearance information and geometric context to address this issue and to improve 2D-3D feature matching. GAM can greatly boost the recall of 2D-3D matches while maintaining high precision. We apply GAM to a new hierarchical visual localization pipeline and show that GAM can effectively improve the robustness and accuracy of localization. Extensive experiments show that GAM can find more real matches than hand-crafted heuristics and learning baselines. Our proposed localization method achieves state-of-the-art results on multiple visual localization datasets. Experiments on Cambridge Landmarks dataset show that our method outperforms the existing state-of-the-art methods and is six times faster than the top-performed method. The source code is available at https://github.com/openxrlab/xrlocalization.

updated: Sun Mar 05 2023 12:12:53 GMT+0000 (UTC)

published: Wed Nov 16 2022 07:02:12 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト