Soft Expectation and Deep Maximization for Image Feature Detection

Alexander Mai; Allen Yang; Dominique E. Meyer

画像特徴検出のためのソフトな期待と深い最大化

多くのマルチビュージオメトリアルゴリズムのアプリケーションの中心は、複数の視点間の一致点の抽出であり、カメラポーズの推定や3D再構築などの古典的なタスクを可能にします。何十年にもわたって、これらのポイントを特徴付ける多くのアプローチが、手作業で調整された外観モデルと最近ではデータ駆動型の学習方法に基づいて提案されてきました。 SEDMを提案します。これは、質問を反転し、最初に繰り返し可能な3Dポイントを探し、次に検出器をトレーニングしてそれらを画像空間にローカライズする、反復的な半教師あり学習プロセスです。私たちの手法は、期待値最大化（EM）の1つとして問題を提起します。この場合、検出器が3Dポイントを特定する可能性が、最大化される目的関数です。シーンのジオメトリを利用して、これらの3Dポイントの位置の推定を洗練し、期待ステップ中に新しい疑似グラウンドトゥルースを生成し、最大化ステップでこの疑似グラウンドトゥルースを予測するように検出器をトレーニングします。検出器を、視覚的位置特定、スパース3D再構成、および平均マッチング精度の標準ベンチマークに適用します。私たちの結果は、SEDMを使用してトレーニングされたこの新しいモデルは、シーン内の基になる3Dポイントをより適切にローカライズでき、SuperPointと比較した場合の平均SfM品質を-0.15±0.11、R2D2と比較した場合の平均再投影エラーを-0.38±0.23改善できることを示しています。

Central to the application of many multi-view geometry algorithms is the extraction of matching points between multiple viewpoints, enabling classical tasks such as camera pose estimation and 3D reconstruction. Over the decades, many approaches that characterize these points have been proposed based on hand-tuned appearance models and more recently data-driven learning methods. We propose SEDM, an iterative semi-supervised learning process that flips the question and first looks for repeatable 3D points, then trains a detector to localize them in image space. Our technique poses the problem as one of expectation maximization (EM), where the likelihood of the detector locating the 3D points is the objective function to be maximized. We utilize the geometry of the scene to refine the estimates of the location of these 3D points and produce a new pseudo ground truth during the expectation step, then train a detector to predict this pseudo ground truth in the maximization step. We apply our detector to standard benchmarks in visual localization, sparse 3D reconstruction, and mean matching accuracy. Our results show that this new model trained using SEDM is able to better localize the underlying 3D points in a scene, improving mean SfM quality by -0.15±0.11 mean reprojection error when compared to SuperPoint or -0.38±0.23 when compared to R2D2.

updated: Wed Apr 21 2021 00:35:32 GMT+0000 (UTC)

published: Wed Apr 21 2021 00:35:32 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト