Adversarial Networks for Camera Pose Regression and Refinement

Mai Bui; Christoph Baur; Nassir Navab; Slobodan Ilic; Shadi Albarqouni

カメラのポーズの回帰と精密化のための敵対的ネットワーク

ニューラルネットワークを使用した直接カメラポーズ回帰のトピックに関する最近の進歩にもかかわらず、単一のRGB画像のカメラポーズを正確に推定することは、依然として困難な作業です。この問題に対処するために、判別器ネットワークと敵対的学習を使用して、RGB画像とそれに対応するカメラポーズの同時分布を暗黙的に学習するという考え方に基づく、新しいフレームワークを紹介します。この方法では、単一の画像からカメラポーズを回帰できるだけでなく、判別器ネットワークを使用してカメラポーズを調整するためのRGBベースのソリューションのみを提供します。さらに、予測されたカメラポーズを最適化し、ローカリゼーションの精度を向上させるために、この方法を効果的に使用できることを示します。この目的のために、一般公開されている7シーンデータセットで提案された方法を検証し、直接カメラポーズ回帰法の結果を改善します。

Despite recent advances on the topic of direct camera pose regression using neural networks, accurately estimating the camera pose of a single RGB image still remains a challenging task. To address this problem, we introduce a novel framework based, in its core, on the idea of implicitly learning the joint distribution of RGB images and their corresponding camera poses using a discriminator network and adversarial learning. Our method allows not only to regress the camera pose from a single image, however, also offers a solely RGB-based solution for camera pose refinement using the discriminator network. Further, we show that our method can effectively be used to optimize the predicted camera poses and thus improve the localization accuracy. To this end, we validate our proposed method on the publicly available 7-Scenes dataset improving upon the results of direct camera pose regression methods.

updated: Sun Oct 27 2019 21:17:06 GMT+0000 (UTC)

published: Fri Mar 15 2019 16:32:51 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト