LU-NeRF: Scene and Pose Estimation by Synchronizing Local Unposed NeRFs

Zezhou Cheng; Carlos Esteves; Varun Jampani; Abhishek Kar; Subhransu Maji; Ameesh Makadia

LU-NeRF: ローカルの未ポーズ NeRF の同期によるシーンとポーズの推定

NeRF モデルが実際に広く展開されることを妨げる重大な障害は、正確なカメラのポーズに依存していることです。その結果、NeRF モデルを拡張してカメラのポーズとシーン表現を共同で最適化することへの関心が高まっており、これは十分に理解されている障害モードを持つ既製の SfM パイプラインの代替手段となります。ポーズなしの NeRF に対する既存のアプローチは、事前のポーズ分布や粗いポーズの初期化などの限られた仮定の下で動作するため、一般的な設定では効果が低くなります。この研究では、ポーズ構成に関する緩和された仮定を使用してカメラのポーズと神経放射フィールドを共同推定する新しいアプローチ LU-NeRF を提案します。私たちのアプローチはローカルからグローバルへの方法で動作し、最初にミニシーンと呼ばれるデータのローカルサブセットに対して最適化します。 LU-NeRF は、この困難な数ショットタスクに対して、局所的なポーズとジオメトリを推定します。ミニシーンのポーズは、堅牢なポーズ同期ステップを通じてグローバル参照フレームに取り込まれ、そこでポーズとシーンの最終的なグローバル最適化を実行できます。事前のポーズに関する制限的な仮定を行わずに、LU-NeRF パイプラインがポーズなしの NeRF での以前の試みよりも優れていることを示します。これにより、ベースラインとは異なり、一般的な SE(3) ポーズ設定で操作できるようになります。また、私たちの結果は、私たちのモデルが低テクスチャおよび低解像度の画像上で COLMAP と比べて優れているため、特徴ベースの SfM パイプラインを補完できることを示しています。

A critical obstacle preventing NeRF models from being deployed broadly in the wild is their reliance on accurate camera poses. Consequently, there is growing interest in extending NeRF models to jointly optimize camera poses and scene representation, which offers an alternative to off-the-shelf SfM pipelines which have well-understood failure modes. Existing approaches for unposed NeRF operate under limited assumptions, such as a prior pose distribution or coarse pose initialization, making them less effective in a general setting. In this work, we propose a novel approach, LU-NeRF, that jointly estimates camera poses and neural radiance fields with relaxed assumptions on pose configuration. Our approach operates in a local-to-global manner, where we first optimize over local subsets of the data, dubbed mini-scenes. LU-NeRF estimates local pose and geometry for this challenging few-shot task. The mini-scene poses are brought into a global reference frame through a robust pose synchronization step, where a final global optimization of pose and scene can be performed. We show our LU-NeRF pipeline outperforms prior attempts at unposed NeRF without making restrictive assumptions on the pose prior. This allows us to operate in the general SE(3) pose setting, unlike the baselines. Our results also indicate our model can be complementary to feature-based SfM pipelines as it compares favorably to COLMAP on low-texture and low-resolution images.

updated: Thu Jun 08 2023 17:56:22 GMT+0000 (UTC)

published: Thu Jun 08 2023 17:56:22 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト