Neural Groundplans: Persistent Neural Scene Representations from a Single Image

Prafull Sharma; Ayush Tewari; Yilun Du; Sergey Zakharov; Rares Ambrus; Adrien Gaidon; William T. Freeman; Fredo Durand; Joshua B. Tenenbaum; Vincent Sitzmann

ニューラルグラウンドプラン: 単一の画像からの永続的なニューラルシーン表現

シーンの 2D 画像観測を永続的な 3D シーン表現にマッピングする方法を提示し、シーンの可動コンポーネントと不動コンポーネントの斬新なビュー合成ともつれた表現を可能にします。ビジョンとロボット工学で一般的に使用される鳥瞰図 (BEV) 表現に動機付けられて、永続的でメモリ効率の高いシーン表現として、条件付きニューラルグラウンドプラン、地面に沿った 2D フィーチャグリッドを提案します。私たちの方法は、微分可能なレンダリングを使用して、ラベル付けされていないマルチビュー観測から自己教師付きでトレーニングされ、オクルージョンされた領域のジオメトリと外観を完成させることを学習します。さらに、トレーニング時にマルチビュービデオを活用して、テスト時に単一の画像からシーンの静的コンポーネントと可動コンポーネントを個別に再構築する方法を学習できることを示します。可動オブジェクトを個別に再構築する機能により、オブジェクト中心の 3D 表現の抽出、新しいビュー合成、インスタンスレベルのセグメンテーション、3D バウンディングボックス予測、シーン編集など、単純なヒューリスティックを使用してさまざまなダウンストリームタスクが可能になります。これは、効率的な 3D シーン理解モデルのバックボーンとしてのニューラルグラウンドプランの価値を強調しています。

We present a method to map 2D image observations of a scene to a persistent 3D scene representation, enabling novel view synthesis and disentangled representation of the movable and immovable components of the scene. Motivated by the bird's-eye-view (BEV) representation commonly used in vision and robotics, we propose conditional neural groundplans, ground-aligned 2D feature grids, as persistent and memory-efficient scene representations. Our method is trained self-supervised from unlabeled multi-view observations using differentiable rendering, and learns to complete geometry and appearance of occluded regions. In addition, we show that we can leverage multi-view videos at training time to learn to separately reconstruct static and movable components of the scene from a single image at test time. The ability to separately reconstruct movable objects enables a variety of downstream tasks using simple heuristics, such as extraction of object-centric 3D representations, novel view synthesis, instance-level segmentation, 3D bounding box prediction, and scene editing. This highlights the value of neural groundplans as a backbone for efficient 3D scene understanding models.

updated: Mon Apr 10 2023 00:49:55 GMT+0000 (UTC)

published: Fri Jul 22 2022 17:41:24 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト