Voxurf: Voxel-based Efficient and Accurate Neural Surface Reconstruction

Tong Wu; Jiaqi Wang; Xingang Pan; Xudong Xu; Christian Theobalt; Ziwei Liu; Dahua Lin

Voxurf: ボクセルベースの効率的かつ正確な神経表面再構築

神経表面再構成は、多視点画像に基づいて正確な 3D 表面を再構成することを目的としています。ニューラルボリュームレンダリングに基づく以前の方法では、ほとんどの場合、MLP を使用して完全に暗黙的なモデルをトレーニングします。これは通常、1 つのシーンに何時間ものトレーニングを必要とします。最近の取り組みでは、学習可能なボセルグリッドを使用して重要な情報を記憶することにより、最適化を加速する明示的なボリューム表現を調査しています。ただし、既存のボクセルベースの方法は、SDF ベースのボリュームレンダリングスキームと組み合わせた場合でも、きめの細かいジオメトリの再構築に苦労することがよくあります。これは、1) ボクセルグリッドが、細かいジオメトリの学習を容易にする色ジオメトリ依存性を破る傾向があること、および 2) 制約の少ないボクセルグリッドが空間的一貫性を欠き、局所的最小値に対して脆弱であるためであることを明らかにします。この作業では、効率的で正確なボクセルベースの表面再構成アプローチである Voxurf を紹介します。 Voxurf は、1) コヒーレントな粗い形状を達成し、細かいディテールを連続して回復する 2 段階のトレーニング手順、2) 色ジオメトリ依存性を維持するデュアルカラーネットワーク、および 3) 階層ジオメトリを含む、いくつかの主要な設計によって前述の問題に対処します。ボクセル間の情報伝播を促進する機能。広範な実験により、Voxurf が高効率と高品質を同時に達成することが示されています。 DTU ベンチマークでは、Voxurf は、以前の完全に暗黙的な方法と比較して、トレーニングを 20 倍高速化し、より高い再構成品質を実現しています。

Neural surface reconstruction aims to reconstruct accurate 3D surfaces based on multi-view images. Previous methods based on neural volume rendering mostly train a fully implicit model with MLPs, which typically require hours of training for a single scene. Recent efforts explore the explicit volumetric representation to accelerate the optimization via memorizing significant information with learnable voxel grids. However, existing voxel-based methods often struggle in reconstructing fine-grained geometry, even when combined with an SDF-based volume rendering scheme. We reveal that this is because 1) the voxel grids tend to break the color-geometry dependency that facilitates fine-geometry learning, and 2) the under-constrained voxel grids lack spatial coherence and are vulnerable to local minima. In this work, we present Voxurf, a voxel-based surface reconstruction approach that is both efficient and accurate. Voxurf addresses the aforementioned issues via several key designs, including 1) a two-stage training procedure that attains a coherent coarse shape and recovers fine details successively, 2) a dual color network that maintains color-geometry dependency, and 3) a hierarchical geometry feature to encourage information propagation across voxels. Extensive experiments show that Voxurf achieves high efficiency and high quality at the same time. On the DTU benchmark, Voxurf achieves higher reconstruction quality with a 20x training speedup compared to previous fully implicit methods.

updated: Tue Oct 04 2022 12:24:43 GMT+0000 (UTC)

published: Fri Aug 26 2022 14:48:02 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト