DiViNeT: 3D Reconstruction from Disparate Views via Neural Template Regularization

Aditya Vora; Akshay Gadi Patil; Hao Zhang

DiViNet: ニューラルテンプレートの正則化による異種ビューからの 3D 再構成

我々は、わずか 3 つの異なる RGB 画像を入力として受け取る、ボリュームレンダリングベースの神経表面再構成法を提案します。私たちの重要なアイデアは、表面事前分布として機能する一連のニューラルテンプレートを学習することで、著しく位置が悪く、まばらなビュー間に大きなギャップが残る再構成を正規化することです。私たちの手法は DiViNet と名付けられ、2 つの段階で動作します。最初の段階では、3D の監視なしで、さまざまなシーンにわたって 3D ガウス関数の形式でテンプレートを学習します。再構築段階では、予測されたテンプレートがアンカーとして機能し、まばらな領域上にサーフェスを「縫い合わせる」のに役立ちます。私たちのアプローチが表面ジオメトリを完成させるだけでなく、いくつかの異なる入力ビューから合理的な範囲で表面の詳細を再構築できることを実証します。 DTU および BlendedMVS データセットでは、このような疎なビューが存在する場合、私たちのアプローチは既存の手法の中で最高の再構成品質を実現し、密なビューが入力として使用される場合、競合する手法と同等かそれ以上のパフォーマンスを発揮します。

We present a volume rendering-based neural surface reconstruction method that takes as few as three disparate RGB images as input. Our key idea is to regularize the reconstruction, which is severely ill-posed and leaving significant gaps between the sparse views, by learning a set of neural templates that act as surface priors. Our method coined DiViNet, operates in two stages. The first stage learns the templates, in the form of 3D Gaussian functions, across different scenes, without 3D supervision. In the reconstruction stage, our predicted templates serve as anchors to help "stitch" the surfaces over sparse regions. We demonstrate that our approach is not only able to complete the surface geometry but also reconstructs surface details to a reasonable extent from few disparate input views. On the DTU and BlendedMVS datasets, our approach achieves the best reconstruction quality among existing methods in the presence of such sparse views, and performs on par, if not better, with competing methods when dense views are employed as inputs.

updated: Thu Jun 15 2023 04:08:25 GMT+0000 (UTC)

published: Wed Jun 07 2023 18:05:14 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト