Generative Occupancy Fields for 3D Surface-Aware Image Synthesis

Xudong Xu; Xingang Pan; Dahua Lin; Bo Dai

3D表面認識画像合成のための生成的占有フィールド

生成ラディアンスフィールドの出現により、3D対応の画像合成の開発が大幅に促進されました。放射輝度フィールドでの累積レンダリングプロセスにより、グラデーションがボリューム全体に分散されるため、これらの生成モデルのトレーニングがはるかに簡単になりますが、オブジェクトの表面が拡散します。その間、放射輝度フィールドと比較して、占有表現は本質的に決定論的な表面を保証することができます。ただし、生成モデルに占有表現を直接適用すると、トレーニング中に、オブジェクトサーフェスに配置されたスパース勾配のみを受け取り、最終的に収束の問題が発生します。本論文では、生成的放射性場に基づく新しいモデルである生成的占有場（GOF）を提案し、その訓練収束を妨げることなくコンパクトな物体表面を学習することができる。 GOFの重要な洞察は、放射輝度フィールドでの累積レンダリングから、学習されたサーフェスがますます正確になるにつれて、サーフェスポイントのみを使用したレンダリングへの専用の移行です。このように、GOFは2つの表現のメリットを統合されたフレームワークに組み合わせています。実際には、放射輝度フィールドと行進から占有表現への開始のトレーニング時間遷移は、レンダリングプロセスのサンプリング領域をボリューム全体から表面周辺の最小の隣接領域に徐々に縮小することによってGOFで実現されます。複数のデータセットでの包括的な実験を通じて、GOFが3Dの一貫性を備えた高品質の画像を合成し、同時にコンパクトで滑らかなオブジェクト表面を学習できることを示します。コード、モデル、およびデモビデオは、https：//sheldontsui.github.io/projects/GOFで入手できます。

The advent of generative radiance fields has significantly promoted the development of 3D-aware image synthesis. The cumulative rendering process in radiance fields makes training these generative models much easier since gradients are distributed over the entire volume, but leads to diffused object surfaces. In the meantime, compared to radiance fields occupancy representations could inherently ensure deterministic surfaces. However, if we directly apply occupancy representations to generative models, during training they will only receive sparse gradients located on object surfaces and eventually suffer from the convergence problem. In this paper, we propose Generative Occupancy Fields (GOF), a novel model based on generative radiance fields that can learn compact object surfaces without impeding its training convergence. The key insight of GOF is a dedicated transition from the cumulative rendering in radiance fields to rendering with only the surface points as the learned surface gets more and more accurate. In this way, GOF combines the merits of two representations in a unified framework. In practice, the training-time transition of start from radiance fields and march to occupancy representations is achieved in GOF by gradually shrinking the sampling region in its rendering process from the entire volume to a minimal neighboring region around the surface. Through comprehensive experiments on multiple datasets, we demonstrate that GOF can synthesize high-quality images with 3D consistency and simultaneously learn compact and smooth object surfaces. Code, models, and demo videos are available at https://sheldontsui.github.io/projects/GOF

updated: Mon Nov 01 2021 14:20:43 GMT+0000 (UTC)

published: Mon Nov 01 2021 14:20:43 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト