BungeeNeRF: Progressive Neural Radiance Field for Extreme Multi-scale Scene Rendering

Yuanbo Xiangli; Linning Xu; Xingang Pan; Nanxuan Zhao; Anyi Rao; Christian Theobalt; Bo Dai; Dahua Lin

BungeeNeRF：極端なマルチスケールシーンレンダリングのためのプログレッシブニューラルラディアンスフィールド

神経放射輝度フィールド（NeRF）は、通常は単一のスケールで、3Dオブジェクトと制御されたシーンのモデリングで卓越したパフォーマンスを実現しました。この作業では、画像の大きな変化が大幅に異なるスケールで観察されるマルチスケールのケースに焦点を当てます。このシナリオは、都市の概要をキャプチャする衛星レベルから、建築の複雑な詳細を示す地上レベルの画像まで、さまざまなビューを持つ都市のシーンなど、実際の3D環境に広く存在します。また、風景や繊細なMinecraftの3Dモデルでも一般的に識別できます。これらのシーン内の広い範囲の表示位置は、非常に異なる詳細レベルのマルチスケールレンダリングを生成します。これは、神経放射輝度フィールドに大きな課題をもたらし、妥協した結果にバイアスをかけます。これらの問題に対処するために、大幅に変化するスケールにわたって詳細レベルのレンダリングを実現するプログレッシブニューラルラディアンスフィールドであるBungeeNeRFを紹介します。遠方のビューを浅いベースブロックに合わせるところから始めて、トレーニングが進むにつれて、新しいブロックが追加され、ますます近づくビューの新しい詳細に対応します。この戦略は、NeRFの位置エンコーディング入力の高周波チャネルを段階的にアクティブにし、トレーニングが進むにつれて、より複雑な詳細を連続的に展開します。複数のデータソース（都市モデル、合成、およびドローンでキャプチャされたデータ）のビューが大幅に異なる多様なマルチスケールシーンのモデリングにおけるBungeeNeRFの優位性と、さまざまな詳細レベルでの高品質レンダリングのサポートを示します。

Neural radiance fields (NeRF) has achieved outstanding performance in modeling 3D objects and controlled scenes, usually under a single scale. In this work, we focus on multi-scale cases where large changes in imagery are observed at drastically different scales. This scenario vastly exists in real-world 3D environments, such as city scenes, with views ranging from satellite level that captures the overview of a city, to ground level imagery showing complex details of an architecture; and can also be commonly identified in landscape and delicate minecraft 3D models. The wide span of viewing positions within these scenes yields multi-scale renderings with very different levels of detail, which poses great challenges to neural radiance field and biases it towards compromised results. To address these issues, we introduce BungeeNeRF, a progressive neural radiance field that achieves level-of-detail rendering across drastically varied scales. Starting from fitting distant views with a shallow base block, as training progresses, new blocks are appended to accommodate the emerging details in the increasingly closer views. The strategy progressively activates high-frequency channels in NeRF's positional encoding inputs and successively unfolds more complex details as the training proceeds. We demonstrate the superiority of BungeeNeRF in modeling diverse multi-scale scenes with drastically varying views on multiple data sources (city models, synthetic, and drone captured data) and its support for high-quality rendering in different levels of detail.

updated: Mon Jul 25 2022 05:03:26 GMT+0000 (UTC)

published: Fri Dec 10 2021 13:16:21 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト