PVD-AL: Progressive Volume Distillation with Active Learning for Efficient Conversion Between Different NeRF Architectures

Shuangkang Fang; Yufeng Wang; Yi Yang; Weixin Xu; Heng Wang; Wenrui Ding; Shuchang Zhou

PVD-AL: 異なる NeRF アーキテクチャ間の効率的な変換のためのアクティブラーニングを使用したプログレッシブボリューム蒸留

Neural Radiance Fields (NeRF) は、3D シーンの実用的で用途の広い表現として広く採用されており、さまざまなダウンストリームタスクを容易にします。ただし、プレーンな多層パーセプトロン (MLP)、テンソル、低ランクのテンソル、ハッシュテーブル、およびそれらの構成を含むさまざまなアーキテクチャには、トレードオフがあります。たとえば、Hashtables ベースの表現は高速なレンダリングを可能にしますが、明確な幾何学的意味が欠けているため、空間関係を意識した編集が困難になります。この制限に対処し、各アーキテクチャの可能性を最大化するために、アクティブラーニングを使用したプログレッシブボリュームディスティレーション (PVD-AL) を提案します。これは、異なるアーキテクチャ間のエニーツーエニー変換を可能にする体系的な蒸留方法です。 PVD-AL は、各構造を 2 つの部分に分解し、レンダリングプロセスから取得した効果的な情報を活用して、浅いボリューム表現からより深いボリューム表現へと徐々に蒸留を実行します。さらに、3 レベルのアクティブラーニング技術により、蒸留プロセス中に継続的なフィードバックが提供されるため、高性能な結果が得られます。複数のベンチマークデータセットで手法を検証するための経験的証拠が提示されます。たとえば、PVD-AL は、ハッシュテーブルベースのモデルから MLP ベースのモデルを、NeRF モデルを最初からトレーニングするよりも 10 ～ 20 倍高速で、0.8dB ～ 2dB 高い PSNR で抽出できます。さらに、PVD-AL により、異なる構造間で多様な機能を融合できるため、複数の編集プロパティを持つモデルが有効になり、リアルタイムの要件を満たすより効率的なモデルが提供されます。プロジェクトのウェブサイト:http://sk-fun.fun/PVD-AL.

Neural Radiance Fields (NeRF) have been widely adopted as practical and versatile representations for 3D scenes, facilitating various downstream tasks. However, different architectures, including plain Multi-Layer Perceptron (MLP), Tensors, low-rank Tensors, Hashtables, and their compositions, have their trade-offs. For instance, Hashtables-based representations allow for faster rendering but lack clear geometric meaning, making spatial-relation-aware editing challenging. To address this limitation and maximize the potential of each architecture, we propose Progressive Volume Distillation with Active Learning (PVD-AL), a systematic distillation method that enables any-to-any conversions between different architectures. PVD-AL decomposes each structure into two parts and progressively performs distillation from shallower to deeper volume representation, leveraging effective information retrieved from the rendering process. Additionally, a Three-Levels of active learning technique provides continuous feedback during the distillation process, resulting in high-performance results. Empirical evidence is presented to validate our method on multiple benchmark datasets. For example, PVD-AL can distill an MLP-based model from a Hashtables-based model at a 10~20X faster speed and 0.8dB~2dB higher PSNR than training the NeRF model from scratch. Moreover, PVD-AL permits the fusion of diverse features among distinct structures, enabling models with multiple editing properties and providing a more efficient model to meet real-time requirements. Project website:http://sk-fun.fun/PVD-AL.

updated: Sat Apr 08 2023 13:59:18 GMT+0000 (UTC)

published: Sat Apr 08 2023 13:59:18 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト