Stylizing 3D Scene via Implicit Representation and HyperNetwork

Pei-Ze Chiang; Meng-Shiun Tsai; Hung-Yu Tseng; Wei-sheng Lai; Wei-Chen Chiu

陰的表現とハイパーネットワークによる3Dシーンのスタイリング

この作業では、3Dシーンの定型化の問題に対処することを目指しています。つまり、任意の新しいビュー角度でシーンの定型化された画像を生成します。簡単な解決策は、既存の新しいビュー合成と画像/ビデオスタイルの転送アプローチを組み合わせることです。これにより、結果がぼやけたり、外観に一貫性がなくなったりすることがよくあります。神経放射輝度フィールド（NeRF）法の高品質な結果に触発されて、希望のスタイルで新しいビューを直接レンダリングするための共同フレームワークを提案します。私たちのフレームワークは、2つのコンポーネントで構成されています。ニューラルラディアンスフィールドモデルを使用した3Dシーンの暗黙的な表現と、スタイル情報をシーン表現に転送するハイパーネットワークです。特に、暗黙の表現モデルはシーンをジオメトリと外観のブランチに解きほぐし、ハイパーネットワークは参照スタイルの画像から外観のブランチのパラメータを予測することを学習します。トレーニングの難しさとメモリの負担を軽減するために、2段階のトレーニング手順とパッチサブサンプリングアプローチを提案して、神経放射輝度フィールドモデルでスタイルとコンテンツの損失を最適化します。最適化後、私たちのモデルは、任意のスタイルで任意のビュー角度で一貫した新しいビューをレンダリングできます。定量的評価と被験者研究の両方が、提案された方法が、異なるビューにわたって一貫した外観を持つ忠実な様式化結果を生成することを実証しました。

In this work, we aim to address the 3D scene stylization problem - generating stylized images of the scene at arbitrary novel view angles. A straightforward solution is to combine existing novel view synthesis and image/video style transfer approaches, which often leads to blurry results or inconsistent appearance. Inspired by the high quality results of the neural radiance fields (NeRF) method, we propose a joint framework to directly render novel views with the desired style. Our framework consists of two components: an implicit representation of the 3D scene with the neural radiance field model, and a hypernetwork to transfer the style information into the scene representation. In particular, our implicit representation model disentangles the scene into the geometry and appearance branches, and the hypernetwork learns to predict the parameters of the appearance branch from the reference style image. To alleviate the training difficulties and memory burden, we propose a two-stage training procedure and a patch sub-sampling approach to optimize the style and content losses with the neural radiance field model. After optimization, our model is able to render consistent novel views at arbitrary view angles with arbitrary style. Both quantitative evaluation and human subject study have demonstrated that the proposed method generates faithful stylization results with consistent appearance across different views.

updated: Sat Jun 05 2021 13:46:59 GMT+0000 (UTC)

published: Thu May 27 2021 09:11:30 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト