Learn how to Prune Pixels for Multi-view Neural Image-based Synthesis

Marta Milovanović; Enzo Tartaglione; Marco Cagnazzo; Félix Henry

マルチビューニューラルイメージベースの合成のためにピクセルをプルーニングする方法を学ぶ

画像ベースのレンダリング技術は、複数の入力画像のセットを指定して斬新なビューを生成するため、ユーザーの没入型体験の中核を成しています。それらは客観的および主観的な品質の点で優れたパフォーマンスを示しているため、研究コミュニティはそれらの改善に多大な努力を払っています.ただし、受信側でレンダリングする必要がある大量のデータは、限られた帯域幅環境でのアプリケーションを妨げたり、リアルタイムアプリケーションでの使用を妨げたりします。レンダリングされたビューに関する各入力ピクセルの重要性を調べ、無関係なピクセルの使用を避ける、入力ピクセルプルーニングの方法である LeHoPP を紹介します。画像ベースのレンダリングネットワークを再トレーニングしなくても、私たちのアプローチは、合成品質とピクセルレートの間の適切なトレードオフを示しています。一般的なニューラルレンダリングフレームワークでテストすると、他のプルーニングベースラインと比較して、LeHoPP は平均で 0.9 dB から 3.6 dB の間で向上します。

Image-based rendering techniques stand at the core of an immersive experience for the user, as they generate novel views given a set of multiple input images. Since they have shown good performance in terms of objective and subjective quality, the research community devotes great effort to their improvement. However, the large volume of data necessary to render at the receiver's side hinders applications in limited bandwidth environments or prevents their employment in real-time applications. We present LeHoPP, a method for input pixel pruning, where we examine the importance of each input pixel concerning the rendered view, and we avoid the use of irrelevant pixels. Even without retraining the image-based rendering network, our approach shows a good trade-off between synthesis quality and pixel rate. When tested in the general neural rendering framework, compared to other pruning baselines, LeHoPP gains between 0.9 dB and 3.6 dB on average.

updated: Fri May 05 2023 14:29:24 GMT+0000 (UTC)

published: Fri May 05 2023 14:29:24 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト