Extending Neural P-frame Codecs for B-frame Coding

Reza Pourreza; Taco S Cohen

BフレームコーディングのためのニューラルPフレームコーデックの拡張

ほとんどのニューラルビデオコーデックはPフレームコーディング（過去のフレームから各フレームを予測）に対応していますが、このホワイトペーパーではBフレーム圧縮（過去と将来の両方の参照フレームを使用してフレームを予測）に対応しています。当社のBフレームソリューションは、既存のPフレーム方式に基づいています。その結果、Bフレームコーディング機能を既存のニューラルコーデックに簡単に追加できます。 Bフレームコーディング方法の基本的な考え方は、2つの参照フレームを補間して1つの参照フレームを生成し、それを既存のPフレームコーデックと一緒に使用して入力Bフレームをエンコードすることです。私たちの調査によると、補間されたフレームは、通常行われているように前のフレームを使用する場合と比較して、Pフレームコーデックの参照としてはるかに優れています。私たちの結果は、既存のPフレームコーデックで提案された方法を使用すると、同じビデオ品質を生成しながら、Pフレームコーデックと比較してUVGデータセットのビットレートを28.5％節約できることを示しています。

While most neural video codecs address P-frame coding (predicting each frame from past ones), in this paper we address B-frame compression (predicting frames using both past and future reference frames). Our B-frame solution is based on the existing P-frame methods. As a result, B-frame coding capability can easily be added to an existing neural codec. The basic idea of our B-frame coding method is to interpolate the two reference frames to generate a single reference frame and then use it together with an existing P-frame codec to encode the input B-frame. Our studies show that the interpolated frame is a much better reference for the P-frame codec compared to using the previous frame as is usually done. Our results show that using the proposed method with an existing P-frame codec can lead to 28.5%saving in bit-rate on the UVG dataset compared to the P-frame codec while generating the same video quality.

updated: Thu Aug 05 2021 05:39:33 GMT+0000 (UTC)

published: Tue Mar 30 2021 21:25:35 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト