Predictive Coding For Animation-Based Video Compression

Goluck Konuko; Stéphane Lathuilière; Giuseppe Valenzise

アニメーションベースのビデオ圧縮のための予測コーディング

私たちは、会議タイプのアプリケーション向けにビデオを効率的に圧縮するという問題に取り組みます。私たちは、画像アニメーションに基づいた最近のアプローチに基づいて構築しています。このアプローチは、まばらなキーポイントのコンパクトなセットで顔の動きを表現することにより、非常に低いビットレートで良好な再構成品質を達成できます。ただし、これらの方法ではビデオをフレームごとにエンコードします。つまり、各フレームが参照フレームから再構築されるため、帯域幅が大きい場合には再構築の品質が制限されます。代わりに、画像アニメーションを予測子として使用し、実際のターゲットフレームに関する残差を符号化する予測符号化方式を提案します。残差は予測的な方法でコード化できるため、時間的な依存関係が効率的に除去されます。私たちの実験では、トーキングヘッドビデオのデータセットで、HEVC ビデオ標準と比較して 70% 以上、VVC と比較して 30% 以上の大幅なビットレートの向上が示されました。

We address the problem of efficiently compressing video for conferencing-type applications. We build on recent approaches based on image animation, which can achieve good reconstruction quality at very low bitrate by representing face motions with a compact set of sparse keypoints. However, these methods encode video in a frame-by-frame fashion, i.e. each frame is reconstructed from a reference frame, which limits the reconstruction quality when the bandwidth is larger. Instead, we propose a predictive coding scheme which uses image animation as a predictor, and codes the residual with respect to the actual target frame. The residuals can be in turn coded in a predictive manner, thus removing efficiently temporal dependencies. Our experiments indicate a significant bitrate gain, in excess of 70% compared to the HEVC video standard and over 30% compared to VVC, on a datasetof talking-head videos

updated: Sun Jul 09 2023 14:40:54 GMT+0000 (UTC)

published: Sun Jul 09 2023 14:40:54 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト