AltFreezing for More General Video Face Forgery Detection

Zhendong Wang; Jianmin Bao; Wengang Zhou; Weilun Wang; Houqiang Li

より一般的なビデオの顔偽造検出のための AltFreezing

既存の顔偽造検出モデルは、空間的アーチファクト（生成アーチファクト、ブレンディングなど）または主に時間的アーチファクト（ちらつき、不連続性など）のみを検出することによって偽画像を識別しようとします。ドメイン外のアーティファクトに直面すると、パフォーマンスが大幅に低下する可能性があります。この論文では、顔の偽造検出のために 1 つのモデルで空間的アーチファクトと時間的アーチファクトの両方をキャプチャすることを提案します。シンプルなアイデアは、時空間モデル (3D ConvNet) を活用することです。しかし、あるタイプのアーティファクトに簡単に依存し、他のタイプを無視する可能性があることがわかりました。この問題に対処するために、より一般的な顔の偽造を検出するための AltFreezing と呼ばれる新しいトレーニング戦略を紹介します。 AltFreezing の目的は、モデルが空間的アーチファクトと時間的アーチファクトの両方を検出できるようにすることです。これは、時空間ネットワークの重みを、空間関連と時間関連の 2 つのグループに分類します。次に、モデルが空間的および時間的特徴を学習して本物のビデオと偽のビデオを区別できるように、トレーニングプロセス中に 2 つの重みグループが交互にフリーズされます。さらに、偽造検出モデルの一般化機能を向上させるために、さまざまなビデオレベルのデータ拡張手法を導入します。広範な実験により、私たちのフレームワークは、目に見えない操作やデータセットへの一般化の点で既存の方法よりも優れていることが示されています。コードは https://github.com/ZhendongWang6/AltFreezing で入手できます。

Existing face forgery detection models try to discriminate fake images by detecting only spatial artifacts (e.g., generative artifacts, blending) or mainly temporal artifacts (e.g., flickering, discontinuity). They may experience significant performance degradation when facing out-domain artifacts. In this paper, we propose to capture both spatial and temporal artifacts in one model for face forgery detection. A simple idea is to leverage a spatiotemporal model (3D ConvNet). However, we find that it may easily rely on one type of artifact and ignore the other. To address this issue, we present a novel training strategy called AltFreezing for more general face forgery detection. The AltFreezing aims to encourage the model to detect both spatial and temporal artifacts. It divides the weights of a spatiotemporal network into two groups: spatial-related and temporal-related. Then the two groups of weights are alternately frozen during the training process so that the model can learn spatial and temporal features to distinguish real or fake videos. Furthermore, we introduce various video-level data augmentation methods to improve the generalization capability of the forgery detection model. Extensive experiments show that our framework outperforms existing methods in terms of generalization to unseen manipulations and datasets. Code is available at https: //github.com/ZhendongWang6/AltFreezing.

updated: Mon Jul 17 2023 08:24:58 GMT+0000 (UTC)

published: Mon Jul 17 2023 08:24:58 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト