Boosting CNN-based primary quantization matrix estimation of double JPEG images via a classification-like architecture

Benedetta Tondi; Andrea Costranzo; Dequ Huang; Bin Li

分類のようなアーキテクチャを介したダブルJPEG画像のCNNベースの一次量子化行列推定のブースト

二重JPEG圧縮画像の一次量子化行列を推定することは、画像の過去の履歴に関する重要な情報を推測できるため、画像フォレンジックにおいて関連する重要な問題です。さらに、異なる画像領域にわたる一次量子化行列の不整合を使用して、二重JPEG改ざん画像のスプライシングをローカライズできます。従来のモデルベースのアプローチは、第1と第2の圧縮品質の関係、およびJPEGグリッドの配置に関する特定の仮定の下で機能します。最近、多種多様な条件下で機能することができる深層学習ベースの推定器が提案されており、ほとんどの場合、調整された既存の方法よりも優れています。この方法は、標準的な回帰問題として推定を解くようにトレーニングされた畳み込みニューラルネットワーク（CNN）に基づいています。量子化係数の整数性を利用することにより、本論文では、直喩分類アーキテクチャに頼ることによって推定を実行する深層学習手法を提案します。 CNNは、推定の精度と平均二乗誤差（MSE）の両方を考慮した損失関数でトレーニングされます。結果は、統計分析、特に深層学習回帰に基づく最先端の方法と比較して、提案された手法の優れたパフォーマンスを確認します。さらに、第2の圧縮グリッドと第1の圧縮の位置合わせ、および前者と第2の圧縮のJPEG品質の組み合わせに関して、一般的な動作条件下で機能する方法の能力は、これらが実際のアプリケーションに非常に関連しています。情報は事前に不明です。

Estimating the primary quantization matrix of double JPEG compressed images is a problem of relevant importance in image forensics since it allows to infer important information about the past history of an image. In addition, the inconsistencies of the primary quantization matrices across different image regions can be used to localize splicing in double JPEG tampered images. Traditional model-based approaches work under specific assumptions on the relationship between the first and second compression qualities and on the alignment of the JPEG grid. Recently, a deep learning-based estimator capable to work under a wide variety of conditions has been proposed, that outperforms tailored existing methods in most of the cases. The method is based on a Convolutional Neural Network (CNN) that is trained to solve the estimation as a standard regression problem. By exploiting the integer nature of the quantization coefficients, in this paper, we propose a deep learning technique that performs the estimation by resorting to a simil-classification architecture. The CNN is trained with a loss function that takes into account both the accuracy and the Mean Square Error (MSE) of the estimation. Results confirm the superior performance of the proposed technique, compared to the state-of-the art methods based on statistical analysis and, in particular, deep learning regression. Moreover, the capability of the method to work under general operative conditions, regarding the alignment of the second compression grid with the one of first compression and the combinations of the JPEG qualities of former and second compression, is very relevant in practical applications, where these information are unknown a priori.

updated: Wed Mar 17 2021 19:54:31 GMT+0000 (UTC)

published: Tue Dec 01 2020 13:20:11 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト