On the robustness of JPEG post-compression to resampling factor estimation

doi:10.1016/j.sigpro.2019.107371

Signal Processing

Volume 168, March 2020, 107371

https://doi.org/10.1016/j.sigpro.2019.107371 Get rights and content

Highlights

•
We first find the coupling effect between post-JPEG compression and resampling in form of frequency mixing, named as octant symmetric aliasing peaks(OSAP).
•
We show that OSAP is non-negligible to the factor estimation of post-JPEG resampling.
•
The relationship between the location of OSAP and resampling factor and is proved by an approximate model, which is consistent with statistical results on test image sets.
•
We proposed a method based on the resampling feature enhanced by OSAP to estimate the factor of post-JPEG resampling.
•
Experimental results demonstrate that the method is effective and efficient.

Abstract

The research on image resampling detection is urgently called for digital image forensics recently. In this paper, the forensics of resampling operation with JPEG compression as post-process is addressed, namely post-JPEG resampling. We first discover the coupling effect between resampling and post-JPEG compression, which is shown as features of frequency mixing in spectrum and named as octant symmetric aliasing peaks(OSAP). The frequency location of OSAP depends on the resampling factor. We verify that the feature of OSAP is prone to be more prominent than resampling when quality factor (QF) of post-JPEG is low enough. By addressing the frequency response of post-JPEG compression, an approximate model of OSAP is derived. The prior knowledge of OSAP can help to enhance the resampling feature, based on which a new method of resampling factor estimation is proposed. In the proposed method, the location of OSAP is estimated from local maximum of spectrum with strong symmetry, and a candidate set of resampling factor is derived from OSAP. The optimal estimation among candidates is chosen in a heuristic way, which weakens the feature of OSAP appropriately. Experiments implemented on natural image set show the effectiveness of proposed method.

Introduction

Digital image has become the most popular kind of data in our daily life. Yet the integrity and authenticity of digital image are heavily affected by the various editing softwares distributed on the internet. The forged images can bring harmful impacts on our daily life and society in some situations, e.g., journalism and criminal investigation. There is a great demand for automatic detector of forged image, where the field of digital image forensics concentrate on [1], [2], [3], [4], [5], [6], [7], [8], [9], [10], [11], [12]. Among them, resampling forensics is one of the most important topic, because malicious tampering is usually performed by geometrically adapting the new image elements to the original scene [13], [14], [15], [16]. Such adaption may require the employment of spatial transformations(e.g., scaling, rotation and shearing), which is actually achieved by numerical interpolation and resampling. Hence the detection of resampling can be a clue to reveal malicious tampering.

In recent years, resampling forensics has become a hot topic, with many papers published. Most of them are based on spectrum analysis of periodic resampling feature. Popescu and Farid [1] gave the first blind detector of resampling, along with strict mathematical proof. In their method, the inspected image is processed by a complicate EM estimator to get a p-map, and peaks in the Fourier spectrum of p-map imply a periodic trace left by resampling. In Gallagher’s research [5] the EM estimator was replaced by two-ordered difference, and peaks refer to resampling also exist in the variance spectrum of differenced signal. Kirchner [7] proved the equivalence between [1] and [5], and an optimal filter was also proposed as the substitute of two-order difference. Except the detection of resampling feature, these methods could also estimate the parameter of resampling, but were limited to resize resampling. Mahdian and Saic [8] first considered the parameter estimation of rotation resampling, using radon transformation of the two-dimensional differenced signal. The transformed one-dimensional signal with radon angle equal to rotation angle will show the strongest periodic property. Wei et al. [10] took the advantage of phase differences in each row of rotation image to differentiate rotation from resizing, and the rotation angle could be directly calculated from the one-dimensional spectrum. In the aspect of parameter estimation, the methods based on the location of spectrum peaks inevitably couldn’t distinguish the aliasing causing by undersampling. The estimation of resampling parameter usually has more than one possible values. To deal with this problem, some methods were not based on Fourier spectrum but other mathematical approach, for example, SVD [17], [18] and cyclic correlation spectrum [12], [19]. Some other methods [20], [21] considered not only the location of peaks but also the energy distribution of hole spectrum, and SVM is used to distinguish the aliasing of parameter estimation. Furthermore, the utilization of machine learning could help to estimate the kind of kernel [22] and enhance the robustness of detector [23], [24].

JPEG is the most popular image format in the world, and tampered image is often stored in JPEG format [25], [26]. Besides, common digital camera has two kinds of output formats including JPEG. So it is reasonable to consider the operation chain consist of resampling operation and JPEG compression, which have three kinds of combination:

(1)
post-JPEG resampling, where the untampered image is in lossless-compression format and the tampered one is in JPEG format.
(2)
pre-JPEG resampling, where the untampered image is in JPEG format and the tampered one is in lossless-compression format.
(3)
double-JPEG resampling, where both the untampered and tampered image are in JPEG format.

Gallagher [5] first discussed the case of post-JEPG resampling. He proposed that when no resampling is involved, the noise introduced by JPEG have periodic variance, which consists with the block size of lossy compression. The spectrum of JPEG image would show spectral peaks in fixed frequency $ω_{jp}^{(n)} = \frac{n}{8}$ . And the amplitude of peaks in $ω_{jp}^{(n)}$ depends on the quality factor(QF) of JPEG and the power spectrum of the uncompressed image, which have on closed-form expression. For post-JPEG resampling, the spectrum is simply considered as the linear superposition of spectral lines corresponding to JPEG and resampling, respectively. Kirchner and Gloe [27] proposed that the post-JPEG would weaken the feature of resampling. They also study the case of pre-JPEG resampling, which shows that the spectral peaks refer to pre-JPEG would be shifted by the following resampling operation. By detecting the shift JPEG peaks, the resampling factor can be estimated. This theory was inherited by Bianchi and Piva [28] in the study of double-JPEG resampling, which ignored the peak of resampling and considered the spectrum as the linear superposition of pre-JPEG and post-JPEG. Nevertheless, there is a problem that the peak of pre-JPEG may couldn’t be differentiated from resampling only by the amplitude. This problem is well solved by Liu et al. [2] with the help of rank statistics. There are two difficulties of post-JPEG resampling. Firstly, the compression process will weaken the trace of resampling and even disappear. Secondly, some suspicious peaks will appear in the spectrum of post-JPEG image, which could not be distinguished from the resampling peaks.

In this paper, we concentrate on forensics of post-JPEG resampling. We verify that the confusion between these suspicious peaks and resampling peaks is the main difficulty for resample factor estimation. For fixed parameters of post-JPEG resampling, the location of suspicious peaks for different images is the same, and any two suspicious peaks are symmetric about the JPEG peaks. Because the JPEG peaks locate in frequences of 8/k, these suspicious peaks are named as octant symmetric aliasing peaks(OSAP). The symmetric property of OSAP implies a coupling effect between JPEG compression and resampling operation, shown as frequency mixing in the spectrum. Using some mathematical approximation, we derive a model of OSAP in the general framework of stochastic signal processing. This model help us to understand the nonlinear behavior of the amplitude of OSAP when resampling factor changes. Finally, the knowledge of OSAP is introduced into spectrum based method to boost the resampling feature. The confusion between OSAP and resampling peaks is solved in a heuristic way, which changes the phase for each row of inspect image in proper degree.

The main contributions of this work include:

(1)
We first find the coupling effect between post-JPEG compression and resampling in form of frequency mixing, named as octant symmetric aliasing peaks(OSAP). We show that OSAP is nonnegligible to the factor estimation of post-JPEG resampling. The relationship between the location of OSAP and resampling factor and is proved by a approximate model, which is consistent with statistical results on test image sets.
(2)
We proposed a method based on the resampling feature enhanced by OSAP to estimate the factor of post-JPEG resampling. The proposed method performs better than all state-of-the-art methods.

The remainder of this paper is organized as follows. Section 2 reviews the general model of resampling forensics, then in Section 3 we give the exact formula of OSAP and tries to prove it with appropriate mathematical approximation. With the help of OSAP, an automatic factor estimation method is proposed in Section 4. Section 5 shows the experiment results of the proposed method and discusses some implementation issues. Eventually, the concluding remarks and future works are given in Section 6.

Section snippets

Model of resampling

This section describes the model of image resampling and the traditional methods of resampling detection. Without loss of generality, we only consider the luminance channel. Because all of the related operations are separable in each space dimension, an analysis of one dimensional signal is enough. A natural image is considered as a sequence of infinite length, denoted as $g_{0} (n) : Z \to R$ . As described in [12], the procedure of resampling includes three steps: (1) reconstruction (2) warping (3)

Introduction of OSAP

In the previous subsection, we show that there are characteristic peaks at $ω_{rs}^{(n, 1)}$ and $ω_{rs}^{(n, 2)}$ . Specifically, if the interpolation kernel is not bilinear [12], the peaks corresponding to high order harmonic would not be prominent in the spectrum. There should be two peaks in the locations of $ω_{rs}^{(1, 1)}$ and $ω_{rs}^{(1, 2)}$ . However, the operation of post-JPEG compression will induce more prominent peaks in the variance spectrum. One example is shown in Fig. 2, which is calculated by the method in [5]

Proposed method

In this section a heuristic approach is proposed to estimate the true resampling factor from OSAP. Note that this approach is not a detecting method but a factor-estimation method for post-JPEG resampling. The proposed algorithm can be summarized by the following steps:

(1)
Calculating the variance spectrum S(ω) and choosing a set of peaks as candidates Ω^c.
(2)
Selecting OSAP from the candidates set Ω^c.
(3)
Estimating the optimal resampling factor from OSAP.

Experiment methodology

To evaluate the performance of the proposed method, two experiments are conducted over a large groups of test sets. These test sets originate from 500 different uncompressed images captured by Nikon cameras, which belong to the Dresden Image Database [36], in consistent with the previous research.

Considering that the image is converted into YUV space before JPEG compression, and the quantization coefficients to luminance channel are smaller than other two channels, the original color image is

Conclusion

In this paper, we concentrated on the resampling factor estimation with post-JEPG compression, and a practical algorithm was proposed. The trace related to post-JEPG compression was analyzed under the model of cyclostationary process, which shows strong nonlinear property. Specifically, in the variance spectrum of tamperd images, this nonlinear property could be characterized as octant symmetric aliasing peak(OSAP), the third kind of peaks which differentiate from JPEG peaks and resampling

Declaration of Competing Interest

We wish to confirm that there are no known conflicts of interest associated with this publication and there has been no significant financial support for this work that could have influenced its outcome.

Acknowledgements

This work is supported by the National Natural Science Foundation of China (No. U1736118), the Key Areas R&D Program of Guangdong (No. 2019B010136002), the Key Scientific Research Program of Guangzhou (No. 201804020068), the Natural Science Foundation of Guangdong (No. 2016A030313350), the Special Funds for Science and Technology Development of Guangdong (No. 2016KZ010103).

References (37)

Z. Xie et al.
Copy-move detection of digital audio based on multi-feature decision
J. Inf. Secur. Appl.
(2018)
C. Lin et al.
Copy-move forgery detection using combined features and transitive matching
Multimed. Tools Appl.
(2018)
D. Vázquez-Padín et al.
A random matrix approach to the forensic analysis of upscaled images
IEEE Trans. Inf. Forensics Secur.
(2017)
B. Bayar et al.
On the robustness of constrained convolutional neural networks to JPEG post-compression for image resampling detection
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
(2017)
T. Bianchi et al.
Reverse engineering of double JPEG compression in the presence of image resizing
International Workshop on Information Forensics and Security
(2012)
A.C. Popescu et al.
Exposing digital forgeries by detecting traces of resampling
IEEE Trans. Signal Process.
(2005)
X. Liu et al.
Downscaling factor estimation on pre-JPEG compressed images
IEEE Trans. Circuits Syst. Video Technol.
(2019)
G. Cao et al.
Contrast enhancement-based forensics in digital images
IEEE Trans. Inf. Forensics Secur.
(2014)
X. Liu et al.
Image deblocking detection based on a convolutional neural network
IEEE Access
(2019)
A.C. Gallagher
Detection of linear and cubic interpolation in JPEG compressed images
Canadian Conference on Computer and Robot Vision
(2005)

X. Liu et al.

Scaling factor estimation on JPEG compressed images by cyclostationarity analysis

Multimed. Tools Appl.

(2018)

M. Kirchner

Fast and reliable resampling detection by spectral analysis of fixed linear predictor residue

ACM Workshop on Multimedia and Security

(2008)

B. Mahdian et al.

Blind authentication using periodic properties of interpolation

IEEE Trans. Inf. Forensics Secur.

(2008)

F. Zhang et al.

Natural image deblurring based on L0-regularization and kernel shape optimization

Multimed. Tools Appl.

(2018)

W. Wei et al.

Estimation of image rotation angle using interpolation-related spectral signatures with application to blind detection of image forgery

IEEE Trans. Inf. Forensics Secur.

(2010)

H. Xiao et al.

Defocus blur detection based on multiscale SVD fusion in gradient domain

J. Vis. Commun. Image Represent.

(2019)

C. Chen et al.

Blind forensics of successive geometric transformations in digital images using spectral method: theory and applications

IEEE Trans. Image Process.

(2017)

X. Huang et al.

Fast and effective copy-move detection of digital audio based on auto segment

International Journal of Digital Crime and Forensics (IJDCF)

(2019)

Cited by (13)

An exhaustive measurement of re-sampling detection in lossy compressed images using deep learning approach
2024, Engineering Applications of Artificial Intelligence
The detection of resampling in digital images is critical for image authentication, but performance can be challenging when dealing with lossy compression. This study proposes an efficient feature extraction technique for detecting resampling (i.e., tampering) in post-JPEG compressed images. Our approach combines compression clues with resampling clues and feeds them to various traditional machine learning (ML) algorithms such as logistic regression, K-nearest neighbours (K-NN), support vector machine (SVM), decision tree (DT), and random forest (RF) to detect and classify doctored images in the re-compression scenario. We propose and evaluate feed-forward deep neural networks (DNN) and 1D convolutional neural networks (CNN) based on evaluation parameters such as accuracy, recall, precision, and F1 score, comparing them with the aforementioned traditional ML algorithms. Our results show that the RF and one-dimensional (1D) CNN are the most efficient models for this task. Furthermore, the 1D CNN outperforms the state-of-the-art techniques, particularly in the most challenging case of downscaling in lossy JPEG compressed images. Our proposed method demonstrates promising results for resampling detection in post-JPEG compressed images, which can be helpful in various image authentication applications.
An ensemble learning approach for resampling forgery detection using Markov process
2023, Applied Soft Computing
Resampling is an extremely well-known technique performed for Image forgery detection. It includes the changes in the content of a picture in terms of rotation, stretching/zooming, and shrinking, to frame a forged picture that is a localized forgery in comparison to the original picture. With the wrong intention, resampling forgery has been increased day by day, and its negative impact has been increased in criminology, law enforcement, forensics, research etc. Accordingly, the interest in the algorithm of image resampling forgery detection is significantly developed in image forensics. In this paper, a novel image resampling forgery detection technique has been proposed. In the proposed technique, two types of Markov feature with spatial and Discrete Cosine Transform domains have been extracted to recognize the resampling operation. The spatial domain gives the information for the distribution of the pixels and DCT gives the edge information. Further, these Markov features are consolidated. Due to high dimensionality hard thresholding technique is used for reducing the dimensionality. Then, these Markov features are applied to the set of models of different classifiers. With the utilization of classifiers, weighted majority voting values have been calculated during the ensemble classification. Unlike the other techniques, these weighted voting boundaries have been consequently balanced during the training process until the best accuracy has been obtained. However, it is very difficult to get best accuracy so for getting best accuracy this research needs to do lots of iterations and trained the dataset. For the comparative study very few research has been found for this resampling forgery technique with different interpolation techniques and classifier. Still, comparison has been done with some latest research work. The comparative analysis shows that the proposed ensemble learning-based algorithm provides the best outcomes with the accuracy of 99.12% for bicubic, 98.89% for bilinear, and 98.23% for lanczos3 kernel with considerably less complexity and high speed in comparison to prior techniques which are using single support vector machine for classification. Moreover, the proposed algorithm also detects a very low probability of error of 0.44% and detects the type of interpolation kernel, size of the forgery, and the type of resampling, whether it is up sampling and down sampling, using Graphical User Interface which has not been detected previously with multiple forgery detection.
Contrastive Learning based Multi-task Network for Image Manipulation Detection
2022, Signal Processing
Citation Excerpt :
Nevertheless, most of these methods only handle a single type of manipulation, resulting in hardly work on a real-world scenario. With evolution of deep learning in forensics field [21–23], a significant number of deep learning based methods have been proposed to address earlier issues. However, in order to achieve localization capabilities, some methods usually rely on heavy, time-consuming pre- and/or post-processing, e.g., patch extraction [17], expectation-maximization [20], feature clustering [15,24], segmentation [25], etc.
The popularity of image editing techniques and user-friendly editing software have seriously reduced the authenticity of the images. Detection and localization of image manipulations are becoming urgent problems to be solved. Although many existing solutions attempt to address these problems, most works can only solve one specific type of manipulations. Furthermore, some methods need heavy, time-consuming preprocessings and/or postprocessings to localize tampered region, resulting in disconnection and under-optimization of the model. In this paper, a contrastive learning based multi-task network is proposed for the localization of multiple image manipulations. Multi-scale tampered patch classifications and pixel-wise tampered region semantic segmentation are integrated into an end-to-end multi-task network. The consistency of different region statistical properties is measured by contrastive learning to enhance the feature representation ability of the proposed network, improving the performance of tampered patch detection. Various scale tampered patch detections cooperate to localize the tampered region boundaries from coarse to fine. Prediction Pyramid composed of different scale patch detection results provides comprehensive guidance for pixel-wise semantic segmentation of the tampered region. Experimental results on four standard image manipulation datasets demonstrate the better performance of the proposed model.
Upscaling factor estimation on pre-JPEG compressed images based on difference histogram of spectral peaks
2021, Signal Processing: Image Communication
Citation Excerpt :
Moreover, he adopted the maximum likelihood method to estimate 1-D signals and proposed a piecewise linear interpolation for resampling factor estimation [25]. More related works are mentioned in [26–29]. In social networks, JPEG is one of the most widely used format for digital images.
Image is one of the most widely used information carrier exchanged in the Internet, which raises a problem of privacy leakage. Private images are vulnerable to be intercepted and altered by an attacker, violating the owner’s privacy. When an image is tampered maliciously, it is often necessary to perform geometric transformations such as scaling to hide the traces of tampering, introducing resampling traces. In the last two decades, spectral analysis is the most commonly used method for resampling detection. However, since JPEG compression severely interferes the statistical characteristics of resampled images and introduces blocking artifacts, the robustness is really poor for most classical spectrum-based methods in the presence of JPEG compression. In this paper, we propose a method to estimate the upscaling factors of upscaled images in the presence of JPEG compression. A comprehensive analysis in spectrum of scaled images is given. We find that both the location and their difference of spectral peaks in the spectrum of the upscaled pre-JPEG images are related to the upscaling factor. Hence, we adopt the difference histogram of spectral peaks to screen candidate upscaling factors and obtain the final estimation by additional verification step according to the location of the spectral peaks. The experimental results demonstrate the effectiveness of the proposed method.
Identification of deep network generated images using disparities in color components
2020, Signal Processing
Citation Excerpt :
Due to the fact that fabricating a fake image would inevitably introduce some traces, the key to the identification is to analyze and extract features that represent the corresponding traces. For example, the quantization artifacts are used in JPEG image forensics [17,18], the joint artifacts left by different image operations can be used to determine the operation chains [19–21], the splicing inconsistences are exploited to locate tampered image regions [22,23], and the displaying/imaging distortions are utilized in face spoofing detection [24,25]. Recently, some works have been developed to identify fake images generated by deep networks.
With the powerful deep network architectures, such as generative adversarial networks, one can easily generate photorealistic images. Although the generated images are not dedicated for fooling human or deceiving biometric authentication systems, research communities and public media have shown great concerns on the security issues caused by these images. This paper addresses the problem of identifying deep network generated (DNG) images. Taking the differences between camera imaging and DNG image generation into considerations, we analyze the disparities between DNG images and real images in different color components. We observe that the DNG images are more distinguishable from real ones in the chrominance components, especially in the residual domain. Based on these observations, we propose a feature set to capture color image statistics for identifying DNG images. Additionally, we evaluate several detection situations, including the training-testing data are matched or mismatched in image sources or generative models and detection with only real images. Extensive experimental results show that the proposed method can accurately identify DNG images and outperforms existing methods when the training and testing data are mismatched. Moreover, when the GAN model is unknown, our methods also achieves good performance with one-class classification by using only real images for training.
Median filtering detection based on multiple-residual learning with attention fusion
2023, Guangdianzi Jiguang/Journal of Optoelectronics Laser

View all citing articles on Scopus

View full text

On the robustness of JPEG post-compression to resampling factor estimation

Highlights

Abstract

Introduction

Section snippets

Model of resampling

Introduction of OSAP

Proposed method

Experiment methodology

Conclusion

Declaration of Competing Interest

Acknowledgements

J. Inf. Secur. Appl.

Multimed. Tools Appl.

IEEE Trans. Inf. Forensics Secur.

Exposing digital forgeries by detecting traces of resampling

IEEE Trans. Signal Process.

Downscaling factor estimation on pre-JPEG compressed images

IEEE Trans. Circuits Syst. Video Technol.

Contrast enhancement-based forensics in digital images

IEEE Trans. Inf. Forensics Secur.

Image deblocking detection based on a convolutional neural network

IEEE Access

Detection of linear and cubic interpolation in JPEG compressed images

Canadian Conference on Computer and Robot Vision

Scaling factor estimation on JPEG compressed images by cyclostationarity analysis

Multimed. Tools Appl.

Fast and reliable resampling detection by spectral analysis of fixed linear predictor residue

ACM Workshop on Multimedia and Security

Blind authentication using periodic properties of interpolation

IEEE Trans. Inf. Forensics Secur.

Natural image deblurring based on L0-regularization and kernel shape optimization

Multimed. Tools Appl.

Estimation of image rotation angle using interpolation-related spectral signatures with application to blind detection of image forgery

IEEE Trans. Inf. Forensics Secur.

Defocus blur detection based on multiscale SVD fusion in gradient domain

J. Vis. Commun. Image Represent.

Blind forensics of successive geometric transformations in digital images using spectral method: theory and applications

IEEE Trans. Image Process.

Fast and effective copy-move detection of digital audio based on auto segment

International Journal of Digital Crime and Forensics (IJDCF)