Elsevier

Digital Investigation

Volume 9, Issue 2, November 2012, Pages 151-159
Digital Investigation

A MCEA based passive forensics scheme for detecting frame-based video tampering

https://doi.org/10.1016/j.diin.2012.07.002Get rights and content

Abstract

Without the use of digital signature or digital watermark, video passive forensics only utilizes the statistical characteristics of digital video to verify its integrity and authenticity. For frame-based video tampering, it usually suffers from double MPEG compression. In this paper, a motion-compensated edge artifact (MCEA) based passive forensics scheme is proposed for detecting frame-based video manipulation. It exploits the MCEA difference between adjacent P frames, and the decision is made by judging whether there are any spikes in the Fourier transform domain after double MPEG compression. Experimental results show that the proposed approach is effective for frame-based tampering, such as adding/deleting frames and GOP structure change, and can predict the GOP structure of original video.

Introduction

With the wide availability of digital video camera and the prevalence of video sharing websites, digital videos are playing important roles in our daily life. Meanwhile, it is becoming much easier to manipulate and tamper digital video without leaving any visual clues with the continuous development of advanced video editing tools (Rocha et al., 2011). As a consequence, various video forgery operations for malicious purposes are more common than ever. There is an urgent need to develop effective forensics techniques for exposing those malicious video manipulations (Chuang et al., 2011). The conventional active methods must embed digital signature or digital watermark into video data in advance to verify its origin or authenticity. Passive video forensics aims at providing tools to support blind investigation because it utilizes only the statistical characteristics of digital video itself. Therefore, passive video forensics does not assume any a-priori knowledge about the original video, which appeals the research efforts in the field of information security.

Digital video can be regarded as an extension of digital image in the time axis. Though there are many works about digital image forensics, the research on digital video forensics is still in its infancy. The reasons are summarized as follows. First, the tampering of digital video is more sophisticated and time-consuming than digital image. Furthermore, due to the large amount of video data, it is usually encoded before storage and transmission. As a result, it is more difficult for video forensics. Second, since digital video has an additional temporal dimension, this brings some forgery operations specific to digital video, such as frame-based tampering. In this paper, we put emphasis on the passive forensics for detecting frame-based tampering.

For an MPEG video, it is usually re-saved in MPEG format after tampering operations. This leads to the so-called double MPEG compression in video forensics. In the literature, there are already several kinds of approaches for detecting double MPEG compression. The most representative algorithm proposed by Wang and Farid (2006) exploits the static and temporal artifacts introduced by double MPEG compression. I frame is viewed as a static image which is similarly subjected to JPEG encoding, and the double JPEG compression detection algorithm is directly extended to double MPEG compression detection. In the temporal domain, it has been stated that motion compensation errors for P-frames are a function over time exhibiting a periodic pattern after frame deletions and recompression. However, this property can only be exploited with some constrains: The number of deleted frames must be multiple times of frame number in a GOP (Group of Picture), and the GOP structure must be kept during tampering. For the detection of GOP structure change in video tampering, Qin et al. (2010) propose a blind forensics technique based on GOP abnormality. It utilizes the Fourier analysis of motion errors. It is effective for the detection of video splicing. Luo et al. (2008) present a feature curve to reveal the compression history of an MPEG video file with a given GOP structure, and use the temporal patterns of block artifacts as evidence to detect tampering. Su and Zhang (2009) utilizes the motion-compensated edge artifacts (MCEA) for the exposing of digital video forgery. However, it needs a hard threshold factor α to detect frame-deleting forgery. Moreover, at least three P frames must be deleted. This seriously constrains its adaptability in practice.

MPEG-2 video system adopts a hybrid coding structure, which integrates these three classical techniques: prediction coding, transform coding and entropy coding. When coarse quantization is combined with motion compensation prediction, the blocking artifacts propagate from I-frames into subsequent frames and accumulate. This will cause structured high frequency noise. The MCEA involves high frequency noise within those blocks in every P frame. In one GOP, the P frames' MCEAs are non-decreasing. By observation, we found that the frame-based forgery operations, such as adding frames, deleting frames or changing the GOP structure, will make the MCEAs of adjacent P frames larger, and they are shown as a periodic characteristics. In this paper, a MCEA based passive forensics scheme is proposed for frame-based video tampering. It is in fact an improved algorithm on Su's work (Su and Zhang, 2009) for detecting double MPEG compression. The block diagram of the proposed approach is illustrated in Fig. 1. The MCEA difference sequences between adjacent P frames are exploited to judge whether there are any spikes in the Fourier transform domain after double MPEG compression. The main contribution of the proposed approach is that it overcomes the shortcomings of the hard threshold in the Su's work (Su and Zhang, 2009). It can not only detect the frame adding/deleting operations, but also is effective for the forensics of GOP structure change.

The rest of this paper is organized as follows. In Section 2, the MPEG-2 video codec and re-compression process are briefly introduced. Section 3 discusses the calculation of the P frame's MCEA and its application for forensics. Section 3 presents the proposed video tampering detection algorithm. Experimental results are reported in Section 5, and conclusions and future work are given in Section 6.

Section snippets

MPEG-2 codec and double MPEG compression

MPEG videos are compressed by removing both the temporal redundancy and spatial redundancy. In the general MPEG architecture, there are three types of frames in video encoding: I (intra-coded) frame, P (forward predictive coded) frame and B (bi-directionally predictive coded) frame. Let N be the total number of frames in a given GOP structure, and M be the minimum distance between P-frames. For example, the GOP structure shown in Fig. 2 can be represented as (N = 12, M = 3).

Due to the

The calculation of MCEA and its application in forensics

For typical in video codec, when coarse quantization is combined with motion compensation prediction, the blocking artifacts propagate from I-frames into subsequent frames. It causes structured high frequency noise that is no longer located at block boundaries. These kind of motion-compensated edge artifacts (MCEA) are referred to be false edges, and their energies accumulate in each GOP (Leontaris et al., 2007). As a no-reference video quality metrics, MCEA is proposed to measure the

The FFT of MCEA difference

For one GOP (N, M), the number of P frames Np is N/M − 1. According to the Formula (6), the MCEA values of all the P frames can be calculated, and a difference sequence ΔM is defined as the difference of MCEA between adjacent P frames.ΔM=MCEAiMCEAi+1,i(1,Np1)

For all the GOP groups in a sequence, their difference sequence dM can be computed. The change of dM is relatively steady. However, the double recompression after frame manipulation or changing the GOP structure brings greater motion

Experimental results

In order to verify the effectiveness of the proposed approach, four typical test video sequences are selected for experiments (Testing samples). They are Carphone, Container, Hall and Mobile (in CIF and QCIF format). Among them, Container and Hall represent those video sequences with nearly static background or simple motions, whereas Carphone and Mobile represents those video sequences with acute motion. The MPEG-2 codec by MPEG Software Simulation Group (MSSG) http://www.mpeg.org/MPEG/video

Conclusions

In this paper, a MCEA-based passive forensics scheme is proposed for frame-based video tampering. It exploits the MCEA difference between adjacent P frames, and judges whether there are any spikes in the Fourier transform domain after double MPEG compression. Experimental results on several test sequences show that the proposed scheme is effective for the forensics of deleting integer multiple M in GOP (N, M) and the GOP structure change in double MPEG compression. Furthermore, it can deduce

Acknowledgments

This work was supported in part by the National Natural Science Foundation of China (61072122) and the Special Prophase Project on National Basic Research Program of China (2010CB334706), Key Project of Hunan Provincial Natural Science Foundation (11JJ2053) and the Program for New Century Excellent Talents in University (NCET-11-0134).

References (10)

  • Weihong Chuang et al.

    Exploiting compression effects for improved source camera identification using strongly compressed video

  • Athanasios Leontaris et al.

    Quality evaluation of motion-compensated edge artifacts in compressed video

    IEEE Transactions on Image Processing

    (2007)
  • Weiqi Luo et al.

    MPEG recompression detection based on block artifacts

  • Yunlong Qin et al.

    Blind detection of video sequence montage based on GOP abnormality

    Chinese Journal of Electronic

    (2010)
  • Anderson Rocha et al.

    Vision of the unseen: current trends and challenges in digital image and video forensics

    ACM Computing Surveys

    (2011)
There are more references available in the full text version of this article.

Cited by (54)

  • Video frame deletion detection based on time–frequency analysis

    2022, Journal of Visual Communication and Image Representation
  • Differentiating synthetic and optical zooming for passive video forgery detection: An anti-forensic perspective

    2019, Digital Investigation
    Citation Excerpt :

    These factors contribute false positives in normal camera zoom in (Hyun et al., 2013) and (Singh and Aggarwal, 2017). In the case of tampered videos with synthetic zooming, the investigator may discard the results of passive video tampering detection methods (Su et al., 2009; Dong et al., 2012; Abbasi Aghamaleki and Behrad, 2017; Kingra et al., 2017), in the belief that they are genuine frames due to camera zooming, leading to false negatives. A method for detecting zoomed frames in videos under investigation is discussed in (Sitara and Mehtre, 2017) for reducing false positives due to genuine camera zooming.

  • Detection of inter-frame forgeries in digital videos

    2018, Forensic Science International
    Citation Excerpt :

    Gironi et al. [30] extended the work in Ref. [21] for detecting frame deletion and frame insertion. Frame deletion detection using motion compensated edge artifact (MCEA) is discussed in Refs. [31,32]. Su et al. [33] utilized periodic artifacts in DCT coefficients of recompressed P and B frames which arises from frame shift due to frame deletion or frame insertion.

View all citing articles on Scopus
View full text