Skip to main content
Log in

A framework for estimating geometric distortions in video copies based on visual-audio fingerprints

  • Original Paper
  • Published:
Signal, Image and Video Processing Aims and scope Submit manuscript

Abstract

Spatio-temporal alignments and estimation of distortion model between pirate and master video contents are prerequisites, in order to approximate the illegal capture location in a theater. State-of-the-art techniques are exploiting only visual features of videos for the alignment and distortion model estimation of watermarked sequences, while few efforts are made toward acoustic features and non-watermarked video contents. To solve this, we propose a distortion model estimation framework based on multimodal signatures, which fully integrates several components: Compact representation of a video using visual-audio fingerprints derived from Speeded Up Robust Features and Mel-Frequency Cepstral Coefficients; Segmentation-based bipartite matching scheme to obtain accurate temporal alignments; Stable frame pairs extraction followed by filtering policies to achieve geometric alignments; and distortion model estimation in terms of homographic matrix. Experiments on camcorded datasets demonstrate the promising results of the proposed framework compared to the reference methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

References

  1. Economic consequences of movie piracy, CMPDA-Feb 2011 report

  2. Delannay, D., de Roover, C., Macq, B.: Temporal alignment of video sequences for watermarking. In: SPIE 15th Annual Symposium on Electronic Imaging, USA, 5020, pp. 481–492 (2003)

  3. Cheng, H.: Temporal registration of video sequences. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, China, pp. 489–492 (2003)

  4. Chupeau, B., Oisel, L., Jouet, P.: Temporal video registration for watermark detection. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, France, pp. 157–160 (2006)

  5. Chen, L., Stentiford, F.W.M.: Video sequence matching based on temporal ordinal measurements. Elsevier Pattern Recognit. Lett. 29, 1824–1831 (2008)

    Article  Google Scholar 

  6. Lee, Y.Y., Kim, C., Lee, S.: Video frame matching algorithm using dynamic programming. In: Proceedings of SPIE and IS and T Journal of Electronic Imaging 18, 1–3 (2009)

  7. Baudry, S., Chupeau, B., Lef\(\grave{\rm e}\)bvre, F.: Adaptive video fingerprints for accurate temporal registration. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 1786–1789 (2010)

  8. Delannay, D., Delaigle, F., Demarty, H., Barlaud, M.: Compensation of geometrical deformations for watermark extraction in digital cinema applications. In: Proceedings of SPIE Electronic Imaging, 4314, pp. 149–157 (2001)

  9. Chupeau, B., Massoudi, A., Lef\(\grave{e}\)bvre, F.: Automatic Estimation and Compensation of Geometric Distortions in Video Copies. In: Proceedings of SPIE, Visual Communication and Image Proceesing, vol. 6508, USA (2007)

  10. Bay, H., Tuytelaars, T., Gool, L.V.: SURF: Speeded up robust features. Computer Vision and Image Understanding, pp. 346–359 (2008)

  11. Yang, G., Chen, N., Jiang, Q.: A robust hashing algorithm based on SURF for video copy detection. Elsevier Comput. Secur. 31, 33–39 (2012)

    Article  Google Scholar 

  12. Hartley, R.I., Zisserman, A.: Multiple View Geometry in Computer Vision. Cambridge University Press, Cambridge (2004)

    Book  MATH  Google Scholar 

  13. Rabiner, L., Juang, B.H.: Fundamentals of Speech Recognition. Prentice Hall Signal Processing Series, Englewood Cliffs (1993)

    Google Scholar 

  14. Roopalakshmi, R., Reddy, G.R.M.: A novel approach to video copy detection using audio fingerprints and PCA. Elsevier Procedia Comput. Sci. 5, 149–156 (2011). doi:10.1016/j.procs.2011.07.021

    Article  Google Scholar 

  15. Chartrand, G.: Introductory graph Theory. Courier Dover Publications, NY (1977)

    Google Scholar 

  16. Goemans, M.X.: Lecture Notes on Bipartite Matching. Massachusetts Institute of Technology (2007)

  17. Kuhn, H.: The Hungarian method for the assignment problem. Naval Res. Logist. 2, 83–97 (1955)

    Article  Google Scholar 

  18. Roopalakshmi, R., Reddy, G.R.M.: A novel spatio-temporal registration framework for video copy localization based on multimodal features. Elsevier Signal Processing (2012). doi:10.1016/j.sigpro.2012.06.004

  19. Lee, M.J., Kim, K.S., Lee, H.K.: Digital cinema watermarking for estimating the position of the pirate. IEEE Trans. Multimed. 12(7), 605–621 (2010)

    Article  Google Scholar 

Download references

Acknowledgments

This research work is supported by Department of Science & Technology (DST) of Government of India under research grant no. SR/WOS-A/ET-48/2010.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to R. Roopalakshmi.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Roopalakshmi, R., Reddy, G.R.M. A framework for estimating geometric distortions in video copies based on visual-audio fingerprints. SIViP 9, 201–210 (2015). https://doi.org/10.1007/s11760-013-0424-7

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11760-013-0424-7

Keywords

Navigation