A framework for estimating geometric distortions in video copies based on visual-audio fingerprints

Roopalakshmi, R.; Reddy, G. Ram Mohana

doi:10.1007/s11760-013-0424-7

A framework for estimating geometric distortions in video copies based on visual-audio fingerprints

Original Paper
Published: 01 February 2013

Volume 9, pages 201–210, (2015)
Cite this article

Signal, Image and Video Processing Aims and scope Submit manuscript

R. Roopalakshmi¹ &
G. Ram Mohana Reddy¹

379 Accesses
3 Citations
Explore all metrics

Abstract

Spatio-temporal alignments and estimation of distortion model between pirate and master video contents are prerequisites, in order to approximate the illegal capture location in a theater. State-of-the-art techniques are exploiting only visual features of videos for the alignment and distortion model estimation of watermarked sequences, while few efforts are made toward acoustic features and non-watermarked video contents. To solve this, we propose a distortion model estimation framework based on multimodal signatures, which fully integrates several components: Compact representation of a video using visual-audio fingerprints derived from Speeded Up Robust Features and Mel-Frequency Cepstral Coefficients; Segmentation-based bipartite matching scheme to obtain accurate temporal alignments; Stable frame pairs extraction followed by filtering policies to achieve geometric alignments; and distortion model estimation in terms of homographic matrix. Experiments on camcorded datasets demonstrate the promising results of the proposed framework compared to the reference methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Efficient video copy detection using multi-modality and dynamic path search

Article 14 May 2014

A robust and low-cost video fingerprint extraction method for copy detection

Article 29 November 2016

A spectrogram-based audio fingerprinting system for content-based copy detection

Article 21 November 2015

References

Economic consequences of movie piracy, CMPDA-Feb 2011 report
Delannay, D., de Roover, C., Macq, B.: Temporal alignment of video sequences for watermarking. In: SPIE 15th Annual Symposium on Electronic Imaging, USA, 5020, pp. 481–492 (2003)
Cheng, H.: Temporal registration of video sequences. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, China, pp. 489–492 (2003)
Chupeau, B., Oisel, L., Jouet, P.: Temporal video registration for watermark detection. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, France, pp. 157–160 (2006)
Chen, L., Stentiford, F.W.M.: Video sequence matching based on temporal ordinal measurements. Elsevier Pattern Recognit. Lett. 29, 1824–1831 (2008)
Article Google Scholar
Lee, Y.Y., Kim, C., Lee, S.: Video frame matching algorithm using dynamic programming. In: Proceedings of SPIE and IS and T Journal of Electronic Imaging 18, 1–3 (2009)
Baudry, S., Chupeau, B., Lef\(\grave{\rm e}\)bvre, F.: Adaptive video fingerprints for accurate temporal registration. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 1786–1789 (2010)
Delannay, D., Delaigle, F., Demarty, H., Barlaud, M.: Compensation of geometrical deformations for watermark extraction in digital cinema applications. In: Proceedings of SPIE Electronic Imaging, 4314, pp. 149–157 (2001)
Chupeau, B., Massoudi, A., Lef\(\grave{e}\)bvre, F.: Automatic Estimation and Compensation of Geometric Distortions in Video Copies. In: Proceedings of SPIE, Visual Communication and Image Proceesing, vol. 6508, USA (2007)
Bay, H., Tuytelaars, T., Gool, L.V.: SURF: Speeded up robust features. Computer Vision and Image Understanding, pp. 346–359 (2008)
Yang, G., Chen, N., Jiang, Q.: A robust hashing algorithm based on SURF for video copy detection. Elsevier Comput. Secur. 31, 33–39 (2012)
Article Google Scholar
Hartley, R.I., Zisserman, A.: Multiple View Geometry in Computer Vision. Cambridge University Press, Cambridge (2004)
Book MATH Google Scholar
Rabiner, L., Juang, B.H.: Fundamentals of Speech Recognition. Prentice Hall Signal Processing Series, Englewood Cliffs (1993)
Google Scholar
Roopalakshmi, R., Reddy, G.R.M.: A novel approach to video copy detection using audio fingerprints and PCA. Elsevier Procedia Comput. Sci. 5, 149–156 (2011). doi:10.1016/j.procs.2011.07.021
Article Google Scholar
Chartrand, G.: Introductory graph Theory. Courier Dover Publications, NY (1977)
Google Scholar
Goemans, M.X.: Lecture Notes on Bipartite Matching. Massachusetts Institute of Technology (2007)
Kuhn, H.: The Hungarian method for the assignment problem. Naval Res. Logist. 2, 83–97 (1955)
Article Google Scholar
Roopalakshmi, R., Reddy, G.R.M.: A novel spatio-temporal registration framework for video copy localization based on multimodal features. Elsevier Signal Processing (2012). doi:10.1016/j.sigpro.2012.06.004
Lee, M.J., Kim, K.S., Lee, H.K.: Digital cinema watermarking for estimating the position of the pirate. IEEE Trans. Multimed. 12(7), 605–621 (2010)
Article Google Scholar

Download references

Acknowledgments

This research work is supported by Department of Science & Technology (DST) of Government of India under research grant no. SR/WOS-A/ET-48/2010.

Author information

Authors and Affiliations

National Institute of Technology Karnataka (NITK), Surathkal, Mangalore, 575025, India
R. Roopalakshmi & G. Ram Mohana Reddy

Authors

R. Roopalakshmi
View author publications
You can also search for this author in PubMed Google Scholar
G. Ram Mohana Reddy
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to R. Roopalakshmi.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Roopalakshmi, R., Reddy, G.R.M. A framework for estimating geometric distortions in video copies based on visual-audio fingerprints. SIViP 9, 201–210 (2015). https://doi.org/10.1007/s11760-013-0424-7

Download citation

Received: 19 July 2012
Revised: 04 January 2013
Accepted: 04 January 2013
Published: 01 February 2013
Issue Date: January 2015
DOI: https://doi.org/10.1007/s11760-013-0424-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A framework for estimating geometric distortions in video copies based on visual-audio fingerprints

Abstract

Access this article

Similar content being viewed by others

Efficient video copy detection using multi-modality and dynamic path search

A robust and low-cost video fingerprint extraction method for copy detection

A spectrogram-based audio fingerprinting system for content-based copy detection

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A framework for estimating geometric distortions in video copies based on visual-audio fingerprints

Abstract

Access this article

Similar content being viewed by others

Efficient video copy detection using multi-modality and dynamic path search

A robust and low-cost video fingerprint extraction method for copy detection

A spectrogram-based audio fingerprinting system for content-based copy detection

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation