Codebook-Based Near-Duplicate Video Detection

Hernández, Guillermo; Arrieta, Angélica González; Novais, Paulo; Rodríguez, Sara

doi:10.1007/978-3-030-87869-6_27

Guillermo Hernández¹⁹,
Angélica González Arrieta¹⁹,
Paulo Novais²⁰ &
…
Sara Rodríguez¹⁹

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1401))

Included in the following conference series:

International Workshop on Soft Computing Models in Industrial and Environmental Applications

1000 Accesses

Abstract

In the current context of monetization of multimedia content, it is common to see the appearance of edited replicas of popular videos to take advantage of the momentum of those. In this work, several parameters of near-duplicate video detection systems based on codebooks are studied using techniques from the field of information retrieval. As a result, a system with high average precision, usually higher than 85%, is obtained. Several hyperparameters of the system, such as the aggregation mechanisms and the retrieval model, are analyzed, thus adjusting the system for optimal performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 219.00; Price excludes VAT (USA)

Softcover Book: USD 279.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Soha, M., McDowell, Z.J.: Monetizing a meme: Youtube, content id, and the harlem shake. Soc. Media Soc. 2(1), 2056305115623801 (2016)
Google Scholar
More than 500 hours of content are now being uploaded to youtube every minute - tubefilter. https://www.tubefilter.com/2019/05/07/number-hours-video-uploaded-to-youtube-per-minute/. (Accessed on 2021-04-29)
Press - youtube. https://www.youtube.com/intl/en/about/press/. Accessed 29 Apr 2021
Wu, X., Ngo, C.-W., Hauptmann, A.G., Tan, H.-K.: Real-time near-duplicate elimination for web video search with content and context. IEEE Trans. Multimedia 11(2), 196–207 (2009)
Article Google Scholar
Wu, X., Hauptmann, A.G., Ngo, C.-W.: Practical elimination of near-duplicates from web video search. In: Proceedings of the 15th ACM International Conference on Multimedia, pp. 218–227. ACM (2007)
Google Scholar
Li, T., Nian, F., Wu, X., Gao, Q., Lu, Y.: Efficient video copy detection using multi-modality and dynamic path search. Multimedia Syst. 22(1), 29–39 (2014). https://doi.org/10.1007/s00530-014-0387-8
Article Google Scholar
Guzman-Zavaleta, Z.J., Feregrino-Uribe, C., Morales-Sandoval, M., Menendez-Ortiz, A.: A robust and low-cost video fingerprint extraction method for copy detection. Multimedia Tools Appl. 76(22), 24143–24163 (2016). https://doi.org/10.1007/s11042-016-4168-6
Article Google Scholar
Guzman-Zavaleta, Z.J., Feregrino-Uribe, C.: Partial-copy detection of non-simulated videos using learning at decision level. Multimedia Tools Appl. 78(2), 2427–2446 (2018). https://doi.org/10.1007/s11042-018-6345-2
Article Google Scholar
Hu, Y., Lu, X.: Learning spatial-temporal features for video copy detection by the combination of CNN and RNN. J. Vis. Commun. Image Representation 55, 21–29 (2018)
Article Google Scholar
Zhang, X., Xie, Y., Luan, X., He, J., Zhang, L., Wu, L.: Video copy detection based on deep CNN features and graph-based sequence matching. Wirel. Pers. Commun. 103(1), 401–416 (2018)
Article Google Scholar
Law-To, J., Buisson, O., Gouet-Brunet, V., Boujemaa, N.: Vicopt: a robust system for content-based video copy detection in large databases. Multimedia Syst. 15(6), 337–353 (2009)
Article Google Scholar
Liu, H., Zhao, Q., Wang, H., Lv, P., Chen, Y.: An image-based near-duplicate video retrieval and localization using improved edit distance. Multimedia Tools Appl. 76(22), 24435–24456 (2017)
Article Google Scholar
Liao, K., Liu, G.: An efficient content based video copy detection using the sample based hierarchical adaptive k-means clustering. J. Intell. Inf. Syst. 44(1), 133–158 (2014). https://doi.org/10.1007/s10844-014-0332-5
Article Google Scholar
Su, P.-C., Wu, C.-S.: Efficient copy detection for compressed digital videos by spatial and temporal feature extraction. Multimedia Tools Appl. 76(1), 1331–1353 (2015). https://doi.org/10.1007/s11042-015-3132-1
Article Google Scholar
Boukhari, A., Serir, A.: Weber binarized statistical image features (WBSIF) based video copy detection. J. Vis. Commun. Image Representation 34, 50–64 (2016)
Article Google Scholar
Kordopatis-Zilos, G., Papadopoulos, S., Patras, I., Kompatsiaris, Y.: Near-duplicate video retrieval by aggregating intermediate CNN layers. In: Amsaleg, L., Guðmundsson, G.Þ, Gurrin, C., Jónsson, B.Þ, Satoh, S. (eds.) MMM 2017. LNCS, vol. 10132, pp. 251–263. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-51811-4_21
Chapter Google Scholar
Panagiotakis, C., Doulamis, A., Tziritas, G.: Equivalent key frames selection based on ISO-content principles. IEEE Trans. Circ. Syst. Video Technol. 19(3), 447–451 (2009)
Article Google Scholar
Kumar, M., Paul, A., Kavitha, J., Arockia, P., Rani, J.: Key-frame extraction techniques: a review. Recent Pat. Comput. Sci. 11(1), 3–16 (2018)
Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)
Google Scholar
Sculley, D.: Web-scale k-means clustering. In: Proceedings of the 19th International Conference on World Wide Web, pp. 1177–1178. ACM (2010)
Google Scholar
Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Inf. Process. Manage. 24(5), 513–523 (1988)
Article Google Scholar
Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval. Addison-Wesley (1999)
Google Scholar

Download references

Acknowledgments

One of the authors (G.H.) gratefully acknowledges the Junta de Castilla y León and the European Regional Development Fund for financial support.

Author information

Authors and Affiliations

Bisite Research Group, University of Salamanca, Salamanca, Spain
Guillermo Hernández, Angélica González Arrieta & Sara Rodríguez
ALGORITMI Centre, University of Minho, Guimarães, Portugal
Paulo Novais

Authors

Guillermo Hernández
View author publications
You can also search for this author in PubMed Google Scholar
Angélica González Arrieta
View author publications
You can also search for this author in PubMed Google Scholar
Paulo Novais
View author publications
You can also search for this author in PubMed Google Scholar
Sara Rodríguez
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Guillermo Hernández .

Editor information

Editors and Affiliations

Faculty of Engineering, University of Deusto, Bilbao, Spain
Hugo Sanjurjo González
Faculty of Engineering, University of Deusto, Bilbao, Spain
Iker Pastor López
Faculty of Engineering, University of Deusto, Bilbao, Spain
Pablo García Bringas
Department of Industrial Engineering, University of A Coruña, Ferrol, Spain
Héctor Quintián
BISITE Research Group, University of Salamanca, Salamanca, Spain
Emilio Corchado

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hernández, G., Arrieta, A.G., Novais, P., Rodríguez, S. (2022). Codebook-Based Near-Duplicate Video Detection. In: Sanjurjo González, H., Pastor López, I., García Bringas, P., Quintián, H., Corchado, E. (eds) 16th International Conference on Soft Computing Models in Industrial and Environmental Applications (SOCO 2021). SOCO 2021. Advances in Intelligent Systems and Computing, vol 1401. Springer, Cham. https://doi.org/10.1007/978-3-030-87869-6_27

Download citation

DOI: https://doi.org/10.1007/978-3-030-87869-6_27
Published: 23 September 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-87868-9
Online ISBN: 978-3-030-87869-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics