Abstract
In the current context of monetization of multimedia content, it is common to see the appearance of edited replicas of popular videos to take advantage of the momentum of those. In this work, several parameters of near-duplicate video detection systems based on codebooks are studied using techniques from the field of information retrieval. As a result, a system with high average precision, usually higher than 85%, is obtained. Several hyperparameters of the system, such as the aggregation mechanisms and the retrieval model, are analyzed, thus adjusting the system for optimal performance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Soha, M., McDowell, Z.J.: Monetizing a meme: Youtube, content id, and the harlem shake. Soc. Media Soc. 2(1), 2056305115623801 (2016)
More than 500 hours of content are now being uploaded to youtube every minute - tubefilter. https://www.tubefilter.com/2019/05/07/number-hours-video-uploaded-to-youtube-per-minute/. (Accessed on 2021-04-29)
Press - youtube. https://www.youtube.com/intl/en/about/press/. Accessed 29 Apr 2021
Wu, X., Ngo, C.-W., Hauptmann, A.G., Tan, H.-K.: Real-time near-duplicate elimination for web video search with content and context. IEEE Trans. Multimedia 11(2), 196–207 (2009)
Wu, X., Hauptmann, A.G., Ngo, C.-W.: Practical elimination of near-duplicates from web video search. In: Proceedings of the 15th ACM International Conference on Multimedia, pp. 218–227. ACM (2007)
Li, T., Nian, F., Wu, X., Gao, Q., Lu, Y.: Efficient video copy detection using multi-modality and dynamic path search. Multimedia Syst. 22(1), 29–39 (2014). https://doi.org/10.1007/s00530-014-0387-8
Guzman-Zavaleta, Z.J., Feregrino-Uribe, C., Morales-Sandoval, M., Menendez-Ortiz, A.: A robust and low-cost video fingerprint extraction method for copy detection. Multimedia Tools Appl. 76(22), 24143–24163 (2016). https://doi.org/10.1007/s11042-016-4168-6
Guzman-Zavaleta, Z.J., Feregrino-Uribe, C.: Partial-copy detection of non-simulated videos using learning at decision level. Multimedia Tools Appl. 78(2), 2427–2446 (2018). https://doi.org/10.1007/s11042-018-6345-2
Hu, Y., Lu, X.: Learning spatial-temporal features for video copy detection by the combination of CNN and RNN. J. Vis. Commun. Image Representation 55, 21–29 (2018)
Zhang, X., Xie, Y., Luan, X., He, J., Zhang, L., Wu, L.: Video copy detection based on deep CNN features and graph-based sequence matching. Wirel. Pers. Commun. 103(1), 401–416 (2018)
Law-To, J., Buisson, O., Gouet-Brunet, V., Boujemaa, N.: Vicopt: a robust system for content-based video copy detection in large databases. Multimedia Syst. 15(6), 337–353 (2009)
Liu, H., Zhao, Q., Wang, H., Lv, P., Chen, Y.: An image-based near-duplicate video retrieval and localization using improved edit distance. Multimedia Tools Appl. 76(22), 24435–24456 (2017)
Liao, K., Liu, G.: An efficient content based video copy detection using the sample based hierarchical adaptive k-means clustering. J. Intell. Inf. Syst. 44(1), 133–158 (2014). https://doi.org/10.1007/s10844-014-0332-5
Su, P.-C., Wu, C.-S.: Efficient copy detection for compressed digital videos by spatial and temporal feature extraction. Multimedia Tools Appl. 76(1), 1331–1353 (2015). https://doi.org/10.1007/s11042-015-3132-1
Boukhari, A., Serir, A.: Weber binarized statistical image features (WBSIF) based video copy detection. J. Vis. Commun. Image Representation 34, 50–64 (2016)
Kordopatis-Zilos, G., Papadopoulos, S., Patras, I., Kompatsiaris, Y.: Near-duplicate video retrieval by aggregating intermediate CNN layers. In: Amsaleg, L., Guðmundsson, G.Þ, Gurrin, C., Jónsson, B.Þ, Satoh, S. (eds.) MMM 2017. LNCS, vol. 10132, pp. 251–263. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-51811-4_21
Panagiotakis, C., Doulamis, A., Tziritas, G.: Equivalent key frames selection based on ISO-content principles. IEEE Trans. Circ. Syst. Video Technol. 19(3), 447–451 (2009)
Kumar, M., Paul, A., Kavitha, J., Arockia, P., Rani, J.: Key-frame extraction techniques: a review. Recent Pat. Comput. Sci. 11(1), 3–16 (2018)
Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)
Sculley, D.: Web-scale k-means clustering. In: Proceedings of the 19th International Conference on World Wide Web, pp. 1177–1178. ACM (2010)
Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Inf. Process. Manage. 24(5), 513–523 (1988)
Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval. Addison-Wesley (1999)
Acknowledgments
One of the authors (G.H.) gratefully acknowledges the Junta de Castilla y León and the European Regional Development Fund for financial support.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Hernández, G., Arrieta, A.G., Novais, P., Rodríguez, S. (2022). Codebook-Based Near-Duplicate Video Detection. In: Sanjurjo González, H., Pastor López, I., García Bringas, P., Quintián, H., Corchado, E. (eds) 16th International Conference on Soft Computing Models in Industrial and Environmental Applications (SOCO 2021). SOCO 2021. Advances in Intelligent Systems and Computing, vol 1401. Springer, Cham. https://doi.org/10.1007/978-3-030-87869-6_27
Download citation
DOI: https://doi.org/10.1007/978-3-030-87869-6_27
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-87868-9
Online ISBN: 978-3-030-87869-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)