Abstract
In recent years, the amount of video data has been increasing explosively, and the requirements for video summarization technology have also increased. Video summarization is a summary of the video. By browsing the video summarization, users can quickly understand the content of the video. The traditional video summarization algorithms extract the global features of the video frames to form video summarization. However, these algorithms have obvious disadvantages. Therefore, we propose a method to generate video summarization by fusing the global and local features of video frames, and clustering video frames by DBSCAN algorithm. By comparing with the video summarization manually selected by multiple users, we achieve better results on OVP and YouTube datasets than previous algorithms.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Basavarajaiah, M., Sharma, P.: GVSUM: generic video summarization using deep visual features. Multimed. Tools Appl. 1–18 (2021)
Chamasemani, F.F., Affendey, L.S., Mustapha, N., Khalid, F.: Video abstraction using density-based clustering algorithm. Vis. Comput. 34(10), 1299–1314 (2017). https://doi.org/10.1007/s00371-017-1432-3
Kannappan, S., Liu, Y., Tiddeman, B.: Human consistency evaluation of static video summaries. Multimed. Tools Appl. 78(9), 12281–12306 (2018). https://doi.org/10.1007/s11042-018-6772-0
Fu, Y., Liu, H., Cheng, Y., et al.: Key-frame selection in WCE video based on shot detection. In: IEEE Intelligent Control and Automation, pp. 5030–5034 (2012)
Jiang, M., Sadka, A., Crookes, D.: Advances in video summarization and skimming. In: Grgic, M., Delac, K., Ghanbari, M., (eds.) Recent Advances in Multimedia Signal Processing and Communications, pp. 27–50. Springer, Berlin (2009). https://doi.org/10.1007/978-3-642-02900-4_2
Zhang, Q., Yu, S.-P., Zhou, D.-S., Wei, X.-P.: An efficient method of key-frame extraction based on a cluster algorithm. J. Hum. Kinet. 39, 5–13 (2013)
Mei, S., Guan, G., Wang, Z., Wan, S., He, M., Feng, D.D.: Video summarization via minimum sparse reconstruction. Pattern Recogn. 48, 522–533 (2015)
de Avila, S.E.F., Lopes, A.P.B., da Luz, A., de Albuquerque Araújo, A.: VSUMM: a mechanism designed to produce static video summaries and a novel evaluation method. Pattern Recogn. Lett. 32, 56–68 (2011)
Zhang, K., Chao, W.-L., Sha, F., Grauman, K.: Video summarization with long short-term memory. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9911, pp. 766–782. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46478-7_47
Trémeau, A., Tominaga, S., Plataniotis, K.N.: Color in image and video processing: most recent trends and future research directions. EURASIP J. Image Video Process. 2008, 1–26 (2008)
James, I.S.P., Angeline, D.M.D.: HSV color histogram based content based image retrieval. Digit. Image Process. 4, 440–443 (2012)
Redmon, J., Farhadi, A.: YOLOv3: an incremental improvement. arXiv preprint arXiv:1804.02767 (2018)
Bochkovskiy, A., Wang, C.-Y., Liao, H.-Y.M.: YOLOv4: optimal speed and accuracy of object detection. arXiv preprint (2020)
Suykens, J.K., Van Gestel, T., Vandewalle, J., De Moor, B.: A support vector machine formulation to PCA analysis and its kernel version. IEEE Trans. Neural Netw. Learn. Syst. 14, 447–450 (2003)
MacQueen, J.: Some methods for classification and analysis of multivariate observations. University of California Press (1967)
Aljarah, I., Faris, H., Mirjalili, S.: Evolutionary Data Clustering: Algorithms and Applications. The University of Jordan, Amman (2021)
DeMenthon, D., Kobla, V., Doermann, D.: Video summarization by curve simplification. In: Multimedia, pp. 211–218 (1998)
Mundur, P., Rao, Y., Yesha, Y.: Keyframe-based video summarization using Delaunay clustering. Int. J. Digit. Libr. 6, 219–232 (2006)
Furini, M., Geraci, F., Montangero, M., Pellegrini, M.: STIMO: STIll and MOving video storyboard for the web scenario. Multimed. Tools Appl. 46, 47–69 (2010)
Acknowledgment
This work was supported in part by the National Natural Science Foundation of China under Grants 61976079 & 61672203, in part by Anhui Natural Science Funds for Distinguished Young Scholar under Grant 170808J08, and in part by Anhui Key Research and Development Program under Grant 202004a05020039.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Tian, WD., Cheng, XY., He, B., Zhao, ZQ. (2021). VISFF: An Approach for Video Summarization Based on Feature Fusion. In: Huang, DS., Jo, KH., Li, J., Gribova, V., Hussain, A. (eds) Intelligent Computing Theories and Application. ICIC 2021. Lecture Notes in Computer Science(), vol 12837. Springer, Cham. https://doi.org/10.1007/978-3-030-84529-2_4
Download citation
DOI: https://doi.org/10.1007/978-3-030-84529-2_4
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-84528-5
Online ISBN: 978-3-030-84529-2
eBook Packages: Computer ScienceComputer Science (R0)