VISFF: An Approach for Video Summarization Based on Feature Fusion

Tian, Wei-Dong; Cheng, Xiao-Yu; He, Bin; Zhao, Zhong-Qiu

doi:10.1007/978-3-030-84529-2_4

VISFF: An Approach for Video Summarization Based on Feature Fusion

Wei-Dong Tian¹³,
Xiao-Yu Cheng¹³,
Bin He¹³ &
…
Zhong-Qiu Zhao¹³

Conference paper
First Online: 09 August 2021

1346 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12837))

Abstract

In recent years, the amount of video data has been increasing explosively, and the requirements for video summarization technology have also increased. Video summarization is a summary of the video. By browsing the video summarization, users can quickly understand the content of the video. The traditional video summarization algorithms extract the global features of the video frames to form video summarization. However, these algorithms have obvious disadvantages. Therefore, we propose a method to generate video summarization by fusing the global and local features of video frames, and clustering video frames by DBSCAN algorithm. By comparing with the video summarization manually selected by multiple users, we achieve better results on OVP and YouTube datasets than previous algorithms.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Basavarajaiah, M., Sharma, P.: GVSUM: generic video summarization using deep visual features. Multimed. Tools Appl. 1–18 (2021)
Google Scholar
Chamasemani, F.F., Affendey, L.S., Mustapha, N., Khalid, F.: Video abstraction using density-based clustering algorithm. Vis. Comput. 34(10), 1299–1314 (2017). https://doi.org/10.1007/s00371-017-1432-3
Article Google Scholar
Kannappan, S., Liu, Y., Tiddeman, B.: Human consistency evaluation of static video summaries. Multimed. Tools Appl. 78(9), 12281–12306 (2018). https://doi.org/10.1007/s11042-018-6772-0
Article Google Scholar
Fu, Y., Liu, H., Cheng, Y., et al.: Key-frame selection in WCE video based on shot detection. In: IEEE Intelligent Control and Automation, pp. 5030–5034 (2012)
Google Scholar
Jiang, M., Sadka, A., Crookes, D.: Advances in video summarization and skimming. In: Grgic, M., Delac, K., Ghanbari, M., (eds.) Recent Advances in Multimedia Signal Processing and Communications, pp. 27–50. Springer, Berlin (2009). https://doi.org/10.1007/978-3-642-02900-4_2
Zhang, Q., Yu, S.-P., Zhou, D.-S., Wei, X.-P.: An efficient method of key-frame extraction based on a cluster algorithm. J. Hum. Kinet. 39, 5–13 (2013)
Article Google Scholar
Mei, S., Guan, G., Wang, Z., Wan, S., He, M., Feng, D.D.: Video summarization via minimum sparse reconstruction. Pattern Recogn. 48, 522–533 (2015)
Article Google Scholar
de Avila, S.E.F., Lopes, A.P.B., da Luz, A., de Albuquerque Araújo, A.: VSUMM: a mechanism designed to produce static video summaries and a novel evaluation method. Pattern Recogn. Lett. 32, 56–68 (2011)
Google Scholar
Zhang, K., Chao, W.-L., Sha, F., Grauman, K.: Video summarization with long short-term memory. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9911, pp. 766–782. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46478-7_47
Chapter Google Scholar
Trémeau, A., Tominaga, S., Plataniotis, K.N.: Color in image and video processing: most recent trends and future research directions. EURASIP J. Image Video Process. 2008, 1–26 (2008)
Google Scholar
James, I.S.P., Angeline, D.M.D.: HSV color histogram based content based image retrieval. Digit. Image Process. 4, 440–443 (2012)
Google Scholar
Redmon, J., Farhadi, A.: YOLOv3: an incremental improvement. arXiv preprint arXiv:1804.02767 (2018)
Bochkovskiy, A., Wang, C.-Y., Liao, H.-Y.M.: YOLOv4: optimal speed and accuracy of object detection. arXiv preprint (2020)
Google Scholar
Suykens, J.K., Van Gestel, T., Vandewalle, J., De Moor, B.: A support vector machine formulation to PCA analysis and its kernel version. IEEE Trans. Neural Netw. Learn. Syst. 14, 447–450 (2003)
Article Google Scholar
MacQueen, J.: Some methods for classification and analysis of multivariate observations. University of California Press (1967)
Google Scholar
Aljarah, I., Faris, H., Mirjalili, S.: Evolutionary Data Clustering: Algorithms and Applications. The University of Jordan, Amman (2021)
Google Scholar
DeMenthon, D., Kobla, V., Doermann, D.: Video summarization by curve simplification. In: Multimedia, pp. 211–218 (1998)
Google Scholar
Mundur, P., Rao, Y., Yesha, Y.: Keyframe-based video summarization using Delaunay clustering. Int. J. Digit. Libr. 6, 219–232 (2006)
Article Google Scholar
Furini, M., Geraci, F., Montangero, M., Pellegrini, M.: STIMO: STIll and MOving video storyboard for the web scenario. Multimed. Tools Appl. 46, 47–69 (2010)
Article Google Scholar

Download references

Acknowledgment

This work was supported in part by the National Natural Science Foundation of China under Grants 61976079 & 61672203, in part by Anhui Natural Science Funds for Distinguished Young Scholar under Grant 170808J08, and in part by Anhui Key Research and Development Program under Grant 202004a05020039.

Author information

Authors and Affiliations

School of Computer Science and Information Engineering, Hefei University of Technology, Hefei, 230009, China
Wei-Dong Tian, Xiao-Yu Cheng, Bin He & Zhong-Qiu Zhao

Authors

Wei-Dong Tian
View author publications
You can also search for this author in PubMed Google Scholar
Xiao-Yu Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Bin He
View author publications
You can also search for this author in PubMed Google Scholar
Zhong-Qiu Zhao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wei-Dong Tian .

Editor information

Editors and Affiliations

Tongji University, Shanghai, China
De-Shuang Huang
University of Ulsan, Ulsan, Korea (Republic of)
Kang-Hyun Jo
Shenzhen University, Shenzhen, China
Jianqiang Li
Far Eastern Branch of the Russian Academy of Sciences, Vladivostok, Russia
Valeriya Gribova
Department of Computer Science, Liverpool John Moores University, Liverpool, UK
Abir Hussain

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tian, WD., Cheng, XY., He, B., Zhao, ZQ. (2021). VISFF: An Approach for Video Summarization Based on Feature Fusion. In: Huang, DS., Jo, KH., Li, J., Gribova, V., Hussain, A. (eds) Intelligent Computing Theories and Application. ICIC 2021. Lecture Notes in Computer Science(), vol 12837. Springer, Cham. https://doi.org/10.1007/978-3-030-84529-2_4

Download citation

DOI: https://doi.org/10.1007/978-3-030-84529-2_4
Published: 09 August 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-84528-5
Online ISBN: 978-3-030-84529-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics