Abstract
To improve faster browsing and more efficient content indexing of huge video collections, video summarization has emerged as an important area of research for the multimedia community. One of the mechanisms to generate video summaries is to extract keyframes which represent the most important content of the video. However, there are still some problems like image imperfection and noise interference, which seriously affect the performance of keyframe selection. Aiming at above problems, in this paper, we propose a linear reconstruction framework to summarize the videos. The first model in our framework seeks the most informative keyframes (base vectors) using the structure sparsity of the \(\ell _{21}\) norm regularization, to represent all the frames as the linear combination of them in a video. Furthermore, we also propose another more robust model via \(\ell _{21}\) norm based loss function to suppress the outlier, and form the joint sparsity with \(\ell _{21}\) norm regularization. For the optimization, we design two efficient algorithms for two proposed models respectively. Finally the extensive experiments on real world video datesets are presented to show the effectiveness of the proposed framework.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Elkhattabi, Z., Tabii, Y., Benkaddour, A.: Video summarization: techniques and applications. Int. J. Comput. Electr. Autom. Control Inf. Eng. 9(4), 928–933 (2015)
Khosla, A., Hamid, R., Lin, C.J., Sundaresan, N.: Large-scale video summarization using web-image priors. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2698–2705 (2013)
Ren, J., Jiang, J., Feng, Y.: Activity-driven content adaptation for effective video summarization. J. Vis. Commun. Image Represent. 21(8), 930–938 (2010)
Ren, J., Jiang, J.: Hierarchical modeling and adaptive clustering for real-time summarization of rush videos. IEEE Trans. Multimed. 11(5), 906–917 (2009)
Ejaz, N., Tariq, T., Balik, S.: Adaptive key frame extraction for video summarization using an aggregating mechanism. J. Vis. Commun. Image Represent. 23, 1031–1040 (2012)
Yu, K., Bi, J., Tresp, V.: Active learning via transductive experimental design. In: International Conference on Machine Learning, pp. 1081–1088. ACM (2006)
Elhamifar, E., Sapiro, G., Vidal, R.: See all by looking at a few: sparse modeling for finding representative objects. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1600–1607. IEEE (2012)
Lin, Z., Chen, M., Ma, Y.: The augmented lagrange multiplier method for exact recovery of corrupted low-rank matrices. arXiv preprint arXiv:1009.5055 (2010)
Liu, G., Lin, Z., Yu, Y.: Robust subspace segmentation by low-rank representation. In: International Conference on Machine Learning, pp. 663–670 (2010)
Nie, F., Wang, H., Huang, H., Ding, C.: Early active learning via robust representation and structured sparsity. In: International Joint Conference on Artificial Intelligence, pp. 1572–1578 (2013)
Dang, C., Radha, H.: RPCA-KFE: key frame extraction for video using robust principal component analysis. IEEE Trans. Image Process. 24(11), 3742–3753 (2015)
Kim, G., Sigal, L., Xing, E.: Joint summarization of large-scale collections of web images and videos for storyline reconstruction. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 4225–4232. IEEE (2014)
Acknowledgments
This work was supported by the Key Natural Science Project of Anhui Provincial Education Department (KJ2018A0023), the Guangdong Province Science and Technology Plan Projects (2017B010110011), the Anhui Key Research and Development Plan (1804a09020101), the National Basic Research Program (973 Program) of China (2015CB351705), the National Natural Science Foundation of China (61906002, 61402002, 61876002 and 61860206004) and 2018 College Students Innovation and Entrepreneurship Training Program (201810357352).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Sheng, M., Shi, J., Sun, D., Ding, Z., Luo, B. (2020). Adaptive Video Summarization via Robust Representation and Structured Sparsity. In: Ren, J., et al. Advances in Brain Inspired Cognitive Systems. BICS 2019. Lecture Notes in Computer Science(), vol 11691. Springer, Cham. https://doi.org/10.1007/978-3-030-39431-8_19
Download citation
DOI: https://doi.org/10.1007/978-3-030-39431-8_19
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-39430-1
Online ISBN: 978-3-030-39431-8
eBook Packages: Computer ScienceComputer Science (R0)