Adaptive Video Summarization via Robust Representation and Structured Sparsity

Sheng, Manjin; Shi, Jiayu; Sun, Dengdi; Ding, Zhuanlian; Luo, Bin

doi:10.1007/978-3-030-39431-8_19

Manjin Sheng¹⁶,
Jiayu Shi¹⁶,
Dengdi Sun¹⁶,
Zhuanlian Ding¹⁷ &
…
Bin Luo¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11691))

Included in the following conference series:

International Conference on Brain Inspired Cognitive Systems

1144 Accesses

Abstract

To improve faster browsing and more efficient content indexing of huge video collections, video summarization has emerged as an important area of research for the multimedia community. One of the mechanisms to generate video summaries is to extract keyframes which represent the most important content of the video. However, there are still some problems like image imperfection and noise interference, which seriously affect the performance of keyframe selection. Aiming at above problems, in this paper, we propose a linear reconstruction framework to summarize the videos. The first model in our framework seeks the most informative keyframes (base vectors) using the structure sparsity of the \(\ell _{21}\) norm regularization, to represent all the frames as the linear combination of them in a video. Furthermore, we also propose another more robust model via \(\ell _{21}\) norm based loss function to suppress the outlier, and form the joint sparsity with \(\ell _{21}\) norm regularization. For the optimization, we design two efficient algorithms for two proposed models respectively. Finally the extensive experiments on real world video datesets are presented to show the effectiveness of the proposed framework.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Elkhattabi, Z., Tabii, Y., Benkaddour, A.: Video summarization: techniques and applications. Int. J. Comput. Electr. Autom. Control Inf. Eng. 9(4), 928–933 (2015)
Google Scholar
Khosla, A., Hamid, R., Lin, C.J., Sundaresan, N.: Large-scale video summarization using web-image priors. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2698–2705 (2013)
Google Scholar
Ren, J., Jiang, J., Feng, Y.: Activity-driven content adaptation for effective video summarization. J. Vis. Commun. Image Represent. 21(8), 930–938 (2010)
Article Google Scholar
Ren, J., Jiang, J.: Hierarchical modeling and adaptive clustering for real-time summarization of rush videos. IEEE Trans. Multimed. 11(5), 906–917 (2009)
Article Google Scholar
Ejaz, N., Tariq, T., Balik, S.: Adaptive key frame extraction for video summarization using an aggregating mechanism. J. Vis. Commun. Image Represent. 23, 1031–1040 (2012)
Article Google Scholar
Yu, K., Bi, J., Tresp, V.: Active learning via transductive experimental design. In: International Conference on Machine Learning, pp. 1081–1088. ACM (2006)
Google Scholar
Elhamifar, E., Sapiro, G., Vidal, R.: See all by looking at a few: sparse modeling for finding representative objects. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1600–1607. IEEE (2012)
Google Scholar
Lin, Z., Chen, M., Ma, Y.: The augmented lagrange multiplier method for exact recovery of corrupted low-rank matrices. arXiv preprint arXiv:1009.5055 (2010)
Liu, G., Lin, Z., Yu, Y.: Robust subspace segmentation by low-rank representation. In: International Conference on Machine Learning, pp. 663–670 (2010)
Google Scholar
Nie, F., Wang, H., Huang, H., Ding, C.: Early active learning via robust representation and structured sparsity. In: International Joint Conference on Artificial Intelligence, pp. 1572–1578 (2013)
Google Scholar
http://www.open-video.org/
Dang, C., Radha, H.: RPCA-KFE: key frame extraction for video using robust principal component analysis. IEEE Trans. Image Process. 24(11), 3742–3753 (2015)
Article MathSciNet Google Scholar
Kim, G., Sigal, L., Xing, E.: Joint summarization of large-scale collections of web images and videos for storyline reconstruction. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 4225–4232. IEEE (2014)
Google Scholar

Download references

Acknowledgments

This work was supported by the Key Natural Science Project of Anhui Provincial Education Department (KJ2018A0023), the Guangdong Province Science and Technology Plan Projects (2017B010110011), the Anhui Key Research and Development Plan (1804a09020101), the National Basic Research Program (973 Program) of China (2015CB351705), the National Natural Science Foundation of China (61906002, 61402002, 61876002 and 61860206004) and 2018 College Students Innovation and Entrepreneurship Training Program (201810357352).

Author information

Authors and Affiliations

School of Computer Science and Technology, Anhui University, Hefei, 230601, China
Manjin Sheng, Jiayu Shi, Dengdi Sun & Bin Luo
School of Internet, Anhui University, Hefei, 230039, China
Zhuanlian Ding

Authors

Manjin Sheng
View author publications
You can also search for this author in PubMed Google Scholar
Jiayu Shi
View author publications
You can also search for this author in PubMed Google Scholar
Dengdi Sun
View author publications
You can also search for this author in PubMed Google Scholar
Zhuanlian Ding
View author publications
You can also search for this author in PubMed Google Scholar
Bin Luo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Dengdi Sun .

Editor information

Editors and Affiliations

University of Strathclyde, Glasgow, UK
Jinchang Ren
Edinburgh Napier University, Edinburgh, UK
Amir Hussain
Guangdong Polytechnic Normal University, Guangzhou, China
Huimin Zhao
Xi’an Jiaotong-Liverpool University, Suzhou, China
Kaizhu Huang
Northwestern Polytechnical University, Xi'an, China
Jiangbin Zheng
Guangdong Polytechnic Normal University, Guangzhou, China
Jun Cai
Guangdong Polytechnic Normal University, Guangzhou, China
Rongjun Chen
Guangdong Polytechnic Normal University, Guangzhou, China
Yinyin Xiao

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sheng, M., Shi, J., Sun, D., Ding, Z., Luo, B. (2020). Adaptive Video Summarization via Robust Representation and Structured Sparsity. In: Ren, J., et al. Advances in Brain Inspired Cognitive Systems. BICS 2019. Lecture Notes in Computer Science(), vol 11691. Springer, Cham. https://doi.org/10.1007/978-3-030-39431-8_19

Download citation

DOI: https://doi.org/10.1007/978-3-030-39431-8_19
Published: 01 February 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-39430-1
Online ISBN: 978-3-030-39431-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics