Abstract
Video summarization is an effective way to quick view videos and relieve the pressure of videos storage. However the traditional algorithms are hardly adapted to unstructured videos, due to the unobvious for scenes changing and ignoring the structure of the videos. Therefore, an Auto-encoder-based summarization algorithm is proposed in this paper for unstructured videos. Each video structure is detected by an Auto-encoder and both of the interestingness and representativeness of each video segment are predicted by the reconstruction errors of the segment. Meanwhile, most interesting and representative summarization is generated with the limited summary length. The experimental results show that the proposed algorithm obtained a better performance by comparing with the state-of-the-art.
Similar content being viewed by others
References
Avila SEFD, Lopes APB, Luz AD et al (2011) VSUMM: a mechanism designed to produce static video summaries and a novel evaluation method. Pattern Recogn Lett 32(1):56–68
Basak J, Luthra V, Chaudhury S (2008) Video summarization with supervised learning. Pattern Recognition, 2008. ICPR 2008. 19th International Conference on, IEEE
Chu W-S, Song Y, Jaimes A (2015) Video co-summarization: Video summarization by visual co-occurrence. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
Dang CT, Radha H (2014) Heterogeneity image patch index and its application to consumer video summarization. Image Processing, IEEE Transactions on 23(6):2704–2718
Gong B et al. (2014) Diverse sequential subset selection for supervised video summarization. Advances in Neural Information Processing Systems
Gygli M et al. (2014) Creating summaries from user videos. Computer Vision–ECCV 2014. Springer International Publishing, pp 505–520
Gygli M, Grabner H, Van Gool L (2015) Video summarization by learning submodular mixtures of objectives. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
Japkowicz N, Myers C, Gluck M (1995) A novelty detection approach to classification. IJCAI
Kang H-W, Hua X-S (2005) To learn representativeness of video frames. Proceedings of the 13th annual ACM international conference on Multimedia, ACM
Lee YJ, Ghosh J, Grauman K (2012) Discovering important people and objects for egocentric video summarization. CVPR, 2. no. 6
Li K, Wang J, Wang H et al (2015) Structuring lecture videos by automatic projection screen localization and analysis. Pattern Analysis and Machine Intelligence, IEEE Transactions on 37(6):1233–1246
Lin C-Y (2004) Rouge: A package for automatic evaluation of summaries. Text summarization branches out: Proceedings of the ACL-04 workshop 8
Lu Z, Grauman K (2013) Story-driven summarization for egocentric video. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
Luan Q et al. (2014) Video Summarization based on Nonnegative Linear Reconstruction. Multimedia and Expo (ICME), 2014 I.E. International Conference on, IEEE
Mahmoud KM, Ghanem NM, Ismail MA (2013) Unsupervised video summarization via dynamic modeling-based hierarchical clustering. Machine Learning and Applications (ICMLA), 2013 12th international conference on, 2. IEEE
Manevitz LM, Yousef M (2002) One-class SVMs for document classification. J Mach Learn Res 2:139–154
Masci J et al. (2011) Stacked convolutional auto-encoders for hierarchical feature extraction. Artificial Neural Networks and Machine Learning–ICANN 2011, Springer, Berlin Heidelberg, pp 52–59
Mei S, Guan G, Wang Z et al (2015) Video summarization via minimum sparse reconstruction. Pattern Recogn 48(2):522–533
Money AG, Agius H (2008) Video summarisation: a conceptual framework and survey of the state of the art. J Vis Commun Image Represent 19(2):121–143
Potapov D et al. (2014) Category-specific video summarization. European conference on computer vision. Springer International Publishing
Rumelhart DE, Hinton GE, Williams RJ (1985) Learning internal representations by error propagation. California Univ San Diego la Jolla Inst for Cognitive Science
Scovanner P, Ali S, Shah M (2007) A 3-dimensional sift descriptor and its application to action recognition. Proceedings of the 15th international conference on Multimedia, ACM
Sun M, Farhadi A, Seitz S (2014) Ranking domain-specific highlights by analyzing edited videos. European conference on computer vision, Springer International Publishing
Truong BT, Venkatesh S (2007) Video abstraction: A systematic review and classification. ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM) 3(1):3
Tsai CM, Kang LW, Lin CW et al (2013) Scene-based movie summarization via role-community networks. IEEE Trans Circuits Syst Video Technol 23(11):1927–1940
Valdés V, Martínez JM (2012) On-line video abstract generation of multimedia news. Multimedia Tools and Applications 59(3):795–832
Wang Z, Yu J, He Y et al (2014) Affection arousal based highlight extraction for soccer video. Multimedia Tools and Applications 73(1):519–546
Weninger F et al. (2014) Deep recurrent de-noising auto-encoder and blind de-reverberation for reverberated speech recognition. Acoustics, Speech and Signal Processing (ICASSP), 2014 I.E. International Conference on, IEEE
Xu J et al. (2015) Gaze-enabled egocentric video summarization via constrained submodular maximization. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
Yang H et al. (2015) Unsupervised extraction of video highlights via robust recurrent auto-encoders. Proceedings of the IEEE International Conference on Computer Vision
Yeung S, Fathi A, Li F-F (2014) Videoset: Video summary evaluation through text. arXiv preprint arXiv:1406.5824
Zhao B, Xing E (2014) Quasi real-time summarization for consumer videos. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
Acknowledgements
This work was partially supported by the National Natural Science Foundation of China (No.61370121), the National Hi-Tech Research and Development Program (863 Program) of China (No.2014AA015102).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Han, MX., Hu, HM., Liu, Y. et al. An auto-encoder-based summarization algorithm for unstructured videos. Multimed Tools Appl 76, 25039–25056 (2017). https://doi.org/10.1007/s11042-017-4485-4
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-017-4485-4