VISCOM: A robust video summarization approach using color co-occurrence matrices

Mussel Cirne, Marcos Vinicius; Pedrini, Helio

doi:10.1007/s11042-016-4300-7

VISCOM: A robust video summarization approach using color co-occurrence matrices

Published: 06 January 2017

Volume 77, pages 857–875, (2018)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Marcos Vinicius Mussel Cirne¹ &
Helio Pedrini¹

741 Accesses
25 Citations
Explore all metrics

Abstract

Video summarization techniques have allowed the content analysis of large volumes of digital video sequences of different categories, such as movies, documentaries, lectures, sports, surveillance, and news. This paper proposes and evaluates a novel video summarization approach called VISCOM, which is based on color co-occurrence matrices to describe the video frames and generate a synopsis with the most representative frames. Experiments conducted on two different data sets of various genres demonstrate the effectiveness of the proposed method in terms of quality. The resulting video summaries are compared against several others using a specific quantitative evaluation metric, producing competitive outcomes among the evaluated methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

KSUMM: A Compressed Domain Technique for Video Summarization Using Partial Decoding of Videos

Video summarization via exploring the global and local importance

Article 03 January 2018

Tongling Hu & Zechao Li

A Domain Independent Approach to Video Summarization

References

Ajmal M, Ashraf MH, Shakir M, Abbas Y, Shah FA (2012) Video summarization: techniques and classification. In: International Conference on Computer Vision and Graphics. Springer, Warsaw, Poland, pp 1–13
Almeida J, Leite NJ, Torres RS (2012) VISON: VIdeo summarization for ONline applications. Pattern Recogn Lett 33(4):397–409
Article Google Scholar
Amel AM, Abdessalem BA, Abdellatif M (2010) Video shot boundary detection using motion activity descriptor. J Telecommun 2(1):54–59
Google Scholar
Angadi S, Naik V (2014) Entropy Based Fuzzy C Means Clustering and Key Frame Extraction for Sports Video Summarization. In: IEEE Fifth International Conference on Signal and Image Processing, Bangalore, India, pp 271–279
Apostolidis E, Mezaris V (2014) Fast Shot Segmentation Combining Global and Local Visual Descriptors. In: IEEE International Conference on Acoustics, Speech and Signal Processing, pp 6583–6587
Arvis V, Debain C, Berducat M, Benassi A (2011) Generalization of the coocurrence matrix for colour images: application to colour texture classification. Image Anal Stereol 23(1):63–72
Article MATH Google Scholar
Baber J, Afzulpurkar N, Dailey M, Bakhtyar M (2011) Shot boundary detection from videos using entropy and local descriptor
Benni V, Dinesh R, Punitha P, Rao V (2015) Keyframe extraction and shot boundary detection using eigen values. Int J Inform Electron Eng 5(1):40–45
Article Google Scholar
Birinci M, Kiranyaz S (2014) A perceptual scheme for fully automatic video shot boundary detection. Signal Process: Image Commun 29(3):410–423
Google Scholar
Borth D, Ulges A, Schulze C, Breuel T (2008) Keyframe extraction for video tagging and summarization. In: G. fur Informatik (ed.) Informatiktage 2008. GI, pp 45–48
Boulkenafet Z, Komulainen J, Hadid A (2015) Face anti-spoofing based on color texture analysis. In: IEEE International Conference on Image Processing. IEEE, Québec City, Canada, pp 2636–2640
Cirne MVM, Pedrini H (2013) Video summarization method based on spectral clustering. In: 18th Iberoamerican Congress on Pattern Recognition, vol 8259, Havana, Cuba, pp 479–486
Cirne MVM, Pedrini H (2014) Summarization of videos by image quality assessment. In: Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications, Lecture Notes in Computer Science, vol 8827. Springer International Publishing, pp 901–908
De Avila SEF, Lopes APB, Da Luz Jr. A., De Albuquerque Araújo A. (2011) VSUMM: A mechanism designed to produce static video summaries and a novel evaluation method. Pattern Recogn Lett 32(1):56–68
Article Google Scholar
De Bruyne S, De Cock J, Poppe C, Hollemeersch CF, Lambert P, Van de Walle R (2011) Compressed-domain shot boundary detection for H.264/AVC using intra partitioning maps. In: Advances in Multimedia Modeling, Lecture Notes in Computer Science, vol 6523. Springer, Berlin Heidelberg, pp 29–39
Ejaz N, Tariq TB, Baik SW (2012) Adaptive key frame extraction for video summarization using an aggregation mechanism. J Vis Commun Image Represent 23 (7):1031–1040
Article Google Scholar
Fang H, Jiang J, Feng Y (2006) A Fuzzy Logic Approach for Detection of Video Shot Boundaries. Pattern Recogn 39(11):2092–2100
Article MATH Google Scholar
Furini M, Geraci F, Montangero M, Pellegrini M (2010) STIMO: STIll and MOving video storyboard for the Web scenario. In: Multimedia Tools and Applications, vol 46. Kluwer Academic Publishers, MA,USA, pp 47–69
Gharbi H, Massaoudi M, Bahroun S, Zagrouba E (2016) Key frames extraction based on local features for efficient video summarization. In: Blanc-Talon J., Distante C., Philips W., Popescu D., Scheunders P. (eds) 17th International Conference Advanced Concepts for Intelligent Vision Systems. Springer International Publishing, pp 275–285
Gonzalez TF (1985) Clustering to minimize the maximum intercluster distance. Theor Comput Sci 38:293–306
Article MathSciNet MATH Google Scholar
Guan G, Wang Z, Lu S, Deng JD, Feng DD (2013) Keypoint-Based Keyframe Selection. IEEE Trans Circ Syst Video Technol 23(4):729–734. doi:10.1109/TCSVT.2012.2214871
Haralick R, Shanmugam K, Dinstein I (1973) Textural features for image classification. IEEE Trans Syst, Man Cybern 3(6):610–621
Article Google Scholar
Hild F, Roux S (2012) Comparison of local and global approaches to digital image correlation. Exper Mech 52(9):1503–1519
Article Google Scholar
Islam MB, Kundu K, Ahmed A (2014) Texture feature based image retrieval algorithms. Int J Eng Tech Res 2:170–173
Google Scholar
Jacob IJ, Srinivasagan K, Jayapriya K (2014) Local oppugnant color texture pattern for image retrieval system. Pattern Recogn Lett 42:72–78
Article Google Scholar
Janwe N, Bhoyar K (2013) Video shot boundary detection based on JND color histogram. In: IEEE Second International Conference on Image Information Processing, pp 476–480
Jiang H, Zhang G, Wang H, Bao H (2015) Spatio-temporal video segmentation of static scenes and its applications. IEEE Trans Multimed 17(1):3–15
Article Google Scholar
Jiang X, Sun T, Liu J, Chao J, Zhang W (2013) An adaptive video shot segmentation scheme based on dual-detection model. Neurocomputing 116:102–111
Article Google Scholar
Katti H, Yadati K, Kankanhalli M, Tat-Seng C (2011) Affective Video Summarization and Story Board Generation using Pupillary Dilation and Eye Gaze. In: IEEE International Symposium on Multimedia. Dana Point, CA, USA, pp 319–326
Lavanya AL, Sreepada R (2012) A generic frame work for image data clustering via weighted clustering ensemble. Int J Comput Sci Inf Technol 3:5429–5433
Google Scholar
Lee K, Kolsch M (2015) Shot boundary detection with graph theory using keypoint features and color histograms
Lee YJ, Ghosh J, Grauman K (2012) Discovering Important People and Objects for Egocentric Video Summarization. In: IEEE Conference on Computer Vision and Pattern Recognition, pp 1346–1353
Lin T, Zhang H (2000) Automatic video scene extraction by shot grouping. Int Conf Pattern Recogn 4:39–42
Article Google Scholar
Lu Z, Grauman K (2013) Story-Driven Summarization for Egocentric Video. In: IEEE Conference on Computer Vision and Pattern Recognition. IEEE Computer Society, DC, USA, pp 2714–2721
Lu ZM, Shi Y (2013) Fast video shot boundary detection based on SVD and pattern matching. IEEE Trans Image Process 22(12):5136–5145
Article MathSciNet Google Scholar
Luan Q, Song M, Liau CY, Bu J, Liu Z, Sun MT (2014) Video summarization based on nonnegative linear reconstruction. In: IEEE International Conference on Multimedia and Expo, Chengdu, China, pp 1–6
Mahmoud KM, Ismail MA, Ghanem NM (2013) VSCAN: An Enhanced Video Summarization Using Density-Based Spatial Clustering. In: Lecture Notes in Computer Science, vol 8156. Springer, pp 733–742
Mei S, Guan G, Wang Z, Wan S, He M, Feng DD (2015) Video summarization via minimum sparse reconstruction. Pattern Recogn 48(2):522–533
Article Google Scholar
Mohanta PP, Chowdhury S, Roy A, Saha SK, Chanda B (2013) Static summarization of video scenes based on minimal spanning tree. In: Pattern Recognition and Machine Intelligence. Springer, pp 437–444
Mohanta PP, Saha SK, Chanda B (2012) A model-based shot boundary detection technique using frame transition parameters. IEEE Trans Multimed 14(1):223–233
Article Google Scholar
Müller A., Lux M, Böszörmenyi L. (2012) The video summary GWAP: Summarization of videos based on a social game. In: 12th International Conference on Knowledge Management and Knowledge Technologies. ACM, Graz, Austria, pp 1–15
Mundur P, Rao Y, Yesha Y (2006) Keyframe-based video summarization using delaunay clustering. Int J Dig Libr 6:219–232
Article Google Scholar
Ngo CW, Ma YF, Zhang H (2005) Video summarization and scene detection by graph modeling. IEEE Trans Circ Syst Video Technol 15:296–305
Article Google Scholar
OpenCV: Open Source Computer Vision (2016). http://www.opencv.org
Pal G, Rudrapaul D, Acharjee S, Ray R, Chakraborty S, Dey N (2015) Video shot boundary detection: a review. In: Satapathy S.C., Govardhan A., Raju K.S., Mandal J.K. (eds) Emerging ICT for Bridging the Future - Proceedings of the 49th Annual Convention of the Computer Society of India CSI Volume 2, Advances in Intelligent Systems and Computing, vol 338. Springer International Publishing, pp 119–127
Pan B, Wang Z (2013) Recent progress in digital image correlation. In: Application of Imaging Techniques to Mechanics of Materials and Structures, Volume 4, Conference Proceedings of the Society for Experimental Mechanics Series. Springer, New York, pp 317–326
Patel NV, Sethi IK (1997) Video shot detection and characterization for video databases. Pattern Recogn 30(4):583–592
Article Google Scholar
Patel U, Shah P, Panchal P (2013) Shot detection using pixel wise difference with adaptive threshold and color histogram method in compressed and uncompressed video. Int J Comput Appl 64(4):38–44
Google Scholar
Pfeiffer S, Lienhart R, Kuhne G, Effelsberg W (1998) The MoCA project – movie content analysis research at the university of mannheim. In: Informatik ’98 : Informatik Zwischen Bild und Sprache, 1. Springer, pp 329–338
Rodriguez MD (2010) CRAM: Compact representation of actions in movies. In: IEEE Computer Vision and Pattern Recognition, pp 3328–3335
Santos A, Pedrini H (2016) Adaptive video shot detection improved by fusion of dissimilarity measures. In: IEEE International Conference on Systems, Man, and Cybernetics, Budapest, Hungary
Santos A, Pedrini H (2016) Adaptive video transition detection based on multiscale structural dissimilarity. In: 12th International Symposium on Visual Computing, vol. Lecture Notes in Computer Science - 10072. Springer-Verlag, NV, USA, pp 181–190
Singh R, Aggarwal N (2015) Novel research in the field of shot boundary detection – a survey. In: Advances in Intelligent Informatics, Advances in Intelligent Systems and Computing, vol 320. Springer International Publishing, pp 457–469
Tian DP (2013) A review on image feature extraction and representation techniques. Int J Multimed Ubiquitous Eng 8(4):385–396
Google Scholar
The Open Video Project (2016). http://www.open-video.org
TREC Video Retrieval Evaluation: TRECVID (2016). http://trecvid.nist.gov
Unser M (1986) Sum and difference histograms for texture classification. IEEE Trans Pattern Anal Mach Intell 8(1):118–125
Article Google Scholar
VidSeg (2016). http://www.site.uottawa.ca/laganier/videoseg/
VSUMM (Video SUMMarization) (2016). https://sites.google.com/site/vsummsite
Wang M, Hong R, Li G, Zha ZJ, Yan S, Chua TS (2012) Event driven web video summarization by tag localization and key-shot identification. IEEE Trans Multimed 14(4):975–985
Article Google Scholar
Whitehead A, Bose P, Laganiere R (2004) Feature based cut detection with automatic threshold selection. In: Third International Conference on Image and Video Retrieval. Springer, Dublin, Ireland, pp 410–418
Xu J, Mukherjee L, Li Y, Warner J, Rehg JM, Singh V (2015) Gaze-enabled egocentric video summarization via constrained submodular maximization. In: IEEE Conf Comput Vis Pattern Recogn, pp 2235–2244
Yi H, Pengzhou Z, Yanfeng W (2012) Adaptive threshold based video shot boundary detection framework. In: International Conference on Image Analysis and Signal Processing, pp 1–5
Yuan Z, Lu T, Wu D, Huang Y, Yu H (2011) Video summarization with semantic concept preservation. In: 10th International Conference on Mobile and Ubiquitous Multimedia, NY, USA, pp 109–112
Zhang H, Kankanhalli A, Smoliar SW (1993) Automatic partitioning of full-motion video. Multimed Syst 1(1):10–28
Article Google Scholar
Zhu X, Loy CC, Gong S (2013) Video synopsis by heterogeneous multi-source correlation. In: IEEE International Conference on Computer Vision. IEEE Computer Society, DC, USA, pp 81–88

Download references

Acknowledgments

The authors are thankful to São Paulo Research Foundation (FAPESP grant #2015/12228-1) and Brazilian National Council for Scientific and Technological Development (CNPq grant #305169/2015-7) for their financial support.

Author information

Authors and Affiliations

Institute of Computing, University of Campinas, Campinas, SP, 13083-852, Brazil
Marcos Vinicius Mussel Cirne & Helio Pedrini

Authors

Marcos Vinicius Mussel Cirne
View author publications
You can also search for this author in PubMed Google Scholar
Helio Pedrini
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Helio Pedrini.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Mussel Cirne, M.V., Pedrini, H. VISCOM: A robust video summarization approach using color co-occurrence matrices. Multimed Tools Appl 77, 857–875 (2018). https://doi.org/10.1007/s11042-016-4300-7

Download citation

Received: 20 March 2016
Revised: 20 December 2016
Accepted: 22 December 2016
Published: 06 January 2017
Issue Date: January 2018
DOI: https://doi.org/10.1007/s11042-016-4300-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

VISCOM: A robust video summarization approach using color co-occurrence matrices

Abstract

Access this article

Similar content being viewed by others

KSUMM: A Compressed Domain Technique for Video Summarization Using Partial Decoding of Videos

Video summarization via exploring the global and local importance

A Domain Independent Approach to Video Summarization

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

VISCOM: A robust video summarization approach using color co-occurrence matrices

Abstract

Access this article

Similar content being viewed by others

KSUMM: A Compressed Domain Technique for Video Summarization Using Partial Decoding of Videos

Video summarization via exploring the global and local importance

A Domain Independent Approach to Video Summarization

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation