Abstract
In this paper we introduce VideoGraph, a novel non-linear representation for scene structure of a video. Unlike classical linear sequential organization, VideoGraph concentrates the video content across the time line by structuring scenes and materializes with two-dimensional graph, which enables non-linear exploration on the scenes and their transitions. To construct VideoGraph, we adopt a sub-shot induced method to evaluate the spatio-temporal similarity between shot segments of video. Then, scene structure is derived by grouping similar shots and identifying the valid transitions between scenes. The final stage is to represent the scene structure using a graph with respect to scene transition topology. Our VideoGraph can provide a condensed representation in the scene level and facilitate a non-linear manner to browse videos. Experimental results are presented to demonstrate the effectiveness and efficiency by using VideoGraph to explore and access the video content.





Similar content being viewed by others
Explore related subjects
Discover the latest articles and news from researchers in related subjects, suggested using machine learning.References
Hu, S.M., Chen, T., Xu, K., Cheng, M.M., Martin, R.R.: Internet visual media processing: a survey with graphics and vision applications. Vis. Comput. 29(5), 393–405 (2013)
Lu, S.P., Zhang, S.H., Wei, J., Hu, H.M., Martin, R.R.: Time-line editing of objects in video. IEEE Trans. Vis. Comput. Graph. 19(7), 1218–1227 (2013)
Li, F., Gupta, A., Sanocki, E., He, L., Rui, Y.: Browsing digital video. In: Proc. SIGCHI, pp. 169–176 (2000)
Daniel, G., Chen, M.: Video visualization. In: Proc. Visualization, pp. 409–416 (2003)
Chen, M., Botchen, R., Hashim, R., Weiskopf, D., Ertl, T., Thornton, I.: Visual signatures in video visualization. IEEE Trans. Vis. Comput. Graph. 12(5), 1093–1100 (2006)
Kim, K., Essa, I., Abowd, G.D.: Interactive mosaic generation for video navigation. In: ACM Multimedia, pp. 655–658 (2006)
Mei, T., Yang, B., Yang, S.Q., Hua, X.S.: Video collage: presenting a video sequence using a single image. Vis. Comput. 25(1), 39–51 (2008)
Barnes, C., Goldman, D.B., Shechtman, E., Finkelstein, A.: Video tapestries with continuous temporal zoom. ACM Trans. Graph. 89, 1 (2010)
Correa, C.D., Ma, K.L.: Dynamic video narratives. ACM Trans. Graph. 88, 1 (2010)
Cong, L., Tong, R., Dong, J.: Selective image abstraction. Vis. Comput. 27(3), 187–198 (2001)
Zhang, L., Huang, H.: Hierarchical narrative collage for digital photo album. Comput. Graph. Forum 31, 2173–2181 (2012)
Smith, M.A., Kanade, T.: Video skimming and characterization through the combination of image and language understanding techniques. In: Proc. CVPR, pp. 775–781 (1997)
Sundaram, H., Xie, L., Chang, S.F.: A utility framework for the automatic generation of audio-visual skims. In: ACM Multimedia, pp. 189–198 (2002)
Teodosio, L., Bender, W.: Salient stills. ACM Trans. Multimed. Comput. Commun. Appl. 1(1), 16–36 (2005)
Caspi, Y., Axelrod, A., Matsushita, Y., gamliel, A.: Dynamic stills and clip trailers. Vis. Comput. 22(9), 642–652 (2006)
Rav-Acha, A., Pritch, Y., Peleg, S.: Making a long video short: dynamic video synopsis. In: Proc. CVPR, pp. 435–441 (2006)
Dragicevic, P., Ramos, G., Bibliowitcz, J., Nowrouzezahrai, D., Balakrishnan, R., Singh, K.: Dragon: a direct manipulation interface for frame-accurate in-scene video navigation. In: Proc. SIGCHI, pp. 247–250 (2008)
Schoeffmann, K., Boeszoermenyi, L.: Video browsing using interactive navigation summaries. In: International Workshop on Content-Based Multimedia Indexing 7, pp. 243–248 (2002)
Chen, T., Lu, A.D., Hu, S.M.: Visual storylines: semantic visualization of movie sequence. Comput. Graph. 36(4), 241–249 (2012)
Erol, B., Lee, D.S., Hul, J.: Multimodal summarization of meeting recording. In: Proc. ICME, pp. 25–28 (2003)
Kim, J.G., Chang, H.S., Kang, K., Kim, M., Kim, J., Kim, H.M.: Summarization of news video and its description for content-based access. Int. J. Imaging Syst. Technol. 13(5), 267–274 (2003)
Truong, B., Venkatesh, S.: Video abstraction: a systematic review and classification. ACM Trans. Multimedia Comput. Commun. Appl. 3(1) (2007)
Lu, S., King, I.K., Lyu, M.R.: Video summarization by video structure analysis and graph optimization. In: Proc. ICME, pp. 1959–1962 (2004)
Sidiropoulos, P., Mezaris, V., Kompatsiaris, I., Meinedo, H., Trancoso, I.: Multi-modal scene segmentation using scene transition graphs. In: ACM Multimedia, pp. 665–668 (2009)
Feng, B.L., Cao, J., Bao, X.J., Bao, L., Zhang, Y.D., Lin, S.X., Yun, X.C.: Graph-based multi-space semantic correlation propagation for video retrieval. Vis. Comput. 27(1), 21–34 (2011)
Yu, L., Lu, A.D., Ribarsky, W., Chen, W.: Automatic animation for time-varying data visualization. Comput. Graph. Forum 29(7), 2271–2280 (2010)
Steiner, T., Verborgh, R., Vallés, J.G., Hausenblas, M., Troncy, R., de Walle, R.V.: Enabling on-the-fly video shot detection on youtube. In: Proc. WWW (2012)
Tang, L.X., Mei, T., Hua, X.S.: Near-lossless video summarization. In: ACM Multimedia, pp. 351–360 (2009)
Jia, Y.T., Hu, S.M., Martin, R.R.: Video completion using tracking and fragment merging. Vis. Comput. 21(8–10), 601–610 (2005)
Lowe, D.G.: Object recognition from local scale invariant features. In: Proc. ICCV, pp. 1150–1157 (1999)
Frey, B.J., Dueck, D.: Clustering by passing messages between data points. Science 315, 972–976 (2007)
Corrigan, T., White, P.: The Film Experience: An Introduction. (2004)
Harary, F.: In: Graph Theory (1994)
Tollis, I., Di Battista, G., Eades, P., Tamassia, R.: Graph Drawing: Algorithms for the Visualization of Graphs. (1998)
Ellson, J., Gansner, E., Koutsofios, E., North, S., Woodhull, G.: Graphviz—open source graph drawing tools. In: Proc. Graph Drawing, pp. 483–484 (2001)
Rother, C., Bordeaux, L., Hamadi, Y., Blake, A.: Autocollage. ACM Trans. Graph. 25(3), 847–852 (2006)
Acknowledgements
We thank the anonymous reviewers for their valuable comments. This work was partly supported by the grants from the National Natural Science Foundation of China (No. 61133008, 61103159) and Excellent Young Scholars Research Fund of Beijing Institute of Technology (No. 2012YR0709).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Zhang, L., Xu, QK., Nie, LZ. et al. VideoGraph: a non-linear video representation for efficient exploration. Vis Comput 30, 1123–1132 (2014). https://doi.org/10.1007/s00371-013-0882-5
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00371-013-0882-5