Abstract
This paper presents how cross-media correlation facilitates summarization of photos and videos captured in journeys. Correlation between photos and videos comes from similar content captured in the same temporal order. We transform photos and videos into sequences of visual word histograms, and adopt approximate sequence matching to find correlation. To summarize photos and videos, we propose that the characteristics of correlated photos can be utilized in selecting important video segments into video summaries, and on the other hand, the characteristics of correlated video segments can be utilized in selecting important photos. Experimental results demonstrate that the proposed summarization methods well take advantage of the correlation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Chu, W.-T., Lin, C.-C., Yu, J.-Y.: Using Cross-Media Correlation for Scene Detection in Travel Videos. In: ACM International Conference on Image and Video Retrieval (2009)
Gatica-Perez, D., Loui, A., Sun, M.-T.: Finding Structure in Home Videos by Probabilistic Hierarchical Clustering. IEEE Transactions on Circuits and Systems for Video Technology 13(6), 539–548 (2003)
Pan, Z., Ngo, C.-W.: Structuring Home Video by Snippet Detection and Pattern Parsing. In: ACM International Workshop on Multimedia Information Retrieval, pp. 69–76 (2004)
Hua, X.-S., Lu, L., Zhang, H.-J.: Optimization-based Automated Home Video Editing System. IEEE Transactions on Circuits and Systems for Video Technology 14(5), 572–583 (2004)
Lee, S.-H., Wang, S.-Z., Kuo, C.C.J.: Tempo-based MTV-style Home Video Authoring. In: IEEE International Workshop on Multimedia Signal Processing (2005)
Peng, W.-T., Chiang, Y.-H., Chu, W.-T., Huang, W.-J., Chang, W.-L., Huang, P.-C., Hung, Y.-P.: Aesthetics-based Automatic Home Video Skimming System. In: Satoh, S., Nack, F., Etoh, M. (eds.) MMM 2008. LNCS, vol. 4903, pp. 186–197. Springer, Heidelberg (2008)
Platt, J.C., Czerwinski, M., Field, B.A.: PhotoTOC: Automating Clustering for Browsing Personal Photographs. In: IEEE Pacific Rim Conference on Multimedia, pp. 6–10 (2003)
Chasanis, V., Likas, A., Galatsanos, N.: Scene Detection in Videos Using Shot Clustering and Symbolic Sequence Segmentation. In: IEEE International Conference on Multimedia Signal Processing, pp. 187–190 (2007)
Likas, A., Vlassis, N., Verbeek, J.J.: The Global K-means Clustering Algorithm. Pattern Recognition 36, 451–461 (2003)
Tong, H., Li, M., Zhang, H.-J., Zhang, C.: Blur Detection for Digital Images Using Wavelet Transform. In: IEEE International Conference on Multimedia & Expo., pp. 17–20 (2004)
Lowe, D.: Distinctive Image Features from Scale-Invariant Keypoints. International Journal of Computer Vision 60(2), 91–110 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chu, WT., Lin, CC., Yu, JY. (2010). Travel Photo and Video Summarization with Cross-Media Correlation and Mutual Influence. In: Boll, S., Tian, Q., Zhang, L., Zhang, Z., Chen, YP.P. (eds) Advances in Multimedia Modeling. MMM 2010. Lecture Notes in Computer Science, vol 5916. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-11301-7_57
Download citation
DOI: https://doi.org/10.1007/978-3-642-11301-7_57
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-11300-0
Online ISBN: 978-3-642-11301-7
eBook Packages: Computer ScienceComputer Science (R0)