Sub-event recognition and summarization for structured scenario photos

Zhang, Liyan; Denney, Bradley; Lu, Juwei

doi:10.1007/s11042-016-3346-x

Sub-event recognition and summarization for structured scenario photos

Published: 03 March 2016

Volume 75, pages 9295–9314, (2016)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Liyan Zhang¹,
Bradley Denney² &
Juwei Lu³

251 Accesses
Explore all metrics

Abstract

Structured scenario photos, referring to the images which capture important events that usually follow specific routines/structures (such as wedding ceremonies, graduation ceremonies, etc.), account for a significant proportion in personal photo collections. Conventional image analysis techniques without considering the event routines/structures are not sufficient to handle these photos. In this paper, we explore the appropriate framework to learn and utilize the specific routines for understanding these structure scenario photos. Specifically, we propose a novel framework which can systematically integrate Hidden Markov Model and Gaussian Mixture Model to recognize sub-events from structured scenario photos. Then we present a comprehensive criterion to select representative images to summarize the whole photo collection. Experimental results conducted on the real-world datasets demonstrate the superiority of our framework in both of sub-event recognition and photo summarization tasks.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Multi-modal and multi-scale photo collection summarization

Article 24 May 2015

Visualization of Photo Album: Selecting a Representative Photo of a Specific Event

V-LESS: A Video from Linear Event Summaries

References

Cheng W, Chuang Y, Chen B, Wu J, Fang S, Lin Y, Hsieh C, Pan C, Chu W, Tien M (2007) Semantic-event based analysis and segmentation of wedding ceremony videos. In: Proceedings of the 9th ACM SIGMM international workshop on multimedia information retrieval, MIR 2007, Augsburg, Bavaria, Germany, 24–29 September 2007, pp 95–104
Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the em algorithm. J R Stat Soc Ser B 39(1):1–38
MathSciNet MATH Google Scholar
Denney B, Balan AK (2014) Selecting representative images for display. USPTO application US 12/906,107
Gatica-Perez D, Loui A, Sun M-T (2003) Finding structure in home videos by probabilistic hierarchical clustering
Ghahramani Z, Jordan MI (1997) Factorial hidden markov models. Mach Learn 29(2–3):245–273
Article MATH Google Scholar
Hua X, Lu L, Zhang H (2003) AVE: automated home video editing. In: Proceedings of the 11th ACM international conference on multimedia, Berkeley, CA, USA, 2–8 November 2003, pp 490–497
Jiang Y, Dai Q, Mei T, Rui Y, Chang S (2015) Super fast event recognition in internet videos. IEEE Trans Multimedia 17(8):1174–1186
Article Google Scholar
Kennedy LS, Naaman M (2008) Generating diverse and representative image search results for landmarks. In: Proceedings of the 17th international conference on world wide web, WWW 2008, Beijing, China, 21–25 April 2008, pp 297–306
Pearson K (1905) The problem of the random walk. Nature
Shaked D, Tastl I (2005) Sharpness measure: towards automatic image enhancement. In: Proceedings of the 2005 international conference on image processing, ICIP 2005, Genoa, Italy, 11–14 September 2005, pp 937–940
Sinha P, Pirsiavash H, Jain R (2009) Personal photo album summarization. In: Proceedings of the 17th international conference on multimedia 2009, Vancouver, British Columbia, Canada, 19–24 October 2009, pp 1131–1132
Sinha P, Mehrotra S, Jain R (2011) Effective summarization of large collections of personal photos. In: Proceedings of the 20th international conference on world wide web, WWW 2011, Hyderabad, India, 28 March - 1 April 2011 (Companion Volume), pp 127–128
Tang J, Hua X (2014) Typicality ranking: beyond accuracy for video semantic annotation. Multimedia Tools Appl 70(2):647–660
Article Google Scholar
Tang J, Yan S, Hong R, Qi G, Chua T (2009) Inferring semantic concepts from community-contributed images and noisy tags. In: ACM multimedia
Tang J, Wang M, Hua X, Chua T (2012) Social media mining and search. Multimedia Tools Appl 56(1):1–7
Article Google Scholar
Zhang L, Vaisenberg R, Mehrotra S, Kalashnikov DV (2011) Video entity resolution: applying er techniques for smart video surveillance. In: PerCom workshops
Zhang L, Kalashnikov DV, Mehrotra S (2013) A unified framework for context assisted face clustering. In: ACM international conference on multimedia retrieval (ACM ICMR 2013), Dallas, Texas, USA, 16–19 April 2013
Zhang L, Xu J, Li C (2013) A random-walk based recommendation algorithm considering item categories. Neurocomputing 120:391–396
Article Google Scholar
Zhang L, Denney B, Lu J (2014) Systems and methods for image management. USPTO application US 13/639,948

Download references

Acknowledgments

This work was supported in part by the National Science Foundation of China under Grants No. 61572252, and National Science Foundation of Jiangsu Province under Grants No. BK20150755.

Author information

Authors and Affiliations

College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing, Jiangsu, China
Liyan Zhang
Canon Information and Imaging Solutions, Inc., Irvine, CA, USA
Bradley Denney
Nymi Inc., Toronto, ON, Canada
Juwei Lu

Authors

Liyan Zhang
View author publications
You can also search for this author inPubMed Google Scholar
Bradley Denney
View author publications
You can also search for this author inPubMed Google Scholar
Juwei Lu
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Liyan Zhang.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhang, L., Denney, B. & Lu, J. Sub-event recognition and summarization for structured scenario photos. Multimed Tools Appl 75, 9295–9314 (2016). https://doi.org/10.1007/s11042-016-3346-x

Download citation

Received: 14 October 2015
Revised: 18 January 2016
Accepted: 04 February 2016
Published: 03 March 2016
Issue Date: August 2016
DOI: https://doi.org/10.1007/s11042-016-3346-x

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Sub-event recognition and summarization for structured scenario photos

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Multi-modal and multi-scale photo collection summarization

Visualization of Photo Album: Selecting a Representative Photo of a Specific Event

V-LESS: A Video from Linear Event Summaries

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now