Abstract
In recent years, with the popularity of mobile devices and mobile Internet, more and more social media sites are growing in an explosive way. Therefore, the social hot event will be rapidly fermented by the interaction of a large number of network users, and a large amount of multimedia data (such as texts, images and videos) will be generated. Therefore, it is important and necessary to conduct the research of multimedia social event analysis to know the evolutionary trend of social event over time automatically. This paper provides a survey and summarizes major progresses in multimedia social event analysis. We focus on four areas: (1) multimedia social event representation; (2) multimedia social event detection and tracking; (3) multimedia social event evolutionary analysis; and (4) multimedia social event topic mining.
Similar content being viewed by others
References
Ahmed A, Xing EP (2008) Dynamic non-parametric mixture models and the recurrent chinese restaurant process: with applications to evolutionary clustering. In: Siam International conference on data mining, SDM 2008, April 24-26, 2008, Atlanta, Georgia, USA, pp 219–230
Allan J (2002) Detection as multi-topic tracking. Inf Retr 5(2-3):139–157
Altman NS (1992) An introduction to kernel and nearest-neighbor nonparametric regression. American Statistician 46(3):175–185
Blei D, Jordan MI (2003) Modeling annotated data, 127–134
Blei D, Mcauliffe JD (2010) Supervised topic models. Adv Neur Inf Process Syst 3:327–332
Blei D, Ng AY, Jordan MI (2003) Latent Dirichlet allocation. J Mach Learn Res Arch 3:993–1022
Boiman O, Shechtman E, Irani M (2008) In defense of nearest-neighbor based image classification. In: IEEE Conference on computer vision and pattern recognition, 2008. CVPR 2008, pp 1–8
Caron F, Davy M, Doucet A (2008) Generalized polya urn for time-varying Dirichlet process mixtures
Chen J, Yu J, Shen Y (2013) Towards topic trend prediction on a topic evolution model with social connection. In: Ieee/wic/acm International conferences on web intelligence and intelligent agent technology, pp 153–157
Chen N, Liu Y, Zhang ZJ (2014) A forecasting system of micro-blog public opinion based on artificial neural network. In: 2014 Tenth international conference on intelligent information hiding and multimedia signal processing (IIH-MSP). IEEE, pp 868–871
Chi Y, Song X, Zhou D, Hino K, Tseng BL (2007) Evolutionary spectral clustering by incorporating temporal smoothness pp 153–162
Csurka G (2004) Visual categorization with bags of keypoints. Workshop Statist Learn Comput Vis Eccv 44(247):1–22
Das R, Zaheer M, Dyer C (2015) Gaussian LDA for topic models with word embeddings. In: Meeting of the association for computational linguistics and the international joint conference on natural language processing, pp 795–804
Debole F, Sebastiani F (2004) Supervised term weighting for automated text categorization. Springer, Berlin
Deerwester S (1990) Indexing by latent semantic indexing. Journal of the American Society Ofr Information Science, 41
Diakopoulos N, Naaman M, Kivran-Swaine F (2010) Diamonds in the rough: social media visual analytics for journalistic inquiry. In: Visual analytics science and technology, pp 115–122
Fang Y, Si L, Somasundaram N, Yu Z (2012) Mining contrastive opinions on political texts using cross-perspective topic model, pp 63–72
Fang Q, Xu C, Sang J, Hossain MS, Muhammad G (2015) Word-of-mouth understanding: entity-centric multimodal aspect-opinion mining in social media. IEEE Trans Multimed 17(12):2281– 2296
Firan CS, Georgescu M, Nejdl W, Paiu R (2010) Bringing order to your photos: event-driven classification of flickr images based on social knowledge. In: ACM International conference on information and knowledge management, pp 189–198
Gao ZJ, Song Y, Liu S, Wang H, Wei H, Chen Y, Cui W (2012) Tracking and connecting topics via incremental hierarchical Dirichlet processes. In: IEEE International conference on data mining, pp 1056–1061
Griffiths TL, Steyvers M (2004) Finding scientific topics. Proc Natl Acad Sci USA 101 Suppl 1(1):5228–5235
Guillaumin M, Verbeek J, Schmid C (2010) Multimodal semi-supervised learning for image classification. In: Computer vision and pattern recognition, pp 902–909
Haghighi A, Vanderwende L (2009) Exploring content models for multi-document summarization. In: Human language technologies: the 2009 conference of the North American chapter of the association for computational linguistics, pp 362–370
Hardoon DR, Szedmak SR, Shawe-Taylor JR (2004) Canonical correlation analysis: an overview with application to learning methods. MIT Press
Hofmann T (1999) Probabilistic latent semantic indexing. In: Proc Sigir, pp 50–57
Hong R, Hu Z, Wang R, Wang M, Tao D (2016) Multi-view object retrieval via multi-scale topic models. IEEE Trans Image Process 25(12):5814–5827
Hong R, Yang Y, Wang M, Hua XS (2015) Learning visual semantic relationships for efficient visual retrieval. IEEE Trans Big Data 1(4):152–161
Hong R, Zhang L, Zhang C, Zimmermann R (2016) Flickr circles: aesthetic tendency discovery by multi-view regularized topic modeling. IEEE Trans Multimed 18 (8):1555–1567
Iwata T, Watanabe S, Yamada T, Ueda N (2009) Topic tracking model for analyzing consumer purchase behavior. In: International jont conference on artifical intelligence, pp 1427–1432
Iwata T, Yamada T, Sakurai Y, Ueda N (2010) Online multiscale dynamic topic models. In: Proc. ACM SIGKDD international conference on knowl-edge discovery and data mining, pp 663–672
Jelodar H, Wang Y, Yuan C, Feng X (2017) Latent Dirichlet allocation (LDA) and topic modeling: models, applications a survey
Jin W, Srihari RK (2007) Graph-based text representation and knowledge discovery. In: ACM Symposium on applied computing, pp 807–811
Joachims T (1999) Text categorization with support vector machines. In: Proc. of European conference on machine learning
Kalamaras I, Drosou A, Tzovaras D (2014) Multi-objective optimization for multimodal visualization. IEEE Trans Multimed 16(5):1460–1472
Kasiviswanathan SP, Melville P, Banerjee A, Sindhwani V (2011) Emerging topic detection using dictionary learning, 745–754
Keller KL (1993) Conceptualizing, measuring and managing customer based brand equity. J Mark 57(1):1–22
Kumaran G, Allan J (2004) Text classification and named entities for new event detection. In: Proc. of the ACM Sigir’04 conference, pp 297–304
Lewis DD (1998) Naive (bayes) at forty: the independence assumption in information retrieval. In: European conference on machine learning, pp 4–15
Li P, Yan Y, Wang C, Ren Z, Cong P, Wang H, Feng J (2016) Customer voice sensor: a comprehensive opinion mining system for call center conversation. In: IEEE International conference on cloud computing and big data analysis, pp 324–329
Liu B, Zhang L (2012) A survey of opinion mining and sentiment analysis. Springer, US
Liu J, Zha ZJ, Tian Q, Liu D, Yao T, Ling Q, Mei T (2016) Multi-scale triplet cnn for person re-identification. In: Proceedings of the 2016 ACM on multimedia conference. ACM, pp 192–196
Makkonen J, Ahonen-Myka H, Salmenkivi M (2004) Simple semantics in topic detection and tracking. Inf Retr 7(3-4):347–368
Marcombes P, Dalalyan A (2010) Towards optimal naive bayes nearest neighbor. In: European conference on computer vision, pp 171–184
Maron ME (1961) Automatic indexing: an experimental inquiry. J Acm 8 (3):404–417
Mccallum A (1998) A comparison of event models for naive bayes text classification. In: Proc. AAAI-98 workshop on learning for text categorization, pp 41–48
Merler M, Huang B, Xie L, Hua G, Natsev A (2012) Semantic model vectors for complex video event recognition. IEEE Trans Multimed 14(1):88–101
Moghaddam S, Ester M (2011) ILDA:interdependent LDA model for learning latent aspects and their ratings from online product reviews, pp 665–674
Moghaddam S, Ester M (2012) On the design of LDA models for aspect-based opinion mining, pp 803–812
Ngiam J, Khosla A, Kim M, Nam J, Lee H, Ng AY (2011) Multimodal deep learning. In: Proceedings of the 28th international conference on machine learning (ICML-11), pp 689–696
Pan CC, Mitra P (2011) Event detection with spatial latent Dirichlet allocation. In: International ACM/IEEE joint conference on digital libraries, pp 349–358
Putthividhy D, Attias HT, Nagarajan SS (2010) Topic regression multi-modal latent Dirichlet allocation for image annotation. In: Computer vision and pattern recognition, pp 3408–3415
Qian S, Zhang T, Hong R, Xu C (2015) Cross-domain collaborative learning in social multimedia. In: ACM International conference on multimedia, pp 99–108
Qian S, Zhang T, Xu C (2015) Boosted multi-modal supervised latent Dirichlet allocation for social event classification. Acm Trans Multimed Comput Commun Appl 11(2):27
Qian S, Zhang T, Xu C, Shao J (2016) Multi-modal event topic model for social event analysis. IEEE Trans Multimed 18(2):233–246
Qiu M, Jiang J (2013) A latent variable model for viewpoint discovery from threaded forum posts. In: NAACL
Ramage D, Heymann P, Manning CD, Garcia-Molina H (2009) Clustering the tagged web, 54–63
Rasiwasia N, Pereira JC, Coviello E, Doyle G, Lanckriet GRG, Levy R, Vasconcelos N (2010) A new approach to cross-modal multimedia retrieval. In: International conference on multimedia, pp 251–260
Ren L, Dunson DB, Carin L (2008) The dynamic hierarchical Dirichlet process. In: International conference, pp 824–831
Salton G (1974) A vector space model for automatic indexing. Commun Acm 18 (11):613–620
Sang J, Xu C (2012) Right buddy makes the difference: an early exploration of social relation analysis in multimedia applications. In: ACM International conference on multimedia, pp 19–28
Sang J, Xu C, Jain R (2017) Social multimedia ming: from special to general. In: IEEE International symposium on multimedia, pp 481–485
Sebastiani F (2002) Machine learning in automated text categorization. Acm Comput Surv 34(1):1–47
Srivastava N, Salakhutdinov RR (2012) Multimodal learning with deep Boltzmann machines. In: Advances in neural information processing systems, pp 2222–2230
Theil H, Chung CF (1988) Relations between two sets of variates: the bits of information provided by each variate in each set. Statist Probab Lett 6(3):137–139
Wang X, Mohanty N, Mccallum A (2005) Group and topic discovery from relations and text. In: Conference on statistical network analysis, pp 28–35
Wan L, Zhu L, Fergus R (2012) A hybrid neural network-latent topic model, 1287–1294
Wang M, Li H, Tao D, Lu K, Wu X (2012) Multimodal graph-based reranking for web image search. IEEE Trans Image Process 21(11):4649–4661
Wang C, Blei D, Heckerman D (2012) Continuous time dynamic topic models. Uai, 579–586
Wang H, Zhang C, Yin H, Wang W, Zhang J, Xu F (2016) A unified framework for fine-grained opinion mining from online reviews. In: Hawaii International conference on system sciences, pp 1134–1143
Wang M, Fu W, Hao S, Tao D, Wu X (2016) Scalable semi-supervised learning by efficient anchor graph regularization. IEEE Trans Knowl Data Eng 28 (7):1864–1877
Wang M, Fu W, Hao S, Liu H, Wu X (2017) Learning on big graph: label inference and regularization with anchor hierarchy. IEEE Trans Knowl Data Eng 29(5):1101–1114
Wu X, Ngo CW, Hauptmann AG (2008) Multimodal news story clustering with pairwise visual near-duplicate constraint. IEEE Trans Multimed 10(2):188–199
Xu C, Xu C, Xu C (2016) Multi-modal multi-view topic-opinion mining for social event analysis. In: ACM on multimedia conference, pp 2–11
Yang Y, Zhang J, Carbonell J, Jin C (2002) Topic-conditioned novelty detection. In: Eighth ACM SIGKDD international conference on knowledge discovery and data mining, pp 688–693
Yang X, Zhang T, Xu C (2014) Cross-domain feature learning in multimedia. IEEE Trans Multimed 17(1):64–78
Yu J, Cong Y, Qin Z, Wan T (2012) Cross-modal topic correlations for multimedia retrieval. In: International conference on pattern recognition, pp 246–249
Zha ZJ, Hua XS, Mei T, Wang J, Qi GJ, Wang Z (2008) Joint multi-label multi-instance learning for image classification. In: IEEE Conference on computer vision and pattern recognition, 2008. CVPR 2008. IEEE, pp 1–8
Zhang H, Zhuang Y, Wu F (2007) . Cross-modal correlation learning for clustering on image-audio dataset 40(8):273–276
Zhang J, Song Y, Zhang C, Liu S (2010) Evolutionary hierarchical Dirichlet processes for multiple correlated time-varying corpora. In: ACM SIGKDD International conference on knowledge discovery and data mining, Washington, Dc, Usa, July, pp 1079–1088
Zhu J, Chen N, Perkins H, Zhang B (2013) Gibbs max-margin topic models with data augmentation. J Mach Learn Res 15(1):1073–1110
Acknowledgments
This work is supported by the National Natural Science Foundation of China (No. 61772170), and the National Key Research and Development Program of China (No. 2017YFB0803301).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Liu, T., Xue, F., Sun, J. et al. A survey of event analysis and mining from social multimedia. Multimed Tools Appl 79, 33431–33448 (2020). https://doi.org/10.1007/s11042-019-7567-7
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-019-7567-7