research-article

Social Event Classification via Boosted Multimodal Supervised Latent Dirichlet Allocation

Authors:

Shengsheng Qian,

M. Shamim HossainAuthors Info & Claims

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), Volume 11, Issue 2

Article No.: 27, Pages 1 - 22

https://doi.org/10.1145/2659521

Published: 07 January 2015 Publication History

Abstract

With the rapidly increasing popularity of social media sites (e.g., Flickr, YouTube, and Facebook), it is convenient for users to share their own comments on many social events, which successfully facilitates social event generation, sharing and propagation and results in a large amount of user-contributed media data (e.g., images, videos, and text) for a wide variety of real-world events of different types and scales. As a consequence, it has become more and more difficult to exactly find the interesting events from massive social media data, which is useful to browse, search and monitor social events by users or governments. To deal with these issues, we propose a novel boosted multimodal supervised Latent Dirichlet Allocation (BMM-SLDA) for social event classification by integrating a supervised topic model, denoted as multi-modal supervised Latent Dirichlet Allocation (mm-SLDA), in the boosting framework. Our proposed BMM-SLDA has a number of advantages. (1) Our mm-SLDA can effectively exploit the multimodality and the multiclass property of social events jointly, and make use of the supervised category label information to classify multiclass social event directly. (2) It is suitable for large-scale data analysis by utilizing boosting weighted sampling strategy to iteratively select a small subset of data to efficiently train the corresponding topic models. (3) It effectively exploits social event structure by the document weight distribution with classification error and can iteratively learn new topic model to correct the previously misclassified event documents. We evaluate our BMM-SLDA on a real world dataset and show extensive experimental results, which demonstrate that our model outperforms state-of-the-art methods.

References

[1]

R. Achanta, A. Shaji, K. Smith, A. Lucchi, P. Fua, and S. Susstrunk. 2010. Slic superpixels. Tech. Rep., EPFL.

[2]

James Allan, Ron Papka, and Victor Lavrenko. 1998. On-line new event detection and tracking. In Proceedings of the Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 37--45.

Digital Library

[3]

Yang Bao, Nigel Collier, and Anindya Datta. 2013. A partially supervised cross-collection topic model for cross-domain text classification. In Proceedings of the International Conference on Information and Knowledge Management. 239--248.

Digital Library

[4]

Hila Becker, Mor Naaman, and Luis Gravano. 2009. Event identification in social media. In Proceedings of the International Workshop on Web and Databases.

[5]

Hila Becker, Mor Naaman, and Luis Gravano. 2010. Learning similarity metrics for event identification in social media. In Proceedings of the ACM Conference on Web Search and Data Mining. 291--300.

Digital Library

[6]

David Blei and Jon McAuliffe. 2008. Supervised topic models. In NIPS. 77--84.

[7]

David M. Blei, Andrew Y. Ng, and Michael I. Jordan. 2003. Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993--1022.

[8]

Chien Chin Chen, Meng Chang Chen, and Ming-Syan Chen. 2009. An adaptive threshold framework for event detection using HMM-based life profiles. ACM Trans. Inf. Syst. 27, 2 (2009), 9:1--9:35.

Digital Library

[9]

Hai Leong Chieu and Yoong Keok Lee. 2004. Query based event extraction along a timeline. In Proceedings of the Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 425--432.

Digital Library

[10]

Nicholas Diakopoulos, Mor Naaman, and Funda Kivran-Swaine. 2010. Diamonds in the rough: Social media visual analytics for journalistic inquiry. In Proceedings of the IEEE Conference on Visual Analytics Science and Technology. 115--122.

[11]

Claudiu S. Firan, Mihai Georgescu, Wolfgang Nejdl, and Raluca Paiu. 2010. Bringing order to your photos: event-driven classification of flickr images based on social knowledge. In Proceedings of the International Conference on Information and Knowledge Management. 189--198.

Digital Library

[12]

Haidong Gao, Siliang Tang, Yin Zhang, Dapeng Jiang, Fei Wu, and Yueting Zhuang. 2012. Supervised cross-collection topic modeling. In Proceedings of the ACM Multimedia Conference. 957--960.

Digital Library

[13]

T. L. Griffiths and M. Steyvers. 2004. Finding scientific topics. Proc. Nat. Acad. Sci. 101, 5228--5235.

[14]

Thomas Hofmann. 1999. Probabilistic latent semantic indexing. In Proceedings of the Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 50--57.

Digital Library

[15]

Ravi Kumar, Uma Mahadevan, and D. Sivakumar. 2004. A graph-theoretic approach to extract storylines from search results. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 216--225.

Digital Library

[16]

Giridhar Kumaran and James Allan. 2004. Text classification and named entities for new event detection. In Proceedings of the Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 297--304.

Digital Library

[17]

Simon Lacoste-Julien, Fei Sha, and Michael I. Jordan. 2008. DiscLDA: Discriminative learning for dimensionality reduction and classification. In NIPS. 897--904.

[18]

Chenliang Li, Aixin Sun, and Anwitaman Datta. 2012. Twevent: Segment-based event detection from tweets. In Proceedings of the International Conference on Information and Knowledge Management. 155--164.

Digital Library

[19]

Chih-Jen Lin, Ruby C. Weng, and S. Sathiya Keerthi. 2008. Trust region Newton method for logistic regression. J. Mach. Learn. Res. 9, 627--650.

Digital Library

[20]

Fu-ren Lin and Chia-Hao Liang. 2008. Storyline-based Summarization for News Topic Retrospection.Decis. Support Syst. 45, 473--490.

Digital Library

[21]

L. Liu, L. Wang, and X. Liu. 2011. In defense of soft assignment coding. In Proceedings of the IEEE International Conference on Computer Vision. 2486--2493.

Digital Library

[22]

Xueliang Liu and Benoit Huet. 2013. Heterogeneous features and model selection for event-based media classification. In Proceedings of the ACM International Conference on Multimedia Retrieval. 151--158.

Digital Library

[23]

Juha Makkonen, Helena Ahonen-Myka, and Marko Salmenkivi. 2004. Simple semantics in topic detection and tracking. Inf. Retr. 7, 3--4, 347--368.

Digital Library

[24]

Andrew J. McMinn, Yashar Moshfeghi, and Joemon M. Jose. 2013. Building a large-scale corpus for evaluating event detection on Twitter. In Proceedings of the International Conference on Information and Knowledge Management. 409--418.

Digital Library

[25]

Zhenxing Niu, Gang Hua, Xinbo Gao, and Qi Tian. 2011. Spatial-DiscLDA for visual recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1769--1776.

Digital Library

[26]

Paul Over, George Awad, Martial Michel, Jonathan Fiscus, Greg Sanders, Wessel Kraaij, Alan F. Smeaton, and Georges Quenot. 2013. TRECVID 2013: An overview of the goals, tasks, data, evaluation mechanisms and metrics. In Proceedings of TRECVID'13. NIST.

[27]

Dhaval Patel, Wynne Hsu, and Mong Li Lee. 2008. Mining relationships among interval-based events for classification. In Proceedings of the ACM SIGMOD International Conference on Management of Data. ACM, 393--404.

Digital Library

[28]

Xuan-Hieu Phan, Le-Minh Nguyen, and Susumu Horiguchi. 2008. Learning to classify short and sparse text & web with hidden topics from large-scale data collections. In Proceedings of the International World Wide Web Conference. ACM, 91--100.

Digital Library

[29]

D. Putthividhy, H.T. Attias, and S.S. Nagarajan. 2010. Topic regression multi-modal Latent Dirichlet allocation for image annotation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3408--3415.

[30]

Shengsheng Qian, Tianzhu Zhang, and Changsheng Xu. 2014a. Boosted multi-modal supervised latent Dirichlet allocation for social event classification. In Proceedings of the International Conference on Pattern Recognition.

Digital Library

[31]

Shengsheng Qian, Tianzhu Zhang, and Changsheng Xu. 2014b. Multi-modal supervised latent dirichlet allocation for event classification in social media. In Proceedings of the International Conference on Internet Multimedia Computing and Service. 152--157.

Digital Library

[32]

Kira Radinsky and Eric Horvitz. 2013. Mining the web to predict future events. In Proceedings of the ACM Conference on Web Search and Data Mining. 255--264.

Digital Library

[33]

Daniel Ramage, David Hall, Ramesh Nallapati, and Christopher D. Manning. 2009a. Labeled LDA: A supervised topic model for credit attribution in multi-labeled corpora. In Proceedings of the Conference on Empirical Methods on Natural Language Processing. 248--256.

Digital Library

[34]

Daniel Ramage, Paul Heymann, Christopher D. Manning, and Hector Garcia-Molina. 2009b. Clustering the tagged web. In Proceedings of the ACM Conference on Web Search and Data Mining. ACM, 54--63.

Digital Library

[35]

Timo Reuter and Philipp Cimiano. 2012. Event-based classification of social media streams. In Proceedings of the ACM International Conference on Multimedia Retrieval. 22:1--22:8.

Digital Library

[36]

Timo Reuter, Symeon Papadopoulos, Georgios Petkos, Vasileios Mezaris, Yiannis Kompatsiaris, Philipp Cimiano, Christopher M. De Vries, and Shlomo Geva. 2013. Social event detection at MediaEval 2013: Challenges, datasets, and evaluation. In Proceedings of the Workshop on Multimedia Evaluation.

[37]

Jitao Sang and Changsheng Xu. 2012. Right buddy makes the difference: An early exploration of social relation analysis in multimedia applications. In Proceedings of the ACM Multimedia Conference. 19--28.

Digital Library

[38]

Yoshihiko Suhara, Hiroyuki Toda, and Akito Sakurai. 2008. Extracting related named entities from blogosphere for event mining. In Proceedings of the International Conference on Ubiquitous Information Management and Communication. ACM, 225--229.

Digital Library

[39]

Chong Wang, David Blei, and Fei-Fei Li. 2009. Simultaneous image classification and annotation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 823--836.

[40]

Xuerui Wang, Natasha Mohanty, and Andrew McCallum. 2005. Group and topic discovery from relations and text. In Proceedings of the 3rd International Workshop on Link Discovery. 28--35.

Digital Library

[41]

Andrew T. Wilson and Peter A. Chew. 2010. Term weighting schemes for latent Dirichlet allocation. In Proceedings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics. 465--473.

Digital Library

[42]

Xiao Wu, Chong-Wah Ngo, and Alexander G. Hauptmann. 2008. Multimodal news story clustering with pairwise visual near-duplicate constraint. IEEE Trans. Multimedia 10, 2, 188--199.

Digital Library

[43]

T Zhang, B Ghanem, S Liu, C Xu, and N Ahuja. 2013. Low-rank sparse coding for image classification. In Proceedings of the IEEE International Conference on Computer Vision. 281--288.

Digital Library

[44]

Tianzhu Zhang, Jing Liu, Si Liu, Yi Ouyang, and Hanqing Lu. 2009. Boosted exemplar learning for human action recognition. In Proceedings of the IEEE 12th International Conference on Computer Vision Workshops. IEEE, 538--545.

[45]

Tianzhu Zhang, Jing Liu, Si Liu, Changsheng Xu, and Hanqing Lu. 2011. Boosted exemplar learning for action recognition and annotation. IEEE Trans. Circuits Syst. Video Technol. 21, 7, 853--866.

Digital Library

[46]

Tianzhu Zhang and Changsheng Xu. 2014. Cross-domain multi-event tracking via CO-PMHT. ACM Trans. Multimedia Comput. Commun. Appl. 10, 4, 31:1--31:19.

Digital Library

[47]

Tianzhu Zhang, Changsheng Xu, Guangyu Zhu, Si Liu, and Hanqing Lu. 2012. A generic framework for video annotation via semi-supervised learning. IEEE Trans. Multimedia 14, 4, 1206--1219.

Digital Library

[48]

Qiankun Zhao, Tie-Yan Liu, Sourav S. Bhowmick, and Wei-Ying Ma. 2006. Event detection from evolution of clickthrough data. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 484--493.

Digital Library

[49]

Ji Zhu, Hui Zou, Saharon Rosset, and Trevor Hastie. 2009. Multi-class AdaBoost. In Statistics and Its Interface, Vol. 2, 349--360

Cited By

Selvaraj SThangavel SPrabhakaran MSathish T(2024)Impressive predictive model for Breast Cancer based on Machine LearningEAI Endorsed Transactions on Pervasive Health and Technology10.4108/eetpht.10.524610Online publication date: 29-Feb-2024
https://doi.org/10.4108/eetpht.10.5246
Faheem Nikhat HSait S(2024)Inappropriate YouTube content detection and classification by using proposed novel auto-determined k-means clustering and PDBRNN architectureJournal of Intelligent & Fuzzy Systems10.3233/JIFS-23687146:4(10833-10845)Online publication date: 18-Apr-2024
https://doi.org/10.3233/JIFS-236871
Lyu YQin PXu TZhu CChen E(2024)InteractNet: Social Interaction Recognition for Semantic-rich VideosACM Transactions on Multimedia Computing, Communications, and Applications10.1145/366366820:8(1-21)Online publication date: 3-May-2024
https://dl.acm.org/doi/10.1145/3663668
Show More Cited By

Index Terms

Social Event Classification via Boosted Multimodal Supervised Latent Dirichlet Allocation
1. Information systems
  1. Information retrieval
  2. Information systems applications

Recommendations

Multi-Modal Supervised Latent Dirichlet Allocation for Event Classification in Social Media
ICIMCS '14: Proceedings of International Conference on Internet Multimedia Computing and Service

In social media, many existing websites (e.g., Flickr, YouTube, and Facebook) are for users to share their own interests and opinions of many popular events, and successfully facilitate the event generation, sharing and propagation. As a result, there ...
Boosted Multi-modal Supervised Latent Dirichlet Allocation for Social Event Classification
ICPR '14: Proceedings of the 2014 22nd International Conference on Pattern Recognition

With the rapidly increasing popularity of Social Media sites (e.g., Flickr, YouTube, and Facebook), it is convenient for users to share their own comments on many social events, which successfully facilitates social event generation, sharing and ...
Multi-modal max-margin supervised topic model for social event analysis

In this paper, we proposed a novel multi-modal max-margin supervised topic model (MMSTM) for social event analysis by jointly learning the representation together with the classifier in a unified framework. Compared with existing methods, the proposed ...

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Multimedia Computing, Communications, and Applications

ACM Transactions on Multimedia Computing, Communications, and Applications Volume 11, Issue 2

December 2014

197 pages

ISSN:1551-6857

EISSN:1551-6865

DOI:10.1145/2716635

Editor:
Ralf Steinmetz
Technische Universität Darmstadt, Germany

Issue’s Table of Contents

Copyright © 2015 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 January 2015

Accepted: 01 August 2014

Revised: 01 June 2014

Received: 01 March 2014

Published in TOMM Volume 11, Issue 2

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed

Funding Sources

National Program on Key Basic Research Project (973 Program, Project No. 2012CB316304)
National Natural Science Foundation of China (61225009, 61303173)
Singapore National Research Foundation under International Research Centre @ Singapore Funding Initiative
Deanship of Scientific Research at King Saud University, Riyadh, Saudi Arabia
IDM Programme Office
International research group project No. IRG-14-18

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

60
Total Citations
View Citations
641
Total Downloads

Downloads (Last 12 months)31
Downloads (Last 6 weeks)4

Reflects downloads up to 03 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

Selvaraj SThangavel SPrabhakaran MSathish T(2024)Impressive predictive model for Breast Cancer based on Machine LearningEAI Endorsed Transactions on Pervasive Health and Technology10.4108/eetpht.10.524610Online publication date: 29-Feb-2024
https://doi.org/10.4108/eetpht.10.5246
Faheem Nikhat HSait S(2024)Inappropriate YouTube content detection and classification by using proposed novel auto-determined k-means clustering and PDBRNN architectureJournal of Intelligent & Fuzzy Systems10.3233/JIFS-23687146:4(10833-10845)Online publication date: 18-Apr-2024
https://doi.org/10.3233/JIFS-236871
Lyu YQin PXu TZhu CChen E(2024)InteractNet: Social Interaction Recognition for Semantic-rich VideosACM Transactions on Multimedia Computing, Communications, and Applications10.1145/366366820:8(1-21)Online publication date: 3-May-2024
https://dl.acm.org/doi/10.1145/3663668
Zhu BCai YWang J(2024)Graph-Based Multimodal Topic Modeling With Word Relations and Object RelationsIEEE Transactions on Multimedia10.1109/TMM.2024.337817326(8210-8225)Online publication date: 19-Mar-2024
https://dl.acm.org/doi/10.1109/TMM.2024.3378173
Thenmozhi RShridevi SMohanty SGarcía-Díaz VGupta DTiwari PShorfuzzaman M(2024)Attribute-Based Adaptive Homomorphic Encryption for Big Data SecurityBig Data10.1089/big.2021.017612:5(343-356)Online publication date: 1-Oct-2024
https://doi.org/10.1089/big.2021.0176
Ullah Khan IOuaissa MOuaissa MEl Himer S(2023)Internet of Medical Things & Machine IntelligenceMachine Intelligence for Internet of Medical Things: Applications and Future Trends10.2174/9789815080445123020004(1-10)Online publication date: 9-May-2023
https://doi.org/10.2174/9789815080445123020004
Zhang YMa DTiwari PZhang CMasud MShorfuzzaman MSong D(2023)Stance-level Sarcasm Detection with BERT and Stance-centered Graph Attention NetworksACM Transactions on Internet Technology10.1145/353343023:2(1-21)Online publication date: 18-May-2023
https://dl.acm.org/doi/10.1145/3533430
Zhang HCai YRen HLi Q(2023)Multimodal Topic Modeling by Exploring Characteristics of Short Text Social MediaIEEE Transactions on Multimedia10.1109/TMM.2022.314706425(2430-2445)Online publication date: 1-Jan-2023
https://dl.acm.org/doi/10.1109/TMM.2022.3147064
Zhang YTiwari PZheng QSaddik AHossain M(2023)A Multimodal Coupled Graph Attention Network for Joint Traffic Event Detection and Sentiment ClassificationIEEE Transactions on Intelligent Transportation Systems10.1109/TITS.2022.320547724:8(8542-8554)Online publication date: 1-Aug-2023
https://dl.acm.org/doi/10.1109/TITS.2022.3205477
Muhammad GHossain MGarg S(2023)Stacked Autoencoder-Based Intrusion Detection System to Combat Financial FraudulentIEEE Internet of Things Journal10.1109/JIOT.2020.304118410:3(2071-2078)Online publication date: 1-Feb-2023
https://doi.org/10.1109/JIOT.2020.3041184
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Issue’s Table of Contents