skip to main content
research-article

Cross-Platform Emerging Topic Detection and Elaboration from Multimedia Streams

Published: 02 June 2015 Publication History

Abstract

With the explosive growth of online media platforms in recent years, it becomes more and more attractive to provide users a solution of emerging topic detection and elaboration. And this posts a real challenge to both industrial and academic researchers because of the overwhelming information available in multiple modalities and with large outlier noises. This article provides a method on emerging topic detection and elaboration using multimedia streams cross different online platforms. Specifically, Twitter, New York Times and Flickr are selected for the work to represent the microblog, news portal and imaging sharing platforms. The emerging keywords of Twitter are firstly extracted using aging theory. Then, to overcome the nature of short length message in microblog, Robust Cross-Platform Multimedia Co-Clustering (RCPMM-CC) is proposed to detect emerging topics with three novelties: 1) The data from different media platforms are in multimodalities; 2) The coclustering is processed based on a pairwise correlated structure, in which the involved three media platforms are pairwise dependent; 3) The noninformative samples are automatically pruned away at the same time of coclustering. In the last step of cross-platform elaboration, we enrich each emerging topic with the samples from New York Times and Flickr by computing the implicit links between social topics and samples from selected news and Flickr image clusters, which are obtained by RCPMM-CC. Qualitative and quantitative evaluation results demonstrate the effectiveness of our method.

Supplementary Material

a54-bao-app.pdf (bao.zip)
Supplemental movie, appendix, image and software files for, A reward-and-punishment-based approach for concept detection using adaptive ontology rules

References

[1]
L. M. Aiello, G. Petkos, C. Martin, D. Corney, S. Papadopoulos, R. Skraba, A. Göker, I. Kompatsiaris, and A. Jaimes. 2013. Sensing trending topics in Twitter. IEEE Trans. Multimedia 15, 6.
[2]
Foteini Alvanaki, Michel Sebastian, Krithi Ramamritham, and Gerhard Weikum. 2011. EnBlogue: Emergent topic detection in Web 2.0 streams. In Proceedings of the ACM International Conference on Management of Data. 1271--1274.
[3]
A. Banerjee, I. Dhillon, J. Ghosh, S. Merugu, and D. S. Modha. 2007. A generalized maximum entropy approach to Bregman co-clustering and matrix approximation. J. Machine Learning Research 8, 1919--1986.
[4]
Romil Bansal, Radhika Kumaran, Diwakar Mahajan, Arpit Khurdiya, Lipika Dey, and Hiranmay Ghosh. 2012. TWIPIX: A web magazine curated from social media. In Proceedings of the ACM International Conference on Multimedia. 1355--1356.
[5]
Bing-Kun Bao, Weiqing Min, Teng Li, and Changsheng Xu. 2015. Joint local and global consistency on interdocument and interword relationships for co-clustering. IEEE Trans. Cybernetics 45, 1, 15--28.
[6]
Bing-Kun Bao, Weiqing Min, Ke Lu, and Changsheng Xu. 2013. Social event detection with robust high-order co-clustering. In Proceedings of the 3rd ACM Conference on International Conference on Multimedia Retrieval. 135--142.
[7]
Bing-Kun Bao, Weiqing Min, Jitao Sang, and Changsheng Xu. 2012. Multimedia news digger on emerging topics from social streams. In Proceedings of the 20th ACM International Conference on Multimedia. 1357--1358.
[8]
R. Cai, L. Lu, and A. Hanjalic. 2008. Co-clustering for auditory scene categorization. IEEE Trans. Multimedia 10, 4, 596--606.
[9]
C. Chen, Y. T. Chen, Y. Sun, and M. Chen. 2003. Life cycle modeling of news events using aging theory. In Proceedings of the International Conference on Machine Learning. 47--59.
[10]
K. Y. Chen, L. Luesukprasert, and S. Chou. 2007. Hot topic extraction based on timeline analysis and multidimensional sentence modeling. IEEE Trans. Knowl. Data Eng. 19, 8, 1016--1025.
[11]
M. Deodhar, H. Cho, G. Gupta, J. Ghosh, and I. Dhillon. 2008. Robust overlapping co-clustering. Tech Rep. IDEAL-TR09, Department of ECE, University of Texas at Austin.
[12]
I.S. Dhillon and D.S. Modha. 2001. Concept decompositions for large sparse text data using clustering. Machine Learning 42, 1, 143--175.
[13]
I. S. Dhillon. 2001. Co-clustering documents and words using bipartite spectral graph partitioning. In Proceedings of the International Conference on Knowledge Discovery and Data Mining. 269--274.
[14]
I. S. Dhillon, S. Mallela, and D. S. Modha. 2003. Information-theoretic co-clustering. In Proceedings of the International Conference on Knowledge Discovery and Data Mining.
[15]
Chris Ding, Tao Li, Wei Peng, and Haesun Park. 2006. Orthogonal nonnegative matrix t-factorizations for clustering. In Proceedings of the ACM International Conference on Knowledge Discovery and Data Mining. 126--135.
[16]
M. Gamon, S. Basu, D. Belenko, D. Fisher, M. Hurst, and A. C. König. 2008. Blews: Using blogs to provide context for news articles. In Proceedings of the AAAI Conference on Weblogs and Social Media. American Association for Artificial Intelligence.
[17]
B. Gao, T. Y. Liu, and W. Y. Ma. 2006. Star-structured high-order heterogeneous data co-clustering based on consistent information theory. In Proceedings of the International Conference on Data Mining. IEEE, 880--884.
[18]
G. Greco, A. Guzzo, and L. Pontieri. 2010. Coclustering multiple heterogeneous domains: Linear combinations and agreements. IEEE Trans. Knowl. Data Eng. 22, 12, 1649--1663.
[19]
K. Jarvelin and J. Kekalainen. 2002. Cumulated gain-based evaluation of IR techniques. ACM Trans. Inf. Syst. 20, 4, 422--446.
[20]
Stefanie Jegelka, Suvrit Sra, and Arindam Banerjee. 2009. Approximation algorithms for tensor clustering. Algorithmic Learning Theory, Lecture Notes in Computer Science, vol. 5809, 368--383.
[21]
Shiva Prasad Kasiviswanathan, Prem Melville, Arindam Banerjee, and Vikas Sindhwani. 2011. Emerging topic detection using dictionary learning. In Proceedings of the ACM International Conference on Information and Knowledge Management. 745--754.
[22]
R. E. Kass and L. Wasserman. 1995. A reference Bayesian test for nested hypotheses and its relationship to the Schwarz criterion. J. Amer. Statist. Assoc., 928--934.
[23]
T. Li, H. Chang, M. Wang, B. Ni, R. Hong, and S. Yan. 2015. Crowded scene analysis: A survey. IEEE Trans. Circuits Syst. Video Technol.
[24]
Bo Long, Zhongfei Mark Zhang, and Philip S. Yu. 2005. Co-clustering by block value decomposition. In Proceedings of the ACM International Conference on Knowledge Discovery in Data Mining. 635--640.
[25]
Miles Osborne, Sasa Petrovic, Richard McCreadie, Craig Macdonald, and Iadh Ounis. 2012. Bieber no more: First story detection using Twitter andWikipedia. In Proceedings of the SIGIR Workshop on Time-Aware Information Access.
[26]
D. Pelleg and A. Moore. 2000. X-means: Extending K-means with efficient estimation of the number of clusters. In Proceedings of the 17th International Conference on Machine Learning. 727--734.
[27]
S. D. Roy, T. Mei, W. Zeng, and S. Li. 2012. Empowering cross-domain internet media with real-time topic learning from social streams. In Proceedings of the IEEE International Conference on Multimedia and Expo.
[28]
T. Sakaki, M. Okazaki, and Y. Matsuo. 2010. Earthquake shakes Twitter users: Real-time event detection by social sensors. In Proceedings of the 19th International Conference on World Wide Web. 851--860.
[29]
Jitao Sang, Changsheng Xu, and Jing Liu. 2012. User-aware image tag refinement via ternary semantic analysis. IEEE Trans. Multimedia 14, 3, 883--895.
[30]
Hassan Sayyadi, Matthew Hurst, and Alexey Maykov. 2009. Event detection and tracking in social streams. In Proceedings of the International Conference on Weblogs and Social Media.
[31]
Giuseppe Serra, Thomas Alisi, Marco Bertini, Lamberto Ballan, Alberto Del Bimbo, Laurent Walter Goix, and Carlo Alberto Licciardi. 2013. Demo paper: Stamat: A framework for social topics and media analysis. In Proceedings of the IEEE International Conference on Multimedia and Expo Workshops. 1--2.
[32]
Y. Takama, A. Matsumura, and T. Kajinami. 2006. Visualization of news distribution in blog space. In Proceedings of the IEEE International Conference on Web Intelligence and Intelligent Agent Technology. 413--416.
[33]
S. Tan, C. W. Ngo, H. K. Tan, and L. Pang. 2011. Cross media hyperlinking for search topic browsing. In Proceedings of the ACM International Conference on Multimedia. 243--252.
[34]
Wei Xu, Xin Liu, and Yihong Gong. 2003. Document clustering based on non-negative matrix factorization. In Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval. 267--273.
[35]
Jianke Zhu, Steven C. H. Hoi, Michael R. Lyu, and Shuicheng Yan. 2008. Near-duplicate keyframe retrieval by nonrigid image matching. In Proceedings of the 16th ACM International Conference on Multimedia. 41--50.

Cited By

View all
  • (2024)Multi-modal topic modeling from social media data using deep transfer learningApplied Soft Computing10.1016/j.asoc.2024.111706160(111706)Online publication date: Jul-2024
  • (2023)Shot Boundary Detection Using Color Clustering and Attention MechanismACM Transactions on Multimedia Computing, Communications, and Applications10.1145/359592319:6(1-23)Online publication date: 12-Jul-2023
  • (2023)Qualitative Modeling to Extract Knowledge for Problem StructuringKnowledge Technology and Systems10.1007/978-981-99-1075-5_5(137-166)Online publication date: 14-Jun-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Multimedia Computing, Communications, and Applications
ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 11, Issue 4
April 2015
231 pages
ISSN:1551-6857
EISSN:1551-6865
DOI:10.1145/2788342
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 02 June 2015
Accepted: 01 January 2015
Revised: 01 August 2014
Received: 01 January 2014
Published in TOMM Volume 11, Issue 4

Permissions

Request permissions for this article.

Check for updates

Badges

  • Best Paper

Author Tags

  1. Topic detection
  2. coclustering
  3. cross-media
  4. cross-platform

Qualifiers

  • Research-article
  • Research
  • Refereed

Funding Sources

  • National Program on Key Basic Research Project 973 Program
  • Project 2012CB316304, in part by the National Natural Science Foundation of China
  • Beijing Natural Science Foundation (4152053 and 4131004)

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)8
  • Downloads (Last 6 weeks)0
Reflects downloads up to 03 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Multi-modal topic modeling from social media data using deep transfer learningApplied Soft Computing10.1016/j.asoc.2024.111706160(111706)Online publication date: Jul-2024
  • (2023)Shot Boundary Detection Using Color Clustering and Attention MechanismACM Transactions on Multimedia Computing, Communications, and Applications10.1145/359592319:6(1-23)Online publication date: 12-Jul-2023
  • (2023)Qualitative Modeling to Extract Knowledge for Problem StructuringKnowledge Technology and Systems10.1007/978-981-99-1075-5_5(137-166)Online publication date: 14-Jun-2023
  • (2023)Cross-Media Topic Detection: Approaches, Challenges, and ApplicationsComputer Vision and Machine Intelligence10.1007/978-981-19-7867-8_45(565-576)Online publication date: 6-May-2023
  • (2022)A Personalized Recommendation Method for Short Drama Videos Based on External Index FeaturesAdvances in Meteorology10.1155/2022/36019562022(1-10)Online publication date: 18-Apr-2022
  • (2022)A novel emerging topic detection methodInformation Processing and Management: an International Journal10.1016/j.ipm.2021.10284359:2Online publication date: 1-Mar-2022
  • (2022)Heterogeneous Information Fusion based Topic Detection from Social Media DataInformation Systems Frontiers10.1007/s10796-022-10334-w25:2(513-528)Online publication date: 7-Sep-2022
  • (2021)Methods on Detecting Closely Related Topics and Spatial EventsInternational Journal of Software Engineering and Knowledge Engineering10.1142/S021819402150045531:10(1377-1398)Online publication date: 15-Nov-2021
  • (2021)Metadata Connector: Exploiting Hashtag and Tag for Cross-OSN Event SearchIEEE Transactions on Multimedia10.1109/TMM.2020.298204723(510-523)Online publication date: 2021
  • (2021)Cross-Platform Event Popularity Analysis via Dynamic Time Warping and Neural PredictionIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2021.3090663(1-1)Online publication date: 2021
  • Show More Cited By

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media