ABSTRACT
Although lots of work has been done since NIST proposed the problem of Topic Detection and Tracking (TDT), most of them focus on single media data. Topic detection for cross-media data hasn't been fully investigated. In this paper, we propose an effective method for cross-media topic detection. Unlike traditional topic detection methods that are mainly based on clustering, we consider using hot search queries as guidance to detect topics. Besides, we propose an improved co-clustering method which can be well suited for cross-media data clustering. First, we use queries to detect topics directly, and find the data associated with the topic. Second, we apply our co-clustering method to find the topics existing in the rest of data. Finally, the results obtained by the first two steps are threaded together as topics. Experiments show that our method can effectively detect topics for cross-media data.
- LDC, "TDT3 evaluation specification version 2.7." 1999.Google Scholar
- {Q. He, K. Chang, and E. P. Lim, "Analyzing feature trajectories for event detection," in ACM SIGIR Conference, 2007. Google ScholarDigital Library
- Q. Mei and C. Zhai, "Discovering evolutionary theme patterns from text: an exploration of temporal text mining," in ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2005. Google ScholarDigital Library
- J. Allan, J. Carbonell, G. Doddington, J. Yamron, and Y. Yang, "Topic detection and tracking pilot study: Final report." In Proceedings of the DARP A Broadcast News Transcription and Understanding Workshop, 1998.Google Scholar
- L. Liu, L. Sun, Y. Rui, Y. Shi, and S. Yang, "Web video topic discovery and tracking via bipartite graph reinforcement model," in International World Wide Web Conference, 2008. Google ScholarDigital Library
- J. Cao, C. W. Ngo, Y. D. Zhang, and J. T. Li, "Tracking web video topics: discovery, visualization and monitoring." IEEE Transactions on Circuits and Systems for Video Technology, 21(12): 1835--1846, 2011.Google ScholarCross Ref
- C. H. Wang, M. Zhang, S. P. Ma, and L. Y. Ru, "Automatic online news issue construction in web environment," in International World Wide Web Conference, 2008. Google ScholarDigital Library
- A. X. Sun, and M. S. Hu, "Query-guided event detection from news and blog streams," IEEE Transactions on Systems, Man and Cybernetics, Part A: Systems and Humans, 41(5): 834--839, 2011. Google ScholarDigital Library
- T. L. Chen, C. X. Liu, Q. M. Huang, "An Effective Multi-Clue Fusion Approach for Web Video Topic Detection," In ACM Multimedia, 2012. Google ScholarDigital Library
- I. S. Dhillon, I. S. et al., "Information theoretic co-lustering," in Proc. 9th ACM SIGKDD'03, pp. 89--98. Google ScholarDigital Library
- J. Shao, S. Ma, W. M. Lu, and Y. T. Zhuang, "A unified framework for web video topic discovery and visualization," Pattern Recognition Letters, 33(4): 410--419, 2012. Google ScholarDigital Library
- I. S. Dhillon, et al., "Co-clustering documents and words using bipartite spectral graph partitioning," in Proc. 7th ACM KDD '01, pp. 269--274. Google ScholarDigital Library
- DOI=http://news.sina.com.cn/Google Scholar
- DOI=http://www.youku.com/Google Scholar
- DOI=http://ictclas.org/ictclas_download.aspxGoogle Scholar
- DOI=http://hot.news.baidu.com/Google Scholar
- D. Carmel, E. Yom-Tov, A. Darlow, and D. Pelleg, "What makes a query difficult?" in Proc. SIGIR, Seattle, WA, 2006. Google ScholarDigital Library
Recommendations
Image-regulated graph topic model for cross-media topic detection
ICIMCS '15: Proceedings of the 7th International Conference on Internet Multimedia Computing and ServiceIn recent years, pictures and videos have become ubiquitous on the Internet, which encourage the development of algorithm that analyze their semantic contents for detecting topics. Among them, topic modeling plays an essential role in discovering topics ...
Semantic-based topic detection using Markov decision processes
In the field of text mining, topic modeling and detection are fundamental problems in public opinion monitoring, information retrieval, social media analysis, and other activities. Document clustering has been used for topic detection at the document ...
A semantic approach for topic-based polarity detection: a case study in the Spanish language
AbstractIn recent years, surprising amounts of news, messages, and reviews of products and services are generated in the online social media. Several efforts are being dedicated to detecting topics, as well as mining opinions in these unstructured texts. ...
Comments