ABSTRACT
Finding the key topics in a large amount of short texts in social networks is a hot research point in data mining. There are a lot of models and algorithms to solve this problem, but they are not designed for social networks where many new words and nonstandard writings, grammars exist. It's more difficult to detect key topics in social networks because of these characteristics. In this paper, we propose a framework for detecting key topics in social networks. First, we get the posts in social networks using a focused crawler. Then we introduce the Word Segment Merging (WSM) method to identify new phrases in short texts and represent a document with the vector space model (VSM). At last, we model the life cycle of topics for clustering and popularity computing. Experiments on three datasets of SINA Weibo show that our method is better than existing state-of-arts models.
- Quercia, D., H. Askham, and J. Crowcroft. TweetLDA: supervised topic classification and link prediction in Twitter. in Proceedings of the 3rd Annual ACM Web Science Conference. 2012. ACM.Google ScholarDigital Library
- Rubin, T.N., et al., Statistical topic models for multi-label document classification. Machine Learning, 2012. 88(1-2): p. 157--208.Google Scholar
- Cha, Y. and J. Cho. Social-network analysis using topic models. in proceedings of the 35th international ACM SIGIR conference on research and development in information retrieval. 2012. ACM.Google ScholarDigital Library
- Ibrahim, R., et al., Tools and approaches for topic detection from Twitter streams: survey. 2018. 54(3): p. 511--539.Google ScholarDigital Library
- Schultz, J.M. and M.Y. Liberman, Towards a "Universal Dictionary" for multi-language information retrieval applications, in Topic detection and tracking. 2002, Springer. p. 225--241.Google ScholarDigital Library
- Kumaran, G. and J. Allan. Using names and topics for new event detection. in Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing. 2005. Association for Computational Linguistics.Google ScholarDigital Library
- Blei, D.M., A.Y. Ng, and M.I. Jordan, Latent dirichlet allocation. the Journal of machine Learning research, 2003. 3: p. 993--1022.Google ScholarDigital Library
- Rosen-Zvi, M., et al. The author-topic model for authors and documents. in Proceedings of the 20th conference on Uncertainty in artificial intelligence. 2004. AUAI Press.Google ScholarDigital Library
- Ramage, D., et al. Labeled LDA: A supervised topic model for credit attribution in multi-labeled corpora. in Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1-Volume 1. 2009. Association for Computational Linguistics.Google ScholarDigital Library
- Nikolenko, S.I., S. Koltcov, and O.J.J.o.I.S. Koltsova, Topic modelling for qualitative studies. 2017. 43(1): p. 88--102.Google ScholarDigital Library
- Fuentes-Pineda, G. and I.V.J.E.S.w.A. Meza-Ruiz, Topic Discovery in Massive Text Corpora Based on Min-Hashing. 2019.Google ScholarDigital Library
- Choi, H.-J. and C.H.J.E.S.w.A. Park, Emerging topic detection in twitter stream based on high utility pattern mining. 2019. 115: p. 27--36.Google ScholarCross Ref
- Jaradat, S. and M. Matskin, On Dynamic Topic Models for Mining Social Media, in Emerging Research Challenges and Opportunities in Computational Social Network Analysis and Mining. 2019, Springer. p. 209--230.Google ScholarCross Ref
- Makkonen, J., H. Ahonen-Myka, and M. Salmenkivi. Topic detection and tracking with spatio-temporal evidence. in European Conference on Information Retrieval. 2003. Springer.Google ScholarCross Ref
- Hofmann, T., Unsupervised learning by probabilistic latent semantic analysis. Machine learning, 2001. 42(1-2): p. 177--196.Google Scholar
- Figueiredo, F., F. Benevenuto, and J.M. Almeida. The tube over time: characterizing popularity growth of youtube videos. in Proceedings of the fourth ACM international conference on Web search and data mining. 2011. ACM.Google ScholarDigital Library
- Bao, P., et al., Popularity prediction in microblogging network: a case study on sina weibo, in Proceedings of the 22nd international conference on World Wide Web companion. 2013, International World Wide Web Conferences Steering Committee: Rio de Janeiro, Brazil. p. 177--178.Google Scholar
- Lerman, K. and T. Hogg, Using a model of social dynamics to predict popularity of news, in Proceedings of the 19th international conference on World wide web. 2010, ACM: Raleigh, North Carolina, USA. p. 621--630.Google ScholarDigital Library
- Furini, M., et al. 5 Steps to Make Art Museums Tweet Influentially. in 2018 International Workshop on Social Sensing (SocialSens). 2018. IEEE.Google ScholarCross Ref
- Crane, R. and D. Sornette, Robust dynamic classes revealed by measuring the response function of a social system. Proceedings of the National Academy of Sciences, 2008. 105(41): p. 15649--15653.Google ScholarCross Ref
- Szabo, G. and B.A. Huberman, Predicting the popularity of online content. Communications of the ACM, 2010. 53(8): p. 80--88.Google Scholar
- Pinto, H., J.M. Almeida, and M.A. Gonçalves. Using early view patterns to predict the popularity of youtube videos. in Proceedings of the sixth ACM international conference on Web search and data mining. 2013. ACM.Google ScholarDigital Library
- Wang, X., et al., Predicting the popularity of topics based on user sentiment in microblogging websites. Journal of Intelligent Information Systems, 2017. 49: p. 1--18.Google Scholar
- Dancey, C.P. and J. Reidy, Statistics without maths for psychology. 2007: Pearson Education.Google Scholar
Index Terms
- A Framework for Detecting Key Topics in Social Networks
Recommendations
Detecting bursts in sentiment-aware topics from social media
Nowadays plenty of user-generated posts, e.g., sina weibos, are published on the social media. The posts contain the publics sentiments (i.e., positive or negative) towards various topics. Bursty sentiment-aware topics from these posts reveal sentiment-...
Social Opinion Mining: An Approach for Italian Language
FICLOUD '15: Proceedings of the 2015 3rd International Conference on Future Internet of Things and CloudOpinion Mining or Sentiment Analysis is an important Computer Science topic, that allows the discovery of the web user's attitude on one or more topics. Marketing, Social Engineering and Information Retrieval are achieved through the application of ...
Integrated microblog sentiment analysis from users' social interaction patterns and textual opinions
Traditional post-level opinion classification methods usually fail to capture a person's overall sentiment orientation toward a topic from his/her microblog posts published for a variety of themes related to that topic. One reason for this is that the ...
Comments