ABSTRACT
Identifying important users from social media has recently attracted much attention in information and knowledge management community. Although researchers have focused on users' knowledge levels on certain topics or influence degrees on other users in social networks, previous works have not studied users' prediction ability on future popularity. In this paper, we propose a novel approach to find important bloggers based on their buzzword prediction ability. We conduct a time-series analysis in the blogosphere considering four factors: post earliness, content similarity, entry frequency and buzzword coverage. We perform preparatory work in categorizing a blogger into knowledgeable categories, identifying past buzzwords, analyzing a buzzword's peak time content and growth period, and finally evaluate a blogger's prediction ability on a buzzword and on a category. Experimental results on real-world blog data consisting of 150 million entries from 11 million bloggers demonstrate that the proposed approach can find prophetic bloggers and outperforms others that do not take temporal features into account.
- S. Nakajima, J. Zhang, Y. Inagaki, T. Kusano and R. Nakamoto: Blog Ranking Based on Bloggers' Knowledge Level for Providing Credible Information. In WISE 2009: 227--234.Google Scholar
- K. W. Church, P. Hanks: Word Association Norms, Mutual Information, and Lexicography. Computational Linguistics, 16(1) (1990): 22--29.Google ScholarDigital Library
- A. Kilgarriff, D. Tugwell: Sketching words. Lexicography and Natural Language Processing: A Festschrift in Honour of B. T. S. Atkins., (2002): 125--137.Google Scholar
- Y. Wang, L. Wang, Y. Li, D. He, T. Liu: A Theoretical Analysis of NDCG Type Ranking Measures. In COLT 2013: 25--54.Google Scholar
- K. Balog, Y. Fang, M. Rijke, P. Serdyukov, L. Si: Expertise Retrieval. Foundations and Trends in Information Retrieval, 6(2--3) (2012): 127--256.Google ScholarDigital Library
- S. H. Hashemi, M. Neshati, H. Beigy: Expertise retrieval in bibliographic network: a topic dominance learning approach. In CIKM 2013: 1117--1126.Google Scholar
- K. Balog, M. Rijke, W. Weerkamp: Bloggers as experts: feed distillation using expert retrieval models. In SIGIR 2008: 753--754.Google Scholar
- A. Bozzon, M. Brambilla, S. Ceri, M. Silvestri, G. Vesci: Choosing the right crowd: expert finding in social networks. In EDBT 2013: 637--648.Google Scholar
- I. Guy, U. Avraham, D. Carmel, S. Ur, M. Jacovi, I. Ronen: Mining expertise and interests from social media. In WWW 2013: 515--526.Google Scholar
- N. Agarwal, H. Liu, L. Tang, P. S. Yu: Identifying the influential bloggers in a community. In WSDM 2008: 207--218.Google Scholar
- Y. Cai, Y. Chen: MASS: a multi-facet domain-specific influential blogger mining system. In ICDE 2010: 1109--1112.Google Scholar
- J. Weng, E. Lim, J. Jiang, Q. He: TwitterRank: finding topic-sensitive influential twitterers. In WSDM 2010: 261--270.Google Scholar
- M. Cha, H. Haddadi, F. Benevenuto, P. K. Gummadi: Measuring User Influence in Twitter: The Million Follower Fallacy. In ICWSM 2010.Google Scholar
- E. Bakshy, J. M. Hofman, W. A. Mason, D. J. Watts: Everyone's an influencer: quantifying influence on twitter. In WSDM 2011: 65--74.Google Scholar
- S. Wu, J. M. Hofman, W. A. Mason, D. J. Watts: Who says what to whom on twitter. In WWW 2011: 705--714.Google Scholar
- A. Goyal, F. Bonchi, L. V. S. Lakshmanan: Discovering leaders from community actions. In CIKM 2008: 499--508.Google Scholar
- Y. Singer: How to win friends and influence people, truthfully: influence maximization mechanisms for social networks. In WSDM 2012: 733--742.Google Scholar
- T. Sakaki, M. Okazaki, Y. Matsuo: Earthquake shakes Twitter users: real-time event detection by social sensors. In WWW 2010: 851--860.Google Scholar
- X. Jin, W. S. Spangler, R. Ma, J. Han: Topic initiator detection on the world wide web. In WWW 2010: 481--490.Google Scholar
- S. Asur, B. A. Huberman, G. Szabo, C. Wang: Trends in Social Media: Persistence and Decay. In ICWSM 2011.Google Scholar
- H. Becker, M. Naaman, L. Gravano: Beyond Trending Topics: Real-World Event Identification on Twitter. In ICWSM 2011.Google Scholar
- H. Yin, B. Cui, H. Lu, Y. Huang, J. Yao: A unified model for stable and temporal topic detection from social media data. In ICDE 2013: 661--672.Google Scholar
- D. Spina, J. Gonzalo, E. Amigo: Learning similarity functions for topic detection in online reputation monitoring. In SIGIR 2014: 527--536.Google Scholar
- X. Zhang, X. Chen, Y. Chen, S. Wang, Z. Li, J. Xia: Event detection and popularity prediction in microblogging. Neurocomputing 149(2015): 1469--1480.Google ScholarCross Ref
- F. Figueiredo, F. Benevenuto, J. M. Almeida: The tube over time: characterizing popularity growth of youtube videos. In WSDM 2011: 745--754.Google Scholar
- H. Pinto, J. M. Almeida, M. A. Goncalves: Using early view patterns to predict the popularity of youtube videos. In WSDM 2013: 365--374.Google Scholar
- H. Li, X. Ma, F. Wang, J. Liu, K. Xu: On popularity prediction of videos shared in online social networks. In CIKM 2013: 169--178.Google Scholar
- K. Lerman, T. Hogg: Using a model of social dynamics to predict popularity of news. In WWW 2010: 621--630.Google Scholar
- R. Bandari, S. Asur, B. A. Huberman: The Pulse of News in Social Media: Forecasting Popularity. In ICWSM 2012.Google Scholar
- S. R. Kairam, M. R. Morris, J. Teevan, D. J. Liebling, S. T. Dumais: Towards Supporting Search over Trending Events with Social Media. In ICWSM 2013.Google Scholar
- N. Golbandi, L. Katzir, Y. Koren, R. Lempel: Expediting search trend detection via prediction of query counts. In WSDM 2013: 295--304.Google Scholar
- K. Radinsky, K. M. Svore, S. T. Dumais, M. Shokouhi, J. Teevan, A. Bocharov, E. Horvitz: Behavioral dynamics on the web: Learning, modeling, and prediction. ACM Trans. Inf. Syst. 31(3)(2013): 16.Google ScholarDigital Library
- M. Mathioudakis, N. Koudas: TwitterMonitor: trend detection over the twitter stream. In SIGMOD Conference 2010: 1155--1158.Google Scholar
- L. Hong, O. Dan, B. D. Davison: Predicting popular messages in Twitter. In WWW (Companion Volume) 2011: 57--58.Google Scholar
- J. Bian, Y. Yang, T. Chua: Predicting trending messages and diffusion participants in microblogging network. In SIGIR 2014: 537--546.Google Scholar
- M. Ahmed, S. Spagna, F. Huici, S. Niccolini: A peek into the future: predicting the evolution of popularity in user generated content. In WSDM 2013: 607--616.Google Scholar
Index Terms
- Finding prophets in the blogosphere: bloggers who predicted buzzwords before they become popular
Recommendations
Early detection of buzzwords based on large-scale time-series analysis of blog entries
HT '12: Proceedings of the 23rd ACM conference on Hypertext and social mediaIn this paper, we discuss a method for early detection of "gradual buzzwords" by analyzing time-series data of blog entries. We observe the process in which certain topics grow to become major buzzwords and determine the key indicators that are ...
Identify Emergent Trends Based on the Blogosphere
WI-IAT '13: Proceedings of the 2013 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT) - Volume 03Information about upcoming trends is a valuable knowledge for both, companies and individuals. Detecting trends for a certain topic is of special interest. According to the latest information over 200 million blogs exist in the World Wide Web. Hence, ...
Buzzword detection in the scientific scenario
Buzzword detection through a time-series analysis.Identification of buzzwords in the DBLP database.Use of clustering techniques in trend detection.Evaluation of terms identified as buzzwords. This paper addresses a relatively new concept: the buzzword. ...
Comments