research-article

Finding prophets in the blogosphere: bloggers who predicted buzzwords before they become popular

Authors:

Seiya Tomonaga,

Shinsuke Nakajima,

Yoichi Inagaki,

Reyn NakamotoAuthors Info & Claims

iiWAS '15: Proceedings of the 17th International Conference on Information Integration and Web-based Applications & Services

Article No.: 15, Pages 1 - 10

https://doi.org/10.1145/2837185.2837188

Published: 11 December 2015 Publication History

Abstract

Identifying important users from social media has recently attracted much attention in information and knowledge management community. Although researchers have focused on users' knowledge levels on certain topics or influence degrees on other users in social networks, previous works have not studied users' prediction ability on future popularity. In this paper, we propose a novel approach to find important bloggers based on their buzzword prediction ability. We conduct a time-series analysis in the blogosphere considering four factors: post earliness, content similarity, entry frequency and buzzword coverage. We perform preparatory work in categorizing a blogger into knowledgeable categories, identifying past buzzwords, analyzing a buzzword's peak time content and growth period, and finally evaluate a blogger's prediction ability on a buzzword and on a category. Experimental results on real-world blog data consisting of 150 million entries from 11 million bloggers demonstrate that the proposed approach can find prophetic bloggers and outperforms others that do not take temporal features into account.

References

[1]

S. Nakajima, J. Zhang, Y. Inagaki, T. Kusano and R. Nakamoto: Blog Ranking Based on Bloggers' Knowledge Level for Providing Credible Information. In WISE 2009: 227--234.

[2]

K. W. Church, P. Hanks: Word Association Norms, Mutual Information, and Lexicography. Computational Linguistics, 16(1) (1990): 22--29.

Digital Library

[3]

A. Kilgarriff, D. Tugwell: Sketching words. Lexicography and Natural Language Processing: A Festschrift in Honour of B. T. S. Atkins., (2002): 125--137.

[4]

Y. Wang, L. Wang, Y. Li, D. He, T. Liu: A Theoretical Analysis of NDCG Type Ranking Measures. In COLT 2013: 25--54.

[5]

K. Balog, Y. Fang, M. Rijke, P. Serdyukov, L. Si: Expertise Retrieval. Foundations and Trends in Information Retrieval, 6(2--3) (2012): 127--256.

Digital Library

[6]

S. H. Hashemi, M. Neshati, H. Beigy: Expertise retrieval in bibliographic network: a topic dominance learning approach. In CIKM 2013: 1117--1126.

[7]

K. Balog, M. Rijke, W. Weerkamp: Bloggers as experts: feed distillation using expert retrieval models. In SIGIR 2008: 753--754.

[8]

A. Bozzon, M. Brambilla, S. Ceri, M. Silvestri, G. Vesci: Choosing the right crowd: expert finding in social networks. In EDBT 2013: 637--648.

[9]

I. Guy, U. Avraham, D. Carmel, S. Ur, M. Jacovi, I. Ronen: Mining expertise and interests from social media. In WWW 2013: 515--526.

[10]

N. Agarwal, H. Liu, L. Tang, P. S. Yu: Identifying the influential bloggers in a community. In WSDM 2008: 207--218.

[11]

Y. Cai, Y. Chen: MASS: a multi-facet domain-specific influential blogger mining system. In ICDE 2010: 1109--1112.

[12]

J. Weng, E. Lim, J. Jiang, Q. He: TwitterRank: finding topic-sensitive influential twitterers. In WSDM 2010: 261--270.

[13]

M. Cha, H. Haddadi, F. Benevenuto, P. K. Gummadi: Measuring User Influence in Twitter: The Million Follower Fallacy. In ICWSM 2010.

[14]

E. Bakshy, J. M. Hofman, W. A. Mason, D. J. Watts: Everyone's an influencer: quantifying influence on twitter. In WSDM 2011: 65--74.

[15]

S. Wu, J. M. Hofman, W. A. Mason, D. J. Watts: Who says what to whom on twitter. In WWW 2011: 705--714.

[16]

A. Goyal, F. Bonchi, L. V. S. Lakshmanan: Discovering leaders from community actions. In CIKM 2008: 499--508.

[17]

Y. Singer: How to win friends and influence people, truthfully: influence maximization mechanisms for social networks. In WSDM 2012: 733--742.

[18]

T. Sakaki, M. Okazaki, Y. Matsuo: Earthquake shakes Twitter users: real-time event detection by social sensors. In WWW 2010: 851--860.

[19]

X. Jin, W. S. Spangler, R. Ma, J. Han: Topic initiator detection on the world wide web. In WWW 2010: 481--490.

[20]

S. Asur, B. A. Huberman, G. Szabo, C. Wang: Trends in Social Media: Persistence and Decay. In ICWSM 2011.

[21]

H. Becker, M. Naaman, L. Gravano: Beyond Trending Topics: Real-World Event Identification on Twitter. In ICWSM 2011.

[22]

H. Yin, B. Cui, H. Lu, Y. Huang, J. Yao: A unified model for stable and temporal topic detection from social media data. In ICDE 2013: 661--672.

[23]

D. Spina, J. Gonzalo, E. Amigo: Learning similarity functions for topic detection in online reputation monitoring. In SIGIR 2014: 527--536.

[24]

X. Zhang, X. Chen, Y. Chen, S. Wang, Z. Li, J. Xia: Event detection and popularity prediction in microblogging. Neurocomputing 149(2015): 1469--1480.

[25]

F. Figueiredo, F. Benevenuto, J. M. Almeida: The tube over time: characterizing popularity growth of youtube videos. In WSDM 2011: 745--754.

[26]

H. Pinto, J. M. Almeida, M. A. Goncalves: Using early view patterns to predict the popularity of youtube videos. In WSDM 2013: 365--374.

[27]

H. Li, X. Ma, F. Wang, J. Liu, K. Xu: On popularity prediction of videos shared in online social networks. In CIKM 2013: 169--178.

[28]

K. Lerman, T. Hogg: Using a model of social dynamics to predict popularity of news. In WWW 2010: 621--630.

[29]

R. Bandari, S. Asur, B. A. Huberman: The Pulse of News in Social Media: Forecasting Popularity. In ICWSM 2012.

[30]

S. R. Kairam, M. R. Morris, J. Teevan, D. J. Liebling, S. T. Dumais: Towards Supporting Search over Trending Events with Social Media. In ICWSM 2013.

[31]

N. Golbandi, L. Katzir, Y. Koren, R. Lempel: Expediting search trend detection via prediction of query counts. In WSDM 2013: 295--304.

[32]

K. Radinsky, K. M. Svore, S. T. Dumais, M. Shokouhi, J. Teevan, A. Bocharov, E. Horvitz: Behavioral dynamics on the web: Learning, modeling, and prediction. ACM Trans. Inf. Syst. 31(3)(2013): 16.

Digital Library

[33]

M. Mathioudakis, N. Koudas: TwitterMonitor: trend detection over the twitter stream. In SIGMOD Conference 2010: 1155--1158.

[34]

L. Hong, O. Dan, B. D. Davison: Predicting popular messages in Twitter. In WWW (Companion Volume) 2011: 57--58.

[35]

J. Bian, Y. Yang, T. Chua: Predicting trending messages and diffusion participants in microblogging network. In SIGIR 2014: 537--546.

[36]

M. Ahmed, S. Spagna, F. Huici, S. Niccolini: A peek into the future: predicting the evolution of popularity in user generated content. In WSDM 2013: 607--616.

Cited By

Gammoudi FSendi MOmri M(2022)A Survey on Social Media Influence Environment and Influencers IdentificationSocial Network Analysis and Mining10.1007/s13278-022-00972-y12:1Online publication date: 3-Oct-2022
https://doi.org/10.1007/s13278-022-00972-y
Bowen GBowen D(2021)Fashion Bloggers: Temperament and CharacteristicsThe Art of Digital Marketing for Fashion and Luxury Brands10.1007/978-3-030-70324-0_4(81-104)Online publication date: 18-Jul-2021
https://doi.org/10.1007/978-3-030-70324-0_4
Khan HDaud AIshfaq UAmjad TAljohani NAbbasi RAlowibdi J(2017)Modelling to identify influential bloggers in the blogosphereComputers in Human Behavior10.1016/j.chb.2016.11.01268:C(64-82)Online publication date: 1-Mar-2017
https://dl.acm.org/doi/10.1016/j.chb.2016.11.012
Show More Cited By

Index Terms

Finding prophets in the blogosphere: bloggers who predicted buzzwords before they become popular
1. Information systems
  1. Information retrieval
    1. Document representation
      1. Thesauri
    2. Retrieval tasks and goals
      1. Document filtering
      2. Information extraction

Recommendations

Early detection of buzzwords based on large-scale time-series analysis of blog entries
HT '12: Proceedings of the 23rd ACM conference on Hypertext and social media

In this paper, we discuss a method for early detection of "gradual buzzwords" by analyzing time-series data of blog entries. We observe the process in which certain topics grow to become major buzzwords and determine the key indicators that are ...
Identify Emergent Trends Based on the Blogosphere
WI-IAT '13: Proceedings of the 2013 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT) - Volume 03

Information about upcoming trends is a valuable knowledge for both, companies and individuals. Detecting trends for a certain topic is of special interest. According to the latest information over 200 million blogs exist in the World Wide Web. Hence, ...
Buzzword detection in the scientific scenario

Buzzword detection through a time-series analysis.Identification of buzzwords in the DBLP database.Use of clustering techniques in trend detection.Evaluation of terms identified as buzzwords. This paper addresses a relatively new concept: the buzzword. ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

iiWAS '15: Proceedings of the 17th International Conference on Information Integration and Web-based Applications & Services

December 2015

704 pages

ISBN:9781450334914

DOI:10.1145/2837185

General Chair:
Gabriele Anderst-Kotsis
Johannes Kepler University Linz, Austria
,
Program Chair:
Maria Indrawan-Santiago
Monash University, Australia

Copyright © 2015 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 December 2015

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Ministry of Education, Culture, Sports, Science, and Technology

Conference

iiWAS '15

iiWAS '15: The 17th International Conference on Information Integration and Web-based Application & Services

December 11 - 13, 2015

Brussels, Belgium

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

5
Total Citations
View Citations
80
Total Downloads

Downloads (Last 12 months)1
Downloads (Last 6 weeks)0

Reflects downloads up to 28 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Gammoudi FSendi MOmri M(2022)A Survey on Social Media Influence Environment and Influencers IdentificationSocial Network Analysis and Mining10.1007/s13278-022-00972-y12:1Online publication date: 3-Oct-2022
https://doi.org/10.1007/s13278-022-00972-y
Bowen GBowen D(2021)Fashion Bloggers: Temperament and CharacteristicsThe Art of Digital Marketing for Fashion and Luxury Brands10.1007/978-3-030-70324-0_4(81-104)Online publication date: 18-Jul-2021
https://doi.org/10.1007/978-3-030-70324-0_4
Khan HDaud AIshfaq UAmjad TAljohani NAbbasi RAlowibdi J(2017)Modelling to identify influential bloggers in the blogosphereComputers in Human Behavior10.1016/j.chb.2016.11.01268:C(64-82)Online publication date: 1-Mar-2017
https://dl.acm.org/doi/10.1016/j.chb.2016.11.012
Imamori DTajima KMukhopadhyay SZhai CBertino ECrestani FMostafa JTang JSi LZhou XChang YLi YSondhi P(2016)Predicting Popularity of Twitter Accounts through the Discovery of Link-Propagating Early AdoptersProceedings of the 25th ACM International on Conference on Information and Knowledge Management10.1145/2983323.2983859(639-648)Online publication date: 24-Oct-2016
https://dl.acm.org/doi/10.1145/2983323.2983859
Zhang JTomonaga SNakajima SInagaki YNakamoto R(2016)Prophetic blogger identification based on buzzword prediction abilityInternational Journal of Web Information Systems10.1108/IJWIS-03-2016-001312:3(267-291)Online publication date: 15-Aug-2016
https://doi.org/10.1108/IJWIS-03-2016-0013

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten