skip to main content
10.1145/3038912.3052626acmotherconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article
Public Access

GPOP: Scalable Group-level Popularity Prediction for Online Content in Social Networks

Published: 03 April 2017 Publication History

Abstract

Predicting the popularity of online content in social networks is important in many applications, ranging from ad campaign design, web content caching and prefetching, to web-search result ranking. Earlier studies target this problem by learning models that either generalize behaviors of the entire network population or capture behaviors of each individual user. In this paper, we claim that a novel approach based on group-level popularity is necessary and more practical, given that users naturally organize themselves into clusters and that users within a cluster react to online content in a uniform manner. We develop a novel framework by first grouping users into cohesive clusters, and then adopt tensor decomposition to make predictions. In order to minimize the impact of noisy data and be more flexible in capturing changes in users' interests, our framework exploits both the network topology and interaction among users in learning a robust user clustering. The PARAFAC tensor decomposition is adapted to work with hierarchical constraint over user groups, and we show that optimizing this constrained function via gradient descent achieves faster convergence and leads to more stable solutions. Extensive experimental results over two social networks demonstrate that our framework is scalable, finds meaningful user groups, and significantly outperforms eight baseline methods in terms of prediction accuracy.

References

[1]
Behance.net social network. https://www.behance.net/.
[2]
Hierarchical group-level popularity prediction for online contents. http://cs.ucsb.edu/~mhoang/gpop_longversion.pdf.
[3]
E. Acar, D. M. Dunlavy, T. G. Kolda, and M. Mørup. Scalable tensor factorizations for incomplete data. Chemometrics and Intelligent Laboratory Systems, 106(1):41--56, 2011.
[4]
E. Acar, T. G. Kolda, and D. M. Dunlavy. All-at-once optimization for coupled matrix and tensor factorizations. arXiv preprint arXiv:1105.3422, 2011.
[5]
B. W. Bader, T. G. Kolda, et~al. Matlab tensor toolbox version 2.6. http://www.sandia.gov/~tgkolda/TensorToolbox/, 2015.
[6]
R. Bandari, S. Asur, and B. A. Huberman. The pulse of news in social media: Forecasting popularity. arXiv preprint arXiv:1202.0332, 2012.
[7]
J. Cheng, L. Adamic, P. A. Dow, J. M. Kleinberg, and J. Leskovec. Can cascades be predicted? In WWW, 2014.
[8]
D. M. Dunlavy, T. G. Kolda, and E. Acar. Poblano v1.0: A matlab toolbox for gradient-based optimization. Technical Report SAND2010--1422, Sandia National Laboratories, 2010.
[9]
M. Eftekhar, Y. Ganjali, and N. Koudas. Information cascade at group scale. In KDD, 2013.
[10]
G. Gürsun, M. Crovella, and I. Matta. Describing and forecasting video access patterns. In INFOCOM, 2011.
[11]
L. Hong, O. Dan, and B. D. Davison. Predicting popular messages in twitter. In WWW Companion, 2011.
[12]
Z. Hu, J. Yao, B. Cui, and E. Xing. Community level diffusion extraction. In SIGMOD, 2015.
[13]
R. J. Hyndman, R. A. Ahmed, G. Athanasopoulos, and H. L. Shang. Optimal combination forecasts for hierarchical time series. Computational Statistics & Data Analysis, 55(9):2579--2589, 2011.
[14]
S. Jamali and H. Rangwala. Digging digg: Comment mining, popularity prediction, and social network analysis. In International Conference on Web Information Systems and Mining (WISM), 2009.
[15]
A. Kaltenbrunner, V. Gomez, and V. Lopez. Description and prediction of slashdot activity. In Web Conference, 2007. LA-WEB 2007. Latin American, pages 57--66. IEEE, 2007.
[16]
G. Karypis and V. Kumar. Metis: A software package for partitioning unstructured graphs, partitioning meshes, and computing fill-reducing orderings of sparse matrices.
[17]
G. Karypis and V. Kumar. Multilevel k-way partitioning scheme for irregular graphs. Journal of Parallel and Distributed computing, 48(1):96--129, 1998.
[18]
S.-D. Kim, S.-H. Kim, and H.-G. Cho. Predicting the virtual temperature of web-blog articles as a measurement tool for online popularity. In International Conference on Computer and Information Technology (CIT), 2011.
[19]
T. G. Kolda and B. W. Bader. Tensor decompositions and applications. SIAM review, 51(3):455--500, 2009.
[20]
S. Kong, F. Ye, and L. Feng. Predicting future retweet counts in a microblog. Journal of Computational Information Systems, 10(4):1393--1404, 2014.
[21]
H. Kwak, C. Lee, H. Park, and S. Moon. What is twitter, a social network or a news media? In WWW, 2010.
[22]
K. Lerman and T. Hogg. Using a model of social dynamics to predict popularity of news. In WWW, 2010.
[23]
Y. Matsubara, Y. Sakurai, et~al. Fast mining and forecasting of complex time-stamped events. In KDD, 2012.
[24]
Y. Matsubara, Y. Sakurai, W. G. Van~Panhuis, and C. Faloutsos. Funnel: automatic mining of spatially coevolving epidemics. In KDD, 2014.
[25]
G. W. Milligan and M. C. Cooper. An examination of procedures for determining the number of clusters in a data set. Psychometrika, 50(2):159--179, 1985.
[26]
A. Oghina et~al. Predicting IMDB movie ratings using social media. In European Conference on Information Retrieval, pages 503--507. Springer, 2012.
[27]
H. Pinto, J. M. Almeida, and M. A. Gonçalves. Using early view patterns to predict the popularity of youtube videos. In WSDM, 2013.
[28]
S. D. Roy, T. Mei, W. Zeng, and S. Li. Towards cross-domain learning for social video popularity prediction. IEEE Transactions on multimedia, 15(6):1255--1267, 2013.
[29]
G. Szabo and B. A. Huberman. Predicting the popularity of online content. Communications of the ACM, 53(8):80--88, 2010.
[30]
A. Tatar, P. Antoniadis, M. D. De~Amorim, and S. Fdida. Ranking news articles based on popularity prediction. In ASONAM, 2012.
[31]
A. Tatar et~al. Predicting the popularity of online articles based on user comments. In International Conference on Web Intelligence, Mining and Semantics, 2011.
[32]
A. Tatar et~al. From popularity prediction to ranking online news. Social Network Analysis and Mining, 4(1):1--12, 2014.
[33]
A. Tatar et~al. A survey on predicting the popularity of web content. Journal of Internet Services and Applications, 5(1):1--20, 2014.
[34]
M. Tsagkias et~al. Predicting the volume of comments on online news stories. In CIKM, 2009.
[35]
J. Yang and J. Leskovec. Patterns of temporal variation in online media. In WSDM, 2011.
[36]
P. Yin, P. Luo, M. Wang, and W.-C. Lee. A straw shows which way the wind blows: ranking potentially popular items from early votes. In WSDM, 2012.
[37]
L. Yu et~al. From micro to macro: Uncovering and predicting information cascading process with behavioral dynamics. In ICDM, 2015.
[38]
T. Zaman, E. B. Fox, E. T. Bradlow, et~al. A bayesian approach for predicting the popularity of tweets. The Annals of Applied Statistics, 8(3):1583--1611, 2014.
[39]
Y. Zhang et~al. Controlling propagation at group scale on networks. In ICDM, 2015.

Cited By

View all
  • (2024)A Survey of Deep Learning-Based Information Cascade PredictionSymmetry10.3390/sym1611143616:11(1436)Online publication date: 29-Oct-2024
  • (2024)Content-Caching-Oriented Popularity Forecast and User ClusteringIEEE Internet of Things Journal10.1109/JIOT.2024.344659111:23(38425-38440)Online publication date: 1-Dec-2024
  • (2023)Fighting False Information from Propagation Process: A SurveyACM Computing Surveys10.1145/356338855:10(1-38)Online publication date: 2-Feb-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
WWW '17: Proceedings of the 26th International Conference on World Wide Web
April 2017
1678 pages
ISBN:9781450349130

Sponsors

  • IW3C2: International World Wide Web Conference Committee

In-Cooperation

Publisher

International World Wide Web Conferences Steering Committee

Republic and Canton of Geneva, Switzerland

Publication History

Published: 03 April 2017

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. content prediction
  2. graph clustering
  3. tensor decomposition

Qualifiers

  • Research-article

Funding Sources

Conference

WWW '17
Sponsor:
  • IW3C2

Acceptance Rates

WWW '17 Paper Acceptance Rate 164 of 966 submissions, 17%;
Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)203
  • Downloads (Last 6 weeks)23
Reflects downloads up to 28 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)A Survey of Deep Learning-Based Information Cascade PredictionSymmetry10.3390/sym1611143616:11(1436)Online publication date: 29-Oct-2024
  • (2024)Content-Caching-Oriented Popularity Forecast and User ClusteringIEEE Internet of Things Journal10.1109/JIOT.2024.344659111:23(38425-38440)Online publication date: 1-Dec-2024
  • (2023)Fighting False Information from Propagation Process: A SurveyACM Computing Surveys10.1145/356338855:10(1-38)Online publication date: 2-Feb-2023
  • (2023)Persistence Augmented Graph Convolution Network for Information Popularity PredictionIEEE Transactions on Network Science and Engineering10.1109/TNSE.2023.3258931(1-13)Online publication date: 2023
  • (2023)User behavior prediction model based on implicit links and multi-type rumor messagesKnowledge-Based Systems10.1016/j.knosys.2023.110276262(110276)Online publication date: Feb-2023
  • (2022)Interval-censored Hawkes processesThe Journal of Machine Learning Research10.5555/3586589.358692723:1(15236-15319)Online publication date: 1-Jan-2022
  • (2022)Event Popularity Prediction Using Influential Hashtags From Social MediaIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2020.304842834:10(4797-4811)Online publication date: 1-Oct-2022
  • (2021)Tensor Data Imputation by PARAFAC with Updated Chaotic Biases by Adam OptimizerInternational Journal of Recent Technology and Engineering10.35940/ijrte.E5291.0396219:6(30-38)Online publication date: 30-Mar-2021
  • (2021)Capturing Dynamics of Information Diffusion in SNS: A Survey of Methodology and TechniquesACM Computing Surveys10.1145/348527355:1(1-51)Online publication date: 23-Nov-2021
  • (2021)Incremental Group-Level Popularity Prediction in Online Social NetworksACM Transactions on Internet Technology10.1145/346183922:1(1-26)Online publication date: 14-Sep-2021
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media