skip to main content
10.1145/3343031.3356054acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Feature Construction for Posts and Users Combined with LightGBM for Social Media Popularity Prediction

Published: 15 October 2019 Publication History

Abstract

In this paper, we propose to address the Social Media Prediction (SMP) Challenge by using regression model with multiple features extracted from various aspects of posts. More specifically, we extract textual features, numeric features, and construct user-related features to this end. For textual features, the rich texts possessed by the posts are integrated to build a corpus, based on which we train a language model to learn the vector representation of semantic information. For numeric features, we construct several new features, including the length and the word numbers of title. For the user-related features, we design a "user id count" based on the number of times each user posted in the entire dataset to show the activity of the user. Finally, the multiple features are feed into LightGBM to predict popularity scores. Extensive experiments conducted on the Social Media Prediction Dataset show the superiority of our method. Our approach achieves the 3rd place in the SMP Challenge.

References

[1]
Bo Wu, Wen-Huang Cheng, Yongdong Zhang, Huang Qiushi, Li Jintao, and Tao Mei. 2017. Sequential Prediction of Social Media Popularity with Deep Temporal Context Networks. In International Joint Conference on Artificial Intelligence (IJCAI).
[2]
Jiani Hu, Toshihiko Yamasaki, and Kiyoharu Aizawa. 2016. Multimodal learning for image popularity prediction on social media. IEEE International Conference on Consumer Electronics-taiwan. IEEE.
[3]
Masoud Mazloom, Iliana Pappi and Marcel Worring. 2018. Category Specific Post Popularity Prediction. In MultiMedia Modeling Lecture Notes in Computer Science, 2018, pp. 594--607.
[4]
Bielski Adam and Tomasz Trzcinski. 2018. Understanding Multimodal Popularity Prediction of Social Media Videos With Self-Attention. IEEE Access (2018), 6: 74277--74287.
[5]
Mayank Meghawat, et al. 2018. A multimodal approach to predict social media popularity. 2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR). IEEE.
[6]
Alireza Zohourian, Sajedi Hedieh, and Yavary Arefeh. 2018. Popularity prediction of images and videos on Instagram. 2018 4th International Conference on Web Research (ICWR). IEEE.
[7]
Xiangnan He, Ming Gao, Min-Yen Kan, Yiqun Liu, and Kazunari Sugiyama. 2014. Predicting the popularity of web 2.0 items based on user comments. In Proc.of SIGIR, pages 233--242, 2014.
[8]
Cheng Li, Yue Lu, Qiaozhu Mei, Dong Wang, and Sandeep Pandey. 2015. Click-through prediction for advertising in twitter timeline. In Proc. of KDD, 2015.
[9]
Juheng Zhang and Piramuthu Selwyn. 2018. Product recommendation with latent review topics. Information Systems Frontiers 20.3 (2018): 617--625.
[10]
Chuang Gan, Chen Sun, Lixin Duan, and Boqing Gong. 2016. Webly-supervised video recognition by mutually voting for relevant web images and web video frames. In ECCV, 2016.
[11]
Bo Wu, et al. 2016. Unfolding temporal dynamics: Predicting social media popularity using multi-scale temporal decomposition. Thirtieth AAAI Conference on Artificial Intelligence.
[12]
Bo Wu, et al. 2016. Time matters: Multi-scale temporalization of social media popularity. Proceedings of the 24th ACM international conference on Multimedia. ACM, 2016.
[13]
Kaiming He, et al. 2016. Deep residual learning for image recognition. Proceedings of the IEEE conference on computer vision and pattern recognition.Conference Name:ACM Woodstock conference.
[14]
Tianqi Chen and Guestrin Carlos. 2016. Xgboost: A scalable tree boosting system. Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining. ACM, 2016.
[15]
Guolin Ke, et al. 2017. Lightgbm: A highly efficient gradient boosting decision tree. Advances in Neural Information Processing Systems.
[16]
Mikolov, Tomas, et al. 2013. Distributed representations of words and phrases and their compositionality. Advances in neural information processing systems.
[17]
Mikolov, Tomas, et al. 2013. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013).

Cited By

View all
  • (2025)Tri-Modal Transformers With Mixture-of-Modality-Experts for Social Media PredictionIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2024.347410135:2(1897-1909)Online publication date: 1-Feb-2025
  • (2024)Dual-Stream Pre-Training Transformer to Enhance Multimodal Learning for Social Media PredictionProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3688998(11450-11456)Online publication date: 28-Oct-2024
  • (2024)MMF: Winning Solution to Social Media Popularity Prediction Challenge 2024Proceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3688997(11445-11449)Online publication date: 28-Oct-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
MM '19: Proceedings of the 27th ACM International Conference on Multimedia
October 2019
2794 pages
ISBN:9781450368896
DOI:10.1145/3343031
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 15 October 2019

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. feature engineering
  2. regression
  3. social media popularity prediction

Qualifiers

  • Research-article

Conference

MM '19
Sponsor:

Acceptance Rates

MM '19 Paper Acceptance Rate 252 of 936 submissions, 27%;
Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)30
  • Downloads (Last 6 weeks)0
Reflects downloads up to 03 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2025)Tri-Modal Transformers With Mixture-of-Modality-Experts for Social Media PredictionIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2024.347410135:2(1897-1909)Online publication date: 1-Feb-2025
  • (2024)Dual-Stream Pre-Training Transformer to Enhance Multimodal Learning for Social Media PredictionProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3688998(11450-11456)Online publication date: 28-Oct-2024
  • (2024)MMF: Winning Solution to Social Media Popularity Prediction Challenge 2024Proceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3688997(11445-11449)Online publication date: 28-Oct-2024
  • (2024)SMP Challenge Summary: Social Media Prediction ChallengeProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3688996(11442-11444)Online publication date: 28-Oct-2024
  • (2024)Enhancing social media post popularity prediction with visual contentJournal of the Korean Statistical Society10.1007/s42952-024-00270-753:3(844-882)Online publication date: 21-May-2024
  • (2023)User review analysis of dating apps based on text miningPLOS ONE10.1371/journal.pone.028389618:4(e0283896)Online publication date: 26-Apr-2023
  • (2023)Double-Fine-Tuning Multi-Objective Vision-and-Language Transformer for Social Media Popularity PredictionProceedings of the 31st ACM International Conference on Multimedia10.1145/3581783.3612845(9462-9466)Online publication date: 26-Oct-2023
  • (2023)Neural Image Popularity Assessment with Retrieval-augmented TransformerProceedings of the 31st ACM International Conference on Multimedia10.1145/3581783.3611918(2427-2436)Online publication date: 26-Oct-2023
  • (2023)SMPC: boosting social media popularity prediction with captionMultimedia Systems10.1007/s00530-022-01030-529:2(577-586)Online publication date: 19-Jan-2023
  • (2022)Overview of the Multimedia Grand Challenges 2022Proceedings of the 30th ACM International Conference on Multimedia10.1145/3503161.3552365(7220-7222)Online publication date: 10-Oct-2022
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media