Abstract
This paper proposes a hybrid feature selection method for predicting user influence on Twitter. A set of candidate features from Twitter is identified based on the five attributes of influencers defined in sociology. Firstly, less relevant features are filtered out with a feature-weighting algorithm. Then the Sequential Backward Floating Selection is utilized as the search strategy. A Back Propagation Neural Network is employed to evaluate the feature subset at each step of searching. Finally, an optimal feature set is obtained for predicting user influence with a high degree of accuracy. Experimental results are provided based on a real world Twitter dataset including seven million tweets associated with 200 popular users. The proposed method can provide a set of features that could be used as a solid foundation for studying complicated user influence evaluation and prediction.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
- 7.
- 8.
- 9.
- 10.
References
Raven, B.H.: Social influence and power. Technical report, DTIC Document (1964)
Cha, M., Haddadi, H., Benevenuto, F., Gummadi, P.K.: Measuring user influence in twitter: the million follower fallacy. ICWSM 10, 30 (2010)
Leavitt, A., Burchard, E., Fisher, D., Gilbert, S.: The influentials: new approaches for analyzing influence on twitter. Web Ecol. Proj. 4, 1–18 (2009)
Rosenman, E.T.: Retweetsbut not just retweets: quantifying and predicting influence on twitter. Ph.D. thesis, Bachelors thesis, applied mathematics. Harvard College, Cambridge (2012)
Weng, J., Lim, E.P., Jiang, J., He, Q.: Twitterrank: finding topic-sensitive influential twitterers. In: Proceedings of the third ACM International Conference on Web Search and Data Mining, pp. 261–270. ACM (2010)
Tunkelang, D.: A twitter analog to pagerank. The Noisy Channel (2009)
Yu, A., Hu, C.V., Kilzer, A.: Khyrank: Using retweets and mentions to predict influential users (2011)
Chen, W., Cheng, S., He, X., Jiang, F.: Influencerank: an efficient social influence measurement for millions of users in microblog. In: 2012 Second International Conference on Cloud and Green Computing (CGC), pp. 563–570. IEEE (2012)
Cappelletti, R., Sastry, N.: Iarank: Ranking users on twitter in near real-time, based on their information amplification potential. In: 2012 International Conference on Social Informatics (SocialInformatics), pp. 70–77. IEEE (2012)
Page, L., Brin, S., Motwani, R., Winograd, T.: The pagerank citation ranking: Bringing order to the web (1999)
Kwak, H., Lee, C., Park, H., Moon, S.: What is twitter, a social network or a news media? In: Proceedings of the 19th international conference on World wide web, pp. 591–600. ACM (2010)
John, G.H., Kohavi, R., Pfleger, K., et al.: Irrelevant features and the subset selection problem. In: Machine Learning: Proceedings of the Eleventh International Conference, pp. 121–129 (1994)
Kohavi, R., John, G.H.: Wrappers for feature subset selection. Artif. intell. 97, 273–324 (1997)
Keller, E., Berry, J.: The influentials: one American in ten tells the other nine how to vote, where to eat, and what to buy. Simon and Schuster, New York (2003)
Naaman, M., Boase, J., Lai, C.H.: Is it really about me? Message content in social awareness streams. In: Proceedings of the 2010 ACM Conference on Computer Supported Cooperative Work, pp. 189–192. ACM (2010)
Robnik-Šikonja, M., Kononenko, I.: Theoretical and empirical analysis of relieff and rrelieff. Mach. Learn. 53, 23–69 (2003)
Robnik-Šikonja, M., Kononenko, I.: An adaptation of relief for attribute estimation in regression. In: Machine Learning: Proceedings of the Fourteenth International Conference (ICML 1997), pp. 296–304 (1997)
Jain, A., Zongker, D.: Feature selection: evaluation, application, and small sample performance. IEEE Trans. Pattern Anal. Mach. Intell 19, 153–158 (1997)
Goh, A.: Back-propagation neural networks for modeling complex systems. Artif. Intell. Eng. 9, 143–151 (1995)
Acknowledgments
The work presented in this paper was partially supported by Macquarie University Research Excellence Scholarship (Allocation No.2013115), an Australian Research Council Linkage Project (LP120200231) and the China Scholarship Council. We also thank anonymous reviewers for their valuable comments.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Mei, Y., Zhang, Z., Zhao, W., Yang, J., Nugroho, R. (2015). A Hybrid Feature Selection Method for Predicting User Influence on Twitter. In: Wang, J., et al. Web Information Systems Engineering – WISE 2015. WISE 2015. Lecture Notes in Computer Science(), vol 9418. Springer, Cham. https://doi.org/10.1007/978-3-319-26190-4_32
Download citation
DOI: https://doi.org/10.1007/978-3-319-26190-4_32
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-26189-8
Online ISBN: 978-3-319-26190-4
eBook Packages: Computer ScienceComputer Science (R0)