Abstract
User profiling is an important research topic in social media analysis, which has great value in research and industries. Existing research on user profiling has mostly focused on manually handcrafted features for user attribute prediction. However, the research has partly overlooked the social relation of users. To address the problem, we propose a multi-granularity convolutional neural network model with feature fusion and refinement. Our model leverages the convolution mechanism to automatically extract user latent semantic features with respect to their attributes from social texts. We also combine different machine learning methods using the stacking mechanism for feature refinement. The proposed model can capture the social relation of users by combining semantic context and social network information, and improve the performance of attribute classification. We evaluate our model based on the dataset from SMP CUP 2016 competition. The experimental results demonstrate that the proposed model is effective in automatic user attribute classification with a particular focus on fine-grained user information.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Volkova, S., Bachrach, Y., Armstrong, M., et al.: Inferring latent user properties from texts published in social media. In: Twenty-Ninth AAAI Conference on Artificial Intelligence (2015)
Park, G., Schwartz, H.A., Eichstaedt, J.C., et al.: Automatic personality assessment through social media language. J. Pers. Soc. Psychol. 108(6), 934 (2015)
Mueller, J., Stumme, G.: Gender inference using statistical name characteristics in Twitter. In: Proceedings of the 3rd Multidisciplinary International Social Networks Conference on SocialInformatics 2016, Data Science 2016, p. 47. ACM (2016)
Alowibdi, J.S., Buy, U.A., Yu, P.: Language independent gender classification on Twitter. In: Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, pp. 739–743. ACM (2013)
Sloan, L., Morgan, J., Burnap, P., et al.: Who tweets? Deriving the demographic characteristics of age, occupation and social class from Twitter user meta-data. PLoS ONE 10(3), e0115545 (2015)
Rahimi, A., Vu, D., Cohn, T., et al.: Exploiting text and network context for geolocation of social media users. arXiv preprint arXiv:1506.04803 (2015)
Ludu, P.S.: Inferring gender of a twitter user using celebrities it follows. arXiv preprint arXiv:1405.6667 (2014)
Sesa-Nogueras, E., Faundez-Zanuy, M., Roure-Alcobé, J.: Gender classification by means of online uppercase handwriting: a text-dependent allographic approach. Cogn. Comput. 8(1), 15–29 (2016)
Chen, H., Sun, M., Tu, C., et al.: Neural sentiment classification with user and product attention. In: Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pp. 1650–1659 (2016)
Yang, Z., Yang, D., Dyer, C., et al.: Hierarchical attention networks for document classification. In: Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 1480–1489 (2016)
Cai, F., Chen, H.: A probabilistic model for information retrieval by mining user behaviors. Cogn. Comput. 8(3), 494–504 (2016)
Peersman, C., Daelemans, W., Van Vaerenbergh, L.: Predicting age and gender in online social networks. In: Proceedings of the 3rd International Workshop on Search and Mining User-Generated Contents, pp. 37–44. ACM (2011)
Schler, J., Koppel, M., Argamon, S., et al.: Effects of age and gender on blogging. In: AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs, pp. 199–205, June 2006
Mukherjee, A., Liu, B.: Improving gender classification of blog authors. In: Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, pp. 207–217. Association for Computational Linguistics (2010)
Burger, J.D., Henderson, J., Kim, G., et al.: Discriminating gender on Twitter. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 1301–1309. Association for Computational Linguistics (2011)
Miller, Z., Dickinson, B., Hu, W.: Gender prediction on twitter using stream algorithms with n-gram character features. Int. J. Intell. Sci. 2(04), 143 (2012)
Mueller, J., Stumme, G.: Gender inference using statistical name characteristics in Twitter. In: Proceedings of the 3rd Multidisciplinary International Social Networks Conference on SocialInformatics 2016, Data Science 2016, pp. 47. ACM (2016)
Han, B., Cook, P., Baldwin, T.: Geolocation prediction in social media data by finding location indicative words. In: Proceedings of COLING 2012, pp. 1045–1062 (2012)
Ahmed, A., Hong, L., Smola, A.J.: Hierarchical geographical modeling of user locations from social media posts. In: Proceedings of the 22nd International Conference on World Wide Web, pp. 25–36. ACM (2013)
Peng, X., Lu, J., Yi, Z., et al.: Automatic subspace learning via principal coefficients embedding. IEEE Trans. Cybern. 47(11), 3583–3596 (2016)
Peng, X., Lu, C., Yi, Z., et al.: Connections between nuclear-norm and frobenius-norm-based representations. IEEE Trans. Neural Netw. Learn. Syst. 29(1), 218–224 (2016)
Le, Q., Mikolov, T.: Distributed representations of sentences and documents. In: International Conference on Machine Learning, pp. 1188–1196 (2014)
Acknowledgements
This work is partially supported by a grant from the Foundation of State Key Laboratory of Cognitive Intelligence, iFLYTEK, P.R. China (COGOS-20190001, Intelligent Medical Question Answering based on User Profiling and Knowledge Graph), the Natural Science Foundation of China (No. 61632011, 61572102,61702080) and the Fundamental Research Funds for the Central Universities (No. DUT18ZD102), Postdoctoral Science Foundation of China (2018M641691), the Ministry of Education Humanities and Social Science Project (No. 19YJCZH199).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Xu, B., Tadesse, M.M., Fei, P., Lin, H. (2019). Multi-granularity Convolutional Neural Network with Feature Fusion and Refinement for User Profiling. In: Zhang, Q., Liao, X., Ren, Z. (eds) Information Retrieval. CCIR 2019. Lecture Notes in Computer Science(), vol 11772. Springer, Cham. https://doi.org/10.1007/978-3-030-31624-2_13
Download citation
DOI: https://doi.org/10.1007/978-3-030-31624-2_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-31623-5
Online ISBN: 978-3-030-31624-2
eBook Packages: Computer ScienceComputer Science (R0)