Abstract
Data quality, completeness, and consistency are crucial for simulation modeling and predictive tasks in socioeconomic systems. Such systems involve heterogeneous entities and their interrelations, which are becoming available only by combining various data sources. In this study, three different sources were combined in a single knowledge graph (KG). It includes an online social network, an online recruitment system, and a financial bank. The constructed knowledge graph is evaluated on link prediction tasks to obtain complete and consistent data. We try to reconstruct links between users and a) socioeconomic statuses, b) organizations they work for, c) job positions. Knowledge graph embedding models and a graph neural network based on Transformer architecture were applied. We get promising results for reconstruction of User-Employer relations (\(MRR=0.42\), \(Hits@10=0.74\)), as well as for reconstruction of User-Position relations (\(MRR=0.59\), \(Hits@10=0.88\)).
The reported study was funded by RFBR according to the research project â„–Â 20-37-90126. We are grateful to Artem Petrov for his assistance with text processing tasks and Valentina Guleva for her valuable scientific advice.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
- 3.
We use \(community\_detection\) method from https://github.com/UKPLab/sentence-transformers.
- 4.
- 5.
References
Abitbol, J., Karsai, M., Fleury, E.: Location, occupation, and semantics based socioeconomic status inference on Twitter, pp. 1192–1199, November 2018
Aletras, N., Chamberlain, B.P.: Predicting Twitter user socioeconomic attributes with network and language information. In: Proceedings of the 29th on Hypertext and Social Media, pp. 20–24. ACM (2018)
Bordes, A., Usunier, N., Garcia-Duran, A., Weston, J., Yakhnenko, O.: Translating embeddings for modeling multi-relational data. In: Burges, C.J.C., Bottou, L., Welling, M., Ghahramani, Z., Weinberger, K.Q. (eds.) Advances in Neural Information Processing Systems, vol. 26, pp. 2787–2795. Curran Associates, Inc. (2013)
Buchgeher, G., Gabauer, D., Martinez-Gil, J., Ehrlinger, L.: Knowledge graphs in manufacturing and production: a systematic literature review. IEEE Access 9, 55537–55554 (2021). https://doi.org/10.1109/ACCESS.2021.3070395
Ding, S., Huang, H., Zhao, T., Fu, X.: Estimating socioeconomic status via temporal-spatial mobility analysis - a case study of smart card data. In: 2019 28th International Conference on Computer Communication and Networks (ICCCN), pp. 1–9 (2019)
Fixman, M., Berenstein, A., Brea, J., Minnoni, M., Travizano, M., Sarraute, C.: A Bayesian approach to income inference in a communication network. In: Proceedings of the 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2016, pp. 579–582. IEEE Press (2016)
Han, X., Wang, L., Liu, G., Zhao, D., Xu, S.: Occupation profiling with user-generated geolocation data. In: 2017 2nd International Conference on Knowledge Engineering and Applications (ICKEA), pp. 93–97 (2017)
Hu, Z., Dong, Y., Wang, K., Sun, Y.: Heterogeneous graph transformer. In: Proceedings of The Web Conference 2020, WWW 2020, pp. 2704–2710. Association for Computing Machinery, New York (2020)
Huang, Y., Yu, L., Wang, X., Cui, B.: A multi-source integration framework for user occupation inference in social media systems. World Wide Web 18(5), 1247–1267 (2014). https://doi.org/10.1007/s11280-014-0300-6
Kalinin, A., Vaganov, D., Bochenina, K.: Discovering patterns of customer financial behavior using social media data. Soc. Netw. Anal. Min. 10(1), 1–14 (2020). https://doi.org/10.1007/s13278-020-00690-3
Lampos, V., Aletras, N., Geyti, J.K., Zou, B., Cox, I.J.: Inferring the socioeconomic status of social media users based on behaviour and language. In: Ferro, N., et al. (eds.) ECIR 2016. LNCS, vol. 9626, pp. 689–695. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-30671-1_54
Li, Z., Xu, W., Zhang, L., Lau, R.Y.: An ontology-based web mining method for unemployment rate prediction. Decis. Support Syst. 66, 114–122 (2014)
Matz, S.C., Menges, J.I., Stillwell, D.J., Schwartz, H.A.: Predicting individual-level income from Facebook profiles. PLoS ONE 14(3), 1–13 (2019)
Preoţiuc-Pietro, D., Lampos, V., Aletras, N.: An analysis of the user occupational class through Twitter content. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pp. 1754–1764 (2015)
Preoţiuc-Pietro, D., Volkova, S., Lampos, V., Bachrach, Y., Aletras, N.: Studying user income through language, behaviour and affect in social media. PLoS ONE 10(9), 1–17 (2015). https://doi.org/10.1371/journal.pone.0138717
Robert, C.P., Casella, G.: The metropolis-hastings algorithm. In: Robert, C.P., Casella, G. (eds.) Monte Carlo Statistical Methods. STS, pp. 231–283. Springer, New York (1999). https://doi.org/10.1007/978-1-4757-3071-5_6
Shi, B., Yang, J., Weninger, T., How, J., He, Q.: Representation learning in heterogeneous professional social networks with ambiguous social connections. In: 2019 IEEE International Conference on Big Data (Big Data), pp. 1928–1937 (2019)
Sun, Z., Deng, Z.H., Nie, J.Y., Tang, J.: RotatE: knowledge graph embedding by relational rotation in complex space. In: International Conference on Learning Representations (2019). https://openreview.net/forum?id=HkgEQnRqYQ
Trouillon, T., Welbl, J., Riedel, S., Ciaussier, E., Bouchard, G.: Complex embeddings for simple link prediction. In: 33rd International Conference on Machine Learning, ICML 2016, vol. 5, pp. 3021–3032 (2016)
Vaganov, D., Kalinin, A., Bochenina, K.: On inferring monthly expenses of social media users: towards data and approaches. In: Cherifi, H., Gaito, S., Mendes, J.F., Moro, E., Rocha, L.M. (eds.) COMPLEX NETWORKS 2019. SCI, vol. 881, pp. 854–865. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-36687-2_71
Vaswani, A., et al.: Attention is all you need. In: Guyon, I., et al. (eds.) Advances in Neural Information Processing Systems, vol. 30. Curran Associates, Inc. (2017)
Wu, Z., Pan, S., Chen, F., Long, G., Zhang, C., Yu, P.S.: A comprehensive survey on graph neural networks. IEEE Trans. Neural Netw. Learn. Syst. 32(1), 4–24 (2021). https://doi.org/10.1109/TNNLS.2020.2978386
Yang, Y., et al.: Multilingual universal sentence encoder for semantic retrieval. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, pp. 87–94. Association for Computational Linguistics, July 2020. Online
Yang, Y., Pang, Y., Huang, G., et al.: The knowledge graph for macroeconomic analysis with alternative big data. arXiv preprint arXiv:2010.05172 (2020)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 Springer Nature Switzerland AG
About this paper
Cite this paper
Kalinin, A., Vaganov, D., Shikov, E. (2022). Relations Reconstruction in a Knowledge Graph of a Socioeconomic System. In: Li, B., et al. Advanced Data Mining and Applications. ADMA 2022. Lecture Notes in Computer Science(), vol 13088. Springer, Cham. https://doi.org/10.1007/978-3-030-95408-6_12
Download citation
DOI: https://doi.org/10.1007/978-3-030-95408-6_12
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-95407-9
Online ISBN: 978-3-030-95408-6
eBook Packages: Computer ScienceComputer Science (R0)