Skip to main content

Relations Reconstruction in a Knowledge Graph of a Socioeconomic System

  • Conference paper
  • First Online:
Advanced Data Mining and Applications (ADMA 2022)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13088))

Included in the following conference series:

  • 889 Accesses

Abstract

Data quality, completeness, and consistency are crucial for simulation modeling and predictive tasks in socioeconomic systems. Such systems involve heterogeneous entities and their interrelations, which are becoming available only by combining various data sources. In this study, three different sources were combined in a single knowledge graph (KG). It includes an online social network, an online recruitment system, and a financial bank. The constructed knowledge graph is evaluated on link prediction tasks to obtain complete and consistent data. We try to reconstruct links between users and a) socioeconomic statuses, b) organizations they work for, c) job positions. Knowledge graph embedding models and a graph neural network based on Transformer architecture were applied. We get promising results for reconstruction of User-Employer relations (\(MRR=0.42\), \(Hits@10=0.74\)), as well as for reconstruction of User-Position relations (\(MRR=0.59\), \(Hits@10=0.88\)).

The reported study was funded by RFBR according to the research project № 20-37-90126. We are grateful to Artem Petrov for his assistance with text processing tasks and Valentina Guleva for her valuable scientific advice.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 64.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 84.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://vk.com.

  2. 2.

    https://hh.ru.

  3. 3.

    We use \(community\_detection\) method from https://github.com/UKPLab/sentence-transformers.

  4. 4.

    https://github.com/DeepGraphLearning/KnowledgeGraphEmbedding.

  5. 5.

    https://github.com/acbull/pyHGT.

References

  1. Abitbol, J., Karsai, M., Fleury, E.: Location, occupation, and semantics based socioeconomic status inference on Twitter, pp. 1192–1199, November 2018

    Google Scholar 

  2. Aletras, N., Chamberlain, B.P.: Predicting Twitter user socioeconomic attributes with network and language information. In: Proceedings of the 29th on Hypertext and Social Media, pp. 20–24. ACM (2018)

    Google Scholar 

  3. Bordes, A., Usunier, N., Garcia-Duran, A., Weston, J., Yakhnenko, O.: Translating embeddings for modeling multi-relational data. In: Burges, C.J.C., Bottou, L., Welling, M., Ghahramani, Z., Weinberger, K.Q. (eds.) Advances in Neural Information Processing Systems, vol. 26, pp. 2787–2795. Curran Associates, Inc. (2013)

    Google Scholar 

  4. Buchgeher, G., Gabauer, D., Martinez-Gil, J., Ehrlinger, L.: Knowledge graphs in manufacturing and production: a systematic literature review. IEEE Access 9, 55537–55554 (2021). https://doi.org/10.1109/ACCESS.2021.3070395

    Article  Google Scholar 

  5. Ding, S., Huang, H., Zhao, T., Fu, X.: Estimating socioeconomic status via temporal-spatial mobility analysis - a case study of smart card data. In: 2019 28th International Conference on Computer Communication and Networks (ICCCN), pp. 1–9 (2019)

    Google Scholar 

  6. Fixman, M., Berenstein, A., Brea, J., Minnoni, M., Travizano, M., Sarraute, C.: A Bayesian approach to income inference in a communication network. In: Proceedings of the 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2016, pp. 579–582. IEEE Press (2016)

    Google Scholar 

  7. Han, X., Wang, L., Liu, G., Zhao, D., Xu, S.: Occupation profiling with user-generated geolocation data. In: 2017 2nd International Conference on Knowledge Engineering and Applications (ICKEA), pp. 93–97 (2017)

    Google Scholar 

  8. Hu, Z., Dong, Y., Wang, K., Sun, Y.: Heterogeneous graph transformer. In: Proceedings of The Web Conference 2020, WWW 2020, pp. 2704–2710. Association for Computing Machinery, New York (2020)

    Google Scholar 

  9. Huang, Y., Yu, L., Wang, X., Cui, B.: A multi-source integration framework for user occupation inference in social media systems. World Wide Web 18(5), 1247–1267 (2014). https://doi.org/10.1007/s11280-014-0300-6

    Article  Google Scholar 

  10. Kalinin, A., Vaganov, D., Bochenina, K.: Discovering patterns of customer financial behavior using social media data. Soc. Netw. Anal. Min. 10(1), 1–14 (2020). https://doi.org/10.1007/s13278-020-00690-3

    Article  Google Scholar 

  11. Lampos, V., Aletras, N., Geyti, J.K., Zou, B., Cox, I.J.: Inferring the socioeconomic status of social media users based on behaviour and language. In: Ferro, N., et al. (eds.) ECIR 2016. LNCS, vol. 9626, pp. 689–695. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-30671-1_54

    Chapter  Google Scholar 

  12. Li, Z., Xu, W., Zhang, L., Lau, R.Y.: An ontology-based web mining method for unemployment rate prediction. Decis. Support Syst. 66, 114–122 (2014)

    Article  Google Scholar 

  13. Matz, S.C., Menges, J.I., Stillwell, D.J., Schwartz, H.A.: Predicting individual-level income from Facebook profiles. PLoS ONE 14(3), 1–13 (2019)

    Article  Google Scholar 

  14. Preoţiuc-Pietro, D., Lampos, V., Aletras, N.: An analysis of the user occupational class through Twitter content. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pp. 1754–1764 (2015)

    Google Scholar 

  15. Preoţiuc-Pietro, D., Volkova, S., Lampos, V., Bachrach, Y., Aletras, N.: Studying user income through language, behaviour and affect in social media. PLoS ONE 10(9), 1–17 (2015). https://doi.org/10.1371/journal.pone.0138717

    Article  Google Scholar 

  16. Robert, C.P., Casella, G.: The metropolis-hastings algorithm. In: Robert, C.P., Casella, G. (eds.) Monte Carlo Statistical Methods. STS, pp. 231–283. Springer, New York (1999). https://doi.org/10.1007/978-1-4757-3071-5_6

    Chapter  MATH  Google Scholar 

  17. Shi, B., Yang, J., Weninger, T., How, J., He, Q.: Representation learning in heterogeneous professional social networks with ambiguous social connections. In: 2019 IEEE International Conference on Big Data (Big Data), pp. 1928–1937 (2019)

    Google Scholar 

  18. Sun, Z., Deng, Z.H., Nie, J.Y., Tang, J.: RotatE: knowledge graph embedding by relational rotation in complex space. In: International Conference on Learning Representations (2019). https://openreview.net/forum?id=HkgEQnRqYQ

  19. Trouillon, T., Welbl, J., Riedel, S., Ciaussier, E., Bouchard, G.: Complex embeddings for simple link prediction. In: 33rd International Conference on Machine Learning, ICML 2016, vol. 5, pp. 3021–3032 (2016)

    Google Scholar 

  20. Vaganov, D., Kalinin, A., Bochenina, K.: On inferring monthly expenses of social media users: towards data and approaches. In: Cherifi, H., Gaito, S., Mendes, J.F., Moro, E., Rocha, L.M. (eds.) COMPLEX NETWORKS 2019. SCI, vol. 881, pp. 854–865. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-36687-2_71

    Chapter  Google Scholar 

  21. Vaswani, A., et al.: Attention is all you need. In: Guyon, I., et al. (eds.) Advances in Neural Information Processing Systems, vol. 30. Curran Associates, Inc. (2017)

    Google Scholar 

  22. Wu, Z., Pan, S., Chen, F., Long, G., Zhang, C., Yu, P.S.: A comprehensive survey on graph neural networks. IEEE Trans. Neural Netw. Learn. Syst. 32(1), 4–24 (2021). https://doi.org/10.1109/TNNLS.2020.2978386

    Article  MathSciNet  Google Scholar 

  23. Yang, Y., et al.: Multilingual universal sentence encoder for semantic retrieval. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, pp. 87–94. Association for Computational Linguistics, July 2020. Online

    Google Scholar 

  24. Yang, Y., Pang, Y., Huang, G., et al.: The knowledge graph for macroeconomic analysis with alternative big data. arXiv preprint arXiv:2010.05172 (2020)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Alexander Kalinin .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Kalinin, A., Vaganov, D., Shikov, E. (2022). Relations Reconstruction in a Knowledge Graph of a Socioeconomic System. In: Li, B., et al. Advanced Data Mining and Applications. ADMA 2022. Lecture Notes in Computer Science(), vol 13088. Springer, Cham. https://doi.org/10.1007/978-3-030-95408-6_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-95408-6_12

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-95407-9

  • Online ISBN: 978-3-030-95408-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics