Skip to main content

Modelling and Predicting Individual Salaries in United Kingdom with Graph Convolutional Network

  • Conference paper
  • First Online:

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 923))

Abstract

Job Posting Sites, such as Indeed and Monster, are specifically designed to help users obtain information from the market. However, at the moment, only approximately half of the UK job postings have a salary publicly displayed. Therefore, the aim of this research is to model and predict the salary of a new job, so as to improve the performance of job search and help a vast amount of job seekers better understand the market worth of their desirable positions. In order to effectively estimate the salary of a given job, we construct a graph database based on job profiles of each posting and build a predictive model through machine learning based on both metadata features and relational features. Our results reveal that these two types of features are conditionally independent and each of them is sufficient for prediction. Therefore they can be exploited as two views in graph convolutional network (GCN), a semi-supervised learning framework, to make use of a large amount of unlabelled data, in addition to the set of labelled ones, for enhanced salary classification. The preliminary experimental results show that GCN outperforms the existing ones that simply pool these two types of features together.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    www.indeed.co.uk/.

  2. 2.

    www.monster.co.uk/.

  3. 3.

    https://www.kaggle.com/c/job-salary-prediction/data.

  4. 4.

    https://www.kaggle.com/c/job-salary-prediction/discussion/4208.

  5. 5.

    https://github.com/matt-gardner/pra.

References

  1. Asiler, M., Yazıcı, A.: BB-graph: a new subgraph isomorphism algorithm for efficiently querying big graph databases. arXiv preprint arXiv:1706.06654 (2017)

  2. Bar-Hillel, A., Hertz, T., Shental, N., Weinshall, D.: Learning a mahalanobis metric from equivalence constraints. J. Mach. Learn. Res. 6(Jun), 937–965 (2005)

    MathSciNet  MATH  Google Scholar 

  3. Borgelt, C., Berthold,M.: Finding relevant substructures of molecules: mining molecular fragments. In: Proceedings of the 2002 IEEE International Conference on Data Mining (ICDM 2002), pp. 51–58 (2002)

    Google Scholar 

  4. Bruna, J., Zaremba, W., Szlam, A., LeCun,Y.: Spectral networks and locally connected networks on graphs. arXiv preprint arXiv:1312.6203 (2013)

  5. Defferrard, M., Bresson, X., Vandergheynst, P.: Convolutional neural networks on graphs with fast localized spectral filtering. In: Advances in Neural Information Processing Systems, pp. 3844–3852 (2016)

    Google Scholar 

  6. Foggia, P., Sansone, C.,, Vento, M.: A database of graphs for isomorphism and sub-graph isomorphism benchmarking. In: Proceedings of the 3rd IAPR TC-15 International Workshop on Graph-based Representations, pp. 176–187 (2001)

    Google Scholar 

  7. Huan, J., Wang, W., Prins, J.: Efficient mining of frequent subgraphs in the presence of isomorphism. In: 2003 Third IEEE International Conference on Data Mining, ICDM 2003, pp. 549–552. IEEE (2003)

    Google Scholar 

  8. Inokuchi, A., Washio, T., Motoda, H.: An apriori-based algorithm for mining frequent substructures from graph data. In: Principles of Data Mining and Knowledge Discovery, pp. 13–23 (2000)

    Google Scholar 

  9. Kingma, D., Ba, J.: Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)

  10. Kipf, T.N., Welling,M.: Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907 (2016)

  11. Kuramochi, M., Karypis, G.: Frequent subgraph discovery. In: 2001 Proceedings IEEE International Conference on Data Mining, ICDM 2001, pp. 313–320. IEEE (2001)

    Google Scholar 

  12. Li, Y., Tarlow, D., Brockschmidt, M., Zemel,R.: Gated graph sequence neural networks. arXiv preprint arXiv:1511.05493 (2015)

  13. Manning, C.D., Raghavan, P., Schütze, H., et al.: Introduction to Information Retrieval, vol. 1. Cambridge University Press, Cambridge (2008)

    Book  Google Scholar 

  14. Neville, J., Jensen, D.: Iterative classification in relational data. In: Proceedings of AAAI-2000 Workshop on Learning Statistical Models from Relational Data, pp. 13–20 (2000)

    Google Scholar 

  15. Niepert, M., Ahmed, M., Kutzkov, K.: Learning convolutional neural networks for graphs. In: International Conference on Machine Learning, pp. 2014–2023 (2016)

    Google Scholar 

  16. Nijssen, S., Kok, J.N.: A quickstart in frequent structure mining can make a difference. In: Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 647–652. ACM (2004)

    Google Scholar 

  17. Perozzi, B., Al-Rfou, R., Skiena, S.: Deepwalk: online learning of social representations. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 701–710. ACM (2014)

    Google Scholar 

  18. Scarselli, F., Gori, M., Tsoi, A.C., Hagenbuchner, M., Monfardini, G.: The graph neural network model. IEEE Trans. Neural Netw. 20(1), 61–80 (2009)

    Article  Google Scholar 

  19. Sen, P., Namata, G., Bilgic, M., Getoor, L., Galligher, B., Eliassi-Rad, T.: Collective classification in network data. AI Mag. 29(3), 93 (2008)

    Article  Google Scholar 

  20. Tang, J., Qu, M., Wang, M., Zhang, M., Yan, J., Mei, Q.: Line: large-scale information network embedding. In: Proceedings of the 24th International Conference on World Wide Web, pp. 1067–1077. International World Wide Web Conferences Steering Committee (2015)

    Google Scholar 

  21. Tang, W., Zhong, S.: Pairwise constraints-guided dimensionality reduction. In: Computational Methods of Feature Selection, pp. 295–312. Chapman and Hall, CRC (2007)

    Google Scholar 

  22. Thoma, M., Cheng, H., Gretton, A., Han, J., Kriegel, H.-P., Smola, A., Song, L., Yu, P.S., Yan, X., Borgwardt, K.: Near-optimal supervised feature selection among frequent subgraphs. In: Proceedings of the 2009 SIAM International Conference on Data Mining, pp. 1076–1087. SIAM (2009)

    Google Scholar 

  23. Yan, X., Cheng, H., Han, J., Yu, P.S.: Mining significant graph patterns by leap search. In: Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data, pp. 433–444. ACM (2008)

    Google Scholar 

  24. Yan, X., Han, J.: gSpan: graph-based substructure pattern mining. In: 2002 IEEE International Conference on Data Mining, ICDM 2003, pp. 721–724. IEEE (2002)

    Google Scholar 

  25. Yang, Z., Cohen, W.W., Salakhutdinov, R.: Revisiting semi-supervised learning with graph embeddings. In: International Conference of Machine Learning (2016)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Long Chen .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Chen, L., Sun, Y., Thakuriah, P. (2020). Modelling and Predicting Individual Salaries in United Kingdom with Graph Convolutional Network. In: Madureira, A., Abraham, A., Gandhi, N., Varela, M. (eds) Hybrid Intelligent Systems. HIS 2018. Advances in Intelligent Systems and Computing, vol 923. Springer, Cham. https://doi.org/10.1007/978-3-030-14347-3_7

Download citation

Publish with us

Policies and ethics