Skip to main content

N-Gram Feature Based Resume Classification Using Machine Learning

  • Conference paper
  • First Online:
Computational Intelligence in Communications and Business Analytics (CICBA 2022)

Abstract

Shortlisting the right resume for the job is a tedious task for a job recruiter. For each post, thousands of applicants send their resumes; among all, very few resumes fit for the published job recruitment. Finding suitable resumes from a huge pool of resumes manually is a time-consuming process. This research addresses the above issue by suggesting a machine learning-based automated resume classification model. The automated resume classification model helps to classify the resume into different categories. Many classification models are used during the model development and found that the random forest classifier provides the most promising outcomes. The random forest classifier achieved the highest micro precision, recall, F1-score, and an accuracy value of 0.99 for the best case with the combination of uni- and bi-gram features.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Howard, J.L., Ferris, G.R.: The employment interview context: social and situational influences on interviewer decisions 1. J. Appl. Soc. Psychol. 26(2), 112–136 (1996)

    Article  Google Scholar 

  2. Kmail, A.B., Maree, M., Belkhatir, M.: MatchingSem: online recruitment system based on multiple semantic resources. In: 2015 12th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD), pp. 2654–2659. IEEE (2015)

    Google Scholar 

  3. Lin, Y., Lei, H., Addo, P.C., Li, X.: Machine learned resume-job matching solution. arXiv preprint arXiv:1607.07657 (2016)

  4. Roy, P.K., Singh, J.P., Nag, A.: Finding active expert users for question routing in community question answering sites. In: Perner, P. (ed.) MLDM 2018. LNCS (LNAI), vol. 10935, pp. 440–451. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-96133-0_33

    Chapter  Google Scholar 

  5. Yi, X., Allan, J., Croft, W.B.: Matching resumes and jobs based on relevance models. In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 809–810 (2007)

    Google Scholar 

  6. Javed, F., Luo, Q., McNair, M., Jacob, F., Zhao, M., Kang, T.S.: Carotene: a job title classification system for the online recruitment domain. In: 2015 IEEE First International Conference on Big Data Computing Service and Applications, pp. 286–293. IEEE (2015)

    Google Scholar 

  7. Kessler, R., Béchet, N., Roche, M., Torres-Moreno, J.-M., El-Bèze, M.: A hybrid approach to managing job offers and candidates. Inf. Process. Manag. 48(6), 1124–1135 (2012)

    Article  Google Scholar 

  8. Zhang, L., Fei, W., Wang, L.: PJ matching model of knowledge workers. Procedia Comput. Sci. 60, 1128–1137 (2015)

    Article  Google Scholar 

  9. Breaugh, J.A.: The use of biodata for employee selection: past research and future directions. Hum. Resour. Manag. Rev. 19(3), 219–231 (2009)

    Google Scholar 

  10. Roy, P.K., Singh, J.P., Baabdullah, A.M., Kizgin, H., Rana, N.P.: Identifying reputation collectors in community question answering (CQA) sites: exploring the dark side of social media. Int. J. Inf. Manag. 42, 25–35 (2018)

    Article  Google Scholar 

  11. Kessler, R., Torres-Moreno, J.M., El-Bèze, M.: E-Gen: automatic job offer processing system for human resources. In: Gelbukh, A., Kuri Morales, Á.F. (eds.) MICAI 2007. LNCS (LNAI), vol. 4827, pp. 985–995. Springer, Heidelberg (2007). https://doi.org/10.1007/978-3-540-76631-5_94

    Chapter  Google Scholar 

  12. Yu, K., Guan, G., Zhou, M.: Resume information extraction with cascaded hybrid model. In: Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL 2005), pp. 499–506 (2005)

    Google Scholar 

  13. Martinez-Gil, J., Paoletti, A.L., Schewe, K.-D.: A smart approach for matching, learning and querying information from the human resources domain. In: Ivanović, M., et al. (eds.) ADBIS 2016. CCIS, vol. 637, pp. 157–167. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-44066-8_17

    Chapter  Google Scholar 

  14. Clyde, S., Zhang, J., Yao, C.-C.: An object-oriented implementation of an adaptive classification of job openings. In: Proceedings the 11th Conference on Artificial Intelligence for Applications, pp. 9–16. IEEE (1995)

    Google Scholar 

  15. Roy, P.K., Chowdhary, S.S., Bhatia, R.: A machine learning approach for automation of resume recommendation system. Procedia Comput. Sci. 167, 2318–2327 (2020)

    Article  Google Scholar 

  16. Lu, J., Wu, D., Mao, M., Wang, W., Zhang, G.: Recommender system application developments: a survey. Decis. Support Syst. 74, 12–32 (2015)

    Article  Google Scholar 

  17. Wei, K., Huang, J., Fu, S.: A survey of e-commerce recommender systems. In: 2007 International Conference on Service Systems and Service Management, pp. 1–5. IEEE (2007)

    Google Scholar 

  18. Fazel-Zarandi, M., Fox, M.S.: Semantic matchmaking for job recruitment: an ontology-based hybrid approach. In: Proceedings of the 8th International Semantic Web Conference, vol. 525, no. 01, p. 2009 (2009)

    Google Scholar 

  19. Malinowski, J., Keim, T., Wendt, O., Weitzel, T.: Matching people and jobs: a bilateral recommendation approach. In: Proceedings of the 39th Annual Hawaii International Conference on System Sciences (HICSS 2006), vol. 6, pp. 137c–137c. IEEE (2006)

    Google Scholar 

  20. Golec, A., Kahya, E.: A fuzzy model for competency-based employee evaluation and selection. Comput. Ind. Eng. 52(1), 143–161 (2007)

    Article  Google Scholar 

  21. Paparrizos, I., Cambazoglu, B.B., Gionis, A.: Machine learned job recommendation. In: Proceedings of the Fifth ACM Conference on Recommender Systems, pp. 325–328 (2011)

    Google Scholar 

  22. Horton, P., Nakai, K.: Better prediction of protein cellular localization sites with the it k nearest neighbors classifier. In: Ismb, vol. 5, pp. 147–152 (1997)

    Google Scholar 

  23. Rish, I., et al.: An empirical study of the Naive Bayes classifier. In: IJCAI 2001 Workshop on Empirical Methods in Artificial Intelligence, vol. 3, no. 22, pp. 41–46 (2001)

    Google Scholar 

  24. Belgiu, M., Drăguţ, L.: Random forest in remote sensing: a review of applications and future directions. ISPRS J. Photogramm. Remote Sens. 114, 24–31 (2016)

    Article  Google Scholar 

  25. Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001)

    Article  MATH  Google Scholar 

  26. Friedman, J.H.: Stochastic gradient boosting. Comput. Stat. Data Anal. 38(4), 367–378 (2002)

    Article  MathSciNet  MATH  Google Scholar 

  27. Kowsari, K., Jafari Meimandi, K., Heidarysafa, M., Mendu, S., Barnes, L., Brown, D.: Text classification algorithms: a survey. Information 10(4), 150 (2019)

    Article  Google Scholar 

  28. Menard, S.: Applied Logistic Regression Analysis, vol. 106. Sage (2002). https://books.google.co.in/books?hl=en &lr= &id=EAI1QmUUsbUC &oi=fnd &pg=PP7 &dq=Applied+Logistic+Regression+Analysis &ots=4VGQI-qVJP &sig=4m7ea9FOnr2GKmpEje65WwfQ1PI &redir_esc=y#v=onepage &q=Applied%20Logistic%20Regression%20Analysis &f=false

  29. Kotsiantis, S.B., Zaharakis, I.D., Pintelas, P.E.: Machine learning: a review of classification and combining techniques. Artif. Intell. Rev. 26(3), 159–190 (2006)

    Article  Google Scholar 

  30. Murthy, S.K.: Automatic construction of decision trees from data: a multi-disciplinary survey. Data Min. Knowl. Disc. 2(4), 345–389 (1998)

    Article  Google Scholar 

  31. Roy, P.K., Singh, J.P., Banerjee, S.: Deep learning to filter SMS spam. Future Gener. Comput. Syst. 102, 524–533 (2020)

    Article  Google Scholar 

  32. Roy, P.K., Jain, A., Ahmad, Z., Singh, J.P.: Identifying expert users on question answering sites. In: Goyal, D., Bălaş, V.E., Mukherjee, A., Hugo C. de Albuquerque, V., Gupta, A.K. (eds.) ICIMMI 2019. AIS, pp. 285–291. Springer, Singapore (2021). https://doi.org/10.1007/978-981-15-4936-6_32

    Chapter  Google Scholar 

  33. Tripathi, D., Reddy, B.R., Shukla, A.K.: CFR: collaborative feature ranking for improving the performance of credit scoring data classification. Computing, 104, 893–923 (2021)

    Google Scholar 

  34. Tripathi, D., Edla, D.R., Bablani, A., Shukla, A.K., Reddy, B.R.: Experimental analysis of machine learning methods for credit score classification. Prog. Artif. Intell. 10(3), 217–243 (2021). https://doi.org/10.1007/s13748-021-00238-2

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Pradeep Kumar Roy .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Roy, P.K., Chahar, S. (2022). N-Gram Feature Based Resume Classification Using Machine Learning. In: Mukhopadhyay, S., Sarkar, S., Dutta, P., Mandal, J.K., Roy, S. (eds) Computational Intelligence in Communications and Business Analytics. CICBA 2022. Communications in Computer and Information Science, vol 1579. Springer, Cham. https://doi.org/10.1007/978-3-031-10766-5_18

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-10766-5_18

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-10765-8

  • Online ISBN: 978-3-031-10766-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics