Skip to main content

Supervised Machine Learning for Automatic Assessment of Free-Text Answers

  • Conference paper
  • First Online:
Advances in Soft Computing (MICAI 2021)

Abstract

The learning assessment seeks to collect data that allows for identifying learning gaps for teacher decision-making. Hence, teachers need to plan and select various assessment instruments that enable the verification of learning evolution. Considering that a more significant number of evaluation instruments and modalities increase the teachers’ workload, the adoption of machine learning might support the assessing actions and amplify the potential of students’ observation and follow-up. This article aims to analyze machine learning algorithms for automatic classification of free-text answers, i.e., evaluating descriptive questions written in Portuguese. We utilized a dataset of 9981 free-text answers for 17 questions. After pre-processing the data, we used eight classification algorithms. In conclusion, we highlight that the Logistic Regression, ExtraTrees, Random Forest, and Multi-layer Perceptron algorithms obtained results above 0.9 of F-score for both multi-class and binary classification.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    https://www.kaggle.com/lucasbgalhardi/pt-asag-2018,ch1galhardi2018portuguese.

References

  1. dos Santos, J.C.A., Ribeiro, T., Favero, E., Queiroz, J.: Aplicação de um método lsa na avaliação automática de respostas discursivas. In: Anais do Workshop de Desafios da Computação Aplicada à Educação, pp. 10–19 (2012)

    Google Scholar 

  2. Passero, G., Haendchen Filho, A., Dazzi, R.: Avaliaçao do uso de métodos baseados em lsa e wordnet para correçao de questoes discursivas. In: Brazilian Symposium on Computers in Education (Simpósio Brasileiro de Informática na Educação-SBIE), vol. 27, p. 1136 (2016)

    Google Scholar 

  3. Çınar, A., Ince, E., Gezer, M., Yılmaz, Ö.: Machine learning algorithm for grading open-ended physics questions in Turkish. Educ. Inf. Technol. 25(5), 3821–3844 (2020)

    Article  Google Scholar 

  4. Basu, S., Jacobs, C., Vanderwende, L.: Powergrading: a clustering approach to amplify human effort for short answer grading. Trans. Assoc. Comput. Ling. 1, 391–402 (2013)

    Google Scholar 

  5. Zehner, F., Sälzer, C., Goldhammer, F.: Automatic coding of short text responses via clustering in educational assessment. Educ. Psychol. Measur. 76(2), 280–303 (2016)

    Article  Google Scholar 

  6. Horbach, A., Pinkal, M.: Semi-supervised clustering for short answer scoring. In: Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018) (2018)

    Google Scholar 

  7. Leacock, C., Chodorow, M.: C-rater: automated scoring of short-answer questions. Comput. Humanit. 37(4), 389–405 (2003)

    Article  Google Scholar 

  8. Willis, A.: Using NLP to support scalable assessment of short free text responses. In: Proceedings of the Tenth Workshop on Innovative Use of NLP for Building Educational Applications, pp. 243–253 (2015)

    Google Scholar 

  9. Marvaniya, S., Saha, S., Dhamecha, T.I., Foltz, P., Sindhgatta, R., Sengupta, B.: Creating scoring rubric from representative student answers for improved short answer grading. In: Proceedings of the 27th ACM International Conference on Information and Knowledge Management, pp. 993–1002 (2018)

    Google Scholar 

  10. Sahu, A., Bhowmick, P.K.: Feature engineering and ensemble-based approach for improving automatic short-answer grading performance. IEEE Trans. Learn. Technol. 13(1), 77–90 (2019)

    Article  Google Scholar 

  11. Vij, S., Tayal, D., Jain, A.: A machine learning approach for automated evaluation of short answers using text similarity based on wordnet graphs. Wireless Pers. Commun. 111(2), 1271–1282 (2020)

    Article  Google Scholar 

  12. Filighera, A., Steuer, T., Rensing, C.: Fooling automatic short answer grading systems. In: Bittencourt, I.I., Cukurova, M., Muldner, K., Luckin, R., Millán, E. (eds.) AIED 2020. LNCS (LNAI), vol. 12163, pp. 177–190. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-52237-7_15

    Chapter  Google Scholar 

  13. Galhardi, L., Barbosa, C.R., de Souza, R.C.T., Brancher, J.D.: Portuguese automatic short answer grading. In: Brazilian Symposium on Computers in Education (Simpósio Brasileiro de Informática na Educação-SBIE), vol. 29, p. 1373 (2018)

    Google Scholar 

  14. Liu, T., Ding, W., Wang, Z., Tang, J., Huang, G.Y., Liu, Z.: Automatic short answer grading via multiway attention networks. In: Isotani, S., Millán, E., Ogan, A., Hastings, P., McLaren, B., Luckin, R. (eds.) AIED 2019. LNCS (LNAI), vol. 11626, pp. 169–173. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-23207-8_32

    Chapter  Google Scholar 

  15. Lun, J., Zhu, J., Tang, Y., Yang, M.: Multiple data augmentation strategies for improving performance on automatic short answer scoring. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, pp. 13389–13396 (2020)

    Google Scholar 

  16. Tan, H., Wang, C., Duan, Q., Lu, Y., Zhang, H., Li, R.: Automatic short answer grading by encoding student responses via a graph convolutional network. In: Interactive Learning Environments, pp. 1–15 (2020)

    Google Scholar 

  17. Zhang, Y., Lin, C., Chi, M.: Going deeper: automatic short-answer grading by combining student and question models. User Model. User-Adap. Inter. 30(1), 51–80 (2020)

    Article  Google Scholar 

  18. Kapoor, B.S.J., Nagpure, S.M., Kolhatkar, S.S., Chanore, P.G., Vishwakarma, M.M., Kokate, R.B.: An analysis of automated answer evaluation systems based on machine learning. In: 2020 International Conference on Inventive Computation Technologies (ICICT), pp. 439–443. IEEE (2020)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Guillermo Rodriguez .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Gomes Rocha, F., Rodriguez, G., Andrade, E.E.F., Guimarães, A., Gonçalves, V., Sabino, R.F. (2021). Supervised Machine Learning for Automatic Assessment of Free-Text Answers. In: Batyrshin, I., Gelbukh, A., Sidorov, G. (eds) Advances in Soft Computing. MICAI 2021. Lecture Notes in Computer Science(), vol 13068. Springer, Cham. https://doi.org/10.1007/978-3-030-89820-5_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-89820-5_1

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-89819-9

  • Online ISBN: 978-3-030-89820-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics