A BERT-Based Scoring System for Workplace Safety Courses in Italian

Arici, Nicola; Gerevini, Alfonso E.; Putelli, Luca; Serina, Ivan; Sigalini, Luca

doi:10.1007/978-3-031-27181-6_32

Nicola Arici¹⁰,
Alfonso E. Gerevini¹⁰,
Luca Putelli¹⁰,
Ivan Serina¹⁰ &
…
Luca Sigalini¹¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13796))

Included in the following conference series:

International Conference of the Italian Association for Artificial Intelligence

457 Accesses

Abstract

Knowing the fundamentals of workplace safety is not only an important right for all categories of workers, but also a legal duty in Italy. Workers have to attend workplace safety courses and, in order to obtain a legally valid certification of the training received, they have to pass a written exam. This exam includes open-ended questions whose answers (provided by the students) are evaluated by human teachers. In the last few years, workplace safety courses have often been attended online via e-learning platforms. This allows the companies offering this kind of service to collect thousands of questions and answers regarding workplace safety that are written in Italian. In this paper, we propose an automatic scoring system for open-ended questions to assist a human teacher in the task of evaluating the student answers. The system is based on deep learning techniques exploiting the available textual data about questions and answers. In particular, we put forward three different approaches based on BERT, and we evaluate the necessary operations in order to create an effective tool.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 54.99; Price excludes VAT (USA)

Softcover Book: USD 69.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
https://huggingface.co/dbmdz/bert-base-italian-uncased.

References

Akiba, T., Sano, S., Yanase, T., Ohta, T., Koyama, M.: Optuna: a next-generation hyperparameter optimization framework. In: Teredesai, A., Kumar, V., Li, Y., Rosales, R., Terzi, E., Karypis, G. (eds.) Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, KDD 2019, Anchorage, AK, USA, 4–8 August 2019, pp. 2623–2631. ACM (2019)
Google Scholar
Basile, V., Novielli, N., Croce, D., Barbieri, F., Nissim, M., Patti, V.: Sentiment polarity classification at EVALITA: lessons learned and open challenges. IEEE Trans. Affect. Comput. 12(2), 466–478 (2021)
Article Google Scholar
Bird, S., Klein, E., Loper, E.: Natural Language Processing with Python: Analyzing Text with the Natural Language Toolkit. O’Reilly Media Inc., Sebastopol (2009)
MATH Google Scholar
Croce, D., Zelenanska, A., Basili, R.: Neural learning for question answering in Italian. In: Ghidini, C., Magnini, B., Passerini, A., Traverso, P. (eds.) AI*IA 2018 - Advances in Artificial Intelligence, pp. 389–402. Springer International Publishing, Cham (2018). https://doi.org/10.1007/978-3-030-03840-3_29
Chapter Google Scholar
Devlin, J., Chang, M., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. In: Burstein, J., Doran, C., Solorio, T. (eds.) Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2019, Minneapolis, MN, USA, 2–7 June 2019, vol. 1 (Long and Short Papers), pp. 4171–4186. Association for Computational Linguistics (2019)
Google Scholar
Ethayarajh, K.: How contextual are contextualized word representations? comparing the geometry of BERT, ELMo, and GPT-2 embeddings. In: Inui, K., Jiang, J., Ng, V., Wan, X. (eds.) Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, EMNLP-IJCNLP 2019, Hong Kong, China, 3–7 November 2019, pp. 55–65. Association for Computational Linguistics (2019)
Google Scholar
Haller, S., Aldea, A., Seifert, C., Strisciuglio, N.: Survey on automated short answer grading with deep learning: from word embeddings to transformers. CoRR abs /2204.03503 (2022)
Google Scholar
Hassan, S., Fahmy, A.A., El-Ramly, M.: Automatic short answer scoring based on paragraph embeddings. Int. J. Adv. Comput. Sci. Appl. 9(10), 397–402 (2018). https://doi.org/10.14569/IJACSA.2018.091048
Article Google Scholar
Ke, Z., Ng, V.: Automated essay scoring: a survey of the state of the art. In: Kraus, S. (ed.) Proceedings of the 28th International Joint Conference on Artificial Intelligence, IJCAI 2019, Macao, China, 10–16 August 2019, pp. 6300–6308. ijcai.org (2019). https://doi.org/10.24963/ijcai.2019/879
Mohler, M., Bunescu, R.C., Mihalcea, R.: Learning to grade short answer questions using semantic similarity measures and dependency graph alignments. In: Lin, D., Matsumoto, Y., Mihalcea, R. (eds.) The 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, 19–24 June 2011, Portland, Oregon, USA, pp. 752–762. The Association for Computer Linguistics (2011)
Google Scholar
Peters, M.E., et al.: Deep contextualized word representations. In: Walker, M.A., Ji, H., Stent, A. (eds.) Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2018, New Orleans, Louisiana, USA, June 1–6 2018, vol.1 (Long Papers), pp. 2227–2237. Association for Computational Linguistics (2018)
Google Scholar
Polignano, M., Basile, P., De Gemmis, M., Semeraro, G., Basile, V.: Alberto: Italian BERT language understanding model for NLP challenging tasks based on tweets. In: 6th Italian Conference on Computational Linguistics, CLiC-it 2019, vol. 2481, pp. 1–6. CEUR (2019)
Google Scholar
Prabhudesai, A., Duong, T.N.B.: Automatic short answer grading using siamese bidirectional LSTM based regression. In: IEEE International Conference on Engineering, Technology and Education, TALE 2019, Yogyakarta, Indonesia, 10–13 December 2019, pp. 1–6. IEEE (2019)
Google Scholar
Pribadi, F.S., Adji, T.B., Permanasari, A.E., Mulwinda, A., Utomo, A.B.: Automatic short answer scoring using words overlapping methods. In: AIP Conference Proceedings, vol. 1818, no. 1 (2017)
Google Scholar
Putelli, L., Gerevini, A.E., Lavelli, A., Maroldi, R., Serina, I.: Attention-based explanation in a deep learning model for classifying radiology reports. In: Tucker, A., Henriques Abreu, P., Cardoso, J., Pereira Rodrigues, P., Riaño, D. (eds.) AIME 2021. LNCS (LNAI), vol. 12721, pp. 367–372. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-77211-6_42
Chapter Google Scholar
Putelli, L., Gerevini, A.E., Lavelli, A., Olivato, M., Serina, I.: Deep learning for classification of radiology reports with a hierarchical schema. In: Cristani, M., Toro, C., Zanni-Merk, C., Howlett, R.J., Jain, L.C. (eds.) Knowledge-Based and Intelligent Information & Engineering Systems: Proceedings of the 24th International Conference KES-2020, Virtual Event, 16–18 September 2020. Procedia Computer Science, vol. 176, pp. 349–359. Elsevier (2020)
Google Scholar
Reimers, N., Gurevych, I.: Sentence-bert: sentence embeddings using siamese BERT-Networks. In: Inui, K., Jiang, J., Ng, V., Wan, X. (eds.) Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, EMNLP-IJCNLP 2019, Hong Kong, China, 3–7 November 2019, pp. 3980–3990. Association for Computational Linguistics (2019)
Google Scholar
Sung, C., Dhamecha, T.I., Saha, S., Ma, T., Reddy, V., Arora, R.: Pre-training BERT on domain resources for short answer grading. In: Inui, K., Jiang, J., Ng, V., Wan, X. (eds.) Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, EMNLP-IJCNLP 2019, Hong Kong, China, 3–7 November 2019, pp. 6070–6074. Association for Computational Linguistics (2019)
Google Scholar
Suzen, N., Gorban, A.N., Levesley, J., Mirkes, E.M.: Automatic short answer grading and feedback using text mining methods. CoRR abs/1807.10543 (2018)
Google Scholar
Tai, W., Kung, H.T., Dong, X., Comiter, M., Kuo, C.F.: exBERT: extending pre-trained models with domain-specific vocabulary under constrained training resources. In: Findings of the Association for Computational Linguistics: EMNLP 2020, pp. 1433–1439. Association for Computational Linguistics, Online (2020). https://doi.org/10.18653/v1/2020.findings-emnlp.129
Vaswani, A., et al.: Attention is all you need. In: Guyon, I., et al., (eds.) Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 4–9 December 2017, Long Beach, CA, USA, pp. 5998–6008 (2017)
Google Scholar
Zhang, Z., Wu, Y., Zhao, H., Li, Z., Zhang, S., Zhou, X., Zhou, X.: Semantics-aware BERT for language understanding. In: The 34th AAAI Conference on Artificial Intelligence, AAAI 2020, The 32nd Innovative Applications of Artificial Intelligence Conference, IAAI 2020, The 10th AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2020, New York, NY, USA, 7–12 February 2020, pp. 9628–9635. AAAI Press (2020)
Google Scholar
Zubani, M., Sigalini, L., Serina, I., Gerevini, A.E.: Evaluating different natural language understanding services in a real business case for the italian language. Procedia Computer Science 176, 995–1004 (2020), knowledge-Based and Intelligent Information & Engineering Systems: Proceedings of the 24th International Conference KES2020
Google Scholar
Zubani, M., Sigalini, L., Serina, I., Putelli, L., Gerevini, A.E., Chiari, M.: A performance comparison of different cloud-based natural language understanding services for an Italian e-learning platform. Future Internet 14(2), 62 (2022)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Università degli Studi di Brescia, Via Branze 38, Brescia, Italy
Nicola Arici, Alfonso E. Gerevini, Luca Putelli & Ivan Serina
Mega Italia Media, Via Roncadelle 70A, Castel Mella, Italy
Luca Sigalini

Authors

Nicola Arici
View author publications
You can also search for this author in PubMed Google Scholar
Alfonso E. Gerevini
View author publications
You can also search for this author in PubMed Google Scholar
Luca Putelli
View author publications
You can also search for this author in PubMed Google Scholar
Ivan Serina
View author publications
You can also search for this author in PubMed Google Scholar
Luca Sigalini
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Nicola Arici .

Editor information

Editors and Affiliations

University of Udine, Udine, Italy
Agostino Dovier
University of Udine, Udine, Italy
Angelo Montanari
National Research Council (CNR-ISTC), Rome, Italy
Andrea Orlandini

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Arici, N., Gerevini, A.E., Putelli, L., Serina, I., Sigalini, L. (2023). A BERT-Based Scoring System for Workplace Safety Courses in Italian. In: Dovier, A., Montanari, A., Orlandini, A. (eds) AIxIA 2022 – Advances in Artificial Intelligence. AIxIA 2022. Lecture Notes in Computer Science(), vol 13796. Springer, Cham. https://doi.org/10.1007/978-3-031-27181-6_32

Download citation

DOI: https://doi.org/10.1007/978-3-031-27181-6_32
Published: 11 March 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-27180-9
Online ISBN: 978-3-031-27181-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

A BERT-Based Scoring System for Workplace Safety Courses in Italian