Extracting and Clustering Main Ideas from Student Feedback Using Language Models

Masala, Mihai; Ruseti, Stefan; Dascalu, Mihai; Dobre, Ciprian

doi:10.1007/978-3-030-78292-4_23

Mihai Masala^13,14,
Stefan Ruseti¹³,
Mihai Dascalu^13,15 &
…
Ciprian Dobre¹³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 12748))

Included in the following conference series:

International Conference on Artificial Intelligence in Education

3320 Accesses
6 Citations

Abstract

Feedback mechanisms for academic courses have been widely used to measure students opinions and satisfaction towards different components of a course; concurrently, open-text detailed impressions enable professors to continually improve their course. However, the process of reading through hundreds of student feedback responses across multiple subjects, followed by the extraction of important ideas is very time consuming. In this work, we propose an automated feedback summarizer to extract the main ideas expressed by all students on various components for each course, based on a pipeline integrating state-of-the-art Natural Language Processing techniques. Our method involves the usage of BERT language models to extract keywords for each course, identify relevant contexts for recurring keywords, and cluster similar contexts. We validate our tool on 8,201 feedback responses for 168 distinct courses from the Computer Science Department of University Politehnica of Bucharest for the 2019–2020 academic year. Our approach achieves a size reduction of 59% on the overall volume of text, while only increasing the mean average error when predicting course ratings from student open-text feedback by an absolute value of 0.06.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 99.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
https://spacy.io/models/ro#ro_core_news_lg. Retrieved April 15, 2021.
2.
https://spacy.io/. Retrieved April 15, 2021.
3.
https://www.tensorflow.org/tensorboard. Retrieved April 15, 2021.

References

Seldin, P.: Using student feedback to improve teaching. Improve Acad. 16(1), 335–345 (1997)
Article Google Scholar
Flodén, J.: The impact of student feedback on teaching in higher education. Assess. Eval. High. Educ. 42(7), 1054–1068 (2017)
Article Google Scholar
Leckey, J., Neill, N.: Quantifying quality: the importance of student feedback. Qual. High. Educ. 7(1), 19–32 (2001)
Article Google Scholar
Moore, S., Kuol, N.: Students evaluating teachers: exploring the importance of faculty reaction to feedback on teaching. Teach. High. Educ. 10(1), 57–73 (2005)
Article Google Scholar
Perera, J., Lee, N., Win, K., Perera, J., Wijesuriya, L.: Formative feedback to students: the mismatch between faculty perceptions and student expectations. Med. Teach. 30(4), 395–399 (2008)
Article Google Scholar
Luo, W., Liu, F., Litman, D.: An improved phrase-based approach to annotating and summarizing student course responses. In: Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, pp. 53–63. The COLING 2016 Organizing Committee, Osaka (2016)
Google Scholar
Unankard, S., Nadee, W.: Topic detection for online course feedback using LDA. In: Popescu, E., Hao, T., Hsu, T.-C., Xie, H., Temperini, M., Chen, W. (eds.) SETE 2019. LNCS, vol. 11984, pp. 133–142. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-38778-5_16
Chapter Google Scholar
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)
MATH Google Scholar
Luo, W., Litman, D.: Summarizing student responses to reflection prompts. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp. 1955–1960 (2015)
Google Scholar
Luo, W., Liu, F., Liu, Z., Litman, D.: Automatic summarization of student course feedback. In: Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 80–85 (2016)
Google Scholar
Miller, D.: Leveraging BERT for extractive text summarization on lectures. arXiv preprint arXiv:1906.04165 (2019)
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2019, pp. 4171–4186 (2019)
Google Scholar
Lin, C.Y.: ROUGE: a package for automatic evaluation of summaries. In: Text Summarization Branches Out, pp. 74–81. Association for Computational Linguistics, Barcelona (2004)
Google Scholar
Masala, M., Ruseti, S., Dascalu, M.: RoBERT - a Romanian BERT model. In: Proceedings of the 28th International Conference on Computational Linguistics, pp. 6626–6637 (2020)
Google Scholar
Grootendorst, M.: KeyBERT: minimal keyword extraction with BERT (2020). https://github.com/MaartenGr/KeyBERT
Carbonell, J., Goldstein, J.: Use of MMR, diversity-based reranking for reordering documents and producing summaries. In: SIGIR Forum (ACM Special Interest Group on Information Retrieval), pp. 335–336 (1998)
Google Scholar
Van der Maaten, L., Hinton, G.: Visualizing data using t-SNE. J. Mach. Learn. Res. 9(11), 2579–2605 (2008)
MATH Google Scholar
Grootendorst, M.: BERTopic: leveraging BERT and c-TF-IDF to create easily interpretable topics (2020). https://github.com/MaartenGr/BERTopic

Download references

Acknowledgments

This research was supported by a grant of the Romanian National Authority for Scientific Research and Innovation, CNCS – UEFISCDI, project number TE 70 PN-III-P1-1.1-TE-2019-2209, ATES – “Automated Text Evaluation and Simplification”.

Author information

Authors and Affiliations

University Politehnica of Bucharest, 313 Splaiul Independentei, 060042, Bucharest, Romania
Mihai Masala, Stefan Ruseti, Mihai Dascalu & Ciprian Dobre
Institute of Mathematics of the Romanian Academy, 21 Calea Grivitei, 010702, Bucharest, Romania
Mihai Masala
Academy of Romanian Scientists, Str. Ilfov, Nr. 3, 050044, Bucharest, Romania
Mihai Dascalu

Authors

Mihai Masala
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Ruseti
View author publications
You can also search for this author in PubMed Google Scholar
Mihai Dascalu
View author publications
You can also search for this author in PubMed Google Scholar
Ciprian Dobre
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mihai Dascalu .

Editor information

Editors and Affiliations

Technion – Israel Institute of Technology, Haifa, Israel
Ido Roll
Arizona State University, Tempe, AZ, USA
Danielle McNamara
Utrecht University, Utrecht, The Netherlands
Sergey Sosnovsky
London Knowledge Lab, London, UK
Rose Luckin
University of Leeds, Leeds, UK
Vania Dimitrova

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Masala, M., Ruseti, S., Dascalu, M., Dobre, C. (2021). Extracting and Clustering Main Ideas from Student Feedback Using Language Models. In: Roll, I., McNamara, D., Sosnovsky, S., Luckin, R., Dimitrova, V. (eds) Artificial Intelligence in Education. AIED 2021. Lecture Notes in Computer Science(), vol 12748. Springer, Cham. https://doi.org/10.1007/978-3-030-78292-4_23

Download citation

DOI: https://doi.org/10.1007/978-3-030-78292-4_23
Published: 11 June 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-78291-7
Online ISBN: 978-3-030-78292-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics