Providing Insights for Open-Response Surveys via End-to-End Context-Aware Clustering

Esmaeilzadeh, Soheil; Williams, Brian; Shamsi, Davood; Vikingstad, Onar

doi:10.1007/978-3-031-11644-5_44

Soheil Esmaeilzadeh¹¹,
Brian Williams¹¹,
Davood Shamsi¹¹ &
…
Onar Vikingstad¹¹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13355))

Included in the following conference series:

International Conference on Artificial Intelligence in Education

4673 Accesses

Abstract

Teachers often conduct surveys in their classes to gain insights into topics of interest. When analyzing surveys with open-ended responses, a teacher traditionally has to read the responses one by one, which is a labor-intensive and time-consuming process. We present a novel end-to-end context-aware framework that extracts, aggregates, and abbreviates embedded semantic patterns in open-response survey data. Our framework uses a pre-trained natural language model to encode the textual data into semantic vectors. The encoded vectors then get clustered either into an optimally tuned number of groups or into a set of groups with pre-specified titles. We provide context-aware wordclouds that demonstrate the semantically prominent keywords within each group. Honoring user privacy, we have successfully built the on-device implementation of our framework suitable for real-time analysis on mobile devices and have tested it on a synthetic dataset. Our framework reduces the costs at-scale by automating the process of extracting the most insightful information pieces from survey data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 99.00; Price excludes VAT (USA)

Softcover Book: USD 129.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Semantic Annotation, Representation and Linking of Survey Data

Interactive Coding of Responses to Open-Ended Questions in Russian

Paradata in Surveys

References

Buenano-Fernandez, D., Gonzalez, M., Gil, D., Lujan-Mora, S.: Text mining of open-ended questions in self-assessment of university teachers: an LDA topic modeling approach. IEEE Access 8, 35318–35330 (2020)
Article Google Scholar
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding (2019)
Google Scholar
McInnes, L., Healy, J., Melville, J.: UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction (2018)
Google Scholar
Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. In: 1st International Conference on Learning Representations, ICLR 2013 - Workshop Track Proceedings, pp. 1–12 (2013)
Google Scholar
Pennington, J., Socher, R., Manning, C.D.: GloVe: global vectors for word representation. In: EMNLP (2014)
Google Scholar
Peters, M.E., Neumann, M., Iyyer, M., Gardner, M., Clark, C., Lee, K., Zettlemoyer, L.: Deep contextualized word representations. In: NAACL HLT, pp. 2227–2237 (2018)
Google Scholar
Reimers, N., Gurevych, I.: Sentence-BERT: Sentence embeddings using siamese BERT-networks. In: EMNLP-IJCNLP, pp. 3982–3992 (2019)
Google Scholar
Reimers, N., Gurevych, I.: Sentence Transformers Trained on the MiniLM Paraphrase Corpus (2019)
Google Scholar
Vayansky, I., Kumar, S.A.: A review of topic modeling methods. Inf. Syst. 94, 101582 (2020)
Google Scholar

Download references

Author information

Authors and Affiliations

Apple, Cupertino, CA, USA
Soheil Esmaeilzadeh, Brian Williams, Davood Shamsi & Onar Vikingstad

Authors

Soheil Esmaeilzadeh
View author publications
You can also search for this author in PubMed Google Scholar
Brian Williams
View author publications
You can also search for this author in PubMed Google Scholar
Davood Shamsi
View author publications
You can also search for this author in PubMed Google Scholar
Onar Vikingstad
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Soheil Esmaeilzadeh .

Editor information

Editors and Affiliations

Ateneo De Manila University, Quezon, Philippines
Maria Mercedes Rodrigo
Department of Computer Science, North Carolina State University, Raleigh, NC, USA
Noburu Matsuda
Durham University, Durham, UK
Alexandra I. Cristea
University of Leeds, Leeds, UK
Vania Dimitrova

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Esmaeilzadeh, S., Williams, B., Shamsi, D., Vikingstad, O. (2022). Providing Insights for Open-Response Surveys via End-to-End Context-Aware Clustering. In: Rodrigo, M.M., Matsuda, N., Cristea, A.I., Dimitrova, V. (eds) Artificial Intelligence in Education. AIED 2022. Lecture Notes in Computer Science, vol 13355. Springer, Cham. https://doi.org/10.1007/978-3-031-11644-5_44

Download citation

DOI: https://doi.org/10.1007/978-3-031-11644-5_44
Published: 27 July 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-11643-8
Online ISBN: 978-3-031-11644-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Providing Insights for Open-Response Surveys via End-to-End Context-Aware Clustering