Automating the Conducting of Surveys Using Large Language Models

Tewari, Trevon; Hosein, Patrick

doi:10.1007/978-3-031-66705-3_9

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 2172))

Included in the following conference series:

International Conference on Deep Learning Theory and Applications

327 Accesses
1 Altmetric

Abstract

Conducting electronic surveys in remote areas can be challenging due to the lack of the required Internet infrastructure. Traditional phone surveys are typically used in such cases since mobile phones have become pervasive. However, this process can be time-consuming since a human is required to conduct the session and they must then upload responses to a database. We propose using Large Language Models (LLMs) to process an audio recording of the phone session to extract the responses and store them in a database. In our proof of concept humans were used to conduct the survey but further automation is planned whereby the phone session itself is carried out by a robot. We use OpenAI’s Whisper for the speech-to-text process and we then pass the text to a Large Language Model (GPT-4) which is prompted to extract the responses. The responses are then uploaded to a database. Finally we use an LLM to provide answers to questions about the survey responses. For multiple choice questions we obtained an accuracy score of 97%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 119.99; Price excludes VAT (USA)

Softcover Book: USD 84.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Jira: a Central Kurdish speech recognition system, designing and building speech corpus and pronunciation lexicon

Article 14 June 2022

Assembling Chinese-Mongolian Speech Corpus via Crowdsourcing

PlattForm: Parallel Spoken Corpus of Middle West German Dialects with Web-Based Interface

References

Achiam, J., et al.: GPT-4 technical report. arXiv preprint arXiv:2303.08774 (2023)
Baevski, A., Zhou, Y., Mohamed, A., Auli, M.: wav2vec 2.0: a framework for self-supervised learning of speech representations. In: Advances in Neural Information Processing Systems, vol. 33, 12449–12460 (2020)
Google Scholar
Chang, Y., et al.: A survey on evaluation of large language models. arXiv preprint arXiv:2307.03109 (2023)
Cheng, C., Lau, Y., Chan, L., Luk, J.W.: Prevalence of social media addiction across 32 nations: meta-analysis with subgroup analysis of classification schemes and cultural values. Addict. Behav. 117, 106845 (2021)
Article Google Scholar
De Koning, R., et al.: Survey fatigue during the Covid-19 pandemic: an analysis of neurosurgery survey response rates. Front. Surg. 8, 690680 (2021)
Article Google Scholar
Demombynes, G., Gubbins, P., Romeo, A.: Challenges and opportunities of mobile phone-based data collection: evidence from South Sudan. World Bank Policy Research Working Paper (2013)
Google Scholar
Fei, J., et al.: Automated chat application surveys using WhatsApp: evidence from panel surveys and a mode experiment. Econstor (2022)
Google Scholar
Feng, Y., Vanam, S., Cherukupally, M., Zheng, W., Qiu, M., Chen, H.: Investigating code generation performance of chat-GPT with crowdsourcing social data. In: Proceedings of the 47th IEEE Computer Software and Applications Conference, pp. 1–10 (2023)
Google Scholar
Floridi, L., Chiriatti, M.: GPT-3: its nature, scope, limits, and consequences. Mind. Mach. 30, 681–694 (2020)
Article Google Scholar
Gourlay, S., Kilic, T., Martuscelli, A., Wollburg, P., Zezza, A.: High-frequency phone surveys on Covid-19: good practices, open questions. Food Policy 105, 102153 (2021)
Article Google Scholar
Jansen, B.J., Jung, S., Salminen, J.: Employing large language models in survey research. Nat. Lang. Process. J. 4, 100020 (2023)
Article Google Scholar
Kasirye, F.: Errors in survey research and their threat to validity and reliability. International Islamic University, Selangor, Malaysia (2021)
Google Scholar
Kasneci, E., et al.: Chatgpt for good? On opportunities and challenges of large language models for education. Learn. Individ. Differ. 103, 102274 (2023)
Article Google Scholar
Kastelic, K.H., Eckman, S., Kastelic, J.G., Mcgee, K.R., et al.: High frequency mobile phone surveys of households to assess the impacts of Covid-19 (vol. 2): guidelines on sampling design. Policy Commons (2020)
Google Scholar
Kelfve, S., Kivi, M., Johansson, B., Lindwall, M.: Going web or staying paper? The use of web-surveys among older people. BMC Med. Res. Methodol. 20, 1–12 (2020)
Article Google Scholar
Kocoń, J., et al.: ChatGPT: jack of all trades, master of none. Inf. Fusion 101861 (2023)
Google Scholar
Olson, K., et al.: Transitions from telephone surveys to self-administered and mixed-mode surveys: AAPOR task force report. J. Surv. Stat. Methodol. 9(3), 381–411 (2021)
Article Google Scholar
Radford, A., Kim, J.W., Xu, T., Brockman, G., McLeavey, C., Sutskever, I.: Robust speech recognition via large-scale weak supervision. In: International Conference on Machine Learning, pp. 28492–28518. PMLR (2023)
Google Scholar
Schaeffer, N.C., Dykema, J.: Advances in the science of asking questions. Ann. Rev. Sociol. 46, 37–60 (2020)
Article Google Scholar
Sinkowitz-Cochran, R.L.: Survey design: to ask or not to ask? That is the question. Clin. Infect. Dis. 56(8), 1159–1164 (2013)
Article Google Scholar
Wang, J., Dong, Y.: Measurement of text similarity: a survey. Information 11(9), 421 (2020)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, The University of the West Indies, St. Augustine, Trinidad
Trevon Tewari & Patrick Hosein

Authors

Trevon Tewari
View author publications
You can also search for this author in PubMed Google Scholar
Patrick Hosein
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Trevon Tewari .

Editor information

Editors and Affiliations

Instituto de Telecomunicações and University of Lisbon, Lisbon, Portugal
Ana Fred
LIAS, LIAS/ENSMA, Poitiers, France
Allel Hadjali
Ford Motor Company, Dearborn, MI, USA
Oleg Gusikhin
University of Naples Federico II, Naples, Italy
Carlo Sansone

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tewari, T., Hosein, P. (2024). Automating the Conducting of Surveys Using Large Language Models. In: Fred, A., Hadjali, A., Gusikhin, O., Sansone, C. (eds) Deep Learning Theory and Applications. DeLTA 2024. Communications in Computer and Information Science, vol 2172. Springer, Cham. https://doi.org/10.1007/978-3-031-66705-3_9

Download citation

DOI: https://doi.org/10.1007/978-3-031-66705-3_9
Published: 21 August 2024
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-66704-6
Online ISBN: 978-3-031-66705-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Automating the Conducting of Surveys Using Large Language Models