Abstract
This study explores the implementation of Retrieval Augmented Large Language Models (LLMs) toward enhancing prospective student engagement during University open days. This work proposes the use of local, 4-bit quantised LLMs such as Microsoft Phi3, Meta’s LLaMa3, and Mistral AI’s Mistral to facilitate interactive dialogue about the Department of Computer Science at Nottingham Trent University. The proposed approaches are validated through the use of synthetic data generation via the RAGAS framework with additional expert human-in-the-loop oversight. We argue that the current state of the art which often involves ChatGPT as a sole validator is problematic, and we propose the use of an ensemble of multiple local validators that operate when a quorum is present to increase robustness. The results indicate that, while the chatbots are successful in providing the correct information, refining data relevance remains an open issue. Mistral demonstrated the highest performance in terms of information accuracy and coherence of responses, however, it was also the slowest at generating responses.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
https://www.ntu.ac.uk/course/computer-science, accessed: 11/06/2024.
- 2.
- 3.
Further details on RAGAS prompts are described at:
https://github.com/explodinggradients/ragas/blob/main/src/ragas/testset/prompts.py, accessed: 11/06/2024.
- 4.
https://github.com/langchain-ai/langchain, accessed: 17/06/2024.
References
Liu, F., Yang, X., Wang, Y.: An interactive chatbot for university open day. In: 2022 IEEE 13th International Conference on Software Engineering and Service Science (ICSESS), pp. 214–217. IEEE (2022)
Es, S., James, J., Espinosa-Anke, L., Schockaert, S.: Ragas: automated evaluation of retrieval augmented generation. arXiv preprint arXiv:2309.15217 (2023)
Suresh, K., Kackar, N., Schleck, L., Fanelli, C.: Towards a rag-based summarization agent for the electron-ion collider. arXiv preprint arXiv:2403.15729 (2024)
Foster, K.: Medical school choice: what influences applicants? Clin. Teach. 11(4), 307–310 (2014)
Sanson, B.: Spacevr: outreach out of this world! In: Wellington Faculty of Engineering Symposium (2023)
Bellotto, N., Rowland, S., Hu, H.: Lux-an interactive receptionist robot for university open days. In: 2008 IEEE/ASME International Conference on Advanced Intelligent Mechatronics, pp. 1355–1360. IEEE (2008)
Nguyen, T.T., Le, A.D., Hoang, H.T., Nguyen, T.: Neu-chatbot: chatbot for admission of national economics university. Comput. Educ.: Artif. Intell. 2, 100036 (2021)
Naik, I., Naik, D., Naik, N.: Demystifying the working, types, benefits and limitations of chatbots. In: UK Workshop on Computational Intelligence, pp. 558–566. Springer, Heidelberg (2023)
Bird, J.J., Ekárt, A., Faria, D.R.: Chatbot interaction with artificial intelligence: human data augmentation with t5 and language transformer ensemble for text classification. J. Ambient. Intell. Humaniz. Comput. 14(4), 3129–3144 (2023)
Dean, M., Bond, R.R., McTear, M.F., Mulvenna, M.D.: Chatpapers: an ai chatbot for interacting with academic research. In: 2023 31st Irish Conference on Artificial Intelligence and Cognitive Science (AICS), pp. 1–7. IEEE (2023)
Kumar, V., et al.: Large-language-models (llm)-based ai chatbots: architecture, in-depth analysis and their performance evaluation. In: International Conference on Recent Trends in Image Processing and Pattern Recognition, pp. 237–249. Springer, Heidelberg (2023)
Abdin, M., et al.: Phi-3 technical report: a highly capable language model locally on your phone. arXiv preprint arXiv:2404.14219 (2024)
Meta. meta-llama/llama3. Meta (2024). https://github.com/meta-llama/llama3
Jiang, A.Q., et al.: Mistral 7b. arXiv preprint arXiv:2310.06825 (2023)
Team, G., et al.: Gemma: open models based on gemini research and technology. arXiv preprint arXiv:2403.08295 (2024)
Bai, J., et al.: Qwen technical report. arXiv preprint arXiv:2309.16609 (2023)
Wolf, T., et al.: Huggingface’s transformers: state-of-the-art natural language processing. arXiv preprint arXiv:1910.03771 (2019)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Shanthakumar, A.K., Fassihi-Tash, F., Lotfi, A., Bird, J.J. (2024). Retrieval Augmented Large Language Model Chatbots in Higher Education: A Study on University Open Days. In: Zheng, H., Glass, D., Mulvenna, M., Liu, J., Wang, H. (eds) Advances in Computational Intelligence Systems. UKCI 2024. Advances in Intelligent Systems and Computing, vol 1462. Springer, Cham. https://doi.org/10.1007/978-3-031-78857-4_3
Download citation
DOI: https://doi.org/10.1007/978-3-031-78857-4_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-78856-7
Online ISBN: 978-3-031-78857-4
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)