Abstract
This paper introduces a new method for developing an Arabic chatbot, utilizing the encoder-decoder architecture enriched with BERT embeddings. Our distinct dataset, constructed manually, aids the model in comprehending intricate questions and prompts, thereby generating coherent and contextually accurate responses in Arabic. The dataset comprises 81,659 manually created conversation pairs. Our model successfully delivered the anticipated answers. We employed a model with a warm-start using the BERT2BERT encoder and decoder. It achieved a BLEU score of 3.52 and a PPL of 36.3.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Luger, E., Sellen, A.: “Like having a really bad PA”: the gulf between user expectation and experience of conversational agents. In: Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems (2016)
Habash, N.: Introduction to Arabic natural language processing. In: Introduction to Arabic Natural Language Processing (2010)
Hasanuzzaman, H.: Arabic language: characteristics and importance. Echo. J. Humanit. Soc. Sci. 1(3), 11–16 (2013)
Ahmed, A., Ali, N., Alzubaidi, M.S., Zaghouani, W., Abd-alrazaq, A.A., Househ, M.J.: Arabic chatbot technologies: a scoping review. Comput. Methods Programs Biomed. Update (2022)
Reimers, N., Gurevych, I.: Sentence-BERT: sentence embeddings using siamese BERT-networks. In: Conference on Empirical Methods in Natural Language Processing (2019)
Yu, S.Y., Chen, Y., Zaidi, H.: AVA: a financial service chatbot based on deep bidirectional transformers. Front. Appl. Math. Stat. 7, 604842 (2020)
Ezen-Can, A.: A Comparison of LSTM and BERT for Small Corpus. arXiv abs/2009.05451 (2020)
Amer, E., Hazem, A., Farouk, O., Louca, A., Mohamed, Y., Ashraf, M.: A proposed chatbot framework for COVID-19. In: 2021 International Mobile, Intelligent, and Ubiquitous Computing Conference (MIUCC), pp. 263–268 (2021)
Kanodia, N., Ahmed, K., Miao, Y.: Question answering model based conversational chatbot using BERT model and google dialogflow. In: 2021 31st International Telecommunication Networks and Applications Conference (ITNAC), pp. 19–22 (2021)
Dong, L., et al.: Unified language model pre-training for natural language understanding and generation. In: Neural Information Processing Systems (2019)
Park, H., Moon, G., Kim, K.S.: Classification of covid-19 symptom for chatbot using bert (2021)
Yoo, S., Jeong, O.: An intelligent chatbot utilizing BERT model and knowledge graph. J. Soc. e-Bus. Stud. 24(3) (2020)
Varitimiadis, S., Kotis, K., Pittou, D., Konstantakis, G.: Graph-based conversational AI: towards a distributed and collaborative multi-chatbot approach for museums. Appl. Sci. (2021)
Fuad, A., Al-Yahya, M.M.: Recent developments in arabic conversational AI: a literature review. IEEE Access 10, 23842–23859 (2022)
El-Ramly, M., et al.: CairoDep: detecting depression in Arabic posts using BERT transformers. In: 2021 Tenth International Conference on Intelligent Computing and Information Systems (ICICIS), pp. 207–212 (2021)
Helwe, C., Dib, G., Shamas, M., Elbassuoni, S.: A semi-supervised BERT approach for Arabic named entity recognition. In: Workshop on Arabic Natural Language Processing (2020)
Gaanoun, K., Benelallam, I.: Arabic dialect identification: an Arabic-BERT model with data augmentation and ensembling strategy. In: Workshop on Arabic Natural Language Processing (2020)
Jarrar, M., Khalilia, M., Ghanem, S.: Wojood: nested Arabic named entity corpus and recognition using BERT. In: International Conference on Language Resources and Evaluation (2022)
Ghaddar, A., et al.: JABER: junior Arabic BERt. arXiv abs/2112.04329 (2021)
Almaliki, M., Almars, A.M., Gad, I., Atlam, E.: ABMM: Arabic BERT-mini model for hate-speech detection on social media. Electronics 12(4), 1048 (2023)
Chouikhi, H., Chniter, H., Jarray, F.: Arabic sentiment analysis using BERT model. In: International Conference on Computational Collective Intelligence (2021)
Talafha, B., et al.: Multi-dialect Arabic BERT for Country-level Dialect Identification. arXiv abs/2007.05612 (2020)
Boussakssou, M., Ezzikouri, H., Erritali, M.: Chatbot in Arabic language using seq to seq model. Multimedia Tools Appl. 81, 2859–2871 (2021)
Antoun, W., Baly, F., Hajj, H.M.: AraELECTRA: Pre-Training Text Discriminators for Arabic Language Understanding. arXiv abs/2012.15516 (2020)
Naous, T., Antoun, W., Mahmoud, R.A., Hajj, H.M.: Empathetic BERT2BERT conversational model: learning Arabic language generation with little data. In: Workshop on Arabic Natural Language Processing (2021)
Rothe, S., Narayan, S., Severyn, A.: Leveraging pre-trained checkpoints for sequence generation tasks. Trans. Assoc. Comput. Linguist. 8, 264–280 (2019)
Alruqi, T.N., Alzahrani, S.M.: Evaluation of an Arabic Chatbot Based on Extractive Question-Answering Transfer Learning and Language Transformers. AI (2023)
Roller, S., et al.: Recipes for building an open-domain chatbot. In: Conference of the European Chapter of the Association for Computational Linguistics (2020)
Alammary, A.S.: BERT models for Arabic text classification: a systematic review. Appl. Sci. 12(11), 5720 (2022)
Abuzayed, A., Al-Khalifa, H.S.: BERT for Arabic topic modeling: an experimental study on BERTopic technique. In: International Conference on Arabic Computational Linguistics (2021)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Boussakssou, M., Erritali, M. (2024). An Arabic Chatbot Leveraging Encoder-Decoder Architecture Enhanced with BERT. In: Santosh, K., et al. Recent Trends in Image Processing and Pattern Recognition. RTIP2R 2023. Communications in Computer and Information Science, vol 2026. Springer, Cham. https://doi.org/10.1007/978-3-031-53082-1_21
Download citation
DOI: https://doi.org/10.1007/978-3-031-53082-1_21
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-53081-4
Online ISBN: 978-3-031-53082-1
eBook Packages: Computer ScienceComputer Science (R0)