ABSTRACT
Potentialized by Natural Language Processing (NLP) technology, we can build a chatbot or an AI Agent to automatically address the need to automatically get credible and timely information, especially in the fight against epidemics. However, Vietnamese understanding is still a big challenge for NLP. This paper introduces an AI Agent using the Attention algorithm and Albert model to implement the question/answering task in the Covid-19 field for the Vietnamese language. In the end, we also built two other modules, one for Vietnamese diacritic auto-correction and another for updating Covid-19 statistics (using RASA framework), to deploy a Covid-19 chatbot application on mobile devices.
- Sumo. 2020. 5 Ecommerce Chatbots (Plus How To Build Your Own In 15 Minutes). https://sumo.com/stories/ecommerce-chatbot-marketing. (2020). (Accessed on 01/08/2020). [60] TechCrunch. 20Google Scholar
- WHO Health Alert brings COVID-19 facts to billions via WhatsApp. (2020, March 20). Retrieved November 30, 2020, from https://web.archive.org/web/20200323042822/https://www.who.int/news-room/feature-stories/detail/who-health-alert-brings-covid-19-facts-to-billions-via-whatsappGoogle Scholar
- Halpern, J., 2020. IS VIETNAMESE A HARD LANGUAGE?. [online] Kanji.org. Available at: <http://www.kanji.org/kanji/jack/vietnamese/is_VN_hard_sum_EN_VN.pdf>[Accessed 30 November 2020].Google Scholar
- Mikolov, Tomas & Sutskever, Ilya & Chen, Kai & Corrado, G.s & Dean, Jeffrey. (2013). Distributed Representations of Words and Phrases and their Compositionality. arXiv:1310.4546.Google Scholar
- Pennington, C. (2014). GloVe: Global Vectors for Word Representation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP) (pp. 1532–1543). Association for Computational Linguistics.Google ScholarCross Ref
- Joulin, A.; Grave, E.; Bojanowski, P.; Douze, M.; Jegou, H.; and Mikolov, T. 2016. Fasttext.zip: 'Compressing text classification models. arXiv preprint. arXiv:1612.03651.Google Scholar
- Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long Short-Term Memory. Neural Comput. 9, 8 (November 15, 1997), 1735–1780. DOI:https://doi.org/10.1162/neco.1997.9.8.1735.Google ScholarDigital Library
- Mandic, Danilo & Chambers, Jonathon. (2001). Recurrent Neural Networks for Prediction: Learning Algorithms,Architectures and Stability. 10.1002/047084535X.Google ScholarCross Ref
- Peters, L. (2018). Deep Contextualized Word Representations. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers) (pp. 2227–2237). Association for Computational Linguistics.Google ScholarCross Ref
- Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Lukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In Advances in Neural Information Processing Systems, pages 6000–6010.Google Scholar
- Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805.Google Scholar
- Alec Radford, Jeffrey Wu, Rewon Child, David Luan, Dario Amodei, and Ilya Sutskever. 2019. Language models are unsupervised multitask learners. Technical report, OpenAI.Google Scholar
- Howard, S. (2018). Universal Language Model Fine-tuning for Text Classification. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (pp. 328–339). Association for Computational Linguistics.Google ScholarCross Ref
- Ponnusamy, Pragaash; Roshan Ghias, Alireza; Guo, Chenlei; Sarikaya, Ruhi (2020). Feedback-Based Self-Learning in Large-Scale Conversational AI Agents. Proceedings of the AAAI Conference on Artificial Intelligence, 34(08), 13180–13187. doi:10.1609/aaai.v34i08.7022.Google ScholarCross Ref
- Vakili Tahami, Amir & Shakery, Azadeh. (2019). Enriching Conversation Context in Retrieval-based AI agents. arXiv preprint arXiv:1911.02290.Google Scholar
- Sugisaki, Kyoko. (2019). Chat-Bot-Kit: A web-based tool to simulate text-based interactions between humans and with computers. arXiv preprint arXiv:1911.00665Google Scholar
- Zhou, K., Zhang, K., Wu, Y., Liu, S., Yu, J.: Unsupervised context rewriting for open domain conversation. In: EMNLP-IJCNLP 2019, Hong Kong, China, 3–7 November 2019, pp. 1834–1844 (2019).Google Scholar
- B. Ruf, M. Sammarco and M. Detyniecki, "Contract Statements Knowledge Service for Chatbots," 2019 IEEE International Conference on Systems, Man and Cybernetics (SMC), Bari, Italy, 2019, pp. 3180-3185, doi: 10.1109/SMC.2019.8913955.Google ScholarDigital Library
- Keyner, Sophia & Savenkov, Vadim & Vakulenko, Svitlana. (2019). Open Data AI agent.Google Scholar
- https://github.com/undertheseanlp/AI agent [Access 25 May 2020].Google Scholar
- Pham, Phuong. "NNVLP: A Neural Network-Based Vietnamese Language Processing Toolkit." . In Proceedings of the IJCNLP 2017, System Demonstrations (pp. 37–40). Association for Computational Linguistics, 2017.Google Scholar
- Nguyen, T., & Shcherbakov, M. (2018). A Neural Network based Vietnamese Chatbot. 2018 International Conference on System Modeling & Advancement in Research Trends (SMART). doi: 10.1109/sysmart.2018.8746962.Google ScholarCross Ref
- Nhu, T. V., & Sawada, H. (2018). Development of Vietnamese Voice Chatbot with Emotion Expression. 2018 International Symposium on Micro-NanoMechatronics and Human Science (MHS). doi: 10.1109/mhs.2018.8886954.Google ScholarDigital Library
- Nguyen, D. Q., Nguyen, D. Q., & Pham, S. B. (2012). A Vietnamese Text-Based Conversational Agent. Lecture Notes in Computer Science, 699–708. doi:10.1007/978-3-642-31087-4_71.Google ScholarDigital Library
- https://github.com/binhvq/news-corpus?fbclid=IwAR2RiYaYlpyo_TcBwKmeFGRs5ywAG46lbLzjR7E2p3pbWvBpzFq0IliJr74#full-txttitle–description–body-v1[Accessed 25 May 2020]Google Scholar
- https://github.com/ngoanpv/albert_vi [Accessed 30 November 2020].Google Scholar
Recommendations
An embedding method for unseen words considering contextual information and morphological information
SAC '21: Proceedings of the 36th Annual ACM Symposium on Applied ComputingThe performance1 of natural language processing has been greatly improved through the pre-trained language models, which are trained with a large amount of corpus. But the performance of natural language processing can be reduced by the OOV (Out of ...
Annotating the world wide web using natural language
RIAO '97: Computer-Assisted Information Searching on InternetThis paper describes the START Information Server built at the MIT Artificial Intelligence Laboratory. Available on the World Wide Web since December 1993, the START Server provides users with access to multi-media information in response to questions ...
Understanding Human Language: Can NLP and Deep Learning Help?
SIGIR '16: Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information RetrievalThere is a lot of overlap between the core problems of information retrieval (IR) and natural language processing (NLP). An IR system gains from understanding a user need and from understanding documents, and hence being able to determine whether a ...
Comments