research-article

FU Covid-19 AI Agent built on Attention algorithm using a combination of Transformer, ALBERT model, and RASA framework

Authors:
Ban Quy Tran

FPT University, Vietnam

FPT University, Vietnam
View Profile

,
Thai Van Nguyen

FPT University, Vietnam

FPT University, Vietnam
View Profile

,
Thang Duc Phung

FPT University, Vietnam

FPT University, Vietnam
View Profile

,
Viet Tan Nguyen

FPT University, Vietnam

FPT University, Vietnam
View Profile

,
Dat Duy Tran

FPT University, Vietnam

FPT University, Vietnam
View Profile

,
Son Tung Ngo

FPT University, Vietnam

FPT University, Vietnam
View Profile

ICSCA '21: Proceedings of the 2021 10th International Conference on Software and Computer ApplicationsFebruary 2021Pages 22–31https://doi.org/10.1145/3457784.3457788

Published:30 July 2021Publication History

ICSCA '21: Proceedings of the 2021 10th International Conference on Software and Computer Applications

Pages 22–31

ABSTRACT

Potentialized by Natural Language Processing (NLP) technology, we can build a chatbot or an AI Agent to automatically address the need to automatically get credible and timely information, especially in the fight against epidemics. However, Vietnamese understanding is still a big challenge for NLP. This paper introduces an AI Agent using the Attention algorithm and Albert model to implement the question/answering task in the Covid-19 field for the Vietnamese language. In the end, we also built two other modules, one for Vietnamese diacritic auto-correction and another for updating Covid-19 statistics (using RASA framework), to deploy a Covid-19 chatbot application on mobile devices.

References

Sumo. 2020. 5 Ecommerce Chatbots (Plus How To Build Your Own In 15 Minutes). https://sumo.com/stories/ecommerce-chatbot-marketing. (2020). (Accessed on 01/08/2020). [60] TechCrunch. 20Google Scholar
WHO Health Alert brings COVID-19 facts to billions via WhatsApp. (2020, March 20). Retrieved November 30, 2020, from https://web.archive.org/web/20200323042822/https://www.who.int/news-room/feature-stories/detail/who-health-alert-brings-covid-19-facts-to-billions-via-whatsappGoogle Scholar
Halpern, J., 2020. IS VIETNAMESE A HARD LANGUAGE?. [online] Kanji.org. Available at: <http://www.kanji.org/kanji/jack/vietnamese/is_VN_hard_sum_EN_VN.pdf>[Accessed 30 November 2020].Google Scholar
Mikolov, Tomas & Sutskever, Ilya & Chen, Kai & Corrado, G.s & Dean, Jeffrey. (2013). Distributed Representations of Words and Phrases and their Compositionality. arXiv:1310.4546.Google Scholar
Pennington, C. (2014). GloVe: Global Vectors for Word Representation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP) (pp. 1532–1543). Association for Computational Linguistics.Google ScholarCross Ref
Joulin, A.; Grave, E.; Bojanowski, P.; Douze, M.; Jegou, H.; and Mikolov, T. 2016. Fasttext.zip: 'Compressing text classification models. arXiv preprint. arXiv:1612.03651.Google Scholar
Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long Short-Term Memory. Neural Comput. 9, 8 (November 15, 1997), 1735–1780. DOI:https://doi.org/10.1162/neco.1997.9.8.1735.Google ScholarDigital Library
Mandic, Danilo & Chambers, Jonathon. (2001). Recurrent Neural Networks for Prediction: Learning Algorithms,Architectures and Stability. 10.1002/047084535X.Google ScholarCross Ref
Peters, L. (2018). Deep Contextualized Word Representations. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers) (pp. 2227–2237). Association for Computational Linguistics.Google ScholarCross Ref
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Lukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In Advances in Neural Information Processing Systems, pages 6000–6010.Google Scholar
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805.Google Scholar
Alec Radford, Jeffrey Wu, Rewon Child, David Luan, Dario Amodei, and Ilya Sutskever. 2019. Language models are unsupervised multitask learners. Technical report, OpenAI.Google Scholar
Howard, S. (2018). Universal Language Model Fine-tuning for Text Classification. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (pp. 328–339). Association for Computational Linguistics.Google ScholarCross Ref
Ponnusamy, Pragaash; Roshan Ghias, Alireza; Guo, Chenlei; Sarikaya, Ruhi (2020). Feedback-Based Self-Learning in Large-Scale Conversational AI Agents. Proceedings of the AAAI Conference on Artificial Intelligence, 34(08), 13180–13187. doi:10.1609/aaai.v34i08.7022.Google ScholarCross Ref
Vakili Tahami, Amir & Shakery, Azadeh. (2019). Enriching Conversation Context in Retrieval-based AI agents. arXiv preprint arXiv:1911.02290.Google Scholar
Sugisaki, Kyoko. (2019). Chat-Bot-Kit: A web-based tool to simulate text-based interactions between humans and with computers. arXiv preprint arXiv:1911.00665Google Scholar
Zhou, K., Zhang, K., Wu, Y., Liu, S., Yu, J.: Unsupervised context rewriting for open domain conversation. In: EMNLP-IJCNLP 2019, Hong Kong, China, 3–7 November 2019, pp. 1834–1844 (2019).Google Scholar
B. Ruf, M. Sammarco and M. Detyniecki, "Contract Statements Knowledge Service for Chatbots," 2019 IEEE International Conference on Systems, Man and Cybernetics (SMC), Bari, Italy, 2019, pp. 3180-3185, doi: 10.1109/SMC.2019.8913955.Google ScholarDigital Library
Keyner, Sophia & Savenkov, Vadim & Vakulenko, Svitlana. (2019). Open Data AI agent.Google Scholar
https://github.com/undertheseanlp/AI agent [Access 25 May 2020].Google Scholar
Pham, Phuong. "NNVLP: A Neural Network-Based Vietnamese Language Processing Toolkit." . In Proceedings of the IJCNLP 2017, System Demonstrations (pp. 37–40). Association for Computational Linguistics, 2017.Google Scholar
Nguyen, T., & Shcherbakov, M. (2018). A Neural Network based Vietnamese Chatbot. 2018 International Conference on System Modeling & Advancement in Research Trends (SMART). doi: 10.1109/sysmart.2018.8746962.Google ScholarCross Ref
Nhu, T. V., & Sawada, H. (2018). Development of Vietnamese Voice Chatbot with Emotion Expression. 2018 International Symposium on Micro-NanoMechatronics and Human Science (MHS). doi: 10.1109/mhs.2018.8886954.Google ScholarDigital Library
Nguyen, D. Q., Nguyen, D. Q., & Pham, S. B. (2012). A Vietnamese Text-Based Conversational Agent. Lecture Notes in Computer Science, 699–708. doi:10.1007/978-3-642-31087-4_71.Google ScholarDigital Library
https://github.com/binhvq/news-corpus?fbclid=IwAR2RiYaYlpyo_TcBwKmeFGRs5ywAG46lbLzjR7E2p3pbWvBpzFq0IliJr74#full-txttitle–description–body-v1[Accessed 25 May 2020]Google Scholar
https://github.com/ngoanpv/albert_vi [Accessed 30 November 2020].Google Scholar

Recommendations

An embedding method for unseen words considering contextual information and morphological information
SAC '21: Proceedings of the 36th Annual ACM Symposium on Applied Computing

The performance¹ of natural language processing has been greatly improved through the pre-trained language models, which are trained with a large amount of corpus. But the performance of natural language processing can be reduced by the OOV (Out of ...
Read More
Annotating the world wide web using natural language
RIAO '97: Computer-Assisted Information Searching on Internet

This paper describes the START Information Server built at the MIT Artificial Intelligence Laboratory. Available on the World Wide Web since December 1993, the START Server provides users with access to multi-media information in response to questions ...
Read More
Understanding Human Language: Can NLP and Deep Learning Help?
SIGIR '16: Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval

There is a lot of overlap between the core problems of information retrieval (IR) and natural language processing (NLP). An IR system gains from understanding a user need and from understanding documents, and hence being able to determine whether a ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

ICSCA '21: Proceedings of the 2021 10th International Conference on Software and Computer Applications
February 2021
325 pages
ISBN:9781450388825
DOI:10.1145/3457784

Copyright © 2021 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 30 July 2021
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
AI Agent
Albert
Application
Attention
Covid-19
Natural Language Processing
Question Answering
Rasa
Transformer
Qualifiers
- research-article
- Research
- Refereed limited
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 81
  Total Downloads
- Downloads (Last 12 months)13
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format

FU Covid-19 AI Agent built on Attention algorithm using a combination of Transformer, ALBERT model, and RASA framework

ICSCA '21: Proceedings of the 2021 10th International Conference on Software and Computer Applications

ABSTRACT

References

Cited By

Recommendations

An embedding method for unseen words considering contextual information and morphological information

Annotating the world wide web using natural language

Understanding Human Language: Can NLP and Deep Learning Help?

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

HTML Format

Caption

FU Covid-19 AI Agent built on Attention algorithm using a combination of Transformer, ALBERT model, and RASA framework

ICSCA '21: Proceedings of the 2021 10th International Conference on Software and Computer Applications

ABSTRACT

References

Cited By

Recommendations

An embedding method for unseen words considering contextual information and morphological information

Annotating the world wide web using natural language

Understanding Human Language: Can NLP and Deep Learning Help?

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

HTML Format

Share this Publication link

Share on Social Media