skip to main content
10.1145/3603287.3656160acmconferencesArticle/Chapter ViewAbstractPublication Pagesacm-seConference Proceedingsconference-collections
poster

An Open-Source Chatbot by Using ParlAI

Published:27 April 2024Publication History

ABSTRACT

While existing work shows a great success, the accuracy, the response time, the interoperability, the extendibility, etc. remain to be improved. We proposed "ParlAIDialogTeacher" using the open-source ParlAI to improve the accuracy, the response time, the interoperability, and the extendibility. Training is based on Maximum Likelihood Estimation (MLE) approach for the generative model. This platform eases sharing, training, and evaluating dialog models with multiple datasets available. Our results indicate the proposed model outperforms x, y, z in terms of response time and accuracy. Future work include extending the model with other genitive models such as GAN.

References

  1. Daniel Adiwardana, Minh-Thang Luong, David R So, Jamie Hall, Noah Fiedel, Romal Thoppilan, Zi Yang, Apoorv Kulshreshtha, Gaurav Nemade, Yifeng Lu, et al. 2020. "Towards a Human-like Open-domain Chatbot". arXiv preprint arXiv:2001.09977 (2020).Google ScholarGoogle Scholar
  2. Tianxing He, Jun Liu, Kyunghyun Cho, Myle Ott, Bing Liu, James Glass, and Fuchun Peng. 2019. "Analyzing the Forgetting Problem in The Pretrain-Finetuning of Dialogue Response Models". arXiv preprint arXiv:1910.07117 (2019).Google ScholarGoogle Scholar
  3. Margaret Li, Jason Weston, and Stephen Roller. 2019. "Acute-eval: Improved Dialogue Evaluation with Optimized Questions and Multi-turn Comparisons"'. arXiv preprint arXiv:1909.03087 (2019).Google ScholarGoogle Scholar
  4. Nitika Mathur, Timothy Baldwin, and Trevor Cohn. 2017. "Sequence Effects in Crowdsourced Annotations". In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing. 2860--2865.Google ScholarGoogle ScholarCross RefCross Ref
  5. Alexander H Miller, Will Feng, Adam Fisch, Jiasen Lu, Dhruv Batra, Antoine Bordes, Devi Parikh, and Jason Weston. 2017. "ParlAI: A Dialog Research Software Platform". arXiv preprint arXiv:1705.06476 (2017).Google ScholarGoogle Scholar
  6. Myle Ott, Sergey Edunov, Alexei Baevski, and Angela Fan. 2019. "Fairseq: A Fast, Extensible Toolkit for Sequence Modeling". arXiv preprint arXiv:1904.01038 (2019).Google ScholarGoogle Scholar
  7. Alec Radford, Jeffrey Wu, Rewon Child, David Luan, Dario Amodei, Ilya Sutskever, et al. 2019. "Language Models are Unsupervised Multitask Learners". OpenAI blog 1, 8 (2019), 9.Google ScholarGoogle Scholar
  8. Noam Shazeer and Mitchell Stern. 2018. "Adafactor: Adaptive Learning Rates with Sublinear Memory Cost". (2018), 4596--4604.Google ScholarGoogle Scholar
  9. Mohammad Shoeybi, Mostofa Patwary, Raul Puri, Patrick LeGresley, Jared Casper, and Bryan Catanzaro. 2019. "Megatron-lm: Training Multi-billion Parameter Language Models Using Model Parallelism". arXiv preprint arXiv:1909.08053 (2019).Google ScholarGoogle Scholar
  10. Heung-Yeung Shum, Xiao-dong He, and Di Li. 2018. "From Eliza to XiaoIce: Challenges and Opportunities with Social Chatbots". Frontiers of IT 19, 1 (2018).Google ScholarGoogle Scholar

Index Terms

  1. An Open-Source Chatbot by Using ParlAI

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        ACM SE '24: Proceedings of the 2024 ACM Southeast Conference
        April 2024
        337 pages
        ISBN:9798400702372
        DOI:10.1145/3603287

        Copyright © 2024 Owner/Author

        Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 27 April 2024

        Check for updates

        Qualifiers

        • poster
        • Research
        • Refereed limited

        Acceptance Rates

        ACM SE '24 Paper Acceptance Rate44of137submissions,32%Overall Acceptance Rate178of377submissions,47%
      • Article Metrics

        • Downloads (Last 12 months)10
        • Downloads (Last 6 weeks)10

        Other Metrics

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader