skip to main content
10.1145/3581754.3584159acmconferencesArticle/Chapter ViewAbstractPublication PagesiuiConference Proceedingsconference-collections
poster

A Vietnamese Spelling Correction System

Published:27 March 2023Publication History

ABSTRACT

This paper presents a new Vietnamese spelling correction system that allows users to correct spelling errors in their text. Our system is an interactive writing assistant that integrates advanced technologies in natural language processing to (i) identify spelling errors and (ii) replace those errors with their corrected version. To the best of our knowledge, our system is the first Vietnamese spelling correction tool that interacts with the users via Web Interface, Microsoft Word and Chrome extensions to provide the best user experience. We also perform automatic and human evaluations to demonstrate the effectiveness of our system. Our system is publicly available at https://grammar.vinai.io/.

References

  1. Christopher Bryant, Mariano Felice, and Ted Briscoe. 2017. Automatic Annotation and Evaluation of Error Types for Grammatical Error Correction. In Proceedings of ACL. 793–805.Google ScholarGoogle ScholarCross RefCross Ref
  2. Kenneth W Church and William A Gale. 1991. Probability scoring for spelling correction. Statistics and Computing 1, 2 (1991), 93–103.Google ScholarGoogle ScholarCross RefCross Ref
  3. Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of NAACL-HLT. 4171–4186.Google ScholarGoogle Scholar
  4. Dinh-Truong Do, Ha Thanh Nguyen, Thang Ngoc Bui, and Hieu Dinh Vo. 2021. VSEC: Transformer-Based Model for Vietnamese Spelling Correction. In Proceedings of PRICAI. 259–272.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Mariano Felice, Christopher Bryant, and Ted Briscoe. 2016. Automatic Extraction of Learner Errors in ESL Sentences Using Linguistically Enhanced Alignments. In Proceedings of COLING. 825–835.Google ScholarGoogle Scholar
  6. Claudia Leacock, Martin Chodorow, Michael Gamon, and Joel Tetreault. 2010. Automated grammatical error detection for language learners. Synthesis lectures on human language technologies 3, 1(2010), 1–134.Google ScholarGoogle Scholar
  7. Yinhan Liu, Myle Ott, Naman Goyal, Jingfei Du, Mandar Joshi, Danqi Chen, Omer Levy, Mike Lewis, Luke Zettlemoyer, and Veselin Stoyanov. 2019. RoBERTa: A Robustly Optimized BERT Pretraining Approach. arXiv preprint arXiv:1907.11692 (2019).Google ScholarGoogle Scholar
  8. Ritika Mishra and Navjot Kaur. 2013. A survey of spelling error detection and correction techniques. International Journal of Computer Trends and Technology 4, 3 (2013), 372–374.Google ScholarGoogle Scholar
  9. Trung Hieu Ngo, Ham Duong Tran, Tin Huynh, and Kiem Hoang. 2022. A Combination of BERT and Transformer for Vietnamese Spelling Correction. In Proceedings of ACIIDS, Part I. 545–558.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Dat Quoc Nguyen and Anh Tuan Nguyen. 2020. PhoBERT: Pre-trained language models for Vietnamese. In Findings of EMNLP. 1037–1042.Google ScholarGoogle Scholar
  11. Dat Quoc Nguyen, Thanh Vu, Dai Quoc Nguyen, Mark Dras, and Mark Johnson. 2017. From Word Segmentation to POS Tagging for Vietnamese. In Proceedings of ALTA. 108–113.Google ScholarGoogle Scholar
  12. Ha Thanh Nguyen, Tran Binh Dang, and Le Minh Nguyen. 2019. Deep Learning Approach for Vietnamese Consonant Misspell Correction. In Proceedings of PACLING. 497–504.Google ScholarGoogle Scholar
  13. Linh Thuy Nguyen, Ban Phuoc Dao, Duc-Vu Nguyen, and Ngan Luu-Thuy Nguyen. 2020. Vietnamese Context-Sensitive Malicious Spelling Error Correction. In Proceedings of NICS. 48–53.Google ScholarGoogle ScholarCross RefCross Ref
  14. Phuong H. Nguyen, Thuan D. Ngo, Dung A. Phan, Thu P. T. Dinh, and Thang Q. Huynh. 2008. Vietnamese spelling detection and correction using Bi-gram, Minimum Edit Distance, SoundEx algorithms with some additional heuristics. In Proceedings of RIVF. 96–102.Google ScholarGoogle ScholarCross RefCross Ref
  15. Thien Hai Nguyen, Tuan-Duy H Nguyen, Duy Phung, Duy Tran-Cong Nguyen, Hieu Minh Tran, Manh Luong, Tin Duy Vo, Hung Hai Bui, Dinh Phung, and Dat Quoc Nguyen. 2022. A Vietnamese-English Neural Machine Translation System. In Proceedings INTERSPEECH. 5543–5544.Google ScholarGoogle Scholar
  16. Kostiantyn Omelianchuk, Vitaliy Atrasevych, Artem Chernodub, and Oleksandr Skurzhanskyi. 2020. GECToR – Grammatical Error Correction: Tag, Not Rewrite. In Proceedings of BEA. 163–170.Google ScholarGoogle Scholar
  17. Rico Sennrich, Barry Haddow, and Alexandra Birch. 2016. Neural Machine Translation of Rare Words with Subword Units. In Proceedings of ACL. 1715–1725.Google ScholarGoogle ScholarCross RefCross Ref
  18. Dong Nguyen Tien, Tuoi Tran Thi Minh, Loi Le Vu, and Tuan Dang Minh. 2022. Vietnamese Spelling Error Detection and Correction Using BERT and N-gram Language Model. In Proceedings of ICISN. 427–436.Google ScholarGoogle ScholarCross RefCross Ref
  19. Hieu Tran, Cuong V. Dinh, Long Phan, and Son T. Nguyen. 2021. Hierarchical Transformer Encoders for Vietnamese Spelling Correction. In Proceedings of IEA/AIE. 547–556.Google ScholarGoogle Scholar
  20. Nguyen Luong Tran, Duong Minh Le, and Dat Quoc Nguyen. 2022. BARTpho: Pre-trained Sequence-to-Sequence Models for Vietnamese. In Proceedings of INTERSPEECH. 1751–1755.Google ScholarGoogle ScholarCross RefCross Ref
  21. Tin Duy Vo, Manh Luong, Duong Minh Le, Hieu Tran, Nhan Do, Tuan-Duy H. Nguyen, Thien Nguyen, Hung Hai Bui, Dat Quoc Nguyen, and Dinh Phung. 2022. Vietnamese Speech-Based Question Answering over Car Manuals. In Companion Proceedings of IUI. 117–119.Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Yu Wang, Yuelin Wang, Kai Dang, Jie Liu, and Zhuo Liu. 2021. A Comprehensive Survey of Grammatical Error Correction. ACM Trans. Intell. Syst. Technol. 12, 5, Article 65(2021).Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Shaohua Zhang, Haoran Huang, Jicong Liu, and Hang Li. 2020. Spelling Error Correction with Soft-Masked BERT. In Proceedings of ACL. 882–890.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. A Vietnamese Spelling Correction System

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          IUI '23 Companion: Companion Proceedings of the 28th International Conference on Intelligent User Interfaces
          March 2023
          266 pages
          ISBN:9798400701078
          DOI:10.1145/3581754

          Copyright © 2023 Owner/Author

          Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 27 March 2023

          Check for updates

          Qualifiers

          • poster
          • Research
          • Refereed limited

          Acceptance Rates

          Overall Acceptance Rate746of2,811submissions,27%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        HTML Format

        View this article in HTML Format .

        View HTML Format