Comparative Analysis of Deep Learning Models for Myanmar Text Classification

Phyu, Myat Sapal; Nwet, Khin Thandar

doi:10.1007/978-3-030-41964-6_7

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 12033))

Included in the following conference series:

Asian Conference on Intelligent Information and Database Systems

1353 Accesses
2 Citations

Abstract

Text classification is one of the major research areas for Natural Language Processing (NLP). Long Short Term Memory (LSTM), Convolutional Neural Networks (CNN), and their combination models have been applied in many NLP tasks. This paper presents a joint CNN with no max-polling layer and Bidirectional LSTM to fulfill the requirements of each model. The proposed model takes advantage of CNN to extract features and Bi-LSTM to capture long term contextual information from past and future contexts. The proposed model is compared with CNN, Bi-LSTM, RNN, and CNN-LSTM models with pre-trained word embedding on five article datasets in Myanmar language.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

References

Aye, Y.M., Aung, S.S.: Enhanced sentiment classification for informal Myanmar text of restaurant reviews. In: 16th International Conference on Software Engineering Research, Management and Applications (SERA), pp. 31–36. IEEE (2018). https://doi.org/10.1109/SERA.2018.8477231
Conneau, A., Schwenk, H., Barrault, L., Lecun, Y.: Very deep convolutional networks for text classification. In: The European Chapter of the Association for Computational Linguistics, EACL 2017 (2017). https://doi.org/10.18653/v1/e17-1104
Grave, E., Bojanowski, P., Gupta, P., Joulin, A., Mikolov, T.: Learning word vectors for 157 languages. In: Proceedings of the Eleventh International Conference on Language Resources and Evaluation, LREC-2018 (2018)
Google Scholar
Hassan, A., Mahmood, A.: Convolutional recurrent deep learning model for sentence classification. IEEE Access 6, 13949–13957 (2018). https://doi.org/10.1109/ACCESS.2018.2814818
Article Google Scholar
Heinzerling, B., Michael, S.: BPEmb: tokenization-free pre-trained sub-word embeddings in 275 languages. In: Proceedings of the Eleventh International Conference on Language Resources and Evaluation, LREC-2018, pp. 31–36 (2018). https://doi.org/10.11588/data/V9CXPR
Joulin, A., Grave, E., Bojanowski, P., Mikolov, T.: Bag of tricks for efficient text classification. In: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers, pp. 427–431 (2017). https://doi.org/10.18653/v1/e17-2068
Khine, A.H., Nwet, K.T., Soe, K.M.: Automatic Myanmar news classification. In: 15th International Conference on Computer Applications, ICCA 2017, pp. 401–408 (2017)
Google Scholar
Kim, Y.: Convolutional neural networks for sentence classification. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1746–1751 (2014). https://doi.org/10.3115/v1/D14-1181
Kyaw, T.N., Nyo, N.N.: Myanmar spam filtering based on Naïve Bayesian learning algorithm (MSFNBLA). In: 14th International Conference on Computer Applications, ICCA 2016 (2016)
Google Scholar
Lai, S., Liheng, X., Kang, L., Jun, Z.: Recurrent convolutional neural networks for text classification. In: The Twenty-Ninth AAAI Conference on Artificial Intelligence (2015)
Google Scholar
Mon, A.N., Pa, W.P., Thu, Y.K.: Exploring the effect of tones for Myanmar language speech recognition using convolutional neural network (CNN). In: Hasida, K., Pa, W.P. (eds.) PACLING 2017. CCIS, vol. 781, pp. 314–326. Springer, Singapore (2018). https://doi.org/10.1007/978-981-10-8438-6_25
Chapter Google Scholar
Phyu, S.P., Nwet, K.T.: Article classification in Myanmar language. In: The Proceeding of 2019 International Conference on Advanced Information Technologies (ICAIT), pp. 188–193. IEEE (2019). https://doi.org/10.1109/AITC.2019.8920927
Song, X., Petrak, J., Roberts, A.: A deep neural network sentence level classification method with context information. In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp. 900–904 (2018). https://doi.org/10.18653/v1/D18-1107
Yu, T., Nwet, K.T.: Annotation and sentiment analysis for Myanmar news. In: 16th International Conferences on Computer Applications, ICCA 2018
Google Scholar
Zhang, X., Zhao, J., LeCun, Y.: Character-level convolutional networks for text classification. In: Advances in Neural Information Processing Systems, pp. 649–657 (2015)
Google Scholar

Download references

Acknowledgement

We deeply thank the anonymous reviewers for sharing their precious time to check our manuscript. We greatly thank the researchers who released pre-trained vectors publicly and these resources helpful for low resources languages. We greatly thank the friends who assist to collect and annotate Myanmar text datasets.

Author information

Authors and Affiliations

Faculty of Computer Science, University of Information Technology, Yangon, Myanmar
Myat Sapal Phyu & Khin Thandar Nwet

Authors

Myat Sapal Phyu
View author publications
You can also search for this author in PubMed Google Scholar
Khin Thandar Nwet
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Myat Sapal Phyu or Khin Thandar Nwet .

Editor information

Editors and Affiliations

Department of Applied Informatics, Wrocław University of Science and Technology, Wrocław, Poland
Ngoc Thanh Nguyen
King Mongkut's Institute of Technology Ladkrabang, Bangkok, Thailand
Kietikul Jearanaitanakij
Faculty of Computer Science and Information, University Teknologi Malaysia, Kuala Lumpur, Malaysia
Ali Selamat
Department of Applied Informatics, Wrocław University of Science and Technology, Wrocław, Poland
Bogdan Trawiński
King Mongkut's Institute of Technology Ladkrabang, Bangkok, Thailand
Suphamit Chittayasothorn

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Phyu, M.S., Nwet, K.T. (2020). Comparative Analysis of Deep Learning Models for Myanmar Text Classification. In: Nguyen, N., Jearanaitanakij, K., Selamat, A., Trawiński, B., Chittayasothorn, S. (eds) Intelligent Information and Database Systems. ACIIDS 2020. Lecture Notes in Computer Science(), vol 12033. Springer, Cham. https://doi.org/10.1007/978-3-030-41964-6_7

Download citation

DOI: https://doi.org/10.1007/978-3-030-41964-6_7
Published: 04 March 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-41963-9
Online ISBN: 978-3-030-41964-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics