skip to main content
research-article

Subword Attentive Model for Arabic Sentiment Analysis: A Deep Learning Approach

Published:13 February 2020Publication History
Skip Abstract Section

Abstract

Social media data is unstructured data where these big data are exponentially increasing day to day in many different disciplines. Analysis and understanding the semantics of these data are a big challenge due to its variety and huge volume. To address this gap, unstructured Arabic texts have been studied in this work owing to their abundant appearance in social media Web sites. This work addresses the difficulty of handling unstructured social media texts, particularly when the data at hand is very limited. This intelligent data augmentation technique that handles the problem of less availability of data are used. This article has proposed a novel architecture for hand Arabic words classification and understands based on convolutional neural networks (CNNs) and recurrent neural networks. Moreover, the CNN technique is the most powerful for the analysis of Arabic tweets and social network analysis. The main technique used in this work is character-level CNN and a recurrent neural network stacked on top of one another as the classification architecture. These two techniques give 95% accuracy in the Arabic texts dataset.

References

  1. [n.d]. Twitter sentiment analysis. Retrieved June 22, 2019 from https://github.com/topics/twitter-sentiment-analysis.Google ScholarGoogle Scholar
  2. Abu Bakr Soliman, Kareem Eissa, and Samhaa R. El-Beltagy. 2017. Aravec: A set of arabic word embedding models for use in arabic nlp. Proc. Comput. Sci. 117 (2017), 256--265.Google ScholarGoogle ScholarCross RefCross Ref
  3. Anwar Alnawas and Nursal Arici. 2019. Sentiment analysis of Iraqi Arabic Dialect on Facebook based on distributed representations of documents. ACM Trans. Asian Low-Resour. Lang. Inf. Process. 18, 3, Article 20 (January 2019), 17 pages. DOI:https://doi.org/10.1145/3278605Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. T. Andrija, J. Predrag, and K. Vlado. 2006. n-Gram-based classification and unsupervised hierarchical clustering of genome sequences. Computer Methods and Programs in Biomedicine 81 (2006), 37--153. DOI:http://dx.doi.org/10.1016/j.cmpb.2005.11.007Google ScholarGoogle Scholar
  5. Albert Bifet and Eibe Frank. 2010. Sentiment knowledge discovery in twitter streaming data. In Proceedings of the 13th International Conference on Discovery Science (DS'10). Springer-Verlag, Berlin, Heidelberg, 1--15.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Changhua Yang, Kevin Hsin-Yih Lin, and Hsin-Hsi Chen. 2007. Emotion classification using Web blog corpora. In Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence (WI'07). IEEE Computer Society, USA, 275--278. DOI:https://doi.org/10.1109/WI.2007.50Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. F. Chollet. 2015. Keras. https://github.com/fchollet/keras.Google ScholarGoogle Scholar
  8. Dmitry Davidov, Oren Tsur, and Ari Rappoport. 2010. Enhanced sentiment learning using Twitter hashtags and smileys. In Proceedings of the 23rd International Conference on Computational Linguistics: Posters (COLING'10). Association for Computational Linguistics, USA, 241--249.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Changliang Li, Bo Xu, Gaowei Wu, Saike He, Guanhua Tian, and Hongwei Hao. 2014. Recursive deep learning for sentiment analysis over social data. In Proceedings of the 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT) - Vol. 02 (WI-IAT'14). IEEE Computer Society, USA, 180--185. DOI:https://doi.org/10.1109/WI-IAT.2014.96Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Junyoung Chung, Caglar Gulcehre, Kyunghyun Cho, and Yoshua Bengio. 2015. Gated feedback recurrent neural networks. In Proceedings of the 32nd International Conference on International Conference on Machine Learning - Volume 37 (ICML'15). JMLR.org, 2067--2075.Google ScholarGoogle Scholar
  11. Yoon Kim. 2014. Convolutional neural networks for sentence classification. Eprint Arxiv, 1746--1751.Google ScholarGoogle Scholar
  12. Kun-Lin Liu, Wu-Jun Li, and Minyi Guo. 2012. Emoticon smoothed language models for twitter sentiment analysis. In Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence (AAAI'12). AAAI Press, 1678--1684.Google ScholarGoogle Scholar
  13. Loai Alnemer, Bayan Alammouri, Jamal Alsakran, and Omar El Ariss. 2019. Enhanced classification of sentiment analysis of Arabic reviews. EIDWT 2019, LNDECT 29 (2019), 210--220.Google ScholarGoogle Scholar
  14. K. P. Murphy. 2012. Machine Learning: A Probabilistic Perspective. The MIT Press.Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Naaima Boudad, Rdouan Faizi, Rachid Oulad Haj Thami, and Raddouane Chiheb. 2017. Sentiment analysis in arabic: A review of the literature. Ain Shams Engineering Journal 9, 4 (2017), 2479--2490.Google ScholarGoogle ScholarCross RefCross Ref
  16. Alexander Pak and Patrick Paroubek. 2010. Twitter as a Corpus for Sentiment Analysis and Opinion Mining. In Proceedings of the Language Resources and Evaluation Confrence (LREC'10). 1320--1326.Google ScholarGoogle Scholar
  17. J. R. Quinlan. 1986. Induction of decision trees. Machine Learning 1 (1986), 81--106. DOI:https://doi.org/10.1023/A:1022643204877Google ScholarGoogle ScholarCross RefCross Ref
  18. J. Ross Quinlan. 1993. C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA.Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. E. Refaee and V. Rieser. 2014. An Arabic Twitter corpus for subjectivity and sentiment analysis. In Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC'14). European Language Resources Association, 2268--2273.Google ScholarGoogle Scholar
  20. Giovanni Seni and John Elder. 2010. Ensemble Methods in Data Mining: Improving Accuracy Through Combining Predictions. Morgan and Claypool Publishers.Google ScholarGoogle Scholar
  21. S. A. Rupal Bhargava and Y. Sharma. 2018. Neural network based architecture for sentiment analysis in Indian languages. Journal of Intelligent Systems 7 (2018), 313--318.Google ScholarGoogle Scholar
  22. Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural Comput. 9, 8 (November 1997), 1735--1780.Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Shubhi Mittal, Ashna Goel, and Rachna Jain. 2016. Sentiment analysis of ecommerce and social networking sites. In 3rd International Conference on Computing for Sustainable Global Development (INDIACom'16). 2300--2305.Google ScholarGoogle Scholar
  24. G. Sidorov. 2013. Syntactic dependency-based n-grams in rule based automatic English as second language grammar correction. International Journal of Computational Linguistics and Applications 4, 2 (2013), 169--188.Google ScholarGoogle Scholar
  25. Theresa Wilson, Janyce Wiebe, and Paul Hoffmann. 2005. Recognizing contextual polarity in phrase-level sentiment analysis. In Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing (HLT'05). Association for Computational Linguistics, USA, 347--354. DOI:https://doi.org/10.3115/1220575.1220619Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Rinalds Vīksna and Gints Jēkabsons. 2018. Sentiment analysis in Latvian and Russian: A survey. Applied Computer Systems 23, 1 (2018), 45--51.Google ScholarGoogle ScholarCross RefCross Ref
  27. Xiang Zhang, Junbo Zhao, and Yann LeCun. 2015. Character-level convolutional networks for text classification. In Proceedings of the 28th International Conference on Neural Information Processing Systems - Vol. 1 (NIPS'15). MIT Press, Cambridge, MA, USA, 649--657.Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Zhun Zhong, Liang Zheng, Zhedong Zheng, Shaozi Li, and Yi Yang. 2019. CamStyle: A novel data augmentation method for person re-identification. IEEE Transactions on Image Processing 28, 3 (2019), 1176--1190.Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Mourad Gridach and Hatem Haddad. 2017. Arabic named entity recognition: A bidirectional GRU-CRF approach. In Proceedings of the International Conference on Computational Linguistics and Intelligent Text Processing. 264--275.Google ScholarGoogle Scholar
  30. Majdi Beseiso. 2019. Word and character information aware neural model for emotional analysis. Recent Patents on Computer Science 12, 2, 142.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Subword Attentive Model for Arabic Sentiment Analysis: A Deep Learning Approach

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Asian and Low-Resource Language Information Processing
      ACM Transactions on Asian and Low-Resource Language Information Processing  Volume 19, Issue 2
      March 2020
      301 pages
      ISSN:2375-4699
      EISSN:2375-4702
      DOI:10.1145/3358605
      Issue’s Table of Contents

      Copyright © 2020 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 13 February 2020
      • Accepted: 1 August 2019
      • Revised: 1 June 2019
      • Received: 1 April 2019
      Published in tallip Volume 19, Issue 2

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format .

    View HTML Format