skip to main content
10.1145/3328833.3328874acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicsieConference Proceedingsconference-collections
research-article

Building a Sentiment Analysis system using automatically generated training Dataset

Published:09 April 2019Publication History

ABSTRACT

In this paper, we describe a procedure for extracting annotated Arabic negative and positive tweets. We use these extracted annotated tweets to build our sentiment system using Naive Bayes with TF-IDF enhancement. The large size of training data for a highly inflected language is necessary to compensate for the sparseness nature of such languages. We present our techniques and explain our experimental system. We automatically collect 200 thousand annotated tweets. The evaluation shows that our sentiment analysis system has high precision and accuracy measures compared to existing ones.

References

  1. D. Daoud, A. Al-kouz, and M. Daoud, "Time-Sensitive Arabic Multiword Expressions Extraction from Social Networks," International Journal of Speech Technology (IJST), 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. D. Daoud, A. Al-kouz, K. Hasssan, and L. Milliam, "Arabic Tweets Clustering and Labeling Based on Lingual and Semantically Enriched Bayesian Network Model," Recent Patents on Computer Science, vol. 8, pp. 1--14, 2015.Google ScholarGoogle ScholarCross RefCross Ref
  3. A. Hamdi, K. B. Shaban, and A. Zainal, "A Review on Challenging Issues in Arabic Sentiment Analysis," JCS, vol. 12, pp. 471--481.Google ScholarGoogle Scholar
  4. A. Shoukry and A. Rafea, "Sentence-level Arabic sentiment analysis," presented at Collaboration Technologies and Systems (CTS), 2012 International Conference on.Google ScholarGoogle Scholar
  5. S. Rosenthal, N. Farra, and P. Nakov, "SemEval-2017 task 4: Sentiment analysis in Twitter," presented at Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017).Google ScholarGoogle Scholar
  6. N. A. Abdulla, N. A. Ahmed, M. A. Shehab, and M. Al-Ayyoub, "Arabic sentiment analysis: Lexicon-based and corpus-based," presented at Applied Electrical Engineering and Computing Technologies (AEECT), 2013 IEEE Jordan Conference on.Google ScholarGoogle Scholar
  7. N. A. Abdulla, N. A. Ahmed, M. A. Shehab, M. Al-Ayyoub, M. N. Al-Kabi, and S. Al-rifai, "Towards improving the lexicon-based approach for Arabic sentiment analysis," International Journal of Information Technology and Web Engineering (IJITWE), vol. 9, pp. 55--71. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. S. Ahmed, M. Pasquier, and G. Qadah, "Key issues in conducting sentiment analysis on Arabic social media text," presented at Innovations in Information Technology (IIT), 2013 9th International Conference on.Google ScholarGoogle Scholar
  9. M. Al-Ayyoub, S. B. Essa, and I. Alsmadi, "Lexicon-based sentiment analysis of Arabic tweets," International Journal of Social Network Mining, vol. 2, pp. 101--114.Google ScholarGoogle ScholarCross RefCross Ref
  10. L. Albraheem and H. S. Al-Khalifa, "Exploring the problems of sentiment analysis in informal Arabic," presented at Proceedings of the 14th International Conference on Information Integration and Web-based Applications & Services. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. S. R. El-Beltagy and A. Ali, "Open issues in the sentiment analysis of Arabic social media: A case study," presented at Innovations in information technology (iit), 2013 9th international conference on.Google ScholarGoogle Scholar
  12. A. N. A., M. N. A., S. M., Al-Ayyoub M., and "Arabic Sentiment Analysis: Corpus-based and Lexicon-based," presented at IEEE Conference on Applied Electrical Engineering and Computing Technologies (AEECT 2013), Amman, Jordan., 2013.Google ScholarGoogle Scholar

Index Terms

  1. Building a Sentiment Analysis system using automatically generated training Dataset

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        ICSIE '19: Proceedings of the 8th International Conference on Software and Information Engineering
        April 2019
        276 pages
        ISBN:9781450361057
        DOI:10.1145/3328833

        Copyright © 2019 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 9 April 2019

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed limited
      • Article Metrics

        • Downloads (Last 12 months)7
        • Downloads (Last 6 weeks)0

        Other Metrics

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader