skip to main content
10.1145/2838706.2838718acmotherconferencesArticle/Chapter ViewAbstractPublication PagesfireConference Proceedingsconference-collections
short-paper

HBE: Hashtag-Based Emotion Lexicons for Twitter Sentiment Analysis

Authors Info & Claims
Published:04 December 2015Publication History

ABSTRACT

In this paper we report the first effort of constructing emotion lexicon by utilizing Twitter as source of data. Specifically we used hashtag feature to obtain tweets with certain emotion label in English. There are eight emotion classes used in our work, comprising of angry, disgust, fear, joy, sad, surprise, trust and anticipation that refer to the Plutchik's wheel. To obtain the lexicon, we first ranked the words according to its term frequency. After that, we reduced some irrelevant words by removing words with low frequency. We also enriched the lexicon with the synonym and conducted filtering by utilizing sentiment lexicon (40,288 words). As result, we successfully constructed 4 Hashtag-Based Emotion (HBE) Lexicons through different procedures and called them as HBE-A1 (50,613 words), HBE-B1 (23,400 words), HBE-A2 (26,909 words) and HBE-B2 (14,905 words). In our experiment, we used the lexicons in investigating Twitter Sentiment Analysis and the result reveals that our proposed emotion lexicons can boost the accuracy and even improve over than NRC-Emotion lexicon. It is also worth noting that our construction idea is simple, automatic, inexpensive and suitable for Social Media analysis.

References

  1. S. M. Mohammad, and P. D. Turney. Crowdsourcing a word-emotion association lexicon. In Computational Intelligence, 29(3): 436--465, 2013.Google ScholarGoogle ScholarCross RefCross Ref
  2. F. Bravo-Marquez, M. Mendoza, and B. Poblete. Combining strengths, emotions and polarities for boosting Twitter sentiment analysis. In Proceedings of the Second International Workshop on Issues of Sentiment Discovery and Opinion Mining, 2, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. W. J. Trybula. Data Mining and Knowledge Discovery. In Annual review of information science and technology (ARIST), 32: 197--229, 1997.Google ScholarGoogle Scholar
  4. S. Raaijmakers and W. Kraaij. A Shallow Approach to Subjectivity Classification. In ICWSM, 2008.Google ScholarGoogle Scholar
  5. W. Zhang and S. Skiena. Trading Strategies to Exploit Blog and News Sentiment. In ICWSM, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  6. V. K. Singh, R. Piryani, A. Uddin, and P. Waila. Sentiment analysis of Movie reviews and Blog posts. In Advance Computing Conference (IACC): 893--898, 2013.Google ScholarGoogle ScholarCross RefCross Ref
  7. L. Shi, B. Sun, L. Kong, and Y. Zhang. Web forum Sentiment analysis based on topics. In Computer and Information Technology, 2: 148--153, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. S. M. Mohammad. #Emotional tweets. In Proceedings of the Sixth International Workshop on Semantic Evaluation, Association for Computational Linguistics, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. A. Go, R. Bhayani, and L. Huang. Twitter sentiment classification using distant supervision. In CS224N Project Report, Stanford, 2009.Google ScholarGoogle Scholar
  10. A. Agarwal, B. Xie, I. Vovsha, O. Rambow, and R. Passonneau. Sentiment analysis of twitter data. In Proceedings of the Workshop on Languages in Social Media: 30--38, 2011. 30--38. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. P. Ekman. An argument for basic emotions. Cognition and Emotion, 6(3-4): 169--200, 1992.Google ScholarGoogle ScholarCross RefCross Ref
  12. R. Plutchik. The psychology and biology of emotion. HarperCollins College Publishers, 1994.Google ScholarGoogle Scholar
  13. The Macquarie Thesaurus. In Macquarie Library, 1986Google ScholarGoogle Scholar
  14. S. Bird. NLTK: the natural language toolkit. In Proceedings of the COLING/ACL on Interactive presentation sessions: 69--72, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. M. Speriosu, N. Sudan, S. Upadhyay, and J. Baldridge. Twitter polarity classification with label propagation over lexical links and the follower graph. In Proceedings of the First workshop on Unsupervised Learning in NLP: 53--63, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. F. A. Nielsen. A new ANEW: Evaluation of a word list for sentiment analysis in microblogs. In arXiv preprint arXiv: 1103.2903., 2011.Google ScholarGoogle Scholar
  17. M. M. Bradley and P. J. Lang. Affective norms for English words (ANEW): Instruction manual and affective ratings. In Technical Report C-1, The Center for Research in Psychophysiology, University of Florida, 1999.Google ScholarGoogle Scholar
  18. F. Koto and M. Adriani. The Use of POS Sequence for Analyzing Sentence Pattern in Twitter Sentiment Analysis. In Proceedings of the 29th WAINA (the Eight International Symposium on Mining and Web), Gwangju, Korea: 547--551, 2015.Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. F. Koto and M. Adriani. A Comparative Study on Twitter Sentiment Analysis: Which Features are Good?. In Natural Language Processing and Information System, Springer International Publishing, 9103: 453--457, 2015.Google ScholarGoogle Scholar
  20. N. Lubis, D. Lestari, A. Purwarianti, S. Sakti, and S. Nakamura. Emotion recognition on Indonesian television talk shows. In Proceedings of Spoken Language Technology Workshop (SLT): 466--471, 2014.Google ScholarGoogle ScholarCross RefCross Ref
  21. B. Liu, M. Hu and J. Cheng. Opinion observer: analyzing and comparing opinions on the web. In Proceedings of the 14th international conference on World Wide Web: 342--351, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. T. Wilson, J. Wiebe, and P. Hoffmann. Recognizing contextual polarity in phrase-level sentiment analysis. In Proceedings of the conference on human language technology and empirical methods in natural language processing: 347--354 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. S. Baccianella, A. Esuli, and F. Sebastiani. SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining. In LREC, 10: 2200--2204, 2010.Google ScholarGoogle Scholar

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    FIRE '15: Proceedings of the 7th Annual Meeting of the Forum for Information Retrieval Evaluation
    December 2015
    57 pages
    ISBN:9781450340045
    DOI:10.1145/2838706

    Copyright © 2015 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 4 December 2015

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • short-paper
    • Research
    • Refereed limited

    Acceptance Rates

    FIRE '15 Paper Acceptance Rate12of42submissions,29%Overall Acceptance Rate19of64submissions,30%

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader