short-paper

HBE: Hashtag-Based Emotion Lexicons for Twitter Sentiment Analysis

Authors:
Fajri Koto

Faculty of Computer Science, University of Indonesia, Depok, Jawa Barat, Indonesia

Faculty of Computer Science, University of Indonesia, Depok, Jawa Barat, Indonesia
View Profile

,
Mirna Adriani

Faculty of Computer Science, University of Indonesia, Depok, Jawa Barat, Indonesia

Faculty of Computer Science, University of Indonesia, Depok, Jawa Barat, Indonesia
View Profile

FIRE '15: Proceedings of the 7th Annual Meeting of the Forum for Information Retrieval EvaluationDecember 2015Pages 31–34https://doi.org/10.1145/2838706.2838718

Published:04 December 2015Publication History

FIRE '15: Proceedings of the 7th Annual Meeting of the Forum for Information Retrieval Evaluation

Pages 31–34

ABSTRACT

In this paper we report the first effort of constructing emotion lexicon by utilizing Twitter as source of data. Specifically we used hashtag feature to obtain tweets with certain emotion label in English. There are eight emotion classes used in our work, comprising of angry, disgust, fear, joy, sad, surprise, trust and anticipation that refer to the Plutchik's wheel. To obtain the lexicon, we first ranked the words according to its term frequency. After that, we reduced some irrelevant words by removing words with low frequency. We also enriched the lexicon with the synonym and conducted filtering by utilizing sentiment lexicon (40,288 words). As result, we successfully constructed 4 Hashtag-Based Emotion (HBE) Lexicons through different procedures and called them as HBE-A1 (50,613 words), HBE-B1 (23,400 words), HBE-A2 (26,909 words) and HBE-B2 (14,905 words). In our experiment, we used the lexicons in investigating Twitter Sentiment Analysis and the result reveals that our proposed emotion lexicons can boost the accuracy and even improve over than NRC-Emotion lexicon. It is also worth noting that our construction idea is simple, automatic, inexpensive and suitable for Social Media analysis.

References

S. M. Mohammad, and P. D. Turney. Crowdsourcing a word-emotion association lexicon. In Computational Intelligence, 29(3): 436--465, 2013.Google ScholarCross Ref
F. Bravo-Marquez, M. Mendoza, and B. Poblete. Combining strengths, emotions and polarities for boosting Twitter sentiment analysis. In Proceedings of the Second International Workshop on Issues of Sentiment Discovery and Opinion Mining, 2, 2013. Google ScholarDigital Library
W. J. Trybula. Data Mining and Knowledge Discovery. In Annual review of information science and technology (ARIST), 32: 197--229, 1997.Google Scholar
S. Raaijmakers and W. Kraaij. A Shallow Approach to Subjectivity Classification. In ICWSM, 2008.Google Scholar
W. Zhang and S. Skiena. Trading Strategies to Exploit Blog and News Sentiment. In ICWSM, 2010.Google ScholarCross Ref
V. K. Singh, R. Piryani, A. Uddin, and P. Waila. Sentiment analysis of Movie reviews and Blog posts. In Advance Computing Conference (IACC): 893--898, 2013.Google ScholarCross Ref
L. Shi, B. Sun, L. Kong, and Y. Zhang. Web forum Sentiment analysis based on topics. In Computer and Information Technology, 2: 148--153, 2009. Google ScholarDigital Library
S. M. Mohammad. #Emotional tweets. In Proceedings of the Sixth International Workshop on Semantic Evaluation, Association for Computational Linguistics, 2012. Google ScholarDigital Library
A. Go, R. Bhayani, and L. Huang. Twitter sentiment classification using distant supervision. In CS224N Project Report, Stanford, 2009.Google Scholar
A. Agarwal, B. Xie, I. Vovsha, O. Rambow, and R. Passonneau. Sentiment analysis of twitter data. In Proceedings of the Workshop on Languages in Social Media: 30--38, 2011. 30--38. Google ScholarDigital Library
P. Ekman. An argument for basic emotions. Cognition and Emotion, 6(3-4): 169--200, 1992.Google ScholarCross Ref
R. Plutchik. The psychology and biology of emotion. HarperCollins College Publishers, 1994.Google Scholar
The Macquarie Thesaurus. In Macquarie Library, 1986Google Scholar
S. Bird. NLTK: the natural language toolkit. In Proceedings of the COLING/ACL on Interactive presentation sessions: 69--72, 2006. Google ScholarDigital Library
M. Speriosu, N. Sudan, S. Upadhyay, and J. Baldridge. Twitter polarity classification with label propagation over lexical links and the follower graph. In Proceedings of the First workshop on Unsupervised Learning in NLP: 53--63, 2011. Google ScholarDigital Library
F. A. Nielsen. A new ANEW: Evaluation of a word list for sentiment analysis in microblogs. In arXiv preprint arXiv: 1103.2903., 2011.Google Scholar
M. M. Bradley and P. J. Lang. Affective norms for English words (ANEW): Instruction manual and affective ratings. In Technical Report C-1, The Center for Research in Psychophysiology, University of Florida, 1999.Google Scholar
F. Koto and M. Adriani. The Use of POS Sequence for Analyzing Sentence Pattern in Twitter Sentiment Analysis. In Proceedings of the 29th WAINA (the Eight International Symposium on Mining and Web), Gwangju, Korea: 547--551, 2015.Google ScholarDigital Library
F. Koto and M. Adriani. A Comparative Study on Twitter Sentiment Analysis: Which Features are Good?. In Natural Language Processing and Information System, Springer International Publishing, 9103: 453--457, 2015.Google Scholar
N. Lubis, D. Lestari, A. Purwarianti, S. Sakti, and S. Nakamura. Emotion recognition on Indonesian television talk shows. In Proceedings of Spoken Language Technology Workshop (SLT): 466--471, 2014.Google ScholarCross Ref
B. Liu, M. Hu and J. Cheng. Opinion observer: analyzing and comparing opinions on the web. In Proceedings of the 14th international conference on World Wide Web: 342--351, 2005. Google ScholarDigital Library
T. Wilson, J. Wiebe, and P. Hoffmann. Recognizing contextual polarity in phrase-level sentiment analysis. In Proceedings of the conference on human language technology and empirical methods in natural language processing: 347--354 2005. Google ScholarDigital Library
S. Baccianella, A. Esuli, and F. Sebastiani. SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining. In LREC, 10: 2200--2204, 2010.Google Scholar

Recommendations

Topic sentiment analysis in twitter: a graph-based hashtag sentiment classification approach
CIKM '11: Proceedings of the 20th ACM international conference on Information and knowledge management

Twitter is one of the biggest platforms where massive instant messages (i.e. tweets) are published every day. Users tend to express their real feelings freely in Twitter, which makes it an ideal source for capturing the opinions towards various ...
Read More
The Use of POS Sequence for Analyzing Sentence Pattern in Twitter Sentiment Analysis
WAINA '15: Proceedings of the 2015 IEEE 29th International Conference on Advanced Information Networking and Applications Workshops

As one of the largest Social Media in providing public data every day, Twitter has attracted the attention of researcher to investigate, in order to mine public opinion, which is known as Sentiment Analysis. Consequently, many techniques and studies ...
Read More
Finding news-topic oriented influential twitter users based on topic related hashtag community detection

Recently, more and more users would like to collect and provide information about news topics in Twitter, which is one of the most popular microblogging services. Virtual communities defined by hashtags in Twitter are created for exchanging information ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
FIRE '15: Proceedings of the 7th Annual Meeting of the Forum for Information Retrieval Evaluation
December 2015
57 pages
ISBN:9781450340045
DOI:10.1145/2838706
Editors:
Prasenjit Majumder,
Mandar Mitra,
Madhulika Agrawal,
Parth Mehta
Copyright © 2015 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 4 December 2015
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
emotion lexicon
hashtag
polarity
sentiment analysis
subjectivity
twitter
Qualifiers
- short-paper
- Research
- Refereed limited
Conference

Acceptance Rates
FIRE '15 Paper Acceptance Rate12of42submissions,29%Overall Acceptance Rate19of64submissions,30%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 10
  Total Citations
  View Citations
- 403
  Total Downloads
- Downloads (Last 12 months)14
- Downloads (Last 6 weeks)2
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HBE: Hashtag-Based Emotion Lexicons for Twitter Sentiment Analysis

FIRE '15: Proceedings of the 7th Annual Meeting of the Forum for Information Retrieval Evaluation

ABSTRACT

References

Cited By

Recommendations

Topic sentiment analysis in twitter: a graph-based hashtag sentiment classification approach

The Use of POS Sequence for Analyzing Sentence Pattern in Twitter Sentiment Analysis

Finding news-topic oriented influential twitter users based on topic related hashtag community detection

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

HBE: Hashtag-Based Emotion Lexicons for Twitter Sentiment Analysis

FIRE '15: Proceedings of the 7th Annual Meeting of the Forum for Information Retrieval Evaluation

ABSTRACT

References

Cited By

Recommendations

Topic sentiment analysis in twitter: a graph-based hashtag sentiment classification approach

The Use of POS Sequence for Analyzing Sentence Pattern in Twitter Sentiment Analysis

Finding news-topic oriented influential twitter users based on topic related hashtag community detection

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media