ABSTRACT
In this paper we report the first effort of constructing emotion lexicon by utilizing Twitter as source of data. Specifically we used hashtag feature to obtain tweets with certain emotion label in English. There are eight emotion classes used in our work, comprising of angry, disgust, fear, joy, sad, surprise, trust and anticipation that refer to the Plutchik's wheel. To obtain the lexicon, we first ranked the words according to its term frequency. After that, we reduced some irrelevant words by removing words with low frequency. We also enriched the lexicon with the synonym and conducted filtering by utilizing sentiment lexicon (40,288 words). As result, we successfully constructed 4 Hashtag-Based Emotion (HBE) Lexicons through different procedures and called them as HBE-A1 (50,613 words), HBE-B1 (23,400 words), HBE-A2 (26,909 words) and HBE-B2 (14,905 words). In our experiment, we used the lexicons in investigating Twitter Sentiment Analysis and the result reveals that our proposed emotion lexicons can boost the accuracy and even improve over than NRC-Emotion lexicon. It is also worth noting that our construction idea is simple, automatic, inexpensive and suitable for Social Media analysis.
- S. M. Mohammad, and P. D. Turney. Crowdsourcing a word-emotion association lexicon. In Computational Intelligence, 29(3): 436--465, 2013.Google ScholarCross Ref
- F. Bravo-Marquez, M. Mendoza, and B. Poblete. Combining strengths, emotions and polarities for boosting Twitter sentiment analysis. In Proceedings of the Second International Workshop on Issues of Sentiment Discovery and Opinion Mining, 2, 2013. Google ScholarDigital Library
- W. J. Trybula. Data Mining and Knowledge Discovery. In Annual review of information science and technology (ARIST), 32: 197--229, 1997.Google Scholar
- S. Raaijmakers and W. Kraaij. A Shallow Approach to Subjectivity Classification. In ICWSM, 2008.Google Scholar
- W. Zhang and S. Skiena. Trading Strategies to Exploit Blog and News Sentiment. In ICWSM, 2010.Google ScholarCross Ref
- V. K. Singh, R. Piryani, A. Uddin, and P. Waila. Sentiment analysis of Movie reviews and Blog posts. In Advance Computing Conference (IACC): 893--898, 2013.Google ScholarCross Ref
- L. Shi, B. Sun, L. Kong, and Y. Zhang. Web forum Sentiment analysis based on topics. In Computer and Information Technology, 2: 148--153, 2009. Google ScholarDigital Library
- S. M. Mohammad. #Emotional tweets. In Proceedings of the Sixth International Workshop on Semantic Evaluation, Association for Computational Linguistics, 2012. Google ScholarDigital Library
- A. Go, R. Bhayani, and L. Huang. Twitter sentiment classification using distant supervision. In CS224N Project Report, Stanford, 2009.Google Scholar
- A. Agarwal, B. Xie, I. Vovsha, O. Rambow, and R. Passonneau. Sentiment analysis of twitter data. In Proceedings of the Workshop on Languages in Social Media: 30--38, 2011. 30--38. Google ScholarDigital Library
- P. Ekman. An argument for basic emotions. Cognition and Emotion, 6(3-4): 169--200, 1992.Google ScholarCross Ref
- R. Plutchik. The psychology and biology of emotion. HarperCollins College Publishers, 1994.Google Scholar
- The Macquarie Thesaurus. In Macquarie Library, 1986Google Scholar
- S. Bird. NLTK: the natural language toolkit. In Proceedings of the COLING/ACL on Interactive presentation sessions: 69--72, 2006. Google ScholarDigital Library
- M. Speriosu, N. Sudan, S. Upadhyay, and J. Baldridge. Twitter polarity classification with label propagation over lexical links and the follower graph. In Proceedings of the First workshop on Unsupervised Learning in NLP: 53--63, 2011. Google ScholarDigital Library
- F. A. Nielsen. A new ANEW: Evaluation of a word list for sentiment analysis in microblogs. In arXiv preprint arXiv: 1103.2903., 2011.Google Scholar
- M. M. Bradley and P. J. Lang. Affective norms for English words (ANEW): Instruction manual and affective ratings. In Technical Report C-1, The Center for Research in Psychophysiology, University of Florida, 1999.Google Scholar
- F. Koto and M. Adriani. The Use of POS Sequence for Analyzing Sentence Pattern in Twitter Sentiment Analysis. In Proceedings of the 29th WAINA (the Eight International Symposium on Mining and Web), Gwangju, Korea: 547--551, 2015.Google ScholarDigital Library
- F. Koto and M. Adriani. A Comparative Study on Twitter Sentiment Analysis: Which Features are Good?. In Natural Language Processing and Information System, Springer International Publishing, 9103: 453--457, 2015.Google Scholar
- N. Lubis, D. Lestari, A. Purwarianti, S. Sakti, and S. Nakamura. Emotion recognition on Indonesian television talk shows. In Proceedings of Spoken Language Technology Workshop (SLT): 466--471, 2014.Google ScholarCross Ref
- B. Liu, M. Hu and J. Cheng. Opinion observer: analyzing and comparing opinions on the web. In Proceedings of the 14th international conference on World Wide Web: 342--351, 2005. Google ScholarDigital Library
- T. Wilson, J. Wiebe, and P. Hoffmann. Recognizing contextual polarity in phrase-level sentiment analysis. In Proceedings of the conference on human language technology and empirical methods in natural language processing: 347--354 2005. Google ScholarDigital Library
- S. Baccianella, A. Esuli, and F. Sebastiani. SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining. In LREC, 10: 2200--2204, 2010.Google Scholar
Recommendations
Topic sentiment analysis in twitter: a graph-based hashtag sentiment classification approach
CIKM '11: Proceedings of the 20th ACM international conference on Information and knowledge managementTwitter is one of the biggest platforms where massive instant messages (i.e. tweets) are published every day. Users tend to express their real feelings freely in Twitter, which makes it an ideal source for capturing the opinions towards various ...
The Use of POS Sequence for Analyzing Sentence Pattern in Twitter Sentiment Analysis
WAINA '15: Proceedings of the 2015 IEEE 29th International Conference on Advanced Information Networking and Applications WorkshopsAs one of the largest Social Media in providing public data every day, Twitter has attracted the attention of researcher to investigate, in order to mine public opinion, which is known as Sentiment Analysis. Consequently, many techniques and studies ...
Finding news-topic oriented influential twitter users based on topic related hashtag community detection
Recently, more and more users would like to collect and provide information about news topics in Twitter, which is one of the most popular microblogging services. Virtual communities defined by hashtags in Twitter are created for exchanging information ...
Comments