skip to main content
10.1145/2905055.2905276acmotherconferencesArticle/Chapter ViewAbstractPublication PagesictcsConference Proceedingsconference-collections
research-article

Towards Sentiment Orientation Data Set Enrichment

Published: 04 March 2016 Publication History

Abstract

Sentiment orientation data sets referred to variously as affective word lists, opinion lexicons, sentiment lexicons, emotion lexicons or sentiment dictionaries contain a list of words scored for the degree of positive and negative emotion they exhibit. Although these lists have been used extensively for the sentiment analysis of text data, they contain a limited number of words that are often inadequate for data obtained from modern text sources dominated by the influence of social media that has resulted in the creation and coining of new words on a regular basis. In an effort to enrich these data sets with new words, we propose two methods. The first method involves the sentiment analysis of portmanteau words. We have hypothesized that the sentiment score of a portmanteau word; which is a combination of two (or more) words and their meanings into a single new word; can be determined as a function of the sentiment scores of its component words. Regression analysis has been used to determine this functional relationship and several cases arising from the above have been evaluated on a data set constructed from SentiWordNet. The second method is an in situ approach for sentiment discovery for unknown words that uses labeled tweets and words from the sentiment orientation data set as inputs to discover the sentiment score of the unknown word. In order to validate the resultant score, we have also used a novel validation-feedback mechanism akin to cross-validation. Both these methods produce acceptable levels of accuracy proving that they can be implemented in practice.

References

[1]
List of portmanteau words. http://en.wikipedia.org/wiki/List_of_portmanteaus, 2014. Accessed: 2014-11-16.
[2]
K. Ahmad and Y. Almas. Visualising sentiments in financial texts? In Information Visualisation, 2005. Proceedings. Ninth International Conference on, pages 363--368, July 2005.
[3]
S. Baccianella, A. Esuli, and F. Sebastiani. Sentiwordnet 3.0: An enhanced lexical resource for sentiment analysis and opinion mining. In LREC, volume 10, pages 2200--2204, 2010.
[4]
H. Bozdogan. Model selection and akaike's information criterion (aic): The general theory and its analytical extensions. Psychometrika, 52(3):345--370, 1987.
[5]
M. M. Bradley and P. J. Lang. Affective norms for english words (anew): Instruction manual and affective ratings. Technical report, Technical Report C-1, The Center for Research in Psychophysiology, University of Florida, 1999.
[6]
H. Cho, J.-S. Lee, and S. Kim. Enhancing lexicon-based review classification by merging and revising sentiment dictionaries. page 463--470.
[7]
D. Davidov, O. Tsur, and A. Rappoport. Enhanced sentiment learning using twitter hashtags and smileys. In Proceedings of the 23rd International Conference on Computational Linguistics: Posters, pages 241--249. Association for Computational Linguistics, 2010.
[8]
J. C. de Albornoz, L. Plaza, and P. GervÃąs. Sentisense: An easily scalable concept-based affective lexicon for sentiment analysis. In N. C. C. Chair), K. Choukri, T. Declerck, M. U. Doħan, B. Maegaard, J. Mariani, A. Moreno, J. Odijk, and S. Piperidis, editors, Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12), Istanbul, Turkey, may 2012. European Language Resources Association (ELRA).
[9]
P. S. Dodds, K. D. Harris, I. M. Kloumann, C. A. Bliss, and C. M. Danforth. Temporal patterns of happiness and information in a global social network: Hedonometrics and twitter. PloS one, 6(12):e26752, 2011.
[10]
P. Domingos. The role of occam's razor in knowledge discovery. Data mining and knowledge discovery, 3(4):409--425, 1999.
[11]
A. Esuli and F. Sebastiani. Sentiwordnet: A publicly available lexical resource for opinion mining. In Proceedings of LREC, volume 6, pages 417--422, 2006.
[12]
A. Go, R. Bhayani, and L. Huang. Twitter sentiment classification using distant supervision. CS224N Project Report, Stanford, pages 1--12, 2009.
[13]
M. Hu and B. Liu. Mining and summarizing customer reviews. In Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, pages 168--177. ACM, 2004.
[14]
C. H. John. Emotionality ratings and free-association norms of 240 emotional and non-emotional words. Cognition & Emotion, 2(1):49--70, 1988.
[15]
J. Liu, J. Yao, and G. Wu. Super parsing:sentiment classification with review extraction. In Computer and Information Technology, 2005. CIT 2005. The Fifth International Conference on, pages 216--222, Sept 2005.
[16]
T. Loughran and B. McDonald. When is a liability not a liability? textual analysis, dictionaries, and 10-ks. The Journal of Finance, 66(1):35--65, 2011.
[17]
S. M. Mohammad and P. D. Turney. Emotions evoked by common words and phrases: Using mechanical turk to create an emotion lexicon. In Proceedings of the NAACL HLT 2010 Workshop on Computational Approaches to Analysis and Generation of Emotion in Text, pages 26--34. Association for Computational Linguistics, 2010.
[18]
F. Å. Nielsen. Afinn, mar 2011.
[19]
F. Å. Nielsen. A new anew: Evaluation of a word list for sentiment analysis in microblogs. arXiv preprint arXiv:1103.2903, 2011.
[20]
B. Pang and L. Lee. Opinion mining and sentiment analysis. Foundations and trends in information retrieval, 2(1-2):1--135, 2008.
[21]
B. Pang, L. Lee, and S. Vaithyanathan. Thumbs up?: sentiment classification using machine learning techniques. In Proceedings of the ACL-02 conference on Empirical methods in natural language processing-Volume 10, pages 79--86. Association for Computational Linguistics, 2002.
[22]
J. Read. Using emoticons to reduce dependency in machine learning techniques for sentiment classification. In Proceedings of the ACL Student Research Workshop, pages 43--48. Association for Computational Linguistics, 2005.
[23]
P. J. Rousseeuw and A. M. Leroy. Robust regression and outlier detection, volume 589. John Wiley & Sons, 2005.
[24]
S. K. Shevade, S. S. Keerthi, C. Bhattacharyya, and K. R. K. Murthy. Improvements to the smo algorithm for svm regression. Neural Networks, IEEE Transactions on, 11(5):1188--1193, 2000.
[25]
V. Singh, R. Piryani, A. Uddin, and P. Waila. Sentiment analysis of movie reviews and blog posts. In Advance Computing Conference (IACC), 2013 IEEE 3rd International, pages 893--898, Feb 2013.
[26]
D. Song, H. Lin, and Z. Yang. Opinion mining in e-learning system. In Network and Parallel Computing Workshops, 2007. NPC Workshops. IFIP International Conference on, pages 788--792, Sept 2007.
[27]
P. J. Stone, R. F. Bales, J. Z. Namenwirth, and D. M. Ogilvie. The general inquirer: A computer system for content analysis and retrieval based on the sentence as a unit of information. Behavioral Science, 7(4):484--498, 1962.
[28]
C. Whissell. Using the revised dictionary of affect in language to quantify the emotional undertones of samples of natural language 1, 2. Psychological reports, 105(2):509--521, 2009.
[29]
C. Whissell, M. Fournier, R. Pelland, D. Weir, and K. Makarec. A dictionary of affect in language: Iv. reliability, validity, and applications. Perceptual and Motor Skills, 62(3):875--888, 1986.
[30]
T. Wilson, P. Hoffmann, S. Somasundaran, J. Kessler, J. Wiebe, Y. Choi, C. Cardie, E. Riloff, and S. Patwardhan. Opinionfinder: A system for subjectivity analysis. In Proceedings of hlt/emnlp on interactive demonstrations, pages 34--35. Association for Computational Linguistics, 2005.
[31]
T. Wilson, J. Wiebe, and P. Hoffmann. Recognizing contextual polarity in phrase-level sentiment analysis. In Proceedings of the conference on human language technology and empirical methods in natural language processing, pages 347--354. Association for Computational Linguistics, 2005.
[32]
Q. Ye, W. Shi, and Y. jun Li. Sentiment classification for movie reviews in chinese by improved semantic oriented approach. In System Sciences, 2006. HICSS '06. Proceedings of the 39th Annual Hawaii International Conference on, volume 3, pages 53b--53b, Jan 2006.
[33]
J. Yi, T. Nasukawa, R. Bunescu, and W. Niblack. Sentiment analyzer: extracting sentiments about a given topic using natural language processing techniques. In Data Mining, 2003. ICDM 2003. Third IEEE International Conference on, pages 427--434, Nov 2003.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICTCS '16: Proceedings of the Second International Conference on Information and Communication Technology for Competitive Strategies
March 2016
843 pages
ISBN:9781450339629
DOI:10.1145/2905055
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 04 March 2016

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. In Situ Sentiment Discovery
  2. Portmanteau Words
  3. SentiWordNet
  4. Sentiment Analysis
  5. Social Media Sentiment Analysis

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

ICTCS '16

Acceptance Rates

Overall Acceptance Rate 97 of 270 submissions, 36%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 02 Mar 2025

Other Metrics

Citations

Cited By

View all

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media