Skip to main content
Log in

A valences-totaling model for English sentiment classification

  • Survey Paper
  • Published:
Knowledge and Information Systems Aims and scope Submit manuscript

Abstract

Sentiment classification plays an important role in everyday life, in political activities, activities of commodity production and commercial activities. Finding a time-effective and highly accurate solution to the classification of emotions is challenging. Today, there are many models (or methods) to classify the sentiment of documents. Sentiment classification has been studied for many years and is used widely in many different fields. We propose a new model, which is called the valences-totaling model (VTM), by using cosine measure (CM) to classify the sentiment of English documents. VTM is a new model for English sentiment classification. In this study, CM is a measure of similarity between two words and is used to calculate the valence (and polarity) of English semantic lexicons. We prove that CM is able to identify the sentiment valence and the sentiment polarity of the English sentiment lexicons online in combination with the Google search engine with AND operator and OR operator. VTM uses many English semantic lexicons. These English sentiment lexicons are calculated online and are based on the Internet. We present a full range of English sentences; thus, the emotion expressed in the English text is classified with more precision. Our new model is not dependent on a special domain and training data set—it is a domain-independent classifier. We test our new model on the Internet data in English. The calculated valence (and polarity) of English semantic words in this model is based on many documents on millions of English Web sites and English social networks.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

References

  1. Large Movie Review Dataset (2016). http://ai.stanford.edu/~amaas/data/sentiment/

  2. Efron M (2004) Cultural orientation: classifying subjective documents by cociation sic analysis. In: Proceedings of the AAAI fall symposium on style and meaning in language, art, music, and design, pp 41–48

  3. Yuen RWM, Chan TYW, Lai Tom BY, Kwong OY, T’sou Benjamin KY (2004) Morpheme-based derivation of bipolar semantic orientation of Chinese words. In: Proceedings of the 20th international conference on computational linguistics. Stroudsburg, PA, USA

  4. Chen L-S, Chiu H-J (2009) Developing a neural network based index for sentiment classification. In: Proceedings of the international multiconference of engineers and computer scientists. Hong Kong, March

  5. Wang G, Araki K (2007) Modifying SO-PMI for Japanese weblog opinion mining by using a balancing factor and detecting neutral expressions. In: Proceedings of NAACL HLT 2007, Companion Volume, pp 189–192

  6. Taboada M, Anthony C, Voll K (2006) Methods for creating semantic orientation dictionaries. In: Proceedings of fifth international conference on language resources and evaluation (LREC 2006). Genoa, Italy, pp 427–432

  7. Cimiano P, Wenderoth J (2007) Automatic acquisition of ranked qualia structures from the web. In: Proceedings of the 45th annual meeting of the association of computational linguistics. Prague, Czech Republic, pp 888–895

  8. Lu G, Huang P, He L, Cu C, Li X (2010) A new semantic similarity measuring method based on web search engines. J WSEAS Trans Comput 9(1):1–10

    Google Scholar 

  9. Voll K, Taboada M (2007) Not all words are created equal: extracting semantic orientation as a function of adjective relevance. In: Proceedings of the 20th Australian joint conference on artificial intelligence. Gold Coast, Australia, pp 337–346

  10. Kundi FM, Khan A, Asghar MZ, Ahamd S (2015) Context-aware spelling corrector for sentiment analysis. MAGNT Res Rep 2(6):1–11

    Google Scholar 

  11. Mao H, Gao P, Wang Y, Bollen J (2014) Automatic construction of financial semantic orientation lexicon from large-scale Chinese news corpus. 7th financial risks international forum

  12. Wikipedia (2016). https://en.wikipedia.org/wiki/

  13. Lin D (1998) Automatic retrieval and clustering of similar words. In: Proceedings of the 17th international conference on computational linguistics, vol 2. Stroudsburg, PA, USA, pp 768–774

  14. Turney PD, Littman ML (2002) unsupervised learning of semantic orientation from a hundred-billion-word corpus. Technical report NRC technical report ERB-1094. Institute for Information Technology, National Research Council Canada

  15. Lin WH, Wu YL, Yu LC (2012) Online computation of mutual information and word context entropy. Int J Future Comput Commun 1(2):167

    Article  Google Scholar 

  16. Omar N, Albared M, Al-Shabi AQ, Al-Moslmi T (2013) Ensemble of classification algorithms for subjectivity and sentiment analysis of Arabic customers’ reviews. Int J Adv Comput Technol (IJACT) 5:77

    Google Scholar 

  17. English Grammar of British Council (2015). https://learnenglish.britishcouncil.org/en/english-grammar

  18. English Grammar of Wikipedia (2015). https://en.wikipedia.org/wiki/English_grammar

  19. English Grammar of Cambridge (2015). http://www.cambridge.org/us/cambridgeenglish/

  20. English Grammar of Oxford (2015). http://www.oxfordonlineenglish.com/free-english-grammar-lessons

  21. Turney PD, Littman ML (2003) Measuring praise and criticism: inference of semantic orientation from association. ACM Trans Inf Syst (TOIS) 21(4):315–346

    Article  Google Scholar 

  22. Turney P (2002) Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews. In: Proceedings of 40th ACL, pp 417–424

  23. Saloot MA, Idris N, Mahmud R, Ja’afar S, Thorleuchter D, Gani A (2016) Hadith data mining and classification: a comparative analysis. Artif Intell Rev 1–16. doi:10.1007/s10462-016-9458-x. Print ISSN 0269-2821

  24. Ofoghi B, Mann M, Verspoor K (2016) Towards early discovery of salient health threats: a social media emotion classification technique. Pacific symposium on biocomputing, Hawaii, US

  25. Zarra T, Chiheb R, Faizi R, El Afia A (2016) Using textual similarity and sentiment analysis in discussions forums to enhance learning. Int J Softw Eng Appl 10(1):191–200

    Google Scholar 

  26. Korayem M, Aljadda K, Crandall D (2016) Sentiment/subjectivity analysis survey for languages other than English. Soc Netw Anal Min 6:75. doi:10.1007/s13278-016-0381-6

  27. Pappas N, Popescu-Belisa A (2016) Adaptive sentiment-aware one-class collaborative filtering. Expert Syst Appl 43:23–41

    Article  Google Scholar 

  28. Fast E, Chen B, Bernstein M (2016) Empath: understanding topic signals in large-scale text. In: ACM conference on human factors in computing systems

  29. Tang D, Wei F, Yang N, Zhou M, Liu T, Qin B (2014) Learning sentiment-specific word embedding for Twitter sentiment classification. In: Proceeding of the 52th annual meeting of the association for computational linguistics (ACL 2014)

  30. Oswin Rahadiyan H, Gloria Virginia, Antonius Rachmat C (2016) Sentiment classification of film reviews using IB1. In: The 7th international conference on intelligent systems, modelling and simulation. doi:10.1109/ISMS.2016.38

  31. Manek AS, Shenoy PD, Mohan MC, Venugopal KR (2016) Aspect term extraction for sentiment analysis in large movie reviews using Gini Index feature selection method and SVM classifier. World Wide Web 1–20. doi:10.1007/s11280-015-0381-x. Print ISSN1386-145X

  32. Agarwal B, Mittal N (2016) Machine learning approach for sentiment analysis. Promin Feature Extr Sentim Anal 21–45. doi:10.1007/978-3-319-25343-5_3. Print ISBN 978-3-319-25341-1

  33. Agarwal B, Mittal N (2016) Semantic orientation-based approach for sentiment analysis. Promin Feature Extr Sentim Anal 77–88. doi:10.1007/978-3-319-25343-5_6. Print ISBN 978-3-319-25341-1

  34. Canuto S, Gonçalves MA, Benevenuto F (2016) Exploiting new sentiment-based meta-level features for effective sentiment analysis. In: Proceedings of the ninth ACM international conference on web search and data mining (WSDM ’16). New York, USA, pp 53–62

  35. Ahmed S, Danti A (2016) Effective sentimental analysis and opinion mining of web reviews using rule based classifiers. Comput Intell Data Mining; 1:171–179. doi:10.1007/978-81-322-2734-2_18. ISBN 978-81-322-2732-8

  36. Phu VN, Tuoi PT (2014) Sentiment classification using enhanced contextual valence shifters. In: International conference on Asian language processing (IALP), pp 224–229

  37. Tran VTN, Phu VN, Tuoi PT (2014) Learning more chi square feature selection to improve the fastest and most accurate sentiment classification. In: The third Asian conference on information systems (ACIS 2014)

  38. Cambria E, Schuller B, Xia Y, White B (2016) New avenues in knowledge bases for natural language processing. Knowl-Based Syst 108(C):1–4

    Article  Google Scholar 

  39. Erik C (2016) Affective computing and sentiment analysis. IEEE Intell Syst 31(2):102–107

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Vo Ngoc Phu.

Ethics declarations

Conflict of interest

The author declares that there is no conflict of interest.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Phu, V.N., Chau, V.T.N., Dat, N.D. et al. A valences-totaling model for English sentiment classification. Knowl Inf Syst 53, 579–636 (2017). https://doi.org/10.1007/s10115-017-1054-0

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10115-017-1054-0

Keywords

Navigation