A valences-totaling model for English sentiment classification

Phu, Vo Ngoc; Chau, Vo Thi Ngoc; Dat, Nguyen Duy; Tran, Vo Thi Ngoc; Nguyen, Tuan A.

doi:10.1007/s10115-017-1054-0

A valences-totaling model for English sentiment classification

Survey Paper
Published: 27 April 2017

Volume 53, pages 579–636, (2017)
Cite this article

Knowledge and Information Systems Aims and scope Submit manuscript

Vo Ngoc Phu ORCID: orcid.org/0000-0001-6047-9066¹,
Vo Thi Ngoc Chau²,
Nguyen Duy Dat³,
Vo Thi Ngoc Tran⁴ &
…
Tuan A. Nguyen⁵

591 Accesses
8 Citations
Explore all metrics

Abstract

Sentiment classification plays an important role in everyday life, in political activities, activities of commodity production and commercial activities. Finding a time-effective and highly accurate solution to the classification of emotions is challenging. Today, there are many models (or methods) to classify the sentiment of documents. Sentiment classification has been studied for many years and is used widely in many different fields. We propose a new model, which is called the valences-totaling model (VTM), by using cosine measure (CM) to classify the sentiment of English documents. VTM is a new model for English sentiment classification. In this study, CM is a measure of similarity between two words and is used to calculate the valence (and polarity) of English semantic lexicons. We prove that CM is able to identify the sentiment valence and the sentiment polarity of the English sentiment lexicons online in combination with the Google search engine with AND operator and OR operator. VTM uses many English semantic lexicons. These English sentiment lexicons are calculated online and are based on the Internet. We present a full range of English sentences; thus, the emotion expressed in the English text is classified with more precision. Our new model is not dependent on a special domain and training data set—it is a domain-independent classifier. We test our new model on the Internet data in English. The calculated valence (and polarity) of English semantic words in this model is based on many documents on millions of English Web sites and English social networks.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Large Movie Review Dataset (2016). http://ai.stanford.edu/~amaas/data/sentiment/
Efron M (2004) Cultural orientation: classifying subjective documents by cociation sic analysis. In: Proceedings of the AAAI fall symposium on style and meaning in language, art, music, and design, pp 41–48
Yuen RWM, Chan TYW, Lai Tom BY, Kwong OY, T’sou Benjamin KY (2004) Morpheme-based derivation of bipolar semantic orientation of Chinese words. In: Proceedings of the 20th international conference on computational linguistics. Stroudsburg, PA, USA
Chen L-S, Chiu H-J (2009) Developing a neural network based index for sentiment classification. In: Proceedings of the international multiconference of engineers and computer scientists. Hong Kong, March
Wang G, Araki K (2007) Modifying SO-PMI for Japanese weblog opinion mining by using a balancing factor and detecting neutral expressions. In: Proceedings of NAACL HLT 2007, Companion Volume, pp 189–192
Taboada M, Anthony C, Voll K (2006) Methods for creating semantic orientation dictionaries. In: Proceedings of fifth international conference on language resources and evaluation (LREC 2006). Genoa, Italy, pp 427–432
Cimiano P, Wenderoth J (2007) Automatic acquisition of ranked qualia structures from the web. In: Proceedings of the 45th annual meeting of the association of computational linguistics. Prague, Czech Republic, pp 888–895
Lu G, Huang P, He L, Cu C, Li X (2010) A new semantic similarity measuring method based on web search engines. J WSEAS Trans Comput 9(1):1–10
Google Scholar
Voll K, Taboada M (2007) Not all words are created equal: extracting semantic orientation as a function of adjective relevance. In: Proceedings of the 20th Australian joint conference on artificial intelligence. Gold Coast, Australia, pp 337–346
Kundi FM, Khan A, Asghar MZ, Ahamd S (2015) Context-aware spelling corrector for sentiment analysis. MAGNT Res Rep 2(6):1–11
Google Scholar
Mao H, Gao P, Wang Y, Bollen J (2014) Automatic construction of financial semantic orientation lexicon from large-scale Chinese news corpus. 7th financial risks international forum
Wikipedia (2016). https://en.wikipedia.org/wiki/
Lin D (1998) Automatic retrieval and clustering of similar words. In: Proceedings of the 17th international conference on computational linguistics, vol 2. Stroudsburg, PA, USA, pp 768–774
Turney PD, Littman ML (2002) unsupervised learning of semantic orientation from a hundred-billion-word corpus. Technical report NRC technical report ERB-1094. Institute for Information Technology, National Research Council Canada
Lin WH, Wu YL, Yu LC (2012) Online computation of mutual information and word context entropy. Int J Future Comput Commun 1(2):167
Article Google Scholar
Omar N, Albared M, Al-Shabi AQ, Al-Moslmi T (2013) Ensemble of classification algorithms for subjectivity and sentiment analysis of Arabic customers’ reviews. Int J Adv Comput Technol (IJACT) 5:77
Google Scholar
English Grammar of British Council (2015). https://learnenglish.britishcouncil.org/en/english-grammar
English Grammar of Wikipedia (2015). https://en.wikipedia.org/wiki/English_grammar
English Grammar of Cambridge (2015). http://www.cambridge.org/us/cambridgeenglish/
English Grammar of Oxford (2015). http://www.oxfordonlineenglish.com/free-english-grammar-lessons
Turney PD, Littman ML (2003) Measuring praise and criticism: inference of semantic orientation from association. ACM Trans Inf Syst (TOIS) 21(4):315–346
Article Google Scholar
Turney P (2002) Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews. In: Proceedings of 40th ACL, pp 417–424
Saloot MA, Idris N, Mahmud R, Ja’afar S, Thorleuchter D, Gani A (2016) Hadith data mining and classification: a comparative analysis. Artif Intell Rev 1–16. doi:10.1007/s10462-016-9458-x. Print ISSN 0269-2821
Ofoghi B, Mann M, Verspoor K (2016) Towards early discovery of salient health threats: a social media emotion classification technique. Pacific symposium on biocomputing, Hawaii, US
Zarra T, Chiheb R, Faizi R, El Afia A (2016) Using textual similarity and sentiment analysis in discussions forums to enhance learning. Int J Softw Eng Appl 10(1):191–200
Google Scholar
Korayem M, Aljadda K, Crandall D (2016) Sentiment/subjectivity analysis survey for languages other than English. Soc Netw Anal Min 6:75. doi:10.1007/s13278-016-0381-6
Pappas N, Popescu-Belisa A (2016) Adaptive sentiment-aware one-class collaborative filtering. Expert Syst Appl 43:23–41
Article Google Scholar
Fast E, Chen B, Bernstein M (2016) Empath: understanding topic signals in large-scale text. In: ACM conference on human factors in computing systems
Tang D, Wei F, Yang N, Zhou M, Liu T, Qin B (2014) Learning sentiment-specific word embedding for Twitter sentiment classification. In: Proceeding of the 52th annual meeting of the association for computational linguistics (ACL 2014)
Oswin Rahadiyan H, Gloria Virginia, Antonius Rachmat C (2016) Sentiment classification of film reviews using IB1. In: The 7th international conference on intelligent systems, modelling and simulation. doi:10.1109/ISMS.2016.38
Manek AS, Shenoy PD, Mohan MC, Venugopal KR (2016) Aspect term extraction for sentiment analysis in large movie reviews using Gini Index feature selection method and SVM classifier. World Wide Web 1–20. doi:10.1007/s11280-015-0381-x. Print ISSN1386-145X
Agarwal B, Mittal N (2016) Machine learning approach for sentiment analysis. Promin Feature Extr Sentim Anal 21–45. doi:10.1007/978-3-319-25343-5_3. Print ISBN 978-3-319-25341-1
Agarwal B, Mittal N (2016) Semantic orientation-based approach for sentiment analysis. Promin Feature Extr Sentim Anal 77–88. doi:10.1007/978-3-319-25343-5_6. Print ISBN 978-3-319-25341-1
Canuto S, Gonçalves MA, Benevenuto F (2016) Exploiting new sentiment-based meta-level features for effective sentiment analysis. In: Proceedings of the ninth ACM international conference on web search and data mining (WSDM ’16). New York, USA, pp 53–62
Ahmed S, Danti A (2016) Effective sentimental analysis and opinion mining of web reviews using rule based classifiers. Comput Intell Data Mining; 1:171–179. doi:10.1007/978-81-322-2734-2_18. ISBN 978-81-322-2732-8
Phu VN, Tuoi PT (2014) Sentiment classification using enhanced contextual valence shifters. In: International conference on Asian language processing (IALP), pp 224–229
Tran VTN, Phu VN, Tuoi PT (2014) Learning more chi square feature selection to improve the fastest and most accurate sentiment classification. In: The third Asian conference on information systems (ACIS 2014)
Cambria E, Schuller B, Xia Y, White B (2016) New avenues in knowledge bases for natural language processing. Knowl-Based Syst 108(C):1–4
Article Google Scholar
Erik C (2016) Affective computing and sentiment analysis. IEEE Intell Syst 31(2):102–107
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Research and Development, Duy Tan University, Da Nang, Vietnam
Vo Ngoc Phu
Computer Science and Engineering (CSE), Ho Chi Minh City University of Technology (HCMUT), Vietnam National University, Ho Chi Minh City, Vietnam
Vo Thi Ngoc Chau
Faculty of Information Technology, Ly Tu Trong Technical College, Ho Chi Minh City, Vietnam
Nguyen Duy Dat
School of Industrial Management (SIM), Ho Chi Minh City University of Technology (HCMUT), Vietnam National University, Ho Chi Minh City, Vietnam
Vo Thi Ngoc Tran
Faculty of Computer Networks and Communications, University of Information Technology (UIT), Vietnam National University of Hochiminh City, Linh Trung Ward, Thu Duc District, Ho Chi Minh City, Vietnam
Tuan A. Nguyen

Authors

Vo Ngoc Phu
View author publications
You can also search for this author in PubMed Google Scholar
Vo Thi Ngoc Chau
View author publications
You can also search for this author in PubMed Google Scholar
Nguyen Duy Dat
View author publications
You can also search for this author in PubMed Google Scholar
Vo Thi Ngoc Tran
View author publications
You can also search for this author in PubMed Google Scholar
Tuan A. Nguyen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Vo Ngoc Phu.

Ethics declarations

Conflict of interest

The author declares that there is no conflict of interest.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Phu, V.N., Chau, V.T.N., Dat, N.D. et al. A valences-totaling model for English sentiment classification. Knowl Inf Syst 53, 579–636 (2017). https://doi.org/10.1007/s10115-017-1054-0

Download citation

Received: 15 June 2016
Revised: 07 April 2017
Accepted: 10 April 2017
Published: 27 April 2017
Issue Date: December 2017
DOI: https://doi.org/10.1007/s10115-017-1054-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A valences-totaling model for English sentiment classification

Abstract

Access this article

Similar content being viewed by others

A Valence-Totaling Model for Vietnamese sentiment classification

An improved algorithm for sentiment analysis based on maximum entropy

Sentiment Analysis Based on Psychological and Linguistic Features for Spanish Language

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A valences-totaling model for English sentiment classification

Abstract

Access this article

Similar content being viewed by others

A Valence-Totaling Model for Vietnamese sentiment classification

An improved algorithm for sentiment analysis based on maximum entropy

Sentiment Analysis Based on Psychological and Linguistic Features for Spanish Language

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation