Abstract
Sentiment classification plays an important role in everyday life, in political activities, activities of commodity production and commercial activities. Finding a time-effective and highly accurate solution to the classification of emotions is challenging. Today, there are many models (or methods) to classify the sentiment of documents. Sentiment classification has been studied for many years and is used widely in many different fields. We propose a new model, which is called the valences-totaling model (VTM), by using cosine measure (CM) to classify the sentiment of English documents. VTM is a new model for English sentiment classification. In this study, CM is a measure of similarity between two words and is used to calculate the valence (and polarity) of English semantic lexicons. We prove that CM is able to identify the sentiment valence and the sentiment polarity of the English sentiment lexicons online in combination with the Google search engine with AND operator and OR operator. VTM uses many English semantic lexicons. These English sentiment lexicons are calculated online and are based on the Internet. We present a full range of English sentences; thus, the emotion expressed in the English text is classified with more precision. Our new model is not dependent on a special domain and training data set—it is a domain-independent classifier. We test our new model on the Internet data in English. The calculated valence (and polarity) of English semantic words in this model is based on many documents on millions of English Web sites and English social networks.
Similar content being viewed by others
References
Large Movie Review Dataset (2016). http://ai.stanford.edu/~amaas/data/sentiment/
Efron M (2004) Cultural orientation: classifying subjective documents by cociation sic analysis. In: Proceedings of the AAAI fall symposium on style and meaning in language, art, music, and design, pp 41–48
Yuen RWM, Chan TYW, Lai Tom BY, Kwong OY, T’sou Benjamin KY (2004) Morpheme-based derivation of bipolar semantic orientation of Chinese words. In: Proceedings of the 20th international conference on computational linguistics. Stroudsburg, PA, USA
Chen L-S, Chiu H-J (2009) Developing a neural network based index for sentiment classification. In: Proceedings of the international multiconference of engineers and computer scientists. Hong Kong, March
Wang G, Araki K (2007) Modifying SO-PMI for Japanese weblog opinion mining by using a balancing factor and detecting neutral expressions. In: Proceedings of NAACL HLT 2007, Companion Volume, pp 189–192
Taboada M, Anthony C, Voll K (2006) Methods for creating semantic orientation dictionaries. In: Proceedings of fifth international conference on language resources and evaluation (LREC 2006). Genoa, Italy, pp 427–432
Cimiano P, Wenderoth J (2007) Automatic acquisition of ranked qualia structures from the web. In: Proceedings of the 45th annual meeting of the association of computational linguistics. Prague, Czech Republic, pp 888–895
Lu G, Huang P, He L, Cu C, Li X (2010) A new semantic similarity measuring method based on web search engines. J WSEAS Trans Comput 9(1):1–10
Voll K, Taboada M (2007) Not all words are created equal: extracting semantic orientation as a function of adjective relevance. In: Proceedings of the 20th Australian joint conference on artificial intelligence. Gold Coast, Australia, pp 337–346
Kundi FM, Khan A, Asghar MZ, Ahamd S (2015) Context-aware spelling corrector for sentiment analysis. MAGNT Res Rep 2(6):1–11
Mao H, Gao P, Wang Y, Bollen J (2014) Automatic construction of financial semantic orientation lexicon from large-scale Chinese news corpus. 7th financial risks international forum
Wikipedia (2016). https://en.wikipedia.org/wiki/
Lin D (1998) Automatic retrieval and clustering of similar words. In: Proceedings of the 17th international conference on computational linguistics, vol 2. Stroudsburg, PA, USA, pp 768–774
Turney PD, Littman ML (2002) unsupervised learning of semantic orientation from a hundred-billion-word corpus. Technical report NRC technical report ERB-1094. Institute for Information Technology, National Research Council Canada
Lin WH, Wu YL, Yu LC (2012) Online computation of mutual information and word context entropy. Int J Future Comput Commun 1(2):167
Omar N, Albared M, Al-Shabi AQ, Al-Moslmi T (2013) Ensemble of classification algorithms for subjectivity and sentiment analysis of Arabic customers’ reviews. Int J Adv Comput Technol (IJACT) 5:77
English Grammar of British Council (2015). https://learnenglish.britishcouncil.org/en/english-grammar
English Grammar of Wikipedia (2015). https://en.wikipedia.org/wiki/English_grammar
English Grammar of Cambridge (2015). http://www.cambridge.org/us/cambridgeenglish/
English Grammar of Oxford (2015). http://www.oxfordonlineenglish.com/free-english-grammar-lessons
Turney PD, Littman ML (2003) Measuring praise and criticism: inference of semantic orientation from association. ACM Trans Inf Syst (TOIS) 21(4):315–346
Turney P (2002) Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews. In: Proceedings of 40th ACL, pp 417–424
Saloot MA, Idris N, Mahmud R, Ja’afar S, Thorleuchter D, Gani A (2016) Hadith data mining and classification: a comparative analysis. Artif Intell Rev 1–16. doi:10.1007/s10462-016-9458-x. Print ISSN 0269-2821
Ofoghi B, Mann M, Verspoor K (2016) Towards early discovery of salient health threats: a social media emotion classification technique. Pacific symposium on biocomputing, Hawaii, US
Zarra T, Chiheb R, Faizi R, El Afia A (2016) Using textual similarity and sentiment analysis in discussions forums to enhance learning. Int J Softw Eng Appl 10(1):191–200
Korayem M, Aljadda K, Crandall D (2016) Sentiment/subjectivity analysis survey for languages other than English. Soc Netw Anal Min 6:75. doi:10.1007/s13278-016-0381-6
Pappas N, Popescu-Belisa A (2016) Adaptive sentiment-aware one-class collaborative filtering. Expert Syst Appl 43:23–41
Fast E, Chen B, Bernstein M (2016) Empath: understanding topic signals in large-scale text. In: ACM conference on human factors in computing systems
Tang D, Wei F, Yang N, Zhou M, Liu T, Qin B (2014) Learning sentiment-specific word embedding for Twitter sentiment classification. In: Proceeding of the 52th annual meeting of the association for computational linguistics (ACL 2014)
Oswin Rahadiyan H, Gloria Virginia, Antonius Rachmat C (2016) Sentiment classification of film reviews using IB1. In: The 7th international conference on intelligent systems, modelling and simulation. doi:10.1109/ISMS.2016.38
Manek AS, Shenoy PD, Mohan MC, Venugopal KR (2016) Aspect term extraction for sentiment analysis in large movie reviews using Gini Index feature selection method and SVM classifier. World Wide Web 1–20. doi:10.1007/s11280-015-0381-x. Print ISSN1386-145X
Agarwal B, Mittal N (2016) Machine learning approach for sentiment analysis. Promin Feature Extr Sentim Anal 21–45. doi:10.1007/978-3-319-25343-5_3. Print ISBN 978-3-319-25341-1
Agarwal B, Mittal N (2016) Semantic orientation-based approach for sentiment analysis. Promin Feature Extr Sentim Anal 77–88. doi:10.1007/978-3-319-25343-5_6. Print ISBN 978-3-319-25341-1
Canuto S, Gonçalves MA, Benevenuto F (2016) Exploiting new sentiment-based meta-level features for effective sentiment analysis. In: Proceedings of the ninth ACM international conference on web search and data mining (WSDM ’16). New York, USA, pp 53–62
Ahmed S, Danti A (2016) Effective sentimental analysis and opinion mining of web reviews using rule based classifiers. Comput Intell Data Mining; 1:171–179. doi:10.1007/978-81-322-2734-2_18. ISBN 978-81-322-2732-8
Phu VN, Tuoi PT (2014) Sentiment classification using enhanced contextual valence shifters. In: International conference on Asian language processing (IALP), pp 224–229
Tran VTN, Phu VN, Tuoi PT (2014) Learning more chi square feature selection to improve the fastest and most accurate sentiment classification. In: The third Asian conference on information systems (ACIS 2014)
Cambria E, Schuller B, Xia Y, White B (2016) New avenues in knowledge bases for natural language processing. Knowl-Based Syst 108(C):1–4
Erik C (2016) Affective computing and sentiment analysis. IEEE Intell Syst 31(2):102–107
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The author declares that there is no conflict of interest.
Rights and permissions
About this article
Cite this article
Phu, V.N., Chau, V.T.N., Dat, N.D. et al. A valences-totaling model for English sentiment classification. Knowl Inf Syst 53, 579–636 (2017). https://doi.org/10.1007/s10115-017-1054-0
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10115-017-1054-0