Abstract
Sentiment analysis is a new challenging task related to Text Mining and Natural Language Processing. Although there are some current works, most of them only focus on English texts. Web pages, information and opinions on the Internet are increasing every day, and English is not the only language used to write them. Other languages like Spanish are increasingly present so we have carried out some experiments over a Spanish film reviews corpus. In this paper we present several experiments using five classification algorithms (SVM, Nave Bayes, BBR, KNN, C4.5). The results obtained are very promising and encourage us to continue investigating in this line.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Ahmad, K., Cheng, D., Almas, Y.: Multi-lingual Sentiment Analysis of Financial News Streams. In: Proceedings of Science, GRID 2006 (2006)
Boldrini, E., Balahur, A., Martínez-Barco, P., Montoyo, A.: EmotiBlog: an annotation scheme for emotion detection and analysis in non-traditional textual genres. In: DMIN, pp. 491–497. CSREA Press
Carletta, J.: Assessing agreement on classification tasks: the kappa statistic. In: Computational Linguistics, vol. 22(2). MIT Press, Cambridge (1996)
Chang, C.C., Lin, C.J.: LIBSVM: a Library for Support Vector Machines (2001)
Cruz, F.L., Troyano, J.A., Enriquez, F., Ortega, J.: Clasificación de documentos basada en la opinión: experimentos con un corpus de críticas de cine en español. Procesamiento de Lenguaje Natural 41 (2008)
Denecke, K.: Using SentiWordNet for multilingual sentiment analysis. In: ICDE Workshops, pp. 507–512. IEEE Computer Society, Los Alamitos (2008)
Genkin, A., Lewis, D., Madigan, D.: Large-Scale Bayesian Logistic Regression for Text Categorization (2004)
Mitchell, T.: Machine Learning. McGraw-Hill, New York (1997)
Ortiz-Martos, A., Martín-Valdivia, M.T., Ureña-Lopez, L.A., Cumbreras-García, M.A.: Detección automática de Spam utilizando Regresión Logística Bayesiana. Procesamiento del Lenguaje Natural 35, 127–133 (2005)
Pang, B., Lee, L.: Opinion mining and sentiment analysis. Foundation and Trends in Information Retrieval 2(1-2), 1–135 (2008)
Quinlan, J.R.: Induction of Decision Trees. Machine Learning 1, 81–106 (1986)
Quinlan, J.R.: Programs for Machine Learning. Morgan Kaurfman, San Francisco (1993)
Sebastiani, F.: Machine Learning in automated text categorization. ACM Computing Surveys (CSUR) 34(1), 1–47 (2002)
Esuli, A., Sebastiani, F.: SentiWordNet: A publicly Available Lexical Resource for Opinion Mining. In: Proceedings of Language Resources and Evaluation, LREC (2006)
Stone, P.J.: The General Inquierer: A Computer Approach to Content Analysis. The MIT Press, Cambridge (1996)
Tan, S., Zhang, J.: An empirical study of sentiment analysis for Chinese documents. Expert System with Applications 34, 2622–2629 (2008)
Vapnik, V.: The Nature of Statistical Learning Theory. Springer-Verlag, New York (1995)
Zhang, C., Zeng, D., Li, J., Wang, F.-Y., Zuo, W.: Sentiment analysis of Chinese documents: From sentence to document level. JASIST 60, 2474–2487 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Martínez-Cámara, E., Martín-Valdivia, M.T., Ureña-López, L.A. (2011). Opinion Classification Techniques Applied to a Spanish Corpus. In: Muñoz, R., Montoyo, A., Métais, E. (eds) Natural Language Processing and Information Systems. NLDB 2011. Lecture Notes in Computer Science, vol 6716. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22327-3_17
Download citation
DOI: https://doi.org/10.1007/978-3-642-22327-3_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-22326-6
Online ISBN: 978-3-642-22327-3
eBook Packages: Computer ScienceComputer Science (R0)