Abstract
In text mining, the feature selection process can potentially improve classification accuracy by reducing the high-dimensional feature space to a low-dimensional feature space resulting in an optimal subset of available features. In this paper, a hybrid method and two meta-heuristic algorithms are employed to find an optimal feature subset. The feature selection task is performed in two steps: first, different feature subsets (called local-solutions) are obtained using a hybrid filter and wrapper approaches to reduce high-dimensional feature space; second, local-solutions are integrated using two meta-heuristic algorithms (namely, the harmony search algorithm and the genetic algorithm) in order to find an optimal feature subset. The results of a wide range of comparative experiments on three widely-used datasets in sentiment analysis show that the proposed method for feature selection outperforms other baseline methods in terms of accuracy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Ekbal, A., Saha, S.: Combining feature selection and classifier ensemble using a multiobjective simulated annealing approach: application to named entity recognition. Soft. Comput. 17, 1–16 (2013)
Saeys, Y., Inza, I., Larrañaga, P.: A review of feature selection techniques in bioinformatics. Bioinformatics 23, 2507–2517 (2007)
Yousefpour, A., Ibrahim, R., Abdull Hamed, H.N., Hajmohammadi, M.S.: A comparative study on sentiment analysis. Adv. Environ. Biol. 8, 53–68 (2014)
Diao, R., Shen, Q.: Feature selection with harmony search. Syst., Man, Cybern., Part B: Cybern., IEEE Trans. 42, 1509–1523 (2012)
Oreski, S., Oreski, G.: Genetic algorithm-based heuristic for feature selection in credit risk assessment. Expert Syst. Appl. 41, 2052–2064 (2014)
Rogati, M., Yang, Y.: High-performing feature selection for text classification. In: Proceedings of the Eleventh International Conference on Information and knowledge management, pp. 659–661 (2002)
Uğuz, H.: A two-stage feature selection method for text categorization by using information gain, principal component analysis and genetic algorithm. Knowl.-Based Syst. 24, 1024–1032 (2011)
Yousefpour, A., Ibrahim, R., Abdull Hamed, H.N., Hajmohammadi, M.S.: Feature reduction using standard deviation with different subsets selection in sentiment analysis. In: Nguyen, N.T., Attachoo, B., Trawiński, B., Somboonviwat, K. (eds.) ACIIDS 2014, Part II. LNCS, vol. 8398, pp. 33–41. Springer, Heidelberg (2014)
Warren Liao, T.: T.: Feature extraction and selection from acoustic emission signals with an application in grinding wheel condition monitoring. Eng. Appl. Artif. Intell. 23, 74–84 (2010)
Wang, Y., Liu, Y., Feng, L., Zhu, X.: Novel feature selection method based on harmony search for email classification. Knowl.-Based Syst. 73, 311–323 (2015)
Blitzer, J., Dredze, M., Pereira, F.: Biographies, bollywood, boom-boxes and blenders: Domain adaptation for sentiment classification. In: ACL, pp. 440–447 (2007)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yousefpour, A., Ibrahim, R., Hamed, H.N.A., Yokoi, T. (2016). Integrated Feature Selection Methods Using Metaheuristic Algorithms for Sentiment Analysis. In: Nguyen, N.T., Trawiński, B., Fujita, H., Hong, TP. (eds) Intelligent Information and Database Systems. ACIIDS 2016. Lecture Notes in Computer Science(), vol 9621. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-49381-6_13
Download citation
DOI: https://doi.org/10.1007/978-3-662-49381-6_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-49380-9
Online ISBN: 978-3-662-49381-6
eBook Packages: Computer ScienceComputer Science (R0)