Abstract
To help individuals or companies make a systematic and more accurate decisions, sentiment analysis (SA) is used to evaluate the polarity of reviews. In SA, feature selection phase is an important phase for machine learning classifiers specifically when the datasets used in training is huge. Whale Optimization Algorithm (WOA) is one of the recent metaheuristic optimization algorithm that mimics the whale hunting mechanism. However, WOA suffers from the same problem faced by many other optimization algorithms and tend to fall in local optima. To overcome these problems, two improvements for WOA algorithm are proposed in this paper. The first improvement includes using Elite Opposition-Based Learning (EOBL) at initialization phase of WOA. The second improvement involves the incorporation of evolutionary operators from Differential Evolution algorithm at the end of each WOA iteration including mutation, crossover, and selection operators. In addition, we also used Information Gain (IG) as a filter features selection technique with WOA using Support Vector Machine (SVM) classifier to reduce the search space explored by WOA. To verify our proposed approach, four Arabic benchmark datasets for sentiment analysis are used since there are only a few studies in sentiment analysis conducted for Arabic language as compared to English. The proposed algorithm is compared with six well-known optimization algorithms and two deep learning algorithms. The comprehensive experiments results show that the proposed algorithm outperforms all other algorithms in terms of sentiment analysis classification accuracy through finding the best solutions, while its also minimizes the number of selected features.
Similar content being viewed by others
References
Schouten K, Frasincar F (2016) Survey on aspect-level sentiment analysis. IEEE Trans Knowl Data Eng 28(3):813–830
Rana TA, Cheah YN (2016) Aspect extraction in sentiment analysis: comparative analysis and survey. Artif Intell Rev 46(4):459–483
Ahmad SR, Bakar AA, Yaakub MR (2015) Metaheuristic algorithms for feature selection in sentiment analysis. In Science and Information Conference (SAI), 2015 (pp. 222–226). IEEE
Liu H, Yu L (2005) Toward integrating feature selection algorithms for classification and clustering. IEEE Trans Knowl Data Eng 17(4):491–502
Mirjalili S, Lewis A (2016) The whale optimization algorithm. Adv Eng Softw 95:51–67
Aljarah I, Faris H, Mirjalili S (2018) Optimizing connection weights in neural networks using the whale optimization algorithm. Soft Comput 22(1):1–15
Wang J, Du P, Niu T, Yang W (2017) A novel hybrid system based on a new proposed algorithm—multi-objective whale optimization algorithm for wind speed forecasting. Appl Energy 208:344–360
Prakash DB, Lakshminarayana C (2017) Optimal siting of capacitors in radial distribution network using whale optimization algorithm. Alexandria Eng J 56(4):499–509
Reddy PDP, Reddy VV, Manohar TG (2017) Whale optimization algorithm for optimal sizing of renewable resources for loss reduction in distribution systems. Renewables: Wind Water Solar 4(1):3
Mostafa A, Hassanien AE, Houseni M, Hefny H (2017) Liver segmentation in MRI images based on whale optimization algorithm. Multimed Tools Appl 76(23):24931–24954
Zhou X, Wu Z, Wang H (2012) Elite opposition-based differential evolution for solving large-scale optimization problems and its implementation on GPU. In Parallel and Distrib Comput, Applications and Technologies (PDCAT), 2012 13th International Conference on (pp. 727–732). IEEE
Sarkhel R, Das N, Saha AK, Nasipuri M (2018) An improved harmony search algorithm embedded with a novel piecewise opposition based learning algorithm. Eng Appl Artif Intell 67:317–330
Shan X, Liu K, Sun PL (2016) Modified Bat Algorithm Based on Lévy Flight and Opposition Based Learning Scientific Programming, 2016
Mahdavi S, Rahnamayan S, Deb K (2017) Opposition based learning: a literature review. Swarm and Evolutionary Computation
Wang H, Wu Z, Liu Y, Wang J, Jiang D, Chen L (2009) Space transformation search: a new evolutionary technique. Proceedings of the first ACM/SIGEVO Summit on Genetic and Evol Comput (pp. 537–544). ACM
Wang H, Wu Z, Rahnamayan S, Liu Y, Ventresca M (2011) Enhancing particle swarm optimization using generalized opposition-based learning. Inf Sci 181(20):4699–4714
Guo Z, Wang S, Yue X, Yang H (2017) Global harmony search with generalized opposition-based learning. Soft Comput 21(8):2129–2137
Wang H, Wang W, Zhu H, Sun H (2013) Generalised opposition-based differential evolution for frequency modulation parameter optimisation. Int J Model Identif Control 18(4):372–379
Si T, De A, Bhattacharjee AK (2014) Particle swarm optimization with generalized opposition based learning in particle's pbest position. In Circuit, Power and Computing Technologies (ICCPCT), 2014 International Conference on (pp. 1662–1667). IEEE
Wang H, Wang W, Sun H (2015) Firefly algorithm with generalised opposition-based learning. Int J Wirel Mob Comput 9(4):370–376
Zhou X, Wu Z, Deng C, Peng H (2015) Enhancing artificial bee colony algorithm with generalised opposition-based learning. Int J Comput Sci Math 6(3):297–309
Chen X, Yu K, Du W, Zhao W, Liu G (2016) Parameters identification of solar cell models using generalized oppositional teaching learning-based optimization. Energy 99:170–180
Zhou Y, Wang R, Luo Q (2016) Elite opposition-based flower pollination algorithm. Neurocomputing 188:294–310
Zhang S, Luo Q, Zhou Y (2017) Hybrid grey wolf optimizer using elite opposition-based learning strategy and simplex method. Int J Comput Intell Appl 16(02):1750012
Huang K, Zhou Y, Wu X, Luo Q (2016) A cuckoo search algorithm with elite opposition-based strategy. J Intell Syst 25(4):567–593
Storn R, Price K (1997) Differential evolution–a simple and efficient heuristic for global optimization over continuous spaces. J Glob Optim 11(4):341–359
Jadon SS, Tiwari R, Sharma H, Bansal JC (2017) Hybrid artificial bee colony algorithm with differential evolution. Appl Soft Comput 58:11–24
Zheng YJ, Xu XL, Ling HF, Chen SY (2015) A hybrid fireworks optimization method with differential evolution operators. Neurocomputing 148:75–82
Awad NH, Ali MZ, Suganthan PN, Reynolds RG (2017) CADE: a hybridization of cultural algorithm and differential evolution for numerical optimization. Inf Sci 378:215–241
Wang G, Sun J, Ma J, Xu K, Gu J (2014) Sentiment classification: the contribution of ensemble learning. Decis Support Syst 57:77–93
Moraes R, Valiati JF, Neto WPG (2013) Document-level sentiment classification: an empirical comparison between SVM and ANN. Expert Syst Appl 40(2):621–633
Ye Q, Zhang Z, Law R (2009) Sentiment classification of online reviews to travel destinations by supervised machine learning approaches. Expert Syst Appl 36(3):6527–6535
Saleh MR, Martín-Valdivia MT, Montejo-Ráez A, Ureña-López LA (2011) Experiments with SVM to classify opinions in different domains. Expert Syst Appl 38(12):14799–14804
Aliane AA, Aliane H, Ziane M, Bensaou N (2016) A genetic algorithm feature selection based approach for Arabic sentiment classification, IEEE
Duwairi RM, Qarqaz I (2016) A framework for Arabic sentiment analysis using supervised classification. Int J Data Mining Model Manag 8(4):369–381
Al-Radaideh QA, Al-Qudah GY (2017) Application of rough set-based feature selection for Arabic sentiment analysis. Cogn Comput, 1–10
Sabri B, Saad S (2016) Arabic sentiment analysis with optimal combination of features selection and machine learning approaches. Res J Appl Sci Eng Technol 13(5):386–393
Rushdi-Saleh M, Teresa M-VM, Urea-Lopez AL, Perea-Ortega JM (2011) OCA: opinion corpus for Arabic. J Am Soc Inf Sci 62:2045–2054
Bahassine S, Madani A, Kissi M (2016) An improved Chi-sqaure feature selection for Arabic text classification using decision Trees In Intelligent Systems: Theories and Applications (SITA), 2016 11th International Conference on (pp. 1–5). IEEE
Al-Kabi MN, Abdulla NA, Al-Ayyoub M (2013) An analytical study of arabic sentiments: Maktoob case study. In Internet Technology and Secured Transactions (ICITST), 2013 8th International Conference for (pp. 89–94). IEEE
Abdulla NA, Ahmed NA, Shehab MA, Al-Ayyoub M (2013) Arabic sentiment analysis: Lexicon-based and corpus-based. In Applied Electr Eng and Computing Technologies (AEECT), 2013 IEEE Jordan Conference on (pp. 1–6). IEEE
Alomari KM, ElSherif HM, Shaalan K (2017) Arabic Tweets Sentimental Analysis Using Mach Learn In International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems (pp. 602–610). Springer, Cham
Abdulla NA, Al-Ayyoub M, Al-Kabi MN (2014) An extended analytical study of Arabic sentiments. Int J Big Data Intell 1 1(1–2):103–113
Omar N, Albared M, Al-Moslmi T, Al-Shabi A (2014) A comparative study of feature selection and Mach Learn algorithms for Arabic sentiment classification. In Asia Inf Retr symposium (pp. 429–443). Springer, Cham
Bostani H, Sheikhan M (2017) Hybrid of binary gravitational search algorithm and mutual information for feature selection in intrusion detection systems. Soft Comput 21(9):2307–2324
Zhang L, Shan L, Wang J (2017) Optimal feature selection using distance-based discrete firefly algorithm with mutual information criterion. Neural Comput & Applic 28(9):2795–2808
Wang H, Niu B (2017) A novel bacterial algorithm with randomness control for feature selection in classification. Neurocomputing 228:176–186
Barani F, Mirhosseini M, Nezamabadi-Pour H (2017) Application of binary quantum-inspired gravitational search algorithm in feature subset selection. Appl Intell 47(2):304–318
Hancer E, Xue B, Zhang M, Karaboga D, Akay B (2018) Pareto front feature selection based on artificial bee colony optimization. Inf Sci 422:462–479
Wolpert DH, Macready WG (1997) No free lunch theorems for optimization. IEEE Trans Evol Comput 1(1):67–82
Tizhoosh HR (2005) Opposition-based learning: a new scheme for machine intelligence. In Computational intelligence for modelling, control and automation, 2005 and international conference on intelligent agents, web technologies and internet commerce, international conference on (Vol. 1, pp. 695–701). IEEE
Song Q, Ni J, Wang G (2013) A fast clustering-based feature subset selection algorithm for high-dimensional data. IEEE Trans Knowl Data Eng 25(1):1–14
Abdulla N, Mahyoub N, Shehab M, Al-Ayyoub M (2013) Arabic sentiment analysis: Corpus-based and lexicon-based. In Proceedings of The IEEE conference on Applied Electrical Engineering and Computing Technologies (AEECT)
Al-Moslmi T, Albared M, Al-Shabi A, Omar N, Abdullah S (2017) Arabic senti-lexicon: constructing publicly available language resources for arabic sentiment analysis. J Inf Sci, 0165551516683908
Saremi S, Mirjalili S, Lewis A (2017) Grasshopper optimisation algorithm: theory and application. Adv Eng Softw 105:30–47
Mirjalili S (2015) The ant lion optimizer. Adv Eng Softw 83:80–98
Bahassine S, Madani A, Al-Sarem M, Kissi M (2018) Feature selection using an improved chi-square for Arabic text classification. Journal of King Saud University-Computer and Information Sciences
Al-Smadi M, Al-Ayyoub M, Jararweh Y, Qawasmeh O (2018) Enhancing aspect-based sentiment analysis of Arabic hotels’ reviews using morphological, syntactic and semantic features. Inf Process Manag
Abdul-Mageed M, Diab M, Kübler S (2014) SAMAR: subjectivity and sentiment analysis for Arabic social media. Comput Speech Lang 28(1):20–37
Soliman TH, Elmasry MA, Hedar A, Doss MM (2014) Sentiment analysis of Arabic slang comments on facebook. Int J Comput Technol 12(5):3470–3478
Nasser A, Dinçer K, Sever H (2016) Investigation of the feature selection problem for sentiment analysis in Arabic language. Res Comput Sci 110:41–54
Al-Rubaiee H, Qiu R, Li D (2016). Identifying Mubasher software products through sentiment analysis of Arabic tweets. In Industrial Informatics and Computer Systems (CIICS), 2016 International Conference on (pp. 1–6). IEEE
Cherif W, Madani A, Kissi M (2015) A new modeling approach for Arabic opinion mining recognition. In 2015 Intelligent Systems and Computer Vision (ISCV) (pp. 1–6). IEEE
Ibrahim HS, Abdou SM, Gheith M (2015) Sentiment analysis for modern standard Arabic and colloquial. arXiv preprint arXiv:1505.03105
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Tubishat, M., Abushariah, M.A.M., Idris, N. et al. Improved whale optimization algorithm for feature selection in Arabic sentiment analysis. Appl Intell 49, 1688–1707 (2019). https://doi.org/10.1007/s10489-018-1334-8
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-018-1334-8