Skip to main content
Log in

Improved whale optimization algorithm for feature selection in Arabic sentiment analysis

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

To help individuals or companies make a systematic and more accurate decisions, sentiment analysis (SA) is used to evaluate the polarity of reviews. In SA, feature selection phase is an important phase for machine learning classifiers specifically when the datasets used in training is huge. Whale Optimization Algorithm (WOA) is one of the recent metaheuristic optimization algorithm that mimics the whale hunting mechanism. However, WOA suffers from the same problem faced by many other optimization algorithms and tend to fall in local optima. To overcome these problems, two improvements for WOA algorithm are proposed in this paper. The first improvement includes using Elite Opposition-Based Learning (EOBL) at initialization phase of WOA. The second improvement involves the incorporation of evolutionary operators from Differential Evolution algorithm at the end of each WOA iteration including mutation, crossover, and selection operators. In addition, we also used Information Gain (IG) as a filter features selection technique with WOA using Support Vector Machine (SVM) classifier to reduce the search space explored by WOA. To verify our proposed approach, four Arabic benchmark datasets for sentiment analysis are used since there are only a few studies in sentiment analysis conducted for Arabic language as compared to English. The proposed algorithm is compared with six well-known optimization algorithms and two deep learning algorithms. The comprehensive experiments results show that the proposed algorithm outperforms all other algorithms in terms of sentiment analysis classification accuracy through finding the best solutions, while its also minimizes the number of selected features.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

References

  1. Schouten K, Frasincar F (2016) Survey on aspect-level sentiment analysis. IEEE Trans Knowl Data Eng 28(3):813–830

    Article  Google Scholar 

  2. Rana TA, Cheah YN (2016) Aspect extraction in sentiment analysis: comparative analysis and survey. Artif Intell Rev 46(4):459–483

    Article  Google Scholar 

  3. Ahmad SR, Bakar AA, Yaakub MR (2015) Metaheuristic algorithms for feature selection in sentiment analysis. In Science and Information Conference (SAI), 2015 (pp. 222–226). IEEE

  4. Liu H, Yu L (2005) Toward integrating feature selection algorithms for classification and clustering. IEEE Trans Knowl Data Eng 17(4):491–502

    Article  Google Scholar 

  5. Mirjalili S, Lewis A (2016) The whale optimization algorithm. Adv Eng Softw 95:51–67

    Article  Google Scholar 

  6. Aljarah I, Faris H, Mirjalili S (2018) Optimizing connection weights in neural networks using the whale optimization algorithm. Soft Comput 22(1):1–15

    Article  Google Scholar 

  7. Wang J, Du P, Niu T, Yang W (2017) A novel hybrid system based on a new proposed algorithm—multi-objective whale optimization algorithm for wind speed forecasting. Appl Energy 208:344–360

    Article  Google Scholar 

  8. Prakash DB, Lakshminarayana C (2017) Optimal siting of capacitors in radial distribution network using whale optimization algorithm. Alexandria Eng J 56(4):499–509

    Article  Google Scholar 

  9. Reddy PDP, Reddy VV, Manohar TG (2017) Whale optimization algorithm for optimal sizing of renewable resources for loss reduction in distribution systems. Renewables: Wind Water Solar 4(1):3

    Article  Google Scholar 

  10. Mostafa A, Hassanien AE, Houseni M, Hefny H (2017) Liver segmentation in MRI images based on whale optimization algorithm. Multimed Tools Appl 76(23):24931–24954

    Article  Google Scholar 

  11. Zhou X, Wu Z, Wang H (2012) Elite opposition-based differential evolution for solving large-scale optimization problems and its implementation on GPU. In Parallel and Distrib Comput, Applications and Technologies (PDCAT), 2012 13th International Conference on (pp. 727–732). IEEE

  12. Sarkhel R, Das N, Saha AK, Nasipuri M (2018) An improved harmony search algorithm embedded with a novel piecewise opposition based learning algorithm. Eng Appl Artif Intell 67:317–330

    Article  Google Scholar 

  13. Shan X, Liu K, Sun PL (2016) Modified Bat Algorithm Based on Lévy Flight and Opposition Based Learning Scientific Programming, 2016

  14. Mahdavi S, Rahnamayan S, Deb K (2017) Opposition based learning: a literature review. Swarm and Evolutionary Computation

  15. Wang H, Wu Z, Liu Y, Wang J, Jiang D, Chen L (2009) Space transformation search: a new evolutionary technique. Proceedings of the first ACM/SIGEVO Summit on Genetic and Evol Comput (pp. 537–544). ACM

  16. Wang H, Wu Z, Rahnamayan S, Liu Y, Ventresca M (2011) Enhancing particle swarm optimization using generalized opposition-based learning. Inf Sci 181(20):4699–4714

    Article  MathSciNet  Google Scholar 

  17. Guo Z, Wang S, Yue X, Yang H (2017) Global harmony search with generalized opposition-based learning. Soft Comput 21(8):2129–2137

    Article  Google Scholar 

  18. Wang H, Wang W, Zhu H, Sun H (2013) Generalised opposition-based differential evolution for frequency modulation parameter optimisation. Int J Model Identif Control 18(4):372–379

    Article  Google Scholar 

  19. Si T, De A, Bhattacharjee AK (2014) Particle swarm optimization with generalized opposition based learning in particle's pbest position. In Circuit, Power and Computing Technologies (ICCPCT), 2014 International Conference on (pp. 1662–1667). IEEE

  20. Wang H, Wang W, Sun H (2015) Firefly algorithm with generalised opposition-based learning. Int J Wirel Mob Comput 9(4):370–376

    Article  Google Scholar 

  21. Zhou X, Wu Z, Deng C, Peng H (2015) Enhancing artificial bee colony algorithm with generalised opposition-based learning. Int J Comput Sci Math 6(3):297–309

    Article  MathSciNet  Google Scholar 

  22. Chen X, Yu K, Du W, Zhao W, Liu G (2016) Parameters identification of solar cell models using generalized oppositional teaching learning-based optimization. Energy 99:170–180

    Article  Google Scholar 

  23. Zhou Y, Wang R, Luo Q (2016) Elite opposition-based flower pollination algorithm. Neurocomputing 188:294–310

    Article  Google Scholar 

  24. Zhang S, Luo Q, Zhou Y (2017) Hybrid grey wolf optimizer using elite opposition-based learning strategy and simplex method. Int J Comput Intell Appl 16(02):1750012

    Article  Google Scholar 

  25. Huang K, Zhou Y, Wu X, Luo Q (2016) A cuckoo search algorithm with elite opposition-based strategy. J Intell Syst 25(4):567–593

    Google Scholar 

  26. Storn R, Price K (1997) Differential evolution–a simple and efficient heuristic for global optimization over continuous spaces. J Glob Optim 11(4):341–359

    Article  MathSciNet  MATH  Google Scholar 

  27. Jadon SS, Tiwari R, Sharma H, Bansal JC (2017) Hybrid artificial bee colony algorithm with differential evolution. Appl Soft Comput 58:11–24

    Article  Google Scholar 

  28. Zheng YJ, Xu XL, Ling HF, Chen SY (2015) A hybrid fireworks optimization method with differential evolution operators. Neurocomputing 148:75–82

    Article  Google Scholar 

  29. Awad NH, Ali MZ, Suganthan PN, Reynolds RG (2017) CADE: a hybridization of cultural algorithm and differential evolution for numerical optimization. Inf Sci 378:215–241

    Article  Google Scholar 

  30. Wang G, Sun J, Ma J, Xu K, Gu J (2014) Sentiment classification: the contribution of ensemble learning. Decis Support Syst 57:77–93

    Article  Google Scholar 

  31. Moraes R, Valiati JF, Neto WPG (2013) Document-level sentiment classification: an empirical comparison between SVM and ANN. Expert Syst Appl 40(2):621–633

    Article  Google Scholar 

  32. Ye Q, Zhang Z, Law R (2009) Sentiment classification of online reviews to travel destinations by supervised machine learning approaches. Expert Syst Appl 36(3):6527–6535

    Article  Google Scholar 

  33. Saleh MR, Martín-Valdivia MT, Montejo-Ráez A, Ureña-López LA (2011) Experiments with SVM to classify opinions in different domains. Expert Syst Appl 38(12):14799–14804

    Article  Google Scholar 

  34. Aliane AA, Aliane H, Ziane M, Bensaou N (2016) A genetic algorithm feature selection based approach for Arabic sentiment classification, IEEE

  35. Duwairi RM, Qarqaz I (2016) A framework for Arabic sentiment analysis using supervised classification. Int J Data Mining Model Manag 8(4):369–381

    Google Scholar 

  36. Al-Radaideh QA, Al-Qudah GY (2017) Application of rough set-based feature selection for Arabic sentiment analysis. Cogn Comput, 1–10

  37. Sabri B, Saad S (2016) Arabic sentiment analysis with optimal combination of features selection and machine learning approaches. Res J Appl Sci Eng Technol 13(5):386–393

    Article  Google Scholar 

  38. Rushdi-Saleh M, Teresa M-VM, Urea-Lopez AL, Perea-Ortega JM (2011) OCA: opinion corpus for Arabic. J Am Soc Inf Sci 62:2045–2054

    Article  Google Scholar 

  39. Bahassine S, Madani A, Kissi M (2016) An improved Chi-sqaure feature selection for Arabic text classification using decision Trees In Intelligent Systems: Theories and Applications (SITA), 2016 11th International Conference on (pp. 1–5). IEEE

  40. Al-Kabi MN, Abdulla NA, Al-Ayyoub M (2013) An analytical study of arabic sentiments: Maktoob case study. In Internet Technology and Secured Transactions (ICITST), 2013 8th International Conference for (pp. 89–94). IEEE

  41. Abdulla NA, Ahmed NA, Shehab MA, Al-Ayyoub M (2013) Arabic sentiment analysis: Lexicon-based and corpus-based. In Applied Electr Eng and Computing Technologies (AEECT), 2013 IEEE Jordan Conference on (pp. 1–6). IEEE

  42. Alomari KM, ElSherif HM, Shaalan K (2017) Arabic Tweets Sentimental Analysis Using Mach Learn In International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems (pp. 602–610). Springer, Cham

  43. Abdulla NA, Al-Ayyoub M, Al-Kabi MN (2014) An extended analytical study of Arabic sentiments. Int J Big Data Intell 1 1(1–2):103–113

    Article  Google Scholar 

  44. Omar N, Albared M, Al-Moslmi T, Al-Shabi A (2014) A comparative study of feature selection and Mach Learn algorithms for Arabic sentiment classification. In Asia Inf Retr symposium (pp. 429–443). Springer, Cham

  45. Bostani H, Sheikhan M (2017) Hybrid of binary gravitational search algorithm and mutual information for feature selection in intrusion detection systems. Soft Comput 21(9):2307–2324

    Article  Google Scholar 

  46. Zhang L, Shan L, Wang J (2017) Optimal feature selection using distance-based discrete firefly algorithm with mutual information criterion. Neural Comput & Applic 28(9):2795–2808

    Article  Google Scholar 

  47. Wang H, Niu B (2017) A novel bacterial algorithm with randomness control for feature selection in classification. Neurocomputing 228:176–186

    Article  Google Scholar 

  48. Barani F, Mirhosseini M, Nezamabadi-Pour H (2017) Application of binary quantum-inspired gravitational search algorithm in feature subset selection. Appl Intell 47(2):304–318

    Article  Google Scholar 

  49. Hancer E, Xue B, Zhang M, Karaboga D, Akay B (2018) Pareto front feature selection based on artificial bee colony optimization. Inf Sci 422:462–479

    Article  Google Scholar 

  50. Wolpert DH, Macready WG (1997) No free lunch theorems for optimization. IEEE Trans Evol Comput 1(1):67–82

    Article  Google Scholar 

  51. Tizhoosh HR (2005) Opposition-based learning: a new scheme for machine intelligence. In Computational intelligence for modelling, control and automation, 2005 and international conference on intelligent agents, web technologies and internet commerce, international conference on (Vol. 1, pp. 695–701). IEEE

  52. Song Q, Ni J, Wang G (2013) A fast clustering-based feature subset selection algorithm for high-dimensional data. IEEE Trans Knowl Data Eng 25(1):1–14

    Article  Google Scholar 

  53. Abdulla N, Mahyoub N, Shehab M, Al-Ayyoub M (2013) Arabic sentiment analysis: Corpus-based and lexicon-based. In Proceedings of The IEEE conference on Applied Electrical Engineering and Computing Technologies (AEECT)

  54. Al-Moslmi T, Albared M, Al-Shabi A, Omar N, Abdullah S (2017) Arabic senti-lexicon: constructing publicly available language resources for arabic sentiment analysis. J Inf Sci, 0165551516683908

  55. Saremi S, Mirjalili S, Lewis A (2017) Grasshopper optimisation algorithm: theory and application. Adv Eng Softw 105:30–47

    Article  Google Scholar 

  56. Mirjalili S (2015) The ant lion optimizer. Adv Eng Softw 83:80–98

    Article  Google Scholar 

  57. Bahassine S, Madani A, Al-Sarem M, Kissi M (2018) Feature selection using an improved chi-square for Arabic text classification. Journal of King Saud University-Computer and Information Sciences

  58. Al-Smadi M, Al-Ayyoub M, Jararweh Y, Qawasmeh O (2018) Enhancing aspect-based sentiment analysis of Arabic hotels’ reviews using morphological, syntactic and semantic features. Inf Process Manag

  59. Abdul-Mageed M, Diab M, Kübler S (2014) SAMAR: subjectivity and sentiment analysis for Arabic social media. Comput Speech Lang 28(1):20–37

    Article  Google Scholar 

  60. Soliman TH, Elmasry MA, Hedar A, Doss MM (2014) Sentiment analysis of Arabic slang comments on facebook. Int J Comput Technol 12(5):3470–3478

    Article  Google Scholar 

  61. Nasser A, Dinçer K, Sever H (2016) Investigation of the feature selection problem for sentiment analysis in Arabic language. Res Comput Sci 110:41–54

    Google Scholar 

  62. Al-Rubaiee H, Qiu R, Li D (2016). Identifying Mubasher software products through sentiment analysis of Arabic tweets. In Industrial Informatics and Computer Systems (CIICS), 2016 International Conference on (pp. 1–6). IEEE

  63. Cherif W, Madani A, Kissi M (2015) A new modeling approach for Arabic opinion mining recognition. In 2015 Intelligent Systems and Computer Vision (ISCV) (pp. 1–6). IEEE

  64. Ibrahim HS, Abdou SM, Gheith M (2015) Sentiment analysis for modern standard Arabic and colloquial. arXiv preprint arXiv:1505.03105

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Norisma Idris.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Tubishat, M., Abushariah, M.A.M., Idris, N. et al. Improved whale optimization algorithm for feature selection in Arabic sentiment analysis. Appl Intell 49, 1688–1707 (2019). https://doi.org/10.1007/s10489-018-1334-8

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-018-1334-8

Keywords

Navigation