Abstract
The curse of dimensionality is a well-known problem in data classification. In this paper, the Center-Based Sampling (CBS) theory is utilized to develop a new variant of Particle Swarm Optimization (PSO), dubbed CBS-PSO, capable of dealing with the curse of dimensionality problem in text classification. More specifically, the CBS is exploited to equip PSO with two specialized mechanisms to attract the search toward the center region of the search space. The first mechanism estimates the coordinates of the center point of the search space using Rocchio Algorithm (RA), whereas the second mechanism uses the RA-based estimation to generate informed particles, located at the center region, and incorporate them in the swarm to gradually attract the search for the optimal classifiers toward this promising region. The performance of the CBS-PSO is evaluated against three Machine Learning (ML) approaches on three classification tasks of textual datasets from UC Irvine ML repository. The results indicate that the CBS-PSO can be regarded as a very competitive and promising text classifier with much space for improvement.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Kennedy, J., Eberhart, R.: Particle swarm optimization. In: IEEE International Conference on Neural Networks (1995)
del Valle, Y., Venayagamoorthy, G.K., Mohagheghi, S., Hernandez, J.C., Harley, R.G.: Particle swarm optimization: basic concepts, variants and applications in power systems. IEEE Trans. Evol. Comput. 12(2), 171–195 (2008)
Abraham, A., Grosan, C., Ramos, V.: Swarm intelligence in data mining. Stud. Comput. Intell. 34, 1–20 (2006)
Punitha, S., Jeyakarthic, M.: Particle swarm optimization based classification algorithm for expert prediction systems. In: The International Conference on Inventive Computation Technologies (ICICT), Coimbatore, India (2020)
Santana, P.J., Lanzarini, L., Barivier, A.F.: Variations of particle swarm optimization for obtaining classification rules applied to credit risk in financial institutions of Ecuador. Risks 8(1), 1 (2019)
Nouaouria, N., Boukadoum, M.: Particle swarm classification for high dimensional data sets. In: 22th International IEEE Conference on Tools with Artificial Intelligence (2010)
Nouaouria, N., Boukadoum, M., Proulx, R.: Particle swarm classification: a survey and positioning. Pattern Recogn. 46(7), 2028–2044 (2013)
Rahnamayan, S., Wang, G.G.: Center-based sampling for population-based algorithms. In: 2009 IEEE Congress on Evolutionary Computation, Trondheim, Norway (2009)
Esmailzadeh, A., Rahnamayan, S.: Enhanced differential evolution using center-based sampling. In: IEEE Congress on Evolutionary Computation (2011)
Sousa, T., Silva, A., Neves, A.: Particle swarm based data mining algorithms for classification tasks. Parallel Comput. 30(5–6), 767–783 (2004)
De Falco, I., Della Cioppa, A., Tarantino, E.: Evaluation of particle swarm optimization effectiveness in classification. In: Bloch, I., Petrosino, A., Tettamanzi, A.G.B. (eds.) WILF 2005. LNCS (LNAI), vol. 3849, pp. 164–171. Springer, Heidelberg (2006). https://doi.org/10.1007/11676935_20
Nouaouria, N., Boukadoum, M.: A particle swarm optimization approach for substance identification. In: GECCO 2009 (2009)
Mahdavi, S., Rahnamayan, S., Deb, K.: Center-based initialization of cooperative co-evolutionary algorithm for large-scale optimization. In: IEEE Congress on Evolutionary Computation (CEC), 25–29 July 2016, Vancouver, Canada (2016)
Yahya, A.A., Osman, A., El-Bashir, M.S.: Rocchio algorithm-based particle initialization mechanism for effective PSO classification of high dimensional data. Swarm Evol. Comput. 34, 18–32 (2018)
Liu, Y., Qin, Z., Shi, Z.W., Lu, J.: Center particle swarm optimization. Neurocomputing 70(4–6), 672–679 (2007)
Rocchio, J.J.: Relevance feedback in information retrieval. SMART Retrieval System: Exp. Autom. Doc. Process. 313–323 (1971)
Hall, M., Frank, E., Holmes, G., Pfahringer, B.R.: The WEKA data mining software: an update. ACM SIGKDD Explor. Newsl. 19(2), 10–18 (2009)
Kazimipour, B., Qin, A.K.: Why advanced population initialization techniques perform poorly in high dimension? In: Dick, G., et al. (eds.) SEAL 2014. LNCS, vol. 8886, pp. 479–490. Springer, Heidelberg (2014). https://doi.org/10.1007/978-3-319-13563-2_41
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Yahya, A.A., Asiri, Y., Alattab, A.A. (2021). Utilizing Center-Based Sampling Theory to Enhance Particle Swarm Classification of Textual Data. In: Fujita, H., Selamat, A., Lin, J.CW., Ali, M. (eds) Advances and Trends in Artificial Intelligence. Artificial Intelligence Practices. IEA/AIE 2021. Lecture Notes in Computer Science(), vol 12798. Springer, Cham. https://doi.org/10.1007/978-3-030-79457-6_37
Download citation
DOI: https://doi.org/10.1007/978-3-030-79457-6_37
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-79456-9
Online ISBN: 978-3-030-79457-6
eBook Packages: Computer ScienceComputer Science (R0)