Abstract
The World Wide Web(WWW) comprises a wide range of information, and it is mainly operated on the principles of keyword matching which often reduces accurate information retrieval. Automatic query expansion is one of the primary methods for information retrieval, and it handles the vocabulary mismatch problem often faced by the information retrieval systems to retrieve an appropriate document using the keywords. This paper proposed a novel approach of hybrid COOT-based Cat and Mouse Optimization (CMO) algorithm named as hybrid COOT-CMO for the appropriate selection of optimal candidate terms in the automatic query expansion process. To improve the accuracy of the Cat and Mouse Optimization (CMO) algorithm, the parameters are tuned with the help of the Coot algorithm. The best suitable expanded query is identified from the available expanded query sets also known as candidate query pools. All feasible combinations in this candidate query pool should be obtained from the top retrieved documents. Benchmark datasets such as the GOV2 Test Collection, the Cranfield Collections, and the NTCIR Test Collection are utilized to assess the performance of the proposed hybrid COOT-CMO method for automatic query expansion. This proposed method surpasses the existing state-of-the-art techniques using many performance measures such as F-score, precision, and mean average precision (MAP).
Similar content being viewed by others
References
Carpineto C, Romano G (2012) A survey of automatic query expansion in information retrieval. Acm Comput Surv (CSUR) 44(1):1–50
Fu G, Jones CB, Abdelmoty AI (2005) Ontology-based spatial query expansion in information retrieval. In: OTM Confederated International Conferences" On the Move to Meaningful Internet Systems", Springer, Berlin, Heidelberg, pp 1466–1482 2005
Azad HK, Deepak A (2019) Query expansion techniques for information retrieval: a survey. Inf Process Manage 56(5):1698–1735
Bai J, Song D, Bruza P, Nie JY, Cao G (2005) Query expansion using term relationships in language models for information retrieval. In: Proceedings of the 14th ACM international conference on Information and knowledge management, pp 688–695 2005
Sundararaj V, Muthukumar S, Kumar RS (2018) An optimal cluster formation based energy efficient dynamic scheduling hybrid MAC protocol for heavy traffic load in wireless sensor networks. Comput Secur 77:277–288
Sundararaj V (2016) An efficient threshold prediction scheme for wavelet based ECG signal noise reduction using variable step size firefly algorithm. Int J Intell Eng Syst 9(3):117–126
Sundararaj V (2019) Optimised denoising scheme via opposition-based self-adaptive learning PSO algorithm for wavelet-based ECG signal noise reduction. Int J Biomed Eng Technol 31(4):325
Sundararaj V, Anoop V, Dixit P, Arjaria A, Chourasia U, Bhambri P, Rejeesh MR, Regu S (2020) CCGPA-MPPT: cauchy preferential crossover-based global pollination algorithm for MPPT in photovoltaic system. Prog Photovolt Res Appl 28(11):1128–1145
Ravikumar S, Kavitha D (2021) CNN-OHGS: CNN-oppositional-based henry gas solubility optimization model for autonomous vehicle control system. J Field Robotics 38(7):967–979
Ravikumar S, Kavitha D (2020) IoT based home monitoring system with secure data storage by Keccak-Chaotic sequence in cloud server. J Ambient Intell Human Comput 12(7):7475–7487
Rejeesh MR, Thejaswini P (2020) MOTF: multi-objective optimal trilateral filtering based partial moving frame algorithm for image denoising. Multimed Tools Appl 79(37):28411–28430
Kavitha D, Ravikumar S (2021) IOT and context-aware learning-based optimal neural network model for real-time health monitoring. Trans Emerging Telecommun Technol 32(1):e4132
Xie Q, Sundararaj V, Mr R (2021) Analyzing the factors affecting the attitude of public toward lockdown, institutional trust, and civic engagement activities. J Commun Psychol
Sundararaj V, Rejeesh MR (2021) A detailed behavioral analysis on consumer and customer changing behavior with respect to social networking sites. J Retail Consum Services 58:102190
Alex JA, Anees S, Madheswari N (2013) User authentication based on persuasive cued click points with sound signature. J Comput Sci Inf Technol Secur 3:353–358
Haseena KS, Anees S, Madheswari N (2014) Power optimization using EPAR protocol in MANET. Int J Innov Sci Eng Technol 6:430–436
Nirmal Kumar SJ, Ravimaran S, Alam MM (2020) An effective non-commutative encryption approach with optimized genetic algorithm for ensuring data protection in cloud computing. Comput Model Eng Sci 125(2):671–697
Gowthul Alam MM, Baulkani S (2017) Reformulated query-based document retrieval using optimised kernel fuzzy clustering algorithm. Int J Bus Intell Data Min 12(3):299
Alam MG, Baulkani S (2016) A hybrid approach for web document clustering using K-means and artificial bee colony algorithm. Int J Intell Eng Syst 9(4):11–20
Azath M, Banu RW, Madheswari AN (2011) Improving fairness in network traffic by controlling congestion and unresponsive flows. In: International conference on network security and applications. Springer, Berlin, Heidelberg, pp 356–363
Gupta P, Bali K, Banchs RE, Choudhury M, Rosso P (2014) Query expansion for mixed-script information retrieval. In: Proceedings of the 37th international ACM SIGIR conference on Research & development in information retrieval, pp 677–686. 2014
Gupta P, Bali K, Banchs RE, Choudhury M, Rosso P (2014) Query expansion for mixed-script information retrieval. In: Proceedings of the 37th international ACM SIGIR conference on Research & development in information retrieval, pp 677–686
Sankhavara J (2020) Feature weighting in finding feedback documents for query expansion in biomedical document retrieval. SN Comput Sci 1(2):1–7
Klink S, Hust A, Junker M, Dengel A (2002) Improving document retrieval by automatic query expansion using collaborative learning of term-based concepts. In: Lopresti Daniel, Jianying Hu, Kashi Ramanujan (eds) International workshop on document analysis systems. Springer, Berlin, pp 376–387
Yusuf NUHU, Yunus MAM, Wahid NORFARADILLA, Nawi NM, Samsudin NA, Arbaiy NUREIZE (2020) Query expansion method for quran search using semantic search and lucene ranking. J Eng Sci Technol 15(1):675–692
Sharma DK, Pamula R, Chauhan DS (2019) A hybrid evolutionary algorithm based automatic query expansion for enhancing document retrieval system. J Ambient Intell Human Comput. https://doi.org/10.1007/s12652-019-01247-9
Singh J, Sharan A (2015) Context window based co-occurrence approach for improving feedback based query expansion in information retrieval. Int J Inf Retriev Res (IJIRR) 5(4):31–45
Gupta Y, Saini A (2017) A novel Fuzzy-PSO term weighting automatic query expansion approach using combined semantic filtering. Knowl-Based Syst 136:97–120
Dehghani M, Hubálovský Š, Trojovský P (2021) Cat and mouse based optimizer: a new nature-inspired optimization algorithm. Sensors 21(15):5214
Gao Y, Zhang G, Ma J, Lu J (2009) A λ-cut and goal-programming-based algorithm for fuzzy-linear multiple-objective bilevel optimization. IEEE Trans Fuzzy Syst 18(1):1–13
Naruei I, Keynia F (2021) A new optimization method based on coot bird natural life model. Expert Syst Appl 183:115352
Mirjalili S, Gandomi AH, Mirjalili SZ, Saremi S, Faris H, Mirjalili SM (2017) Salp swarm algorithm: a bio-inspired optimizer for engineering design problems. Adv Eng Softw 114:163–191
Dirani KM, Abadi M, Alizadeh A, Barhate B, Garza RC, Gunasekara N, Ibrahim G, Majzun Z (2020) Leadership competencies and the essential role of human resource development in times of crisis: a response to Covid-19 pandemic. Human Resour Develop Int 23(4):380–394
Lorenzetti CM, Maguitman AG (2009) A semi-supervised incremental algorithm to automatically formulate topical queries. Inf Sci 179(12):1881–1892
Liu Q, Huang H, Xuan J, Zhang G, Gao Y, Jie L (2020) A fuzzy word similarity measure for selecting top-k similar words in query expansion. IEEE Trans Fuzzy Syst. https://doi.org/10.1109/TFUZZ.2020.2993702
Zheng Z, Kai H, Ben H, Xianpei H, Le S, Andrew Y (2020) BERT-QE: contextualized query expansion for document re-ranking.
Jiang Y (2020) Semantically-enhanced information retrieval using multiple knowledge sources. Cluster Comput 23(4):2925–2944
Bhopale AP, Tiwari A (2020) Swarm optimized cluster based framework for information retrieval. Expert Syst Appl 154:113441
Kane M (1996) The precision of measurements. Appl Measur Educ 9(4):355–379
Kishida K (2005) Property of average precision and its generalization: An examination of evaluation indicator for information retrieval experiments. National Institute of Informatics, Tokyo, Japan, p 19p
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Alqahtani, A.S., Saravanan, P., Maheswari, M. et al. An automatic query expansion based on hybrid CMO-COOT algorithm for optimized information retrieval. J Supercomput 78, 8625–8643 (2022). https://doi.org/10.1007/s11227-021-04171-y
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-021-04171-y