Skip to main content
Log in

An automatic query expansion based on hybrid CMO-COOT algorithm for optimized information retrieval

  • Published:
The Journal of Supercomputing Aims and scope Submit manuscript

Abstract

The World Wide Web(WWW) comprises a wide range of information, and it is mainly operated on the principles of keyword matching which often reduces accurate information retrieval. Automatic query expansion is one of the primary methods for information retrieval, and it handles the vocabulary mismatch problem often faced by the information retrieval systems to retrieve an appropriate document using the keywords. This paper proposed a novel approach of hybrid COOT-based Cat and Mouse Optimization (CMO) algorithm named as hybrid COOT-CMO for the appropriate selection of optimal candidate terms in the automatic query expansion process. To improve the accuracy of the Cat and Mouse Optimization (CMO) algorithm, the parameters are tuned with the help of the Coot algorithm. The best suitable expanded query is identified from the available expanded query sets also known as candidate query pools. All feasible combinations in this candidate query pool should be obtained from the top retrieved documents. Benchmark datasets such as the GOV2 Test Collection, the Cranfield Collections, and the NTCIR Test Collection are utilized to assess the performance of the proposed hybrid COOT-CMO method for automatic query expansion. This proposed method surpasses the existing state-of-the-art techniques using many performance measures such as F-score, precision, and mean average precision (MAP).

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2

Similar content being viewed by others

References

  1. Carpineto C, Romano G (2012) A survey of automatic query expansion in information retrieval. Acm Comput Surv (CSUR) 44(1):1–50

    Article  Google Scholar 

  2. Fu G, Jones CB, Abdelmoty AI (2005) Ontology-based spatial query expansion in information retrieval. In: OTM Confederated International Conferences" On the Move to Meaningful Internet Systems", Springer, Berlin, Heidelberg, pp 1466–1482 2005

  3. Azad HK, Deepak A (2019) Query expansion techniques for information retrieval: a survey. Inf Process Manage 56(5):1698–1735

    Article  Google Scholar 

  4. Bai J, Song D, Bruza P, Nie JY, Cao G (2005) Query expansion using term relationships in language models for information retrieval. In: Proceedings of the 14th ACM international conference on Information and knowledge management, pp 688–695 2005

  5. Sundararaj V, Muthukumar S, Kumar RS (2018) An optimal cluster formation based energy efficient dynamic scheduling hybrid MAC protocol for heavy traffic load in wireless sensor networks. Comput Secur 77:277–288

    Article  Google Scholar 

  6. Sundararaj V (2016) An efficient threshold prediction scheme for wavelet based ECG signal noise reduction using variable step size firefly algorithm. Int J Intell Eng Syst 9(3):117–126

    Google Scholar 

  7. Sundararaj V (2019) Optimised denoising scheme via opposition-based self-adaptive learning PSO algorithm for wavelet-based ECG signal noise reduction. Int J Biomed Eng Technol 31(4):325

    Article  Google Scholar 

  8. Sundararaj V, Anoop V, Dixit P, Arjaria A, Chourasia U, Bhambri P, Rejeesh MR, Regu S (2020) CCGPA-MPPT: cauchy preferential crossover-based global pollination algorithm for MPPT in photovoltaic system. Prog Photovolt Res Appl 28(11):1128–1145

    Article  Google Scholar 

  9. Ravikumar S, Kavitha D (2021) CNN-OHGS: CNN-oppositional-based henry gas solubility optimization model for autonomous vehicle control system. J Field Robotics 38(7):967–979

    Article  Google Scholar 

  10. Ravikumar S, Kavitha D (2020) IoT based home monitoring system with secure data storage by Keccak-Chaotic sequence in cloud server. J Ambient Intell Human Comput 12(7):7475–7487

    Article  Google Scholar 

  11. Rejeesh MR, Thejaswini P (2020) MOTF: multi-objective optimal trilateral filtering based partial moving frame algorithm for image denoising. Multimed Tools Appl 79(37):28411–28430

    Article  Google Scholar 

  12. Kavitha D, Ravikumar S (2021) IOT and context-aware learning-based optimal neural network model for real-time health monitoring. Trans Emerging Telecommun Technol 32(1):e4132

    Google Scholar 

  13. Xie Q, Sundararaj V, Mr R (2021) Analyzing the factors affecting the attitude of public toward lockdown, institutional trust, and civic engagement activities. J Commun Psychol

  14. Sundararaj V, Rejeesh MR (2021) A detailed behavioral analysis on consumer and customer changing behavior with respect to social networking sites. J Retail Consum Services 58:102190

    Article  Google Scholar 

  15. Alex JA, Anees S, Madheswari N (2013) User authentication based on persuasive cued click points with sound signature. J Comput Sci Inf Technol Secur 3:353–358

    Google Scholar 

  16. Haseena KS, Anees S, Madheswari N (2014) Power optimization using EPAR protocol in MANET. Int J Innov Sci Eng Technol 6:430–436

    Google Scholar 

  17. Nirmal Kumar SJ, Ravimaran S, Alam MM (2020) An effective non-commutative encryption approach with optimized genetic algorithm for ensuring data protection in cloud computing. Comput Model Eng Sci 125(2):671–697

    Google Scholar 

  18. Gowthul Alam MM, Baulkani S (2017) Reformulated query-based document retrieval using optimised kernel fuzzy clustering algorithm. Int J Bus Intell Data Min 12(3):299

    Google Scholar 

  19. Alam MG, Baulkani S (2016) A hybrid approach for web document clustering using K-means and artificial bee colony algorithm. Int J Intell Eng Syst 9(4):11–20

    Google Scholar 

  20. Azath M, Banu RW, Madheswari AN (2011) Improving fairness in network traffic by controlling congestion and unresponsive flows. In: International conference on network security and applications. Springer, Berlin, Heidelberg, pp 356–363

    Google Scholar 

  21. Gupta P, Bali K, Banchs RE, Choudhury M, Rosso P (2014) Query expansion for mixed-script information retrieval. In: Proceedings of the 37th international ACM SIGIR conference on Research & development in information retrieval, pp 677–686. 2014

  22. Gupta P, Bali K, Banchs RE, Choudhury M, Rosso P (2014) Query expansion for mixed-script information retrieval. In: Proceedings of the 37th international ACM SIGIR conference on Research & development in information retrieval, pp 677–686

  23. Sankhavara J (2020) Feature weighting in finding feedback documents for query expansion in biomedical document retrieval. SN Comput Sci 1(2):1–7

    Article  Google Scholar 

  24. Klink S, Hust A, Junker M, Dengel A (2002) Improving document retrieval by automatic query expansion using collaborative learning of term-based concepts. In: Lopresti Daniel, Jianying Hu, Kashi Ramanujan (eds) International workshop on document analysis systems. Springer, Berlin, pp 376–387

    Chapter  Google Scholar 

  25. Yusuf NUHU, Yunus MAM, Wahid NORFARADILLA, Nawi NM, Samsudin NA, Arbaiy NUREIZE (2020) Query expansion method for quran search using semantic search and lucene ranking. J Eng Sci Technol 15(1):675–692

    Google Scholar 

  26. Sharma DK, Pamula R, Chauhan DS (2019) A hybrid evolutionary algorithm based automatic query expansion for enhancing document retrieval system. J Ambient Intell Human Comput. https://doi.org/10.1007/s12652-019-01247-9

    Article  Google Scholar 

  27. Singh J, Sharan A (2015) Context window based co-occurrence approach for improving feedback based query expansion in information retrieval. Int J Inf Retriev Res (IJIRR) 5(4):31–45

    Google Scholar 

  28. Gupta Y, Saini A (2017) A novel Fuzzy-PSO term weighting automatic query expansion approach using combined semantic filtering. Knowl-Based Syst 136:97–120

    Article  Google Scholar 

  29. Dehghani M, Hubálovský Š, Trojovský P (2021) Cat and mouse based optimizer: a new nature-inspired optimization algorithm. Sensors 21(15):5214

    Article  Google Scholar 

  30. Gao Y, Zhang G, Ma J, Lu J (2009) A λ-cut and goal-programming-based algorithm for fuzzy-linear multiple-objective bilevel optimization. IEEE Trans Fuzzy Syst 18(1):1–13

    Article  Google Scholar 

  31. Naruei I, Keynia F (2021) A new optimization method based on coot bird natural life model. Expert Syst Appl 183:115352

    Article  Google Scholar 

  32. Mirjalili S, Gandomi AH, Mirjalili SZ, Saremi S, Faris H, Mirjalili SM (2017) Salp swarm algorithm: a bio-inspired optimizer for engineering design problems. Adv Eng Softw 114:163–191

    Article  Google Scholar 

  33. Dirani KM, Abadi M, Alizadeh A, Barhate B, Garza RC, Gunasekara N, Ibrahim G, Majzun Z (2020) Leadership competencies and the essential role of human resource development in times of crisis: a response to Covid-19 pandemic. Human Resour Develop Int 23(4):380–394

    Article  Google Scholar 

  34. Lorenzetti CM, Maguitman AG (2009) A semi-supervised incremental algorithm to automatically formulate topical queries. Inf Sci 179(12):1881–1892

    Article  Google Scholar 

  35. http://ir.dcs.gla.ac.uk/test_collections/gov2-summary.htm.

  36. http://ir.dcs.gla.ac.uk/resources/test_collections/cran/

  37. https://www.nii.ac.jp/dsc/idr/en/ntcir/ntcir-taskdata.html

  38. Liu Q, Huang H, Xuan J, Zhang G, Gao Y, Jie L (2020) A fuzzy word similarity measure for selecting top-k similar words in query expansion. IEEE Trans Fuzzy Syst. https://doi.org/10.1109/TFUZZ.2020.2993702

    Article  Google Scholar 

  39. Zheng Z, Kai H, Ben H, Xianpei H, Le S, Andrew Y (2020) BERT-QE: contextualized query expansion for document re-ranking.

  40. Jiang Y (2020) Semantically-enhanced information retrieval using multiple knowledge sources. Cluster Comput 23(4):2925–2944

    Article  Google Scholar 

  41. Bhopale AP, Tiwari A (2020) Swarm optimized cluster based framework for information retrieval. Expert Syst Appl 154:113441

    Article  Google Scholar 

  42. Kane M (1996) The precision of measurements. Appl Measur Educ 9(4):355–379

    Article  Google Scholar 

  43. Kishida K (2005) Property of average precision and its generalization: An examination of evaluation indicator for information retrieval experiments. National Institute of Informatics, Tokyo, Japan, p 19p

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Abdullah Saleh Alqahtani.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Alqahtani, A.S., Saravanan, P., Maheswari, M. et al. An automatic query expansion based on hybrid CMO-COOT algorithm for optimized information retrieval. J Supercomput 78, 8625–8643 (2022). https://doi.org/10.1007/s11227-021-04171-y

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11227-021-04171-y

Keywords

Navigation