Skip to main content
Log in

Federated search techniques: an overview of the trends and state of the art

  • Review
  • Published:
Knowledge and Information Systems Aims and scope Submit manuscript

Abstract

Conventional search engines, such as Bing, Baidu, and Google, offer a convenient way for users to seek information on the web. However, with all the benefits they provide, one major limitation is that a sizable portion of the information sources on the web may not be available due to commercial or proprietary reasons. Federated search solves this problem by providing a single user interface through which multiple independent resources can be searched and their results are combined for end users. Up to now, federated search has become a well-established research area, with many systems developed and algorithms proposed to deal with three major issues: resource description, resource selection, and results merging. This paper reviews state-of-the-art federated search techniques developed over the past three decades, with more attention to recent achievement. Both resource selection and result merging methods are categorized into three types, heuristic, machine learning-based, and other methods. Apart from the three major issues above-mentioned, we also discuss systems and prototypes developed, and datasets used for federated search experiments. Some other related issues including retrieval evaluation, aggregated search, metasearch, supporting personalization in federated search, are also covered. Finally, we conclude by discussing some directions for future research.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1

Similar content being viewed by others

Notes

  1. http://www.bing.com/.

  2. http://www.baidu.com/.

  3. http://www.google.com/.

  4. https://www.priceline.com/.

  5. https://searchengineland.com/google-search-press-129925.

  6. https://efpf-portal.ascora.eu/.

  7. https://www.nimble-project.org.

  8. https://www.composition-project.eu.

  9. https://www.vf-os.eu.

  10. https://www.digicor-project.eu.

  11. CERN and OpenAIREplus launch new European research repository (sciencenode.org)

  12. https://www.pinterest.com.

  13. https://www.flickr.com.

References

  1. Sreeja SR, Chaudhari S (2014) Review of web crawlers. Int J Knowl Web Intell 5(1):49–61

    Article  Google Scholar 

  2. Nguyen D, Demeester T, Trieschnigg D, Hiemstra D (2012) Federated search in the wild: the combined power of over a hundred search engines. In Chen X, Lebanon G, Wang H, Zaki MJ (eds) 21st ACM international conference on information and knowledge management, CIKM’12, Maui, HI, USA, October 29–November 02, 2012, pp. 1874–1878. https://doi.org/10.1145/2396761.2398535

  3. Li X (2022) Federated search to merge the results of the extracted functional requirements. PhD thesis, University of Cincinnati

  4. Damas J, Devezas J, Nunes S (2022) Federated search using query log evidence. In: Progress in artificial intelligence: Proceedings of 21st EPIA conference on artificial intelligence, EPIA 2022, Lisbon, Portugal, August 31–September 2, 2022, pp 794–805. Springer. https://doi.org/10.1007/978-3-031-16474-3_64.

  5. Gravano L, Chang C-CK, Garcia-Molina H, Paepcke A (1997) STARTS: stanford proposal for internet meta-searching. In: Proceedings of the 1997 ACM SIGMOD international conference on management of data, pp 207–218. https://doi.org/10.1145/253262.253299

  6. Gravano L, Garcia-Molina H, Tomasic A (1994) The effectiveness of GlOSS for the text database discovery problem. In: Proceedings of the 1994 ACM SIGMOD international conference on management of data, pp 126–137

  7. Callan J, Connell M (2001) Query-based sampling of text databases. ACM Trans Inf Syst 19(2):97–130. https://doi.org/10.1145/382979.383040

    Article  Google Scholar 

  8. Baillie M, Azzopardi L, Crestani F (2006) Adaptive query-based sampling of distributed collections. In: International symposium on string processing and information retrieval, pp 316–328. Springer

  9. Shokouhi M, Zobel J, Scholer F, Tahaghoghi SM (2006) Capturing collection size for distributed non-cooperative retrieval. In: Proceedings of the 29th annual international ACM SIGIR conference on research and development in information retrieval, pp 316–323

  10. Shokouhi M, Si L (2011) Federated search. Found Trends Inf Retriev 5(1):1–102

    Article  Google Scholar 

  11. Van den Bosch A, Bogers T, De Kunder M (2016) Estimating search engine index size variability: a 9-year longitudinal study. Scientometrics 107(2):839–856

    Article  Google Scholar 

  12. Khelghati M, Hiemstra D, Van Keulen M (2013) Deep web entity monitoring. In: Proceedings of the 22Nd international conference on world wide web, pp 377–382

  13. Bergman MK (2001) White paper: the deep web: surfacing hidden value. J Electron 7(1)

  14. Craswell N (2000) Methods for distributed information retrieval

  15. Yuwono B, Lee DL (1997) Server ranking for distributed text retrieval systems on the internet. In: 5th International conference on database systems for advanced applications database systems for advanced applications’ 97 (Melbourne, Australia), pp 41–49

  16. Arguello J, Diaz F, Callan J, Crespo J-F (2009) Sources of evidence for vertical selection. In: Proceedings of the 32nd international ACM SIGIR conference on research and development in information retrieval, pp 315–322

  17. Zhao H, Hu X (2014) Drexel at trec 2014 federated web search track. Technical report, Drexel univ Philadelphia pa coll of computing and informatics

  18. Wang Y, Liang J, Lu J (2014) Estimating the size of hidden data sources by queries. In: 2014 IEEE/ACM international conference on advances in social networks analysis and mining (ASONAM 2014), pp 712–719. IEEE

  19. Lu J, Li D (2010) Estimating deep web data source size by capture-recapture method. Inf Retriev 13:70–95

    Article  Google Scholar 

  20. Lu J (2008) Efficient estimation of the size of text deep web data source. In: Proceedings of the 17th ACM conference on information and knowledge management, pp 1485–1486

  21. Broder A, Fontura M, Josifovski V, Kumar R, Motwani R, Nabar S, Panigrahy R, Tomkins A, Xu Y (2006) Estimating corpus size via queries. In: Proceedings of the 15th ACM international conference on information and knowledge management, pp 594–603

  22. Dasgupta A, Jin X, Jewell B, Zhang N, Das G (2010) Unbiased estimation of size and other aggregates over hidden web databases. In: Proceedings of the 2010 ACM SIGMOD international conference on management of data, pp 855–866

  23. Shokouhi M (2007) Central-rank-based collection selection in uncooperative distributed information retrieval. In: European conference on information retrieval, pp 160–172. Springer

  24. Si L, Callan J (2003) Relevant document distribution estimation method for resource selection. In: Proceedings of the 26th annual international ACM SIGIR conference on research and development in informaion retrieval, pp 298–305

  25. Nguyen D, Demeester T, Trieschnigg D, Hiemstra D (2016) Resource selection for federated search on the web. arXiv preprint arXiv:1609.04556

  26. Shokouhi M, Zobel J (2007) Federated text retrieval from uncooperative overlapped collections. In: Proceedings of the 30th annual international acm sigir conference on research and development in information retrieval, pp 495–502

  27. Bernstein Y, Shokouhi M, Zobel J (2006) Compact features for detection of near-duplicates in distributed retrieval. In: Proceedings of string processing and information retrieval: 13th international conference, SPIRE 2006, Glasgow, UK, October 11-13, 2006, pp 110–121. Springer

  28. Callan J (2000) Distributed information retrieval. Adv Inf Retriev, pp 127–150

  29. Arguello J, Callan J, Diaz F (2009) Classification-based resource selection. In: Proceedings of the 18th ACM conference on information and knowledge management, pp 1277–1286

  30. Hong D, Si L, Bracke P, Witt M, Juchcinski T (2010) A joint probabilistic classification model for resource selection. In: Proceedings of the 33rd international ACM SIGIR conference on research and development in information retrieval, pp 98–105

  31. Di Buccio E, Melucci M (2014) University of padua at TREC 2014: Federated web search track. Technical report, Padua Univ (Italy)

  32. Hiemstra D, Trieschnigg D, Demeester T (2013) Mirex and taily at trec 2013

  33. Balog K (2013) The university of stavanger at the trec 2013 federated web search track

  34. Jin S, Lan M (2014) Simple may be best-a simple and effective method for federated web search via search engine impact factor estimation. In: TREC

  35. Wang Q, Shi S, Cao W (2014) Ruc at TREC 2014: select resources using topic models. Technical report, Renmin Univ Beijing (China)

  36. Ghansah B, Wu S (2016) A mean-variance analysis based approach for search result diversification in federated search. Int J Uncert Fuzziness Knowl-Based Syst 24(02):195–211

    Article  Google Scholar 

  37. Hamid B, Samir K (2016) Contextual source selection for federated search in mobile environment. In: 2016 30th international conference on advanced information networking and applications workshops (WAINA), pp 883–888. https://ieeexplore.ieee.org/document/7471315/. IEEE

  38. Dai Z, Kim Y, Callan J (2017) Learning to rank resources. In: Proceedings of the 40th International ACM SIGIR conference on research and development in information retrieval, pp 837–840

  39. Li L, Zhang Z, Wu S (2018) LDA-based resource selection for results diversification in federated search. In: Proceedings of web information systems and applications: 15th international conference, WISA 2018, Taiyuan, China, September 14–15, pp 147–156. Springer

  40. Han B, Chen L, Tian X (2018) Knowledge based collection selection for distributed information retrieval. Inf Process Manage 54(1):116–128

    Article  Google Scholar 

  41. Urak G, Ziak H, Kern R (2018) Source selection of long tail sources for federated search in an uncooperative setting. In: Proceedings of the 33rd annual ACM symposium on applied computing, pp 720–727

  42. Wu T, Liu X, Dong S (2019) Ltrrs: A learning to rank based algorithm for resource selection in distributed information retrieval. In: China conference on information retrieval, pp 52–63. Springer

  43. Garba A, Khalid S, Ullah I, Khusro S, Mumin D (2020) Embedding based learning for collection selection in federated search. Data Technologies and Applications

  44. Hong D, Si L (2012) Mixture model with multiple centralized retrieval algorithms for result merging in federated search. In: Proceedings of the 35th international ACM SIGIR conference on research and development in information retrieval, pp 821–830

  45. Hong D, Si L (2013) Search result diversification in resource selection for federated search. In: Proceedings of the 36th international ACM SIGIR Conference on research and development in information retrieval, pp 613–622

  46. Cetintas S, Si L, Yuan H (2009) Learning from past queries for resource selection. In: Proceedings of the 18th ACM conference on information and knowledge management, pp 1867–1870

  47. Shokouhi M, Zobel J (2009) Robust result merging using sample-based score estimates. ACM Trans Inf Syst 27(3):1–29

    Article  Google Scholar 

  48. Demeester T, Trieschnigg D, Nguyen D, Zhou K, Hiemstra D (2014) Overview of the TREC 2014 federated web search track. Technical report, Ghent Univ (Belgium)

  49. Demeester T, Trieschnigg D, Nguyen D, Hiemstra D, Zhou K (2015) Fedweb greatest hits: presenting the new test collection for federated web search. In: Proceedings of the 24th international conference on world wide web, pp 27–28

  50. Bellogín A, Gebremeskel GG, He J, Said A, Samar T, de Vries AP, Lin J, Vuurens JB (2013) Cwi and tu delft notebook TREC 2013: contextual suggestion, federated web search, kba, and web tracks. In: TREC. Citeseer

  51. Guan F, Xue Y, Yu X, Liu Y, Cheng X (2014) Ictnet at federated web search track 2013. In: TREC

  52. Aly R, Hiemstra D, Demeester T (2013) Taily: shard selection using the tail of score distributions. In: Proceedings of the 36th international ACM SIGIR conference on research and development in information retrieval, pp 673–682

  53. Xu J, Li X (2007) Learning to rank collections. In: Proceedings of the 30th annual international ACM SIGIR conference on research and development in information retrieval, pp 765–766

  54. Joachims T (2006) Training linear SVMS in linear time. In: Proceedings of the 12th ACM SIGKDD international conference on knowledge discovery and data mining, pp 217–226

  55. Wu Q, Burges CJ, Svore KM, Gao J (2010) Adapting boosting for information retrieval measures. Inf Retrieval 13(3):254–270

    Article  Google Scholar 

  56. Zhu Q, Li D, Lee DL (2018) C-dlsi: an extended lsi tailored for federated text retrieval. arXiv preprint arXiv:1810.02579

  57. Calì A, Straccia U (2017) Integration of deep web sources: A distributed information retrieval approach. In: Proceedings of the 7th international conference on web intelligence, mining and semantics, pp 1–4

  58. Benbelgacem S, Guezouli L, Seghir R (2020) A distributed information retrieval approach for copyright protection. In: Proceedings of the 3rd international conference on networking, information systems and security, pp 1–6

  59. Xia L, Xu J, Lan Y, Guo J, Zeng W, Cheng X (2017) Adapting markov decision process for search result diversification. In: Proceedings of the 40th international ACM SIGIR conference on research and development in information retrieval, pp 535–544

  60. Yigit-Sert S, Altingovde IS, Macdonald C, Ounis I, Ulusoy Ö (2020) Supervised approaches for explicit search result diversification. Inf Process Manage 57(6):102356

    Article  Google Scholar 

  61. Wang J, Zhu J (2009) Portfolio theory of information retrieval. In: Proceedings of the 32nd International ACM SIGIR conference on research and development in information retrieval, pp 115–122

  62. Cleverley PH, Burnett S (2019) Enterprise search: a state of the art. Bus Inf Rev 36(2):60–69

    Google Scholar 

  63. Wauer M, Schuster D, Schill A (2011) Advanced resource selection for federated enterprise search. In: Business information systems workshops: BIS 2011 international workshops and BPSC international conference, Poznań, Poland, June 15-17, 2011. Revised Papers 14, pp. 154–159. Springer

  64. Rasolofo Y, Hawking D, Savoy J (2003) Result merging strategies for a current news metasearcher. Inf Process Manage 39(4):581–609

    Article  MATH  Google Scholar 

  65. Si L, Callan J (2003) A semisupervised learning method to merge search engine results. ACM Trans Inf Syst 21(4):457–491

    Article  Google Scholar 

  66. He C, Hong D, Si L (2011) A weighted curve fitting method for result merging in federated search. In: Proceedings of the 34th international ACM SIGIR conference on research and development in information retrieval, pp 1177–1178

  67. Mourao A, Martins F, Magalhaes J (2013) Novasearch at trec 2013 federated web search track: experiments with rank fusion. In: TREC

  68. Cormack GV, Clarke CL, Buettcher S (2009) Reciprocal rank fusion outperforms condorcet and individual rank learning methods. In: Proceedings of the 32nd international ACM SIGIR conference on research and development in information retrieval, pp. 758–759

  69. Pal D, Mitra M (2013) Isi at the trec 2013 federated task. In: TREC

  70. Giachanou A, Markov I, Crestani F (2014) Opinions in federated search: University of lugano at trec 2014 federated web search track. Technical report, Lugano Univ (Switzerland)

  71. Esuli A, Sebastiani F (2006) Sentiwordnet: a publicly available lexical resource for opinion mining. In: Proceedings of the fifth international conference on language resources and evaluation (LREC’06)

  72. Garba A, Wu S (2023) Snippet-based result merging in federated search. J Inf Sci

  73. Tjin-Kam-Jet K, Hiemstra D (2010) Learning to merge search results for efficient distributed information retrieval

  74. Ghansah B, Wu S, Ghansah N (2015) Rankboost-based result merging. In: 2015 IEEE international conference on computer and information technology; ubiquitous computing and communications; dependable, autonomic and secure computing; pervasive intelligence and computing, pp 907–914. IEEE

  75. Freund Y, Iyer R, Schapire RE, Singer Y (2003) An efficient boosting algorithm for combining preferences. J Mach Learn Res 4(Nov):933–969

    MathSciNet  MATH  Google Scholar 

  76. Ponnuswami AK, Pattabiraman K, Wu Q, Gilad-Bachrach R, Kanungo T (2011) On composition of a federated web search result page: using online users to provide pairwise preference for heterogeneous verticals. In: Proceedings of the fourth ACM international conference on web search and data mining, pp 715–724

  77. Vo HT (2019) New re-ranking approach in merging search results. Informatic 43(2)

  78. Almeida TS, Laitz T, Seródio J, Bonifacio LH, Lotufo R, Nogueira R (2022) Neuralsearchx: serving a multi-billion-parameter reranker for multilingual metasearch at a low cost. arXiv preprint arXiv:2210.14837

  79. Palakodety S, Callan J (2014) Query transformations for result merging. Technical report, Carnegie-Mellon Univ Pittsburgh, PA School of Computer Science

  80. Ceppi S, Gatti N, Gerding E (2011) Mechanism design for federated sponsored search auctions. Proc AAAI Confer Artific Intell 25:608–613

    Google Scholar 

  81. Bonetti LE, Ceppi S, Gatti N, et al (2011) Designing a revenue mechanism for federated search engines. In: VLDS, pp 46–51. Citeseer

  82. Trieschnigg D, Tjin-Kam-Jet K, Hiemstra D (2013) Searchresultfinder: Federated search made easy. In: Proceedings of the 36th international ACM SIGIR conference on research and development in information retrieval, pp 1113–1114

  83. Demeester T, Nguyen D, Trieschnigg D, Develder C, Hiemstra D (2013) Snippet-based relevance predictions for federated web search. In: Advances in information retrieval: 35th European conference on IR research, ECIR 2013, Moscow, Russia, March 24-27. Proceedings 35, pp 697–700. Springer

  84. Arya D, Ha-Thuc V, Sinha S (2015) Personalized federated search at linkedin. In: Proceedings of the 24th ACM international on conference on information and knowledge management, pp 1699–1702

  85. Paepcke A, Brandriff R, Janee G, Larson R, Ludaescher B, Melnik S, Raghavan S (2000) Search middleware and the simple digital library interoperability protocol. DLIB Magazine 6(3)

  86. Green N, Ipeirotis PG, Gravano L (2001) SDLIP+ STARTS= SDARTS a protocol and toolkit for metasearching. In: Proceedings of the 1st ACM/IEEE-CS joint conference on digital libraries, pp 207–214

  87. Avrahami TT, Yau L, Si L, Callan J (2006) The fedlemur project: Federated search in the real world. J Am Soc Inform Sci Technol 57(3):347–358

    Article  Google Scholar 

  88. Jayakody D, Selvanathan N, Damjanovic-Behrendt V (2020) Federated search and recommendation. In: I-ESA Workshops

  89. Dragoni M, Rexha A, Ziak H, Kern R (2017) A semantic federated search engine for domain-specific document retrieval. In: Proceedings of the symposium on applied computing, pp 303–308

  90. Stoddard J, Mustafa A, Goela N (2021) Tanium reveal: a federated search engine for querying unstructured file data on large enterprise networks. Proc VLDB Endow 14(12):3096–3109

    Article  Google Scholar 

  91. Collarana D, Galkin M, Lange C, Grangel-González I, Vidal M-E, Auer S (2016) Fuhsen: A federated hybrid search engine for building a knowledge graph on-demand (short paper). In: OTM confederated international conferences on the move to meaningful internet systems, pp 752–761. Springer

  92. Rasolofo Y, Abbaci F, Savoy J (2001) Approaches to collection selection and results merging for distributed information retrieval. In: Proceedings of the tenth international conference on information and knowledge management, pp. 91–198

  93. Xu J, Croft WB (1999) Cluster-based language models for distributed retrieval. In: Proceedings of the 22nd annual international ACM SIGIR conference on research and development in information retrieval, pp 54–261

  94. Powell AL, French JC (2003) Comparing the performance of collection selection algorithms. ACM Trans Inf Syst 21(4):412–456

    Article  Google Scholar 

  95. D’Souza DJ, Zobel J, Thom JA (2004) Is cori effective for collection selection? An exploration of parameters, queries, and data. In: ADCS, pp 41–46

  96. Nguyen D, Demeester T, Trieschnigg D, Hiemstra D (2012) Federated search in the wild: the combined power of over a hundred search engines. In: Proceedings of the 21st ACM international conference on information and knowledge management, pp 1874–1878

  97. Cahoon B, McKinley KS (1996) Performance evaluation of a distributed architecture for information retrieval. In: Proceedings of the 19th annual international ACM SIGIR conference on research and development in information retrieval, SIGIR’96, August 18-22, 1996, Zurich, Switzerland (Special Issue of the SIGIR Forum), pp 110–118. ACM

  98. Witschel HF, Holz F, Heinrich G, Teresniak S (2008) An evaluation measure for distributed information retrieval systems. In: Proceedings 30th European conference on IR research, advances in information retrieval, ECIR 2008, Glasgow, UK, March 30-April 3, 2008. Lecture Notes in Computer Science, vol vol 4956, pp 607–611. https://doi.org/10.1007/978-3-540-78646-7_64

  99. Losee RM LC Jr (2004) Information retrieval with distributed databases: analytic models of performance. IEEE Tran. Parall Distribut Syst 15(1):18–27

    Article  Google Scholar 

  100. Jung JJ (2009) Consensus-based evaluation framework for distributed information retrieval systems. Knowl Inf Syst 18(2):199–211

    Article  Google Scholar 

  101. Williams J, Kochendorfer KM (2012) Evaluation of a federated medical search engine during third-year medical clerkship. In: AMIA 2012, American medical informatics association annual symposium, Chicago, Illinois, USA, November 3-7, 2012

  102. Buccio ED, Masiero I, Melucci M (2014) Evaluation of a recursive weighting scheme for federated web search. In: Basili R, Crestani F, Pennacchiotti M (eds) Proceedings of the 5th Italian information retrieval workshop, Roma, Italy, January 20-21, 2014. CEUR workshop, vol 1127, pp 1–10

  103. Pergantis M, Varlamis I, Giannakoulopoulos A (2022) User evaluation and metrics analysis of a prototype web-based federated search engine for art and cultural heritage. Information 13(6):285

    Article  Google Scholar 

  104. Arguello J (2017) Aggregated search. Found Trends Inf Retriev 10(5):365–502

    Article  Google Scholar 

  105. Arguello J, Diaz F, Callan J (2011) Learning to aggregate vertical results into web search results. In: Proceedings of the 20th ACM international conference on information and knowledge management, pp 201–210

  106. Ma X (2020) A new aggregated search method. J Intell Fuzzy Syst 38(1):55–63

    Article  Google Scholar 

  107. Rashid U, Saleem K, Ahmed A (2021) Mirre approach: nonlinear and multimodal exploration of mir aggregated search results. Multimed Tools Appl 80(13):20217–20253

    Article  Google Scholar 

  108. Meng W, Yu CT (2010) Advanced metasearch engine technology. Synth Lect Data Manage 2(1):1–129

    Article  MathSciNet  MATH  Google Scholar 

  109. Wu S (2012) Data fusion in information retrieval. Adapt Learn Optim 13:1–228. https://doi.org/10.1007/978-3-642-28866-1

    Article  MATH  Google Scholar 

  110. Aslam JA, Montague MH (2001) Models for metasearch. In: Croft WB, Harper DJ, Kraft DH, Zobel J (eds) SIGIR 2001: Proceedings of the 24th annual international ACM SIGIR conference on research and development in information retrieval, September 9-13, 2001, New Orleans, Louisiana, USA, pp 275–284

  111. Montague MH, Aslam JA (2002) Condorcet fusion for improved retrieval. In: Proceedings of the 2002 ACM CIKM international conference on information and knowledge management, McLean, VA, USA, November 4-9, 2002, pp 538–548

  112. Wu S (2013) The weighted condorcet fusion in information retrieval. Inf Process Manage 49(1):108–122

    Article  Google Scholar 

  113. Wu S (2012) Linear combination of component results in information retrieval. Data Knowl Eng 71(1):114–126

    Article  Google Scholar 

  114. Amin GR, Emrouznejad A, Sadeghi H (2012) Metasearch information fusion using linear programming. RAIRO Oper Res 46(4):289–303

    Article  MathSciNet  MATH  Google Scholar 

  115. Tayal DK, Jain A, Dimri N, Gupta S (2015) Metasurfer: a new metasearch engine based on FAHP and modified EOWA operator. Int J Syst Assur Eng Manag 6(4):487–499

    Article  Google Scholar 

  116. Kaur P, Singh M, Josan GS, Dhillon SS (2018) Rank aggregation using ant colony approach for metasearch. Soft Comput 22(13):4477–4492

    Article  Google Scholar 

  117. Vijaya P, Chander S (2018) Lionrank: lion algorithm-based metasearch engines for re-ranking of webpages. Sci China Inf Sci 61(12):122102–112210216

    Article  Google Scholar 

  118. Liu W, Han C, Lian F (2009) An alternative derivation of a bayes tracking filter based on finite mixture models. In: 12th international conference on information fusion, FUSION ’09, Seattle, Washington, USA, July 6-9, pp 842–849

  119. Smalheiser NR, Lin C, Jia L, Jiang Y, Cohen AM, Yu CT, Davis JM, Adams CE, McDonagh MS, Meng W (2014) Design and implementation of metta, a metasearch engine for biomedical literature retrieval intended for systematic reviewers. Health Inf Sci Syst 2(1):1

    Article  Google Scholar 

  120. Saito K, Kimura M, Ohara K, Motoda H (2010) Selecting information diffusion models over social networks for behavioral analysis. In: Joint European conference on machine learning and knowledge discovery in databases, pp 180–195. Springer

  121. Chelmis C, Prasanna VK (2013) Social link prediction in online social tagging systems. ACM Trans Inf Syst 31(4):1–27

    Article  Google Scholar 

  122. Saoud Z, Kechid S (2016) Integrating social profile to improve the source selection and the result merging process in distributed information retrieval. Inf Sci 336:115–128

    Article  Google Scholar 

  123. Kechid S, Drias H (2009) Personalizing the source selection and the result merging process. Int J Artif Intell Tools 18(02):331–354

    Article  Google Scholar 

Download references

Funding

This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.

Author information

Authors and Affiliations

Authors

Contributions

AG conceptualized the idea for the article, performed the literature search and data analysis, prepared the figures, and wrote the first draft. SW manually read and selected the papers to be included in the manuscript, restructured the manuscript, critically revised the work for important intellectual content, and proofread it. SK performed the literature search, proofread the manuscript, prepared the tables, and formatted it in Latex for journal submission.

Corresponding author

Correspondence to Adamu Garba.

Ethics declarations

Conflict of interest

The authors of this manuscript have no potential conflicts of interest to disclose.

Ethical approval

Not applicable.

Consent to participate

Not applicable.

Consent to publish

The manuscript was read and approved for submission by all authors.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Garba, A., Wu, S. & Khalid, S. Federated search techniques: an overview of the trends and state of the art. Knowl Inf Syst 65, 5065–5095 (2023). https://doi.org/10.1007/s10115-023-01922-6

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10115-023-01922-6

Keywords

Navigation