research-article

Open access

Large Language Models and Future of Information Retrieval: Opportunities and Challenges

Author:

ChengXiang ZhaiAuthors Info & Claims

SIGIR '24: Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval

Pages 481 - 490

https://doi.org/10.1145/3626772.3657848

Published: 11 July 2024 Publication History

Abstract

Recent years have seen great success of large language models (LLMs) in performing many natural language processing tasks with impressive performance, including tasks that directly serve users such as question answering and text summarization. They open up unprecedented opportunities for transforming information retrieval (IR) research and applications. However, concerns such as halluciation undermine their trustworthiness, limiting their actual utility when deployed in real-world applications, especially high-stake applications where trust is vital. How can we both exploit the strengths of LLMs and mitigate any risk caused by their weaknesses when applying LLMs to IR? What are the best opportunities for us to apply LLMs to IR? What are the major challenges that we will need to address in the future to fully exploit such opportunities? Given the anticipated growth of LLMs, what will future information retrieval systems look like? Will LLMs eventually replace an IR system? In this perspective paper, we examine these questions and provide provisional answers to them. We argue that LLMs will not be able to replace search engines, and future LLMs would need to learn how to use a search engine so that they can interact with a search engine on behalf of users. We conclude with a set of promising future research directions in applying LLMs to IR.

References

[1]

Josh Achiam, Steven Adler, Sandhini Agarwal, Lama Ahmad, Ilge Akkaya, Florencia Leoni Aleman, Diogo Almeida, Janko Altenschmidt, Sam Altman, Shyamal Anadkat, et al. 2023. Gpt-4 technical report. arXiv preprint arXiv:2303.08774 (2023).

[2]

Qingyao Ai, Ting Bai, Zhao Cao, Yi Chang, ..., Weinan Zhang, Min Zhang, and Xiaofei Zhu. 2023. Information Retrieval meets Large Language Models: A strategic report from Chinese IR community. AI Open, Vol. 4 (2023), 80--90. https://doi.org/10.1016/j.aiopen.2023.08.001

[3]

Krisztian Balog and ChengXiang Zhai. 2024. User Simulation for Evaluating Information Access Systems. Foundations and Trends in Information Retrieval (2024). forthcoming.

[4]

Nicholas J Belkin. 1996. Intelligent information retrieval: Whose intelligence? ISI, Vol. 96 (1996), 25--31.

[5]

David M Blei. 2012. Probabilistic topic models. Commun. ACM, Vol. 55, 4 (2012), 77--84.

Digital Library

[6]

Tom Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared D Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, et al. 2020. Language models are few-shot learners. Advances in neural information processing systems, Vol. 33 (2020), 1877--1901.

[7]

Paul F Christiano, Jan Leike, Tom Brown, Miljan Martic, Shane Legg, and Dario Amodei. 2017. Deep reinforcement learning from human preferences. Advances in neural information processing systems, Vol. 30 (2017).

[8]

Kevyn Collins-Thompson, Preben Hansen, and Claudia Hauff. 2017. Search as learning (dagstuhl seminar 17092). In Dagstuhl reports, Vol. 7. Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik.

[9]

Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2019, Minneapolis, MN, USA, June 2--7, 2019, Volume 1 (Long and Short Papers), Jill Burstein, Christy Doran, and Thamar Solorio (Eds.). Association for Computational Linguistics, 4171--4186. https://doi.org/10.18653/V1/N19--1423

[10]

Guglielmo Faggioli, Laura Dietz, Charles LA Clarke, Gianluca Demartini, Matthias Hagen, Claudia Hauff, Noriko Kando, Evangelos Kanoulas, Martin Potthast, Benno Stein, et al. 2023. Perspectives on large language models for relevance judgment. In Proceedings of the 2023 ACM SIGIR International Conference on Theory of Information Retrieval. 39--50.

Digital Library

[11]

Artur d'Avila Garcez, Sebastian Bader, Howard Bowman, Luis C Lamb, Leo de Penning, BV Illuminoo, Hoifung Poon, and COPPE Gerson Zaverucha. 2022. Neural-symbolic learning and reasoning: A survey and interpretation. Neuro-Symbolic Artificial Intelligence: The State of the Art, Vol. 342, 1 (2022), 327.

[12]

Jiafeng Guo, Yixing Fan, Liang Pang, Liu Yang, Qingyao Ai, Hamed Zamani, Chen Wu, W Bruce Croft, and Xueqi Cheng. 2020. A deep look into neural ranking models for information retrieval. Information Processing & Management, Vol. 57, 6 (2020), 102067.

[13]

Thomas Hofmann. 1999. Probabilistic latent semantic indexing. In Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval. 50--57.

Digital Library

[14]

Saar Kuzi, Abhishek Narwekar, Anusri Pampari, and ChengXiang Zhai. 2019. Help me search: Leveraging user-system collaboration for query construction to improve accuracy for difficult queries. In Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval. 1221--1224.

Digital Library

[15]

David Dolan Lewis. 1992. Representation and learning in information retrieval. University of Massachusetts Amherst.

[16]

Kelong Mao, Zhicheng Dou, Haonan Chen, Fengran Mo, and Hongjin Qian. 2023. Large Language Models Know Your Contextual Search Intent: A Prompting Framework for Conversational Search. arXiv preprint arXiv:2303.06573 (2023).

[17]

Adam Pauls and Dan Klein. 2011. Faster and smaller n-gram language models. In Proceedings of the 49th annual meeting of the Association for Computational Linguistics: Human Language Technologies. 258--267.

Digital Library

[18]

Baolin Peng, Chunyuan Li, Pengcheng He, Michel Galley, and Jianfeng Gao. 2023. Instruction tuning with gpt-4. arXiv preprint arXiv:2304.03277 (2023).

[19]

Alec Radford, Jeffrey Wu, Rewon Child, David Luan, Dario Amodei, Ilya Sutskever, et al. 2019. Language models are unsupervised multitask learners. OpenAI blog, Vol. 1, 8 (2019), 9.

[20]

Alireza Salemi, Sheshera Mysore, Michael Bendersky, and Hamed Zamani. 2023. LaMP: When Large Language Models Meet Personalization. arXiv preprint arXiv:2304.11406 (2023).

[21]

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In Proceedings of the 31st International Conference on Neural Information Processing Systems (Long Beach, California, USA) (NIPS'17). Curran Associates Inc., Red Hook, NY, USA, 6000--6010.

Digital Library

[22]

Ryen W White. 2024. Tasks, Copilots, and the Future of Search: A Keynote at SIGIR 2023. In ACM SIGIR Forum, Vol. 57. ACM New York, NY, USA, 1--8.

[23]

Ke Yang, Jiateng Liu, John Wu, Chaoqi Yang, Yi R Fung, Sha Li, Zixuan Huang, Xu Cao, Xingyao Wang, Yiquan Wang, et al. 2024. If llm is the wizard, then code is the wand: A survey on how code empowers large language models to serve as intelligent agents. arXiv preprint arXiv:2401.00812 (2024).

[24]

Zhilin Yang, Zihang Dai, Yiming Yang, Jaime Carbonell, Russ R Salakhutdinov, and Quoc V Le. 2019. Xlnet: Generalized autoregressive pretraining for language understanding. Advances in neural information processing systems, Vol. 32 (2019).

[25]

ChengXiang Zhai et al. 2008. Statistical language models for information retrieval a critical review. Foundations and Trends® in Information Retrieval, Vol. 2, 3 (2008), 137--213.

[26]

Yutao Zhu, Huaying Yuan, Shuting Wang, Jiongnan Liu, Wenhan Liu, Chenlong Deng, Zhicheng Dou, and Ji-Rong Wen. 2023. Large language models for information retrieval: A survey. arXiv preprint arXiv:2308.07107 (2023).

[27]

Daniel M Ziegler, Nisan Stiennon, Jeffrey Wu, Tom B Brown, Alec Radford, Dario Amodei, Paul Christiano, and Geoffrey Irving. 2019. Fine-tuning language models from human preferences. arXiv preprint arXiv:1909.08593 (2019).

Cited By

Hoppe AYu RLiu JBhattacharya NNejdl WAuer SKarras OCha MMoens MNajork M(2025)IWILDS'25: The 5th International Workshop on Investigating Learning During Web SearchProceedings of the Eighteenth ACM International Conference on Web Search and Data Mining10.1145/3701551.3705709(1116-1117)Online publication date: 10-Mar-2025
https://dl.acm.org/doi/10.1145/3701551.3705709
Yamamoto SKando N(2025)Temporal Closeness for Enhanced Cross-Modal Retrieval of Sensor and Image DataMultiMedia Modeling10.1007/978-981-96-2071-5_13(170-183)Online publication date: 2-Jan-2025
https://doi.org/10.1007/978-981-96-2071-5_13
Cigliano AFallucchi F(2025)The Convergence of Open Data, Linked Data, Ontologies, and Large Language Models: Enabling Next-Generation Knowledge SystemsMetadata and Semantic Research10.1007/978-3-031-81974-2_17(197-213)Online publication date: 26-Feb-2025
https://doi.org/10.1007/978-3-031-81974-2_17
Show More Cited By

Index Terms

Large Language Models and Future of Information Retrieval: Opportunities and Challenges
1. Information systems
  1. Information retrieval
    1. Retrieval models and ranking
      1. Language models

Recommendations

An adaptable search engine for multimodal information retrieval

This article describes an information retrieval approach according to the two different search modes that exist: browsing an ontology (via categories) or defining a query in free language (via keywords). Various proposals offer approaches adapted to one ...
Search Engines Information Retrieval in Practice
Sentence-based relevance flow analysis for high accuracy retrieval

Traditional ranking models for information retrieval lack the ability to make a clear distinction between relevant and nonrelevant documents at top ranks if both have similar bag-of-words representations with regard to a user query. We aim to go beyond ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

SIGIR '24: Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval

July 2024

3164 pages

ISBN:9798400704314

DOI:10.1145/3626772

General Chairs:
Grace Hui Yang
Georgetown University, USA
,
Hongning Wang
Tsinghua University, China
,
Sam Han
The Washington Post, USA
,
Program Chairs:
Claudia Hauff
Spotify, Netherlands
,
Guido Zuccon
The University of Queensland, Australia
,
Yi Zhang
University of California Santa Cruz, USA

Copyright © 2024 Owner/Author.

This work is licensed under a Creative Commons Attribution International 4.0 License.

Sponsors

SIGIR: ACM Special Interest Group on Information Retrieval

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 July 2024

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

National Science Foundation

Conference

SIGIR 2024

Sponsor:

SIGIR

SIGIR 2024: The 47th International ACM SIGIR Conference on Research and Development in Information Retrieval

July 14 - 18, 2024

Washington DC, USA

Acceptance Rates

Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

5
Total Citations
View Citations
3,036
Total Downloads

Downloads (Last 12 months)3,036
Downloads (Last 6 weeks)708

Reflects downloads up to 02 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

Hoppe AYu RLiu JBhattacharya NNejdl WAuer SKarras OCha MMoens MNajork M(2025)IWILDS'25: The 5th International Workshop on Investigating Learning During Web SearchProceedings of the Eighteenth ACM International Conference on Web Search and Data Mining10.1145/3701551.3705709(1116-1117)Online publication date: 10-Mar-2025
https://dl.acm.org/doi/10.1145/3701551.3705709
Yamamoto SKando N(2025)Temporal Closeness for Enhanced Cross-Modal Retrieval of Sensor and Image DataMultiMedia Modeling10.1007/978-981-96-2071-5_13(170-183)Online publication date: 2-Jan-2025
https://doi.org/10.1007/978-981-96-2071-5_13
Cigliano AFallucchi F(2025)The Convergence of Open Data, Linked Data, Ontologies, and Large Language Models: Enabling Next-Generation Knowledge SystemsMetadata and Semantic Research10.1007/978-3-031-81974-2_17(197-213)Online publication date: 26-Feb-2025
https://doi.org/10.1007/978-3-031-81974-2_17
Khalifeh FTaheri MFakhrahmad MMansoori E(2024)Efficient Top-k Keyword Search in Relational Databases Considering Integrated Candidate NetworkIEEE Access10.1109/ACCESS.2024.343346612(173775-173791)Online publication date: 2024
https://doi.org/10.1109/ACCESS.2024.3433466
Heikkilä J(2024)Human intelligence versus artificial intelligence in classifying economics research articles: exploratory evidenceJournal of Documentation10.1108/JD-05-2024-010481:7(18-30)Online publication date: 16-Dec-2024
https://doi.org/10.1108/JD-05-2024-0104

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Figures

Tables

Media

View Table of Conten