skip to main content
10.1145/3477495.3531672acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
short-paper

QFinder: A Framework for Quantity-centric Ranking

Published: 07 July 2022 Publication History

Abstract

Quantities shape our understanding of measures and values, and they are an important means to communicate the properties of objects. Often, search queries contain numbers as retrieval units, e.g., "iPhone that costs less than 800 Euros''. Yet, modern search engines lack a proper understanding of numbers and units. In queries and documents, search engines handle them as normal keywords and therefore are ignorant of relative conditions between numbers, such as greater than or less than, or, more generally, the numerical proximity of quantities. In this work, we demonstrate QFinder, our quantity-centric framework for ranking search results for queries with quantity constraints. We also open-source our new ranking method as an Elasticsearch plug-in for future use. Our demo is available at: https://qfinder.ifi.uni-heidelberg.de/

References

[1]
Rakesh Agrawal and Ramakrishnan Srikant. 2003. Searching with Numbers. IEEE Trans. Knowl. Data Eng., Vol. 15, 4 (2003), 855--870. https://doi.org/10.1109/TKDE.2003.1209004
[2]
Hannah Bast, Björn Buchhold, and Elmar Haussmann. 2016. Semantic Search on Text and Knowledge Bases. Found. Trends Inf. Retr., Vol. 10, 2--3 (2016), 119--271. https://doi.org/10.1561/1500000032
[3]
Marcus Fontoura, Ronny Lempel, Runping Qi, and Jason Y. Zien. 2007. Inverted Index Support for Numeric Search. Internet Math., Vol. 3, 2 (2007), 153--185. https://doi.org/10.1080/15427951.2006.10129119
[4]
Vinh Thinh Ho, Yusra Ibrahim, Koninika Pal, Klaus Berberich, and Gerhard Weikum. 2019. Qsearch: Answering Quantity Queries from Text. In The Semantic Web - ISWC - 18th International Semantic Web Conference, Proceedings, Part I (Lecture Notes in Computer Science, Vol. 11778). Springer, 237--257. https://doi.org/10.1007/978-3-030-30793-6_14
[5]
Vinh Thinh Ho, Koninika Pal, Niko Kleer, Klaus Berberich, and Gerhard Weikum. 2020. Entities with Quantities: Extraction, Search, and Ranking. In WSDM '20: The Thirteenth ACM International Conference on Web Search and Data Mining. ACM, 833--836. https://doi.org/10.1145/3336191.3371860
[6]
Vinh Thinh Ho, Koninika Pal, Simon Razniewski, Klaus Berberich, and Gerhard Weikum. 2021 b. Extracting Contextualized Quantity Facts from Web Tables. In WWW '21: The Web Conference 2021, Virtual Event / Ljubljana, Slovenia, April 19-23, 2021. ACM / IW3C2, 4033--4042. https://doi.org/10.1145/3442381.3450072
[7]
Vinh Thinh Ho, Koninika Pal, and Gerhard Weikum. 2021 a. QuTE: Answering Quantity Queries from Web Tables. In SIGMOD '21: International Conference on Management of Data, Virtual Event, China, June 20-25, 2021. ACM, 2740--2744. https://doi.org/10.1145/3448016.3452763
[8]
Yusra Ibrahim, Mirek Riedewald, Gerhard Weikum, and Demetrios Zeinalipour-Yazti. 2019. Bridging Quantities in Tables and Text. In 35th IEEE International Conference on Data Engineering, ICDE. IEEE, 1010--1021. https://doi.org/10.1109/ICDE.2019.00094
[9]
Yusra Ibrahim and Gerhard Weikum. 2019. ExQuisiTe: Explaining Quantities in Text. In The World Wide Web Conference, WWW. ACM, 3541--3544. https://doi.org/10.1145/3308558.3314134
[10]
Vladimir Karpukhin, Barlas Oguz, Sewon Min, Patrick S. H. Lewis, Ledell Wu, Sergey Edunov, Danqi Chen, and Wen-tau Yih. 2020. Dense Passage Retrieval for Open-Domain Question Answering. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, EMNLP. ACL, 6769--6781. https://doi.org/10.18653/v1/2020.emnlp-main.550
[11]
Tongliang Li, Lei Fang, Jian-Guang Lou, Zhoujun Li, and Dongmei Zhang. 2021. AnaSearch: Extract, Retrieve and Visualize Structured Results from Unstructured Text for Analytical Queries. In WSDM '21, The Fourteenth ACM International Conference on Web Search and Data Mining, Virtual Event, Israel, March 8-12, 2021. ACM, 906--909. https://doi.org/10.1145/3437963.3441694
[12]
Aman Madaan, Ashish Mittal, Mausam, Ganesh Ramakrishnan, and Sunita Sarawagi. 2016. Numerical Relation Extraction with Minimal Supervision. In Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence. AAAI Press, 2764--2771. http://www.aaai.org/ocs/index.php/AAAI/AAAI16/paper/view/12486
[13]
Arun S. Maiya, Dale Visser, and Andrew Wan. 2015. Mining Measured Information from Text. In Proceedings of the 38th International SIGIR Conference on Research and Development in Information Retrieval. ACM, 899--902. https://doi.org/10.1145/2766462.2767789
[14]
Natalya Fridman Noy, Yuqing Gao, Anshu Jain, Anant Narayanan, Alan Patterson, and Jamie Taylor. 2019. Industry-Scale Knowledge Graphs: Lessons and Challenges. ACM, Vol. 62, 8 (2019), 36--43. https://doi.org/10.1145/3331166
[15]
Qiu Ran, Yankai Lin, Peng Li, Jie Zhou, and Zhiyuan Liu. 2019. NumNet: Machine Reading Comprehension with Numerical Reasoning. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing EMNLP-IJCNLP. Association for Computational Linguistics, 2474--2484. https://doi.org/10.18653/v1/D19-1251
[16]
Stephen Robertson and Hugo Zaragoza. 2009. The probabilistic relevance framework: BM25 and beyond. Now Publishers Inc.
[17]
Subhro Roy, Tim Vieira, and Dan Roth. 2015. Reasoning about Quantities in Natural Language. Trans. Assoc. Comput. Linguistics, Vol. 3 (2015), 1--13. https://tacl2013.cs.columbia.edu/ojs/index.php/tacl/article/view/452
[18]
Swarnadeep Saha, Harinder Pal, and Mausam. 2017. Bootstrapping for Numerical Open IE. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, ACL 2017, Volume 2: Short Papers. Association for Computational Linguistics, 317--323. https://doi.org/10.18653/v1/P17-2050
[19]
Gerhard Weikum. 2020. Entities with Quantities. IEEE Data Eng. Bull., Vol. 43, 1 (2020), 4--8. http://sites.computer.org/debull/A20mar/p4.pdf

Cited By

View all
  • (2024)ProSwats: A Proxy-based Scientific Workflow Retrieval Approach by Bridging the Gap between Textual and Structural Semantics2024 IEEE International Conference on Web Services (ICWS)10.1109/ICWS62655.2024.00111(932-943)Online publication date: 7-Jul-2024
  • (2023)SciHarvester: Searching Scientific Documents for Numerical ValuesProceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3539618.3591808(3135-3139)Online publication date: 19-Jul-2023

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGIR '22: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval
July 2022
3569 pages
ISBN:9781450387323
DOI:10.1145/3477495
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 July 2022

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. information extraction
  2. quantities
  3. ranking functions

Qualifiers

  • Short-paper

Conference

SIGIR '22
Sponsor:

Acceptance Rates

Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)48
  • Downloads (Last 6 weeks)4
Reflects downloads up to 28 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)ProSwats: A Proxy-based Scientific Workflow Retrieval Approach by Bridging the Gap between Textual and Structural Semantics2024 IEEE International Conference on Web Services (ICWS)10.1109/ICWS62655.2024.00111(932-943)Online publication date: 7-Jul-2024
  • (2023)SciHarvester: Searching Scientific Documents for Numerical ValuesProceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3539618.3591808(3135-3139)Online publication date: 19-Jul-2023

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media