short-paper

An ALBERT-based Similarity Measure for Mathematical Answer Retrieval

Authors:

Wolfgang LehnerAuthors Info & Claims

SIGIR '21: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval

Pages 1593 - 1597

https://doi.org/10.1145/3404835.3463023

Published: 11 July 2021 Publication History

Abstract

Mathematical Language Processing (MLP) deals with the automated processing and analysis of mathematical documents and relies heavily on good representations of mathematical symbols and texts. The aim of this work is to explore the modeling capabilities of state-of-the-art unsupervised deep learning methods to create such representations. Therefore, we pre-trained different instances of an ALBERT model on Mathematics StackExchange data and fine-tuned it on the task of Mathematical Answer Retrieval. Our evaluation shows that ALBERT outperforms all previous systems and is on par with current state-of-the-art systems for math retrieval indicating strong capabilities of modeling mathematical posts. This implies that our approach can also be beneficial to various other tasks in MLP such as automatic proof checking or summarization of scientific texts.

References

[1]

Iz Beltagy, Kyle Lo, and Arman Cohan. 2019. SciBERT: A Pretrained Language Model for Scientific Text. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). 3606--3611.

[2]

Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018).

[3]

Zhangyin Feng, Daya Guo, Duyu Tang, Nan Duan, Xiaocheng Feng, Ming Gong, Linjun Shou, Bing Qin, Ting Liu, Daxin Jiang, et al. 2020. Codebert: A pre-trained model for programming and natural languages. arXiv preprint arXiv:2002.08155 (2020).

[4]

Aditya Kanade, Petros Maniatis, Gogul Balakrishnan, and Kensen Shi. 2020. Learning and Evaluating Contextual Embedding of Source Code. In International Conference on Machine Learning. PMLR, 5110--5121.

[5]

Omar Khattab and Matei Zaharia. 2020. Colbert: Efficient and effective passage search via contextualized late interaction over bert. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. 39--48.

Digital Library

[6]

Zhenzhong Lan, Mingda Chen, Sebastian Goodman, Kevin Gimpel, Piyush Sharma, and Radu Soricut. 2019. Albert: A lite bert for self-supervised learning of language representations. arXiv preprint arXiv:1909.11942 (2019).

[7]

Yinhan Liu, Myle Ott, Naman Goyal, Jingfei Du, Mandar Joshi, Danqi Chen, Omer Levy, Mike Lewis, Luke Zettlemoyer, and Veselin Stoyanov. 2019. Roberta: A robustly optimized bert pretraining approach. arXiv preprint arXiv:1907.11692 (2019).

[8]

Ilya Loshchilov and Frank Hutter. 2018. Decoupled Weight Decay Regularization. In International Conference on Learning Representations.

[9]

Yuanhua Lv and ChengXiang Zhai. 2011. Lower-bounding term frequency normalization. In Proceedings of the 20th ACM international conference on Information and knowledge management. 7--16.

Digital Library

[10]

Behrooz Mansouri, Anurag Agarwal, Douglas Oard, and Richard Zanibbi. 2020. Finding Old Answers to New Math Questions: The ARQMath Lab at CLEF 2020. In European Conference on Information Retrieval. Springer, 564--571.

Digital Library

[11]

V'it Novotnỳ, Petr Sojka, Michal Stefánik, and Dávid Lupták. 2020. Three is better than one. In CEUR Workshop Proceedings. Thessaloniki, Greece .

[12]

Alec Radford, Karthik Narasimhan, Tim Salimans, and Ilya Sutskever. 2018. Improving language understanding with unsupervised learning. Technical report, OpenAI (2018).

[13]

Shaurya Rohatgi, Jian Wu, and C Lee Giles. 2020. PSU at CLEF-2020 ARQMath Track: Unsupervised Re-ranking using Pretraining. In CEUR Workshop Proceedings. Thessaloniki, Greece .

[14]

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Łukasz Kaiser, and Illia Polosukhin. 2017. Attention is All you Need. Advances in Neural Information Processing Systems, Vol. 30 (2017), 5998--6008.

[15]

NG Yin Ki, Dallas J Fraser, Besat Kassaie, George Labahn, Mirette S Marzouk, Frank Wm Tompa, and Kevin Wang. 2020. Dowsing for Math Answers with Tangent-L. In CEUR Workshop Proceedings. Thessaloniki, Greece.

[16]

Yang You, Jing Li, Sashank Reddi, Jonathan Hseu, Sanjiv Kumar, Srinadh Bhojanapalli, Xiaodan Song, James Demmel, Kurt Keutzer, and Cho-Jui Hsieh. 2019. Large Batch Optimization for Deep Learning: Training BERT in 76 minutes. In International Conference on Learning Representations.

Cited By

Mansouri BMaarefdoust RHui Yang GWang HHan SHauff CZuccon GZhang Y(2024)Using Large Language Models for Math Information RetrievalProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3626772.3657907(2693-2697)Online publication date: 10-Jul-2024
https://dl.acm.org/doi/10.1145/3626772.3657907
Zeng BTian X(2024)Retrieval and Sorting of Scientific Documents Based on Stacked Embedding and Hybrid Attention Model2024 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN60899.2024.10650167(1-8)Online publication date: 30-Jun-2024
https://doi.org/10.1109/IJCNN60899.2024.10650167
Mansouri BJahedibashiz ZYoshioka MKiseleva JAliannejadi M(2023)Clarifying Questions in Math Information RetrievalProceedings of the 2023 ACM SIGIR International Conference on Theory of Information Retrieval10.1145/3578337.3605123(149-158)Online publication date: 9-Aug-2023
https://dl.acm.org/doi/10.1145/3578337.3605123
Show More Cited By

Index Terms

An ALBERT-based Similarity Measure for Mathematical Answer Retrieval
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
  2. Machine learning
    1. Machine learning approaches
      1. Neural networks
2. Information systems
  1. Information retrieval
    1. Retrieval models and ranking

Recommendations

Transformer-Encoder-Based Mathematical Information Retrieval
Experimental IR Meets Multilinguality, Multimodality, and Interaction
Abstract
Mathematical Information Retrieval (MIR) deals with the task of finding relevant documents that contain text and mathematical formulas. Therefore, retrieval systems should not only be able to process natural language, but also mathematical and ...
Retrieval models for question and answer archives
SIGIR '08: Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval

Retrieval in a question and answer archive involves finding good answers for a user's question. In contrast to typical document retrieval, a retrieval model for this task can exploit question similarity as well as ranking the associated answers. In this ...
Incorporating rich features to boost information retrieval performance

Research highlights We propose a regression-based re-ranking framework that can take into account rich features for boosting information retrieval (IR) performance. A set of salient features that may affect IR performance are investigated. Extensive ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

SIGIR '21: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval

July 2021

2998 pages

ISBN:9781450380379

DOI:10.1145/3404835

General Chairs:
Fernando Diaz
(Google)
,
Chirag Shah
University of Washington
,
Torsten Suel
New York University
,
Program Chairs:
Pablo Castells
Universidad Autónoma de Madrid, Amazon
,
Rosie Jones
Spotify
,
Tetsuya Sakai
Waseda University

Copyright © 2021 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

SIGIR: ACM Special Interest Group on Information Retrieval

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 July 2021

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Short-paper

Funding Sources

German Research Foundation

Conference

SIGIR '21

Sponsor:

SIGIR

SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval

July 11 - 15, 2021

Virtual Event, Canada

Acceptance Rates

Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

8
Total Citations
View Citations
264
Total Downloads

Downloads (Last 12 months)25
Downloads (Last 6 weeks)2

Reflects downloads up to 03 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

Mansouri BMaarefdoust RHui Yang GWang HHan SHauff CZuccon GZhang Y(2024)Using Large Language Models for Math Information RetrievalProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3626772.3657907(2693-2697)Online publication date: 10-Jul-2024
https://dl.acm.org/doi/10.1145/3626772.3657907
Zeng BTian X(2024)Retrieval and Sorting of Scientific Documents Based on Stacked Embedding and Hybrid Attention Model2024 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN60899.2024.10650167(1-8)Online publication date: 30-Jun-2024
https://doi.org/10.1109/IJCNN60899.2024.10650167
Mansouri BJahedibashiz ZYoshioka MKiseleva JAliannejadi M(2023)Clarifying Questions in Math Information RetrievalProceedings of the 2023 ACM SIGIR International Conference on Theory of Information Retrieval10.1145/3578337.3605123(149-158)Online publication date: 9-Aug-2023
https://dl.acm.org/doi/10.1145/3578337.3605123
Zhong WLin SYang JLin JChen HDuh WHuang HKato MMothe JPoblete B(2023)One Blade for One Purpose: Advancing Math Information Retrieval using Hybrid SearchProceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3539618.3591746(141-151)Online publication date: 19-Jul-2023
https://dl.acm.org/doi/10.1145/3539618.3591746
Sukmandhani AArifin YZarlis MBudiharto W(2023)Recent Trends for Text Summarization in Scientific Documents2023 IEEE 9th International Conference on Computing, Engineering and Design (ICCED)10.1109/ICCED60214.2023.10425025(1-6)Online publication date: 7-Nov-2023
https://doi.org/10.1109/ICCED60214.2023.10425025
Zhong WXie YLin J(2023)Answer Retrieval for Math Questions Using Structural and Dense RetrievalExperimental IR Meets Multilinguality, Multimodality, and Interaction10.1007/978-3-031-42448-9_18(209-223)Online publication date: 18-Sep-2023
https://dl.acm.org/doi/10.1007/978-3-031-42448-9_18
Reusch AAmigo ECastells PGonzalo JCarterette BCulpepper JKazai G(2022)Pre-Training for Mathematics-Aware RetrievalProceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3477495.3531680(3496-3496)Online publication date: 6-Jul-2022
https://dl.acm.org/doi/10.1145/3477495.3531680
Xingguang LZhenbo CZhengyuan SHaoxin ZHangcheng MXuesong XGang X(2022)Building a Question Answering System for the Manufacturing DomainIEEE Access10.1109/ACCESS.2022.319167810(75816-75824)Online publication date: 2022
https://doi.org/10.1109/ACCESS.2022.3191678

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten