research-article

Graph-based Similarity for Document Retrieval in the Biomedical Domain

Authors:
Adelaida Zuluaga Cajiao

Department of Electrical and Electronic Engineering.Universidad de los Andes,Colombia., Universidad de los Andes, Colombia

Department of Electrical and Electronic Engineering.Universidad de los Andes,Colombia., Universidad de los Andes, Colombia
View Profile

,
Andres Rosso Mateus

School of Exact Sciences and Engineering, Universidad Sergio Arboleda, Colombia

School of Exact Sciences and Engineering, Universidad Sergio Arboleda, Colombia
View Profile

ICMLT '22: Proceedings of the 2022 7th International Conference on Machine Learning TechnologiesMarch 2022Pages 180–184https://doi.org/10.1145/3529399.3529428

Published:10 June 2022Publication History

ICMLT '22: Proceedings of the 2022 7th International Conference on Machine Learning Technologies

Pages 180–184

ABSTRACT

The growing amount of available data in the biomedical domain turns out to be beneficial for decision-making, but a sufficiently accurate DR system is required. Plenty of NLP techniques and models have been proposed for semantic similarity in DR, but few of them have been able to consider the variations of the language and relationship between distant words in texts. This work is focused on formulating a Graph-based Similarity for DR method (GBS-DR) for the biomedical domain and comparing the obtained results with traditional DR paradigms. The graph-based methods were selected to prove the importance of analyzing the semantic, syntactic, and long-distant word relationships in texts. It will be demonstrated that through the graph's topology the system can extract the structural information of documents, which solves relevant issues that are faced in this research area.

CCS CONCEPTS • Information Systems • Information Retrieval • Retrieval Models and Ranking • Learning to Rank

References

V.Boteva, D.Gholipour, A.Sokolov, and S.Riezler. “Full-Text Learning to Rank Dataset for Medical Information Retrieval”(2016)Google Scholar
S.Zhao, C.Su, A.Sboner and F.Wang. “GRAPHENE: A Precise Biomedical Literature Retrieval Engine with Graph Augmented Deep Learning and External Knowledge Empowerment” (2019)Google Scholar
G.Brokos, P.Malakasiotis and I.Androutsopoulos., “Using Centroids of Word Embeddings and Word Mover's Distance for Biomedical Document Retrieval in Question Answering” (2016).Google Scholar
T.Zhang, B.Liu, D.Niu, K.Lai and Y.Xu. “Multiresolution Graph Attention Networks for Relevance Matching” (2018)Google ScholarDigital Library
T. Mikolov, K. Chen, G. Corrado and J. Dean, Efficient Estimation of Word Representations in Vector Space” (2013) International Conference on Learning Representations (2013)Google Scholar
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, Ł. & Polosukhin, I. (2017). Attention is all you need. Advances in Neural Information Processing Systems (p./pp. 5998–6008)Google Scholar
X.Yu,W.Xu, Z.Cui and S.Wu1,L.Wang “Graph-based Hierarchical Relevance Matching Signals for Ad-hoc Retrieval” (2021)Google ScholarDigital Library
J.Frej, J.Chevallet, D.Schwab, “Knowledge Based Transformer Model for Information Retrieval” Joint Conference of the Information Retrieval Communities in Europe (CIRCLE 2020), (2020), Samatan, France. ffhal-03263784fGoogle Scholar
M. Zuckerman and M.Last “Using Graphs for Word Embedding with En-hanced Semantic Relations” (2019)Google Scholar
K.M. Svore and C. J. C. Burges “A Machine Learning Approach for Improved BM25 Retrieval” (2009)Google ScholarDigital Library
T.Tan “Evolution of Language Models: N-Grams, Word Embeddings, Attention & Transformers” (2020)Google Scholar
C.Nicholson. “A Beginner's Guide to Attention Mechanisms and Memory Networks” (2019)Google Scholar

Index Terms

Graph-based Similarity for Document Retrieval in the Biomedical Domain

Index terms have been assigned to the content through auto-classification.

Recommendations

Lexical ambiguity and information retrieval

Lexical ambiguity is a pervasive problem in natural language processing. However, little quantitative information is available about the extent of the problem or about the impact that it has on information retrieval systems. We report on an analysis of ...
Read More
Non-relevance Feedback for Document Retrieval
KAM '09: Proceedings of the 2009 Second International Symposium on Knowledge Acquisition and Modeling - Volume 02

We need to find documents that relate to human interesting from a large data set of documents. The relevance feedback method needs a set of relevant and non-relevant documents to work usefully. However, the initial retrieved documents, which are ...
Read More
A passage retrieval method based on probabilistic information retrieval model and UMLS concepts in biomedical question answering

Display Omitted A new passage retrieval method is proposed for biomedical question answering system.It is based on PubMed and UMLS similarity to retrieve relevant documents.Stanford CoreNLP sentence length is used as passage length in this work.It uses ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

ICMLT '22: Proceedings of the 2022 7th International Conference on Machine Learning Technologies
March 2022
291 pages
ISBN:9781450395748
DOI:10.1145/3529399

Copyright © 2022 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 10 June 2022
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Biomedical Literature
Document Retrieval
Graphs
Natural Language Processing
Search Engines
Qualifiers
- research-article
- Research
- Refereed limited
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 46
  Total Downloads
- Downloads (Last 12 months)13
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format

Graph-based Similarity for Document Retrieval in the Biomedical Domain

ICMLT '22: Proceedings of the 2022 7th International Conference on Machine Learning Technologies

ABSTRACT

References

Cited By

Index Terms

Recommendations

Lexical ambiguity and information retrieval

Non-relevance Feedback for Document Retrieval

A passage retrieval method based on probabilistic information retrieval model and UMLS concepts in biomedical question answering

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

HTML Format

Caption

Graph-based Similarity for Document Retrieval in the Biomedical Domain

ICMLT '22: Proceedings of the 2022 7th International Conference on Machine Learning Technologies

ABSTRACT

References

Cited By

Index Terms

Recommendations

Lexical ambiguity and information retrieval

Non-relevance Feedback for Document Retrieval

A passage retrieval method based on probabilistic information retrieval model and UMLS concepts in biomedical question answering

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

HTML Format

Share this Publication link

Share on Social Media