ABSTRACT
The growing amount of available data in the biomedical domain turns out to be beneficial for decision-making, but a sufficiently accurate DR system is required. Plenty of NLP techniques and models have been proposed for semantic similarity in DR, but few of them have been able to consider the variations of the language and relationship between distant words in texts. This work is focused on formulating a Graph-based Similarity for DR method (GBS-DR) for the biomedical domain and comparing the obtained results with traditional DR paradigms. The graph-based methods were selected to prove the importance of analyzing the semantic, syntactic, and long-distant word relationships in texts. It will be demonstrated that through the graph's topology the system can extract the structural information of documents, which solves relevant issues that are faced in this research area.
CCS CONCEPTS • Information Systems • Information Retrieval • Retrieval Models and Ranking • Learning to Rank
- V.Boteva, D.Gholipour, A.Sokolov, and S.Riezler. “Full-Text Learning to Rank Dataset for Medical Information Retrieval”(2016)Google Scholar
- S.Zhao, C.Su, A.Sboner and F.Wang. “GRAPHENE: A Precise Biomedical Literature Retrieval Engine with Graph Augmented Deep Learning and External Knowledge Empowerment” (2019)Google Scholar
- G.Brokos, P.Malakasiotis and I.Androutsopoulos., “Using Centroids of Word Embeddings and Word Mover's Distance for Biomedical Document Retrieval in Question Answering” (2016).Google Scholar
- T.Zhang, B.Liu, D.Niu, K.Lai and Y.Xu. “Multiresolution Graph Attention Networks for Relevance Matching” (2018)Google ScholarDigital Library
- T. Mikolov, K. Chen, G. Corrado and J. Dean, Efficient Estimation of Word Representations in Vector Space” (2013) International Conference on Learning Representations (2013)Google Scholar
- Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, Ł. & Polosukhin, I. (2017). Attention is all you need. Advances in Neural Information Processing Systems (p./pp. 5998–6008)Google Scholar
- X.Yu,W.Xu, Z.Cui and S.Wu1,L.Wang “Graph-based Hierarchical Relevance Matching Signals for Ad-hoc Retrieval” (2021)Google ScholarDigital Library
- J.Frej, J.Chevallet, D.Schwab, “Knowledge Based Transformer Model for Information Retrieval” Joint Conference of the Information Retrieval Communities in Europe (CIRCLE 2020), (2020), Samatan, France. ffhal-03263784fGoogle Scholar
- M. Zuckerman and M.Last “Using Graphs for Word Embedding with En-hanced Semantic Relations” (2019)Google Scholar
- K.M. Svore and C. J. C. Burges “A Machine Learning Approach for Improved BM25 Retrieval” (2009)Google ScholarDigital Library
- T.Tan “Evolution of Language Models: N-Grams, Word Embeddings, Attention & Transformers” (2020)Google Scholar
- C.Nicholson. “A Beginner's Guide to Attention Mechanisms and Memory Networks” (2019)Google Scholar
Index Terms
- Graph-based Similarity for Document Retrieval in the Biomedical Domain
Recommendations
Lexical ambiguity and information retrieval
Lexical ambiguity is a pervasive problem in natural language processing. However, little quantitative information is available about the extent of the problem or about the impact that it has on information retrieval systems. We report on an analysis of ...
Non-relevance Feedback for Document Retrieval
KAM '09: Proceedings of the 2009 Second International Symposium on Knowledge Acquisition and Modeling - Volume 02We need to find documents that relate to human interesting from a large data set of documents. The relevance feedback method needs a set of relevant and non-relevant documents to work usefully. However, the initial retrieved documents, which are ...
A passage retrieval method based on probabilistic information retrieval model and UMLS concepts in biomedical question answering
Display Omitted A new passage retrieval method is proposed for biomedical question answering system.It is based on PubMed and UMLS similarity to retrieve relevant documents.Stanford CoreNLP sentence length is used as passage length in this work.It uses ...
Comments