short-paper

Denoising Clinical Notes for Medical Literature Retrieval with Convolutional Neural Model

Authors:
Luca Soldaini

Georgetown University, Washington, DC, USA

Georgetown University, Washington, DC, USA
View Profile

,
Andrew Yates

Max Planck Institute for Informatics, Saarbrucken, Germany

Max Planck Institute for Informatics, Saarbrucken, Germany
View Profile

,
Nazli Goharian

Georgetown University, Washington, DC, USA

Georgetown University, Washington, DC, USA
View Profile

CIKM '17: Proceedings of the 2017 ACM on Conference on Information and Knowledge ManagementNovember 2017Pages 2307–2310https://doi.org/10.1145/3132847.3133149

Published:06 November 2017Publication History

CIKM '17: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management

Pages 2307–2310

ABSTRACT

The rapid increase of medical literature poses a significant challenge for physicians, who have repeatedly reported to struggle to keep up to date with developments in research. This gap is one of the main challenges in integrating recent advances in clinical research with day-to-day practice. Thus, the need for clinical decision support (CDS) search systems that can retrieve highly relevant medical literature given a clinical note describing a patient has emerged. However, clinical notes are inherently noisy, thus not being fit to be used as queries as-is. In this work, we present a convolutional neural model aimed at improving clinical notes representation, making them suitable for document retrieval. The system is designed to predict, for each clinical note term, its importance in relevant documents. The approach was evaluated on the 2016 TREC CDS dataset, where it achieved a 37% improvement in infNDCG over state-of-the-art query reduction methods and a 27% improvement over the best known method for the task.

References

Saeid Balaneshin-kordan and Alexander Kotov. 2016. Optimization method for weighting explicit and latent concepts in clinical decision support queries. In ICTIR. Google ScholarDigital Library
Michael Bendersky, Donald Metzler, and W Bruce Croft. 2011. Parameterized concept weighting in verbose queries SIGIR. ACM. Google ScholarDigital Library
Steven Bethard, Guergana Savova, Wei-Te Chen, Leon Derczynski, James Pustejovsky, and Marc Verhagen. 2016. Semeval-2016 task 12: Clinical tempeval. SemEval (2016).Google Scholar
Yoon Kim. 2014. Convolutional neural networks for sentence classification. arXiv:1408.5882 (2014).Google Scholar
Diederik Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv:1412.6980 (2014).Google Scholar
Giridhar Kumaran and Vitor R Carvalho. 2009. Reducing long queries using query quality predictors SIGIR. Google ScholarDigital Library
André Mourao, Flávio Martins, and Joao Magalhaes. 2014. NovaSearch at TREC 2014 clinical decision support track TREC.Google Scholar
Eitan Naveh, Tal Katz-Navon, and Zvi Stern. 2015. Resident physicians' clinical training and error rate: the roles of autonomy, consultation, and familiarity with the literature. Advances in Health Sciences Education 1 (2015), 59--71.Google ScholarCross Ref
Heung-Seon Oh and Yuchul Jung. 2015. Cluster-based query expansion using external collections in medical information retrieval. Journal of Biomedical Informatics (2015). Google ScholarDigital Library
Jeffrey Pennington, Richard Socher, and Christopher D. Manning. 2014. GloVe: Global Vectors for Word Representation. In EMNLP.Google Scholar
Kirk Roberts. 2016. Assessing the Corpus Size vs Similarity Trade-off for Word Embeddings in Clinical NLP ClinicalNLP workshop at COLING 2016.Google Scholar
Kirk Roberts, Dina Demner-Fushman, Ellen M Voorhees, and William R Hersh. 2017. Overview of the TREC 2016 Clinical Decision Support Track. TREC.Google Scholar
Aliaksei Severyn and Alessandro Moschitti. 2015. Learning to Rank Short Text Pairs with Convolutional Deep Neural Networks SIGIR. Google ScholarDigital Library
Luca Soldaini, Arman Cohan, Andrew Yates, Nazli Goharian, and Ophir Frieder. 2015. Retrieving medical literature for clinical decision support ECIR.Google Scholar
Luca Soldaini and Nazli Goharian. 2016. QuickUMLS: a fast, unsupervised approach for medical concept extraction MedIR Workshop at SIGIR.Google Scholar
Luca Soldaini, Andrew Yates, and Nazli Goharian. 2017. Learning to Reformulate Long Queries for Clinical Decision Support. JASIST (2017).Google Scholar
Emine Yilmaz, Evangelos Kanoulas, and Javed A Aslam. 2008. A simple and efficient sampling method for estimating AP and NDCG SIGIR. Google ScholarDigital Library
Hualong Zhang and Liting Liu. 2017. NKU at TREC 2016: Clinical Decision Support Track. (2017).Google Scholar

Index Terms

Denoising Clinical Notes for Medical Literature Retrieval with Convolutional Neural Model
1. Computing methodologies
  1. Machine learning
    1. Machine learning approaches
      1. Neural networks
2. Information systems
  1. Information retrieval
    1. Information retrieval query processing
      1. Query reformulation

Recommendations

A Review of Analytics and Clinical Informatics in Health Care

Federal investment in health information technology has incentivized the adoption of electronic health record systems by physicians and health care organizations; the result has been a massive rise in the collection of patient data in electronic form (...
Read More
Medical Specialists Retrieval System Using Unified Medical Language System
ICMHI '17: Proceedings of the 1st International Conference on Medical and Health Informatics 2017

A large number of doctors and wide range of medical specialties can cause confusion in choosing the right medical specialist. This research aims to build a medical specialists retrieval system that corresponds with the user's disease. To make the system ...
Read More
GM Notes: Tabletop Gaming Themed Discreet Internet Website Password Organizer Book
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
CIKM '17: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management
November 2017
2604 pages
ISBN:9781450349185
DOI:10.1145/3132847
General Chairs:
Ee-Peng Lim
Singapore Management University, Singapore
,
Marianne Winslett
University of Illinois at Urbana-Champaign, USA, and Advanced Digital Sciences Center, Singapore
,
Program Chairs:
Mark Sanderson
RMIT, Australia
,
Ada Fu
Chinese University of Hong Kong, Hong Kong
,
Jimeng Sun
Georgia Tech, USA
,
Shane Culpepper
RMIT, Australia
,
Eric Lo
Chinese University of Hong Kong, Hong Kong
,
Joyce Ho
Emory University, USA
,
Debora Donato
Mix Tech, Inc., USA
,
Rakesh Agrawal
Data Insights Laboratories, USA
,
Yu Zheng
Microsoft Research Asia, China
,
Carlos Castillo
Qatar Computing Research Institute, Qatar
,
Aixin Sun
Nanyang Technological University, Singapore
,
Vincent S. Tseng
National Cheng Kung University, Taiwan
,
Chenliang Li
Wuhan University, China
Copyright © 2017 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 6 November 2017
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
clinical decision support systems
convolutional neural networks
medical informatics
query reduction
Qualifiers
- short-paper
Conference

Acceptance Rates
CIKM '17 Paper Acceptance Rate171of855submissions,20%Overall Acceptance Rate1,861of8,427submissions,22%
More
Upcoming Conference
CIKM '24

Sponsor:

sigir

sigir

The 33rd ACM International Conference on Information and Knowledge Management

October 21 - 25, 2024

Boise , ID , USA
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 7
  Total Citations
  View Citations
- 141
  Total Downloads
- Downloads (Last 12 months)6
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Denoising Clinical Notes for Medical Literature Retrieval with Convolutional Neural Model

CIKM '17: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management

ABSTRACT

References

Cited By

Index Terms

Recommendations

A Review of Analytics and Clinical Informatics in Health Care

Medical Specialists Retrieval System Using Unified Medical Language System

GM Notes: Tabletop Gaming Themed Discreet Internet Website Password Organizer Book