ABSTRACT
Accurately answering verbose queries that describe a clinical case and aim at finding articles in a collection of medical literature requires capturing many explicit and latent aspects of complex information needs underlying such queries. Proper representation of these aspects often requires query analysis to identify the most important query concepts as well as query transformation by adding new concepts to a query, which can be extracted from the top retrieved documents or medical knowledge bases. Traditionally, query analysis and expansion have been done separately. In this paper, we propose a method for representing verbose domain-specific queries based on weighted unigram, bigram, and multi-term concepts in the query itself, as well as extracted from the top retrieved documents and external knowledge bases. We also propose a graduated non-convexity optimization framework, which allows to unify query analysis and expansion by jointly determining the importance weights for the query and expansion concepts depending on their type and source. Experiments using a collection of PubMed articles and TREC Clinical Decision Support (CDS) track queries indicate that applying our proposed method results in significant improvement of retrieval accuracy over state-of-the-art methods for ad hoc and medical IR.
- A. R. Aronson and F.-M. Lang. An overview of MetaMap: historical perspective and recent advances. Journal of the American Medical Informatics Association, 17(3):229--236, 2010.Google ScholarCross Ref
- S. Balaneshin-kordan, A. Kotov, and R. Xisto. WSU-IR at TREC 2015 clinical decision support track: Joint weighting of explicit and latent medical query concepts from diverse sources. In Proceedings of TREC'15, 2015.Google Scholar
- M. Bendersky and W. B. Croft. Discovering key concepts in verbose queries. In Proceedings of SIGIR'08, pages 491--498, 2008. Google ScholarDigital Library
- M. Bendersky, D. Metzler, and W. B. Croft. Learning concept importance using a weighted dependence model. In Proceedings of WSDM'10, pages 31--40, 2010. Google ScholarDigital Library
- M. Bendersky, D. Metzler, and W. B. Croft. Parameterized concept weighting in verbose queries. In Proceedings of SIGIR'14, pages 605--614, 2011. Google ScholarDigital Library
- A. Blake and A. Zisserman. Visual reconstruction, volume 2. MIT press Cambridge, 1987. Google ScholarDigital Library
- S. Choi, J. Choi, S. Yoo, H. Kim, and Y. Lee. Semantic concept-enriched dependence model for medical information retrieval. Journal of Biomedical Informatics, 47:18--27, 2014.Google ScholarDigital Library
- J. I. Garcia-Gathright, F. Meng, and W. Hsu. UCLA at TREC 2014 clinical decision support track: Exploring language models, query expansion, and boosting. Proceedings of TREC'14, 2014.Google Scholar
- B. Koopman, G. Zuccon, P. Bruza, L. Sitbon, and M. Lawley. Information retrieval as semantic inference: a graph inference model applied to medical search. Information Retrieval Journal, 19(1--2):6--37, 2016. Google ScholarDigital Library
- B. Koopman, G. Zuccon, A. Nguyen, D. Vickers, L. Butt, and P. Bruza. Exploiting SNOMED CT concepts & relationships for clinical information retrieval: Australian e-Health Research Centre and Queensland University of Technology at the TREC 2012 medical track. Proceedings of TREC'12, 2012.Google Scholar
- A. Kotov and C. Zhai. Tapping into knowledge base for concept feedback: leveraging conceptnet to improve search results for difficult queries. In Proceedings of WSDM'12, pages 403--412, 2012. Google ScholarDigital Library
- V. Lavrenko and W. B. Croft. Relevance based language models. In Proceedings of SIGIR'01, pages 120--127, 2001. Google ScholarDigital Library
- N. Limsopatham, C. Macdonald, and I. Ounis. Inferring conceptual relationships to improve medical records search. In Proceedings of OAIR'13, pages 1--8, 2013. Google ScholarDigital Library
- J. Lin and D. Demner-Fushman. The role of knowledge in conceptual retrieval: a study in the domain of clinical medicine. In Proceedings of SIGIR'06, pages 99--106, 2006. Google ScholarDigital Library
- Z. Lu, W. Kim, and W. J. Wilbur. Evaluation of query expansion using mesh in PubMed. Information retrieval, 12(1):69--80, 2009. Google ScholarDigital Library
- D. Metzler and W. B. Croft. A Markov random field model for term dependencies. In Proceedings of SIGIR'05, pages 472--479, 2005. Google ScholarDigital Library
- D. Metzler and W. B. Croft. Latent concept expansion using Markov random fields. In Proceedings of SIGIR'07, pages 311--318, 2007. Google ScholarDigital Library
- D. Metzler and W. B. Croft. Linear feature-based models for information retrieval. Information Retrieval, 10(3):257--274, 2007. Google ScholarDigital Library
- D. A. Metzler, W. B. Croft, and A. McCallum. Direct maximization of rank-based metrics for information retrieval. Technical report, CIIR, 2005.Google Scholar
- H. Mobahi and J. W. Fisher III. On the link between gaussian homotopy continuation and convex envelopes. In EMMCVPR'15, pages 43--56, 2015.Google ScholarCross Ref
- W. Morgan, W. Greiff, and J. Henderson. Direct maximization of average precision by hill-climbing, with a comparison to a maximum entropy approach. In Proceedings of NAACL-HLT'04, pages 93--96, 2004. Google ScholarDigital Library
- A. Mourao, F. Martins, and J. Magalhaes. NovaSearch at TREC 2014 clinical decision support track. Proceedings of TREC'14, 2014.Google Scholar
- K. Roberts, M. S. Simpson, E. Voorhees, and W. R. Hersh. Overview of the trec 2015 clinical decision support track. Proceedings of TREC'15, 2015.Google Scholar
- R. W. Schafer. What is a Savitzky-Golay filter?{lecture notes}. IEEE Signal Processing Magazine, 28(4):111--117, 2011.Google ScholarCross Ref
- W. Shen, J.-Y. Nie, X. Liu, and X. Liui. An investigation of the effectiveness of concept-based approach in medical information retrieval GRIUM@ CLEF2014eHealthTask 3. Proceedings of the ShARe/CLEF eHealth Evaluation Lab, 2014.Google Scholar
- M. S. Simpson, E. M. Voorhees, and W. Hersh. Overview of the trec 2014 clinical decision support track. Proceedings of TREC'14, 2014.Google Scholar
- C. A. Sneiderman, D. Demner-Fushman, M. Fiszman, N. C. Ide, and T. C. Rindflesch. Knowledge-based methods to help clinicians find answers in MEDLINE. Journal of the American Medical Informatics Association, 14(6):772--780, 2007.Google ScholarCross Ref
- L. Soldaini, A. Cohan, A. Yates, N. Goharian, and O. Frieder. Query reformulation for clinical decision support search. Proceedings of TREC'14, 2014.Google Scholar
- L. Soldaini, A. Cohan, A. Yates, N. Goharian, and O. Frieder. Retrieving medical literature for clinical decision support. In Advances in Information Retrieval, pages 538--549. Springer, 2015.Google ScholarCross Ref
- P. Sondhi, J. Sun, C. Zhai, R. Sorrentino, and M. S. Kohn. Leveraging medical thesauri and physician feedback for improving medical literature retrieval for case queries. Journal of the American Medical Informatics Association, 19(5):851--858, 2012.Google ScholarCross Ref
- P. Srinivasan. Retrieval feedback in MEDLINE. Journal of the American Medical Informatics Association, 3:157--167, 1996.Google ScholarCross Ref
- C. Wang and R. Akella. Concept-based relevance models for medical and semantic information retrieval. In Proceedings of CIKM'15, pages 173--182, 2015. Google ScholarDigital Library
- Y. Wang and H. Fang. Exploring the query expansion methods for concept based representation. Proceedings of TREC'14, 2014.Google Scholar
- Z. Xie, Y. Xia, and Q. Zhou. Incorporating semantic knowledge with MRF term dependency model in medical document retrieval. In NLPCC'15, pages 219--228. Springer, 2015. Google ScholarDigital Library
- Y. Xu, G. J. Jones, and B. Wang. Query dependent pseudo-relevance feedback based on wikipedia. In Proceedings of SIGIR'02, pages 59--66, 2009. Google ScholarDigital Library
- C. Zhai and J. Lafferty. Two-stage language models for information retrieval. In Proceedings of SIGIR'02, pages 49--56, 2002. Google ScholarDigital Library
- M. Zhong and X. Huang. Concept-based biomedical text retrieval. In Proceedings of SIGIR'06, pages 723--724, 2006. Google ScholarDigital Library
- W. Zhou, C. Yu, N. Smalheiser, V. Torvik, and J. Hong. Knowledge-intensive conceptual retrieval and passage extraction of biomedical literature. In Proceedings of SIGIR'07, pages 655--662, 2007. Google ScholarDigital Library
Index Terms
- Optimization Method for Weighting Explicit and Latent Concepts in Clinical Decision Support Queries
Recommendations
Medical Question Answering for Clinical Decision Support
CIKM '16: Proceedings of the 25th ACM International on Conference on Information and Knowledge ManagementThe goal of modern Clinical Decision Support (CDS) systems is to provide physicians with information relevant to their management of patient care. When faced with a medical case, a physician asks questions about the diagnosis, the tests, or treatments ...
Bayesian approach to incorporating different types of biomedical knowledge bases into information retrieval systems for clinical decision support in precision medicine
Graphical abstractDisplay Omitted
AbstractBy providing clinicians with information regarding treatment options for molecular sub-types of complex diseases with genetic origin, such as cancer, information retrieval (IR) systems play an important role in precision medicine. In ...
A Fixed-Point Method for Weighting Terms in Verbose Informational Queries
CIKM '14: Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge ManagementThe term weighting and document ranking functions used with informational queries are typically optimized for cases in which queries are short and documents are long. It is reasonable to assume that the presence of a term in a short query reflects some ...
Comments