skip to main content
10.1145/2970398.2970418acmconferencesArticle/Chapter ViewAbstractPublication PagesictirConference Proceedingsconference-collections
research-article

Optimization Method for Weighting Explicit and Latent Concepts in Clinical Decision Support Queries

Published:12 September 2016Publication History

ABSTRACT

Accurately answering verbose queries that describe a clinical case and aim at finding articles in a collection of medical literature requires capturing many explicit and latent aspects of complex information needs underlying such queries. Proper representation of these aspects often requires query analysis to identify the most important query concepts as well as query transformation by adding new concepts to a query, which can be extracted from the top retrieved documents or medical knowledge bases. Traditionally, query analysis and expansion have been done separately. In this paper, we propose a method for representing verbose domain-specific queries based on weighted unigram, bigram, and multi-term concepts in the query itself, as well as extracted from the top retrieved documents and external knowledge bases. We also propose a graduated non-convexity optimization framework, which allows to unify query analysis and expansion by jointly determining the importance weights for the query and expansion concepts depending on their type and source. Experiments using a collection of PubMed articles and TREC Clinical Decision Support (CDS) track queries indicate that applying our proposed method results in significant improvement of retrieval accuracy over state-of-the-art methods for ad hoc and medical IR.

References

  1. A. R. Aronson and F.-M. Lang. An overview of MetaMap: historical perspective and recent advances. Journal of the American Medical Informatics Association, 17(3):229--236, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  2. S. Balaneshin-kordan, A. Kotov, and R. Xisto. WSU-IR at TREC 2015 clinical decision support track: Joint weighting of explicit and latent medical query concepts from diverse sources. In Proceedings of TREC'15, 2015.Google ScholarGoogle Scholar
  3. M. Bendersky and W. B. Croft. Discovering key concepts in verbose queries. In Proceedings of SIGIR'08, pages 491--498, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. M. Bendersky, D. Metzler, and W. B. Croft. Learning concept importance using a weighted dependence model. In Proceedings of WSDM'10, pages 31--40, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. M. Bendersky, D. Metzler, and W. B. Croft. Parameterized concept weighting in verbose queries. In Proceedings of SIGIR'14, pages 605--614, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. A. Blake and A. Zisserman. Visual reconstruction, volume 2. MIT press Cambridge, 1987. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. S. Choi, J. Choi, S. Yoo, H. Kim, and Y. Lee. Semantic concept-enriched dependence model for medical information retrieval. Journal of Biomedical Informatics, 47:18--27, 2014.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. J. I. Garcia-Gathright, F. Meng, and W. Hsu. UCLA at TREC 2014 clinical decision support track: Exploring language models, query expansion, and boosting. Proceedings of TREC'14, 2014.Google ScholarGoogle Scholar
  9. B. Koopman, G. Zuccon, P. Bruza, L. Sitbon, and M. Lawley. Information retrieval as semantic inference: a graph inference model applied to medical search. Information Retrieval Journal, 19(1--2):6--37, 2016. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. B. Koopman, G. Zuccon, A. Nguyen, D. Vickers, L. Butt, and P. Bruza. Exploiting SNOMED CT concepts & relationships for clinical information retrieval: Australian e-Health Research Centre and Queensland University of Technology at the TREC 2012 medical track. Proceedings of TREC'12, 2012.Google ScholarGoogle Scholar
  11. A. Kotov and C. Zhai. Tapping into knowledge base for concept feedback: leveraging conceptnet to improve search results for difficult queries. In Proceedings of WSDM'12, pages 403--412, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. V. Lavrenko and W. B. Croft. Relevance based language models. In Proceedings of SIGIR'01, pages 120--127, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. N. Limsopatham, C. Macdonald, and I. Ounis. Inferring conceptual relationships to improve medical records search. In Proceedings of OAIR'13, pages 1--8, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. J. Lin and D. Demner-Fushman. The role of knowledge in conceptual retrieval: a study in the domain of clinical medicine. In Proceedings of SIGIR'06, pages 99--106, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Z. Lu, W. Kim, and W. J. Wilbur. Evaluation of query expansion using mesh in PubMed. Information retrieval, 12(1):69--80, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. D. Metzler and W. B. Croft. A Markov random field model for term dependencies. In Proceedings of SIGIR'05, pages 472--479, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. D. Metzler and W. B. Croft. Latent concept expansion using Markov random fields. In Proceedings of SIGIR'07, pages 311--318, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. D. Metzler and W. B. Croft. Linear feature-based models for information retrieval. Information Retrieval, 10(3):257--274, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. D. A. Metzler, W. B. Croft, and A. McCallum. Direct maximization of rank-based metrics for information retrieval. Technical report, CIIR, 2005.Google ScholarGoogle Scholar
  20. H. Mobahi and J. W. Fisher III. On the link between gaussian homotopy continuation and convex envelopes. In EMMCVPR'15, pages 43--56, 2015.Google ScholarGoogle ScholarCross RefCross Ref
  21. W. Morgan, W. Greiff, and J. Henderson. Direct maximization of average precision by hill-climbing, with a comparison to a maximum entropy approach. In Proceedings of NAACL-HLT'04, pages 93--96, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. A. Mourao, F. Martins, and J. Magalhaes. NovaSearch at TREC 2014 clinical decision support track. Proceedings of TREC'14, 2014.Google ScholarGoogle Scholar
  23. K. Roberts, M. S. Simpson, E. Voorhees, and W. R. Hersh. Overview of the trec 2015 clinical decision support track. Proceedings of TREC'15, 2015.Google ScholarGoogle Scholar
  24. R. W. Schafer. What is a Savitzky-Golay filter?{lecture notes}. IEEE Signal Processing Magazine, 28(4):111--117, 2011.Google ScholarGoogle ScholarCross RefCross Ref
  25. W. Shen, J.-Y. Nie, X. Liu, and X. Liui. An investigation of the effectiveness of concept-based approach in medical information retrieval GRIUM@ CLEF2014eHealthTask 3. Proceedings of the ShARe/CLEF eHealth Evaluation Lab, 2014.Google ScholarGoogle Scholar
  26. M. S. Simpson, E. M. Voorhees, and W. Hersh. Overview of the trec 2014 clinical decision support track. Proceedings of TREC'14, 2014.Google ScholarGoogle Scholar
  27. C. A. Sneiderman, D. Demner-Fushman, M. Fiszman, N. C. Ide, and T. C. Rindflesch. Knowledge-based methods to help clinicians find answers in MEDLINE. Journal of the American Medical Informatics Association, 14(6):772--780, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  28. L. Soldaini, A. Cohan, A. Yates, N. Goharian, and O. Frieder. Query reformulation for clinical decision support search. Proceedings of TREC'14, 2014.Google ScholarGoogle Scholar
  29. L. Soldaini, A. Cohan, A. Yates, N. Goharian, and O. Frieder. Retrieving medical literature for clinical decision support. In Advances in Information Retrieval, pages 538--549. Springer, 2015.Google ScholarGoogle ScholarCross RefCross Ref
  30. P. Sondhi, J. Sun, C. Zhai, R. Sorrentino, and M. S. Kohn. Leveraging medical thesauri and physician feedback for improving medical literature retrieval for case queries. Journal of the American Medical Informatics Association, 19(5):851--858, 2012.Google ScholarGoogle ScholarCross RefCross Ref
  31. P. Srinivasan. Retrieval feedback in MEDLINE. Journal of the American Medical Informatics Association, 3:157--167, 1996.Google ScholarGoogle ScholarCross RefCross Ref
  32. C. Wang and R. Akella. Concept-based relevance models for medical and semantic information retrieval. In Proceedings of CIKM'15, pages 173--182, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Y. Wang and H. Fang. Exploring the query expansion methods for concept based representation. Proceedings of TREC'14, 2014.Google ScholarGoogle Scholar
  34. Z. Xie, Y. Xia, and Q. Zhou. Incorporating semantic knowledge with MRF term dependency model in medical document retrieval. In NLPCC'15, pages 219--228. Springer, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Y. Xu, G. J. Jones, and B. Wang. Query dependent pseudo-relevance feedback based on wikipedia. In Proceedings of SIGIR'02, pages 59--66, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. C. Zhai and J. Lafferty. Two-stage language models for information retrieval. In Proceedings of SIGIR'02, pages 49--56, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. M. Zhong and X. Huang. Concept-based biomedical text retrieval. In Proceedings of SIGIR'06, pages 723--724, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. W. Zhou, C. Yu, N. Smalheiser, V. Torvik, and J. Hong. Knowledge-intensive conceptual retrieval and passage extraction of biomedical literature. In Proceedings of SIGIR'07, pages 655--662, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Optimization Method for Weighting Explicit and Latent Concepts in Clinical Decision Support Queries

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      ICTIR '16: Proceedings of the 2016 ACM International Conference on the Theory of Information Retrieval
      September 2016
      318 pages
      ISBN:9781450344975
      DOI:10.1145/2970398

      Copyright © 2016 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 12 September 2016

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      ICTIR '16 Paper Acceptance Rate41of79submissions,52%Overall Acceptance Rate209of482submissions,43%

      Upcoming Conference

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader