Skip to main content

Learning2extract for Medical Domain Retrieval

  • Conference paper
  • First Online:
Book cover Information Retrieval Technology (AIRS 2017)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10648))

Included in the following conference series:

Abstract

Search is important in medical domain. For example, physicians need to search for literature to support their decisions when they diagnose the patients, especially for the complicated cases. Even though they could manually input the queries, it is not an easy task because queries are expected to include enough information about the patients. Therefore, the queries tend to be verbose. However, those verbose queries may not work well since the search engine would favor documents covering every term in the query, but not the ones which are truly important. Existing work on verbose query processing in Web search has studied the similar problem, but the methods are not applicable to the medical domain because of the complexity of the medical queries and the lack of domain-specific features. In this work, we propose a set of new features to capture the importance of the terms which are helpful for medical retrieval, i.e., Key Terms, from verbose queries. Experiment results on the TREC Clinical Decision Support collections show that the improvement of using the selected Key Terms over the baseline methods is statistically significant.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://metamap.nlm.nih.gov/.

  2. 2.

    https://www.oxforddictionaries.com/.

  3. 3.

    https://www.merriam-webster.com/.

References

  1. Gupta, M., Bendersky, M.: Information retrieval with verbose queries. In: Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2015), New York, NY, USA, pp. 1121–1124. ACM (2015)

    Google Scholar 

  2. Bendersky, M., Croft, W.B.: Discovering key concepts in verbose queries. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2008), pp. 491–498 (2008)

    Google Scholar 

  3. Jones, R., Fain, D.C.: Query word deletion prediction. In: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2003), New York, NY, USA, pp. 435–436. ACM (2003)

    Google Scholar 

  4. Soldaini, L., Cohan, A., Yates, A., Goharian, N., Frieder, O.: Retrieving medical literature for clinical decision support. In: Hanbury, A., Kazai, G., Rauber, A., Fuhr, N. (eds.) ECIR 2015. LNCS, vol. 9022, pp. 538–549. Springer, Cham (2015). doi:10.1007/978-3-319-16354-3_59

    Google Scholar 

  5. Kumaran, G., Carvalho, V.R.: Reducing long queries using query quality predictors. In: Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2009), pp. 564–571 (2009)

    Google Scholar 

  6. Turney, P.D.: Learning algorithms for keyphrase extraction. Inf. Retr. 2(4), 303–336 (2000)

    Article  Google Scholar 

  7. Díaz-Galiano, M.C., Martín-Valdivia, M., Ureña López, L.A.: Query expansion with a medical ontology to improve a multimodal information retrieval system. Comput. Biol. Med. 39(4), 396–403 (2009)

    Article  Google Scholar 

  8. Martinez, D., Otegi, A., Soroa, A., Agirre, E.: Improving search over electronic health records using umls-based query expansion through random walks. J. Biomed. Inform. 51, 100–106 (2014)

    Article  Google Scholar 

  9. Yang, C., He, B., Xu, J.: Integrating feedback-based semantic evidence to enhance retrieval effectiveness for clinical decision support. In: Chen, L., Jensen, C.S., Shahabi, C., Yang, X., Lian, X. (eds.) APWeb-WAIM 2017. LNCS, vol. 10367, pp. 153–168. Springer, Cham (2017). doi:10.1007/978-3-319-63564-4_13

    Chapter  Google Scholar 

  10. Zhu, D., Carterette, B.: Combining multi-level evidence for medical record retrieval. In: Proceedings of the 2012 International Workshop on Smart Health and Wellbeing (SHB 2012), pp. 49–56 (2012)

    Google Scholar 

  11. Limsopatham, N., Macdonald, C., Ounis, I.: Aggregating evidence from hospital departments to improve medical records search. In: Serdyukov, P., et al. (eds.) ECIR 2013. LNCS, vol. 7814, pp. 279–291. Springer, Heidelberg (2013). doi:10.1007/978-3-642-36973-5_24

    Chapter  Google Scholar 

  12. Wang, Y., Fang, H.: Exploring the query expansion methods for concept based representation. In: TREC 2014 (2014)

    Google Scholar 

  13. Limsopatham, N., Macdonald, C., Ounis, I.: Learning to combine representations for medical records search. In: Proceedings of SIGIR 2013 (2013)

    Google Scholar 

  14. Qi, Y., Laquerre, P.F.: Retrieving medical records: NEC Labs America at TREC 2012 medical record track. In: TREC 2012 (2012)

    Google Scholar 

  15. Koopman, B., Zuccon, G., Nguyen, A., Vickers, D., Butt, L., Bruza, P.D.: Exploiting SNOMED CT concepts & relationships for clinical information retrieval: Australian e-health research centre and Queensland University of Technology at the TREC 2012 medical track. In: TREC 2012 (2012)

    Google Scholar 

  16. Wang, Y., Liu, X., Fang, H.: A study of concept-based weighting regularization for medical records search. In: ACL 2014 (2014)

    Google Scholar 

  17. Wang, Y., Fang, H.: Extracting useful information from clinical notes. In: TREC 2016 (2016)

    Google Scholar 

Download references

Acknowledgments

This research was supported by the U.S. National Science Foundation under IIS-1423002.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yue Wang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Wang, Y., Lu, K., Fang, H. (2017). Learning2extract for Medical Domain Retrieval. In: Sung, WK., et al. Information Retrieval Technology. AIRS 2017. Lecture Notes in Computer Science(), vol 10648. Springer, Cham. https://doi.org/10.1007/978-3-319-70145-5_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-70145-5_4

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-70144-8

  • Online ISBN: 978-3-319-70145-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics