Skip to main content

Designing a Novel Framework for Precision Medicine Information Retrieval

  • Conference paper
  • First Online:
Smart Health (ICSH 2018)

Abstract

Precision medicine information retrieval (PMIR) is about matching the most relevant scientific articles to an individual patient for reliable disease treatment. The corresponding Precision Medicine (PM) Track organized by 2017 Text REtrieval Conference [1] provides a test collection for evaluating the performance of PMIR techniques for finding reliable medical evidence. It significantly facilitates PMIR research and system development. However, the performance of current PMIR systems is still far from satisfactory. This study aims to investigate the application of the latest information retrieval and text mining techniques to PMIR. Based on a review of previous efforts and approaches, we propose three promising techniques: keyphrase extraction for indexing, hybrid query expansion including word embeddings, and retrieval results re-ranking with supervised regression analysis for PMIR. A novel framework for PMIR is therefore designed. A PMIR system based on this framework will be implemented and tested using 2017 and 2018 TREC Precision Medicine Track datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. TREC Precision Medicine/Clinical Decision Support Track. http://www.trec-cds.org/2017.html. Accessed 09 Apr 2018

  2. Roberts, K., et al.: Overview of the TREC 2017 precision medicine track. In: TREC, Gaithersburg, MD (2017)

    Google Scholar 

  3. Collins, F.S., Varmus, H.: A new initiative on precision medicine. N. Engl. J. Med. 372(9), 793–795 (2015)

    Article  Google Scholar 

  4. Frey, L.J., Bernstam, E.V., Denny, J.C.: Precision medicine informatics. J. Am. Med. Inform. Assoc. 23(4), 668–670 (2016)

    Article  Google Scholar 

  5. Aronson, S.J., Rehm, H.L.: Building the foundation for genomics in precision medicine. Nature 526(7573), 336–342 (2015)

    Article  Google Scholar 

  6. National Research Council: Toward precision medicine: building a knowledge network for biomedical research and a new taxonomy of disease. National Academies Press, Washington DC (2011)

    Google Scholar 

  7. The Twenty-Sixth Text REtrieval Conference (TREC 2017) Proceedings. https://trec.nist.gov/pubs/trec26/trec2017.html. Accessed 09 Apr 2018

  8. Paschea, E., et al. Customizing a variant annotation-support tool: an inquiry into probability ranking principles for TREC precision medicine. In: TREC, Gaithersburg, MD (2017)

    Google Scholar 

  9. Jo, S.H., Lee, K.S.: CBNU at TREC 2017 precision medicine track. In: TREC, Gaithersburg, MD (2017)

    Google Scholar 

  10. Nguyen, V., Karimi, S., Falamaki, S., Molla-Aliod, D., Paris, C., Wan, S.: CSIRO at 2017 TREC precision medicine track. In: TREC, Gaithersburg, MD (2017)

    Google Scholar 

  11. Foroutan Eghlidi, N., Griner, J., Mesot, N., von Werra, L., Eickhoff, C.: ETH Zurich at TREC precision medicine 2017. In: TREC, Gaithersburg, MD (2017)

    Google Scholar 

  12. Wu, J., Ma, X., Fan, W.: HokieGo at 2017 PM task: genetic programming based re-ranking method in biomedical information retrieval. In: TREC, Gaithersburg, MD (2017)

    Google Scholar 

  13. García, P.L., Oleynik, M., Kasáč, Z., Schulz, S.: TREC 2017 precision medicine - medical university of Graz. In: TREC, Gaithersburg, MD (2017)

    Google Scholar 

  14. Wang, Y., Komandur-Elayavilli, R., Rastegar-Mojarad, M., Liu, H.: Leveraging both structured and unstructured data for precision information retrieval. In: TREC, Gaithersburg, MD (2017)

    Google Scholar 

  15. Yin, T., Wu, D.T., Vydiswaran, V.V.: Retrieving documents based on gene name variations: MedIER at TREC 2017 precision medicine track. In: TREC, Gaithersburg, MD (2017)

    Google Scholar 

  16. Przybyla, P., Soto, A.J., Ananiadou, S.: Identifying personalised treatments and clinical trials for precision medicine using semantic search with thalia. In: TREC, Gaithersburg, MD (2017)

    Google Scholar 

  17. Cieślewicz, A., Dutkiewicz, J., Jędrzejek, C.: POZNAN contribution to TREC PM 2017. In: TREC, Gaithersburg, MD (2017)

    Google Scholar 

  18. Ling, Y., et al.: A hybrid approach to precision medicine-related biomedical article retrieval and clinical trial matching. In: TREC, Gaithersburg, MD (2017)

    Google Scholar 

  19. Li, C., He, B., Sun, Y., Xu, J.: UCAS at TREC-2017 precision medicine track. In: TREC, Gaithersburg, MD (2017)

    Google Scholar 

  20. Mahmood, A.A., et al.: UD_GU_BioTM at TREC 2017: precision medicine track. In: TREC, Gaithersburg, MD (2017)

    Google Scholar 

  21. Wang, Y., Fang, H.: Combining term-based and concept-based representation for clinical retrieval. In: TREC, Gaithersburg, MD (2017)

    Google Scholar 

  22. Noh, J., Kavuluru, R.: Team UKNLP at TREC 2017 precision medicine track: a knowledge-based IR system with tuned query-time boosting. In: TREC, Gaithersburg, MD (2017)

    Google Scholar 

  23. Viswavarapu, L.K., Chen, J., Cleveland, A., Chen, H.: UNT precision medicine information retrieval at TREC 2017. In: TREC, Gaithersburg, MD (2017)

    Google Scholar 

  24. Goodwin, T.R., Skinner, M.A., Harabagiu, S.M.: UTD HLTRI at TREC 2017: precision medicine track. In: TREC, Gaithersburg, MD (2017)

    Google Scholar 

  25. Azzopardi, L., et al.: The lucene for information access and retrieval research (LIARR) workshop at SIGIR 2017. In: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, Shinjuku, Tokyo, Japan, pp. 1429–1430. ACM (2017)

    Google Scholar 

  26. Yang, P., Fang, H., Lin, J.: Anserini: enabling the use of lucene for information retrieval research. In: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, Shinjuku, Tokyo, Japan, pp. 1253–1256. ACM (2017)

    Google Scholar 

  27. Cormack, G.V., Clarke, C.L., Buettcher, S.: Reciprocal rank fusion outperforms condorcet and individual rank learning methods. In: Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval, Boston, MA, USA, pp. 758–759. ACM (2009)

    Google Scholar 

  28. Li, P.V., Thomas, P., Hawking, D.: Merging algorithms for enterprise search. In: Proceedings of the 18th Australasian Document Computing Symposium, pp. 42–49. ACM, New York (2013)

    Google Scholar 

  29. Nguyen, V., Karimi, S., Falamaki, S., Paris, C.: Benchmarking clinical decision support search. arXiv preprint arXiv:1801.09322 (2018)

  30. Previde, P., et al.: GeneDive: a gene interaction search and visualization tool to facilitate precision medicine. In: Proceedings of the Pacific Symposium on Biocomputing, pp. 590–601. World Scientific, Kohala Coast (2018)

    Google Scholar 

  31. Gonzalez-Hernandez, G., Sarker, A., O’Connor, K., Greene, C., Liu, H.: Advances in text mining and visualization for precision medicine. In: Proceedings of the Pacific Symposium on Biocomputing, pp. 590–601. World Scientific, Kohala Coast (2018)

    Google Scholar 

  32. Balaneshin-kordan, S., Kotov, A.: Optimization method for weighting explicit and latent concepts in clinical decision support queries. In: Proceedings of the 2016 ACM International Conference on the Theory of Information Retrieval, Newark, Delaware, USA, pp. 241–250. ACM (2016)

    Google Scholar 

  33. Wang, H., Zhang, Q., Yuan, J.: Semantically enhanced medical information retrieval system: a tensor factorization based approach. IEEE Access 5, 7584–7593 (2017)

    Article  Google Scholar 

  34. Bird, S., Loper, E.: NLTK: the natural language toolkit. In: Proceedings of the ACL 2004 on Interactive poster and demonstration sessions, Barcelona, Spain. Association for Computational Linguistics (2004)

    Google Scholar 

  35. Demner-Fushman, D., Rogers, W.J., Aronson, A.R.: MetaMap Lite: an evaluation of a new Java implementation of MetaMap. J. Am. Med. Inform. Assoc. 24(4), 841–844 (2017)

    Google Scholar 

  36. Medelyan, O., Frank, E., Witten, I.H.: Human-competitive tagging using automatic keyphrase extraction. In: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore. Association for Computational Linguistics (2009)

    Google Scholar 

  37. Crimp, R., Trotman, A.: Automatic term reweighting for query expansion. In: Proceedings of the 22nd Australasian Document Computing Symposium, Brisbane, QLD, Australia. ACM (2017)

    Google Scholar 

  38. Meng, R., Zhao, S., Han, S., He, D., Brusilovsky, P., Chi, Y.: Deep keyphrase generation. arXiv preprint arXiv:1704.06879 (2017)

  39. Kathait, S.S., Tiwari, S., Varshney, A., Sharma, A.: Unsupervised key-phrase extraction using noun phrases. Int. J. Comput. Appl. 162(1), 1–5 (2017)

    Google Scholar 

  40. Habibi, M., Weber, L., Neves, M., Wiegandt, D.L., Leser, U.: Deep learning with word embeddings improves biomedical named entity recognition. Bioinformatics 33(14), i37–i48 (2017)

    Article  Google Scholar 

  41. Holzinger, A.: Interactive machine learning for health informatics: when do we need the human-in-the-loop? Brain Inform. 3(2), 119–131 (2016)

    Article  Google Scholar 

  42. Allison, P.D.: Change scores as dependent variables in regression analysis. Sociol. Methodol. 20, 93–114 (1990)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jiangping Chen .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Chen, H., Ding, J., Chen, J., Cao, G. (2018). Designing a Novel Framework for Precision Medicine Information Retrieval. In: Chen, H., Fang, Q., Zeng, D., Wu, J. (eds) Smart Health. ICSH 2018. Lecture Notes in Computer Science(), vol 10983. Springer, Cham. https://doi.org/10.1007/978-3-030-03649-2_16

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-03649-2_16

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-03648-5

  • Online ISBN: 978-3-030-03649-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics