Abstract
Precision medicine information retrieval (PMIR) is about matching the most relevant scientific articles to an individual patient for reliable disease treatment. The corresponding Precision Medicine (PM) Track organized by 2017 Text REtrieval Conference [1] provides a test collection for evaluating the performance of PMIR techniques for finding reliable medical evidence. It significantly facilitates PMIR research and system development. However, the performance of current PMIR systems is still far from satisfactory. This study aims to investigate the application of the latest information retrieval and text mining techniques to PMIR. Based on a review of previous efforts and approaches, we propose three promising techniques: keyphrase extraction for indexing, hybrid query expansion including word embeddings, and retrieval results re-ranking with supervised regression analysis for PMIR. A novel framework for PMIR is therefore designed. A PMIR system based on this framework will be implemented and tested using 2017 and 2018 TREC Precision Medicine Track datasets.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
TREC Precision Medicine/Clinical Decision Support Track. http://www.trec-cds.org/2017.html. Accessed 09 Apr 2018
Roberts, K., et al.: Overview of the TREC 2017 precision medicine track. In: TREC, Gaithersburg, MD (2017)
Collins, F.S., Varmus, H.: A new initiative on precision medicine. N. Engl. J. Med. 372(9), 793–795 (2015)
Frey, L.J., Bernstam, E.V., Denny, J.C.: Precision medicine informatics. J. Am. Med. Inform. Assoc. 23(4), 668–670 (2016)
Aronson, S.J., Rehm, H.L.: Building the foundation for genomics in precision medicine. Nature 526(7573), 336–342 (2015)
National Research Council: Toward precision medicine: building a knowledge network for biomedical research and a new taxonomy of disease. National Academies Press, Washington DC (2011)
The Twenty-Sixth Text REtrieval Conference (TREC 2017) Proceedings. https://trec.nist.gov/pubs/trec26/trec2017.html. Accessed 09 Apr 2018
Paschea, E., et al. Customizing a variant annotation-support tool: an inquiry into probability ranking principles for TREC precision medicine. In: TREC, Gaithersburg, MD (2017)
Jo, S.H., Lee, K.S.: CBNU at TREC 2017 precision medicine track. In: TREC, Gaithersburg, MD (2017)
Nguyen, V., Karimi, S., Falamaki, S., Molla-Aliod, D., Paris, C., Wan, S.: CSIRO at 2017 TREC precision medicine track. In: TREC, Gaithersburg, MD (2017)
Foroutan Eghlidi, N., Griner, J., Mesot, N., von Werra, L., Eickhoff, C.: ETH Zurich at TREC precision medicine 2017. In: TREC, Gaithersburg, MD (2017)
Wu, J., Ma, X., Fan, W.: HokieGo at 2017 PM task: genetic programming based re-ranking method in biomedical information retrieval. In: TREC, Gaithersburg, MD (2017)
García, P.L., Oleynik, M., Kasáč, Z., Schulz, S.: TREC 2017 precision medicine - medical university of Graz. In: TREC, Gaithersburg, MD (2017)
Wang, Y., Komandur-Elayavilli, R., Rastegar-Mojarad, M., Liu, H.: Leveraging both structured and unstructured data for precision information retrieval. In: TREC, Gaithersburg, MD (2017)
Yin, T., Wu, D.T., Vydiswaran, V.V.: Retrieving documents based on gene name variations: MedIER at TREC 2017 precision medicine track. In: TREC, Gaithersburg, MD (2017)
Przybyla, P., Soto, A.J., Ananiadou, S.: Identifying personalised treatments and clinical trials for precision medicine using semantic search with thalia. In: TREC, Gaithersburg, MD (2017)
Cieślewicz, A., Dutkiewicz, J., Jędrzejek, C.: POZNAN contribution to TREC PM 2017. In: TREC, Gaithersburg, MD (2017)
Ling, Y., et al.: A hybrid approach to precision medicine-related biomedical article retrieval and clinical trial matching. In: TREC, Gaithersburg, MD (2017)
Li, C., He, B., Sun, Y., Xu, J.: UCAS at TREC-2017 precision medicine track. In: TREC, Gaithersburg, MD (2017)
Mahmood, A.A., et al.: UD_GU_BioTM at TREC 2017: precision medicine track. In: TREC, Gaithersburg, MD (2017)
Wang, Y., Fang, H.: Combining term-based and concept-based representation for clinical retrieval. In: TREC, Gaithersburg, MD (2017)
Noh, J., Kavuluru, R.: Team UKNLP at TREC 2017 precision medicine track: a knowledge-based IR system with tuned query-time boosting. In: TREC, Gaithersburg, MD (2017)
Viswavarapu, L.K., Chen, J., Cleveland, A., Chen, H.: UNT precision medicine information retrieval at TREC 2017. In: TREC, Gaithersburg, MD (2017)
Goodwin, T.R., Skinner, M.A., Harabagiu, S.M.: UTD HLTRI at TREC 2017: precision medicine track. In: TREC, Gaithersburg, MD (2017)
Azzopardi, L., et al.: The lucene for information access and retrieval research (LIARR) workshop at SIGIR 2017. In: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, Shinjuku, Tokyo, Japan, pp. 1429–1430. ACM (2017)
Yang, P., Fang, H., Lin, J.: Anserini: enabling the use of lucene for information retrieval research. In: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, Shinjuku, Tokyo, Japan, pp. 1253–1256. ACM (2017)
Cormack, G.V., Clarke, C.L., Buettcher, S.: Reciprocal rank fusion outperforms condorcet and individual rank learning methods. In: Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval, Boston, MA, USA, pp. 758–759. ACM (2009)
Li, P.V., Thomas, P., Hawking, D.: Merging algorithms for enterprise search. In: Proceedings of the 18th Australasian Document Computing Symposium, pp. 42–49. ACM, New York (2013)
Nguyen, V., Karimi, S., Falamaki, S., Paris, C.: Benchmarking clinical decision support search. arXiv preprint arXiv:1801.09322 (2018)
Previde, P., et al.: GeneDive: a gene interaction search and visualization tool to facilitate precision medicine. In: Proceedings of the Pacific Symposium on Biocomputing, pp. 590–601. World Scientific, Kohala Coast (2018)
Gonzalez-Hernandez, G., Sarker, A., O’Connor, K., Greene, C., Liu, H.: Advances in text mining and visualization for precision medicine. In: Proceedings of the Pacific Symposium on Biocomputing, pp. 590–601. World Scientific, Kohala Coast (2018)
Balaneshin-kordan, S., Kotov, A.: Optimization method for weighting explicit and latent concepts in clinical decision support queries. In: Proceedings of the 2016 ACM International Conference on the Theory of Information Retrieval, Newark, Delaware, USA, pp. 241–250. ACM (2016)
Wang, H., Zhang, Q., Yuan, J.: Semantically enhanced medical information retrieval system: a tensor factorization based approach. IEEE Access 5, 7584–7593 (2017)
Bird, S., Loper, E.: NLTK: the natural language toolkit. In: Proceedings of the ACL 2004 on Interactive poster and demonstration sessions, Barcelona, Spain. Association for Computational Linguistics (2004)
Demner-Fushman, D., Rogers, W.J., Aronson, A.R.: MetaMap Lite: an evaluation of a new Java implementation of MetaMap. J. Am. Med. Inform. Assoc. 24(4), 841–844 (2017)
Medelyan, O., Frank, E., Witten, I.H.: Human-competitive tagging using automatic keyphrase extraction. In: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore. Association for Computational Linguistics (2009)
Crimp, R., Trotman, A.: Automatic term reweighting for query expansion. In: Proceedings of the 22nd Australasian Document Computing Symposium, Brisbane, QLD, Australia. ACM (2017)
Meng, R., Zhao, S., Han, S., He, D., Brusilovsky, P., Chi, Y.: Deep keyphrase generation. arXiv preprint arXiv:1704.06879 (2017)
Kathait, S.S., Tiwari, S., Varshney, A., Sharma, A.: Unsupervised key-phrase extraction using noun phrases. Int. J. Comput. Appl. 162(1), 1–5 (2017)
Habibi, M., Weber, L., Neves, M., Wiegandt, D.L., Leser, U.: Deep learning with word embeddings improves biomedical named entity recognition. Bioinformatics 33(14), i37–i48 (2017)
Holzinger, A.: Interactive machine learning for health informatics: when do we need the human-in-the-loop? Brain Inform. 3(2), 119–131 (2016)
Allison, P.D.: Change scores as dependent variables in regression analysis. Sociol. Methodol. 20, 93–114 (1990)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Chen, H., Ding, J., Chen, J., Cao, G. (2018). Designing a Novel Framework for Precision Medicine Information Retrieval. In: Chen, H., Fang, Q., Zeng, D., Wu, J. (eds) Smart Health. ICSH 2018. Lecture Notes in Computer Science(), vol 10983. Springer, Cham. https://doi.org/10.1007/978-3-030-03649-2_16
Download citation
DOI: https://doi.org/10.1007/978-3-030-03649-2_16
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-03648-5
Online ISBN: 978-3-030-03649-2
eBook Packages: Computer ScienceComputer Science (R0)