Multilayer Data and Document Stratification for Comorbidity Analysis

Heffernan, Kevin; Liò, Pietro; Teufel, Simone

doi:10.1007/978-3-319-67834-4_17

Kevin Heffernan¹⁷,
Pietro Liò¹⁷ &
Simone Teufel¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNBI,volume 10477))

Included in the following conference series:

International Meeting on Computational Intelligence Methods for Bioinformatics and Biostatistics

905 Accesses

Abstract

In this work, we introduce two novel contributions to the study of comorbidity. The first is a new method for finding disease correlations, using a multitude of information sources. In the era of big data, methods such as evidence synthesis enable researchers to exploit many freely available information sources to enrich their analyses. This forms the basis for our method where in lieu of examining one form of evidence, we introduce a novel combination of sources, providing an indirect association between patient genetic data and the scientific literature. Our second contribution is a new method for stratifying the scientific literature when searching for newly discovered disease correlations. Given that the volume of published biomedical literature has increased dramatically, a clinician does not have the ability to read every relevant article. We therefore propose a new way for refining the literature search space to discover recently introduced disease correlations. Results show that our system can produce reasonable hypotheses for disease correlations, and that document stratification is an important aspect to take into account when using scientific literature.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE38642.

References

Valderas, J.M., Starfield, B., Sibbald, B., Salisbury, C., Roland, M.: Defining comorbidity: implications for understanding health and health services. Ann. Family Med. 7(4), 357–363 (2009)
Article Google Scholar
Ware, H., Mullett, C.J., Jagannathan, V.: Natural language processing framework to assess clinical conditions. J. Am. Med. Inform. Assoc. 16(4), 585–589 (2009)
Article Google Scholar
Salmasian, H., Freedberg, D.E., Friedman, C.: Deriving comorbidities from medical records using natural language processing. J. Am. Med. Inform. Assoc. 20, e239 (2013). amiajnl-2013
Article Google Scholar
Sutton, A.J., Welton, N.J., Cooper, N., Abrams, K.R., Ades, A.E.: Evidence Synthesis for Decision Making in Healthcare, vol. 132. Wiley, Hoboken (2012)
Google Scholar
Smyth, G.K.: Limma: linear models for microarray data. In: Gentleman, R., Carey, V.J., Huber, W., Irizarry, R.A., Dudoit, S. (eds.) Bioinformatics and Computational Biology Solutions Using R and Bioconductor. Statistics for Biology and Health, pp. 397–420. Springer, New York (2005). doi:10.1007/0-387-29362-0_23
Chapter Google Scholar
Doms, A., Schroeder, M.: Gopubmed: exploring pubmed with the gene ontology. Nucleic Acids Res. 33(suppl 2), W783–W786 (2005)
Article Google Scholar
Lipscomb, C.E.: Medical subject headings (mesh). Bull. Med. Libr. Assoc. 88(3), 265 (2000)
Google Scholar
Aronson, A.R.: Effective mapping of biomedical text to the UMLS metathesaurus: the MetaMap program. In: Proceedings of the AMIA Symposium, p. 17. American Medical Informatics Association (2001)
Google Scholar
Bodenreider, O.: The unified medical language system (UMLS): integrating biomedical terminology. Nucleic Acids Res. 32(suppl 1), D267–D270 (2004)
Article Google Scholar
Hidalgo, C.A., Blumm, N., Barabási, A.L., Christakis, N.A.: A dynamic network approach for the study of human phenotypes. PLoS Comput. Biol. 5(4), e1000353 (2009)
Article Google Scholar

Download references

Acknowledgements

This work has been supported by the EPSRC. We thank the reviewers for their helpful comments.

Author information

Authors and Affiliations

Computer Laboratory, University of Cambridge, 15 JJ Thomson Avenue, Cambridge, CB3 0FD, UK
Kevin Heffernan, Pietro Liò & Simone Teufel

Authors

Kevin Heffernan
View author publications
You can also search for this author in PubMed Google Scholar
Pietro Liò
View author publications
You can also search for this author in PubMed Google Scholar
Simone Teufel
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kevin Heffernan .

Editor information

Editors and Affiliations

Computing Science and Mathematics, University of Stirling, Stirling, United Kingdom
Andrea Bracciali
School of Informatics, University of Edinburgh, Edinburgh, United Kingdom
Giulio Caravagna
Department of Computer Science, Brunel University London, Uxbridge, Middlesex, United Kingdom
David Gilbert
Department of Management and Innovation Systems DISA-MIS, University of Salerno, Fisciano, Italy
Roberto Tagliaferri

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Heffernan, K., Liò, P., Teufel, S. (2017). Multilayer Data and Document Stratification for Comorbidity Analysis. In: Bracciali, A., Caravagna, G., Gilbert, D., Tagliaferri, R. (eds) Computational Intelligence Methods for Bioinformatics and Biostatistics. CIBB 2016. Lecture Notes in Computer Science(), vol 10477. Springer, Cham. https://doi.org/10.1007/978-3-319-67834-4_17

Download citation

DOI: https://doi.org/10.1007/978-3-319-67834-4_17
Published: 17 October 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-67833-7
Online ISBN: 978-3-319-67834-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics