skip to main content
10.1145/1963192.1963298acmotherconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
demonstration

DIDO: a disease-determinants ontology from web sources

Authors Info & Claims
Published:28 March 2011Publication History

ABSTRACT

This paper introduces DIDO, a system providing convenient access to knowledge about factors involved in human diseases, automatically extracted from textual Web sources. The knowledge base is bootstrapped by integrating entities from hand-crafted sources like MeSH and OMIM. As these are short on relationships between dierent types of biomedical entities, DIDO employs flexible and robust pattern learning and constraint-based reasoning methods to automatically extract new relational facts from textual sources. These facts can then be iteratively added to the knowledge base. The result is a semantic graph of typed entities and relations between diseases, their symptoms, and their factors, with emphasis on environmental factors but covering also molecular determinants. We demonstrate the value of DIDO for knowledge discovery about causal factors and properties of complex diseases, including factor-disease chains.

References

  1. GO: The gene ontology. http://www.geneontology.org/.Google ScholarGoogle Scholar
  2. KEGG:. http://www.genome.jp/kegg/.Google ScholarGoogle Scholar
  3. MeSH: Medical Sub ject Headings. http://www.nlm.nih.gov/mesh/.Google ScholarGoogle Scholar
  4. MIPS: The Mammalian Protein-Protein Interaction Database. http://www.test.org/doe/.Google ScholarGoogle Scholar
  5. OMIM: Online Mendelian Inheritance in Man. http://www.ncbi.nlm.nih.gov/omim/.Google ScholarGoogle Scholar
  6. Stanford Log-linear Part-Of-Speech Tagger. http://nlp.stanford.edu/software/tagger.shtml.Google ScholarGoogle Scholar
  7. UMLS: Unified Medical Language System. http://www.nlm.nih.gov/research/umls/.Google ScholarGoogle Scholar
  8. S. Auer, C. Bizer, G. Kobilarov, J. Lehmann, R. Cyganiak, and Z. Ives. DBpedia: a nucleus for a web of open data. In Proc. of ISWC '07, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. A. Doan and et al. (Eds.). Special Issue on Information Extraction. ACM SIGMOD Record, 37(4), 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. J.-H. Kim, A. Mitchell, T. K. Attwood, and M. Hilario. Learning to extract relations for protein annotation. Bioinformatics, 23(13):i256--63, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Y. I. Liu, P. H. Wise, and A. J. Butte. The "etiome": identification and clustering of human disease etiological factors. BMC Bioinf., 10(S2):S14, 2009.Google ScholarGoogle Scholar
  12. N. Nakashole, M. Theobald, and G. Weikum. Find your Advisor: Robust Knowledge Gathering from the Web. In Proc. of WebDB '10. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. F. M. Suchanek, G. Kasneci, and G. Weikum. YAGO: A Core of Semantic Knowledge. In Proc. of WWW '07. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. F. M. Suchanek, M. Sozio, and G. Weikum. SOFIE: A Self-Organizing Framework for Information Extraction. In Proc. of WWW '09. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. L. Tari, S. Anwar, S. Liang, J. Cai, and C. Baral. Discovering drug-drug interactions: a text-mining and reasoning approach based on properties of drug metabolism. Bioinformatics, 26(18):i547--53, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. G. Weikum and M. Theobald. From Information to Knowledge: Harvesting Entities and Relationships from Web Sources. In Proc. of PODS '10, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. DIDO: a disease-determinants ontology from web sources

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      WWW '11: Proceedings of the 20th international conference companion on World wide web
      March 2011
      552 pages
      ISBN:9781450306379
      DOI:10.1145/1963192

      Copyright © 2011 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 28 March 2011

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • demonstration

      Acceptance Rates

      Overall Acceptance Rate1,899of8,196submissions,23%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader