skip to main content
10.1145/1651318.1651329acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

MKEM: a multi-level knowledge emergence model for mining undiscovered public knowledge

Published:06 November 2009Publication History

ABSTRACT

Since Swanson proposed the Undiscovered Public Knowledge (UPK) model, there have been many approaches to uncover UPK by mining the biomedical literature. These earlier works, however, required substantial manual intervention to reduce the number of possible connections and are mainly applied to disease-effect relation. With the advancement in biomedical science, it has become imperative to extract and combine information from multiple disjoint researches, studies and articles to infer new hypothesesand expand knowledge. In this paper, we propose MKEM, a Multi-level Knowledge Emergence Model, to discover implicit relationships using Natural Language Processing techniques such as Link Grammar and Ontologies such as Unified Medical Language System (UMLS) MetaMap. The contribution of MKEM is as follows: First, we propose a flexible knowledge emergence model to extract implicit relationships across different levels such as molecular level for gene and protein and Phenomic level for disease and treatment. Second, we employ MetaMap for tagging biological concepts. Third, we provide an empirical and systematic approach to discover novel relationships. Our experiments show that MKEM is a powerful tool to discover hidden relationships residing in extracted entities that were represented by our Substance-Effect-Process-Disease-Body Part (SEPDB) model.

References

  1. Swanson, DR. 1986. Fish-oil, Raynaud's Syndrome, and undiscovered public knowledge. Perspectives in Biology and Medicine, 30(1):7--18.Google ScholarGoogle ScholarCross RefCross Ref
  2. Swanson, DR. 1986. Undiscovered public knowledge. Library Quarterly. 56(2):103--118.Google ScholarGoogle ScholarCross RefCross Ref
  3. Swanson, DR. 2001. On the fragmentation of knowledge, the connection explosion and assembling other people's ideas. Bull. Amer. Soc. Inf. Sci. Technol. (Feb 2001) Vol. 27, no 3, pp. 12--14.Google ScholarGoogle Scholar
  4. Lindsay, R. K, and Gordon, M. D. 1999. Literature-based discovery by lexical statistics. Journal of the American Society for Information Science, 50(7):574--587, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Pratt, Wanda and Yetisgen-Yildiz, Meliha. 2003. LitLinker: capturing connections across the biomedical literature. K-CAP'03 (Oct. 23-25, 2003) pp. 105--112, Sanibel Island, FL. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Srinivasan, P., 2004. Text mining: Generating hypotheses from MEDLINE. Journal of the American Society for Information Science, 2004, Vol. 55, No. 4, pp. 396--413 Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Weeber, M., Vos, R., Klein, H., de Jong-Van den Berg, L.T.W., Aronson, A&Molema, G. 2003. Generating hypotheses by discovering implicit associations in the literature: A case report for new potential therapeutic uses for Thalidomide. Journal of the American Medical Informatics Association, 10(3):252--259, 2003.Google ScholarGoogle ScholarCross RefCross Ref
  8. Hristovski D, Stare J, Peterlin B, and Dzeroski S. 2001. Supporting discovery in medicine by association rule mining in Medline and UMLS. Medinfo. 2001, 10(Pt 2), 1344--8.Google ScholarGoogle Scholar
  9. Atkinson R and Rivas A. 2008. Discovering Novel Causal Patterns from Biomedical Natural-Language Texts using Bayesian Nets. IEEE Transactions on Information technology in Biomedicine (November 2008), Vol 12, No. 6. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Srinivasan, P. 2004. Text mining: Generating hypotheses from MEDLINE, Journal of the American Society for Information Science, 2004, Vol. 55, No. 4, pp. 396--413 Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Agrawal, R., et al. 1995. Fast Discovery of Association Rules, Advances in Knowledge Discovery and Data Mining, U. Fayyad, et al., Editors. 1995, AAAI/MIT Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Gordon, MD, Dumais, S. 1998. Using latent semantic indexing for literature based discovery. Journal of the American Society for Information Science, 49 : 674--685. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Gordon, MD, Lindsay, RK. 1996. Toward discovery support systems: a replication, re-examination and extension of Swanson's work on literature-based discovery of a connection between Raynaud's and fish oil. Journal of the American Society for Information Science, 47 : 116--128. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Lindsay, RK, Gordon, MD. 1999 Literature-based discovery by lexical statistics. Journal of the American Society for Information Science, 50: 574--587. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Small Molecule Subgraph Detector --http://www.ebi.ac.uk/thorntonsrv/software/SMSD/Google ScholarGoogle Scholar
  16. Hu X., Yoo I., Song M., Zhang Y., Song I-Y. 2005. Mining Undiscovered Public Knowledge from Complementary and Non-interactive Biomedical Literature through Semantic Pruning. in ACM Fourteen Conference on Information and Knowledge Management (ACM CIKM 2005). Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Dae-Hee Lee, Juong G Rhee and Yong J Lee. 2009. Reactive oxygen species up-regulate p53 and Puma; a possible mechanism for apoptosis during combined treatment with TRAIL and wogonin. British Journal of Pharmacology (2009)Google ScholarGoogle Scholar
  18. Mao Li, Zhuo Zhang, Donald L. Hill, Xinbin Chen, Hui Wang, and Ruiwen Zhang. 2005. Genistein, a Dietary Isoflavone, Down-Regulates the MDM2 Oncogene at Both Transcriptional and Posttranslational Levels. Cancer Res. 2005 65: (18).Google ScholarGoogle Scholar
  19. Srinivasan, Padmini, and Thomas C. Rindflesch. 2002. Exploring text mining from MEDLINE. Proceedings of the 2002 AMIA Annual Symposium, 722--6.Google ScholarGoogle Scholar
  20. Rindflesch TC, Fiszman M. 2003. The Interaction of Domain Knowledge and Linguistic Structure in Natural Language Processing: Interpreting Hypernymic Propositions. In Biomedical Text Journal of Biomedical Informatics. 2003;36(6):462--77. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Fiszman Marcelo, Thomas C. Rindflesch, Halil Kilicoglu. 2003. Integrating a hypernymic proposition interpreter into a semantic processor for biomedical text. Proceedings of the 2003 AMIA Annual Symposium.Google ScholarGoogle Scholar

Index Terms

  1. MKEM: a multi-level knowledge emergence model for mining undiscovered public knowledge

              Recommendations

              Comments

              Login options

              Check if you have access through your login credentials or your institution to get full access on this article.

              Sign in
              • Published in

                cover image ACM Conferences
                DTMBIO '09: Proceedings of the third international workshop on Data and text mining in bioinformatics
                November 2009
                106 pages
                ISBN:9781605588032
                DOI:10.1145/1651318

                Copyright © 2009 ACM

                Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

                Publisher

                Association for Computing Machinery

                New York, NY, United States

                Publication History

                • Published: 6 November 2009

                Permissions

                Request permissions about this article.

                Request Permissions

                Check for updates

                Qualifiers

                • research-article

                Acceptance Rates

                DTMBIO '09 Paper Acceptance Rate8of18submissions,44%Overall Acceptance Rate41of247submissions,17%

                Upcoming Conference

              PDF Format

              View or Download as a PDF file.

              PDF

              eReader

              View online with eReader.

              eReader