Skip to main content

Biomedical Scientific Textual Data Types and Processing

  • Reference work entry
Encyclopedia of Database Systems
  • 170 Accesses

Synonyms

Scientific knowledge bases; Biomedical literature; MEDLINE/PubMed; Curation; Annotation; Information retrieval; Information retrieval models/metrics/operations; Indexing; Semi-structured text retrieval; Text extraction; Text mining; Web search and crawling

Definition

Vast amounts of biomedical scientific information and knowledge are recorded in text [1,7]. Various scientific textual data in the biomedical domain may generally be disseminated through the following resources [7,11]: biomedical literature (e.g., original reports and summaries of research in journals, books, reports, and guidelines), biological databases (e.g., annotations in gene/protein databases), patient records (e.g., clinical narrative reports), and web content.

A variety of techniques have been applied to identify, extract, manage, integrate and exploit knowledge from biomedical text. Some researchers [11] divide biomedical scientific textual data processing into three major activities as shown in Figure 1...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 2,500.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Recommended Reading

  1. H., Chen C., Friedman W., and Hersh S.S. (eds.) Fuller Medical Informatics: Knowledge Management and Data Mining in Biomedicine. Springer, Secaucus, NJ, 2005.

    Google Scholar 

  2. Cohen A.M. and Hersh W.R. A survey of current work in biomedical text mining. Brief Bioinform., 6(1):57–71, 2005.

    Article  Google Scholar 

  3. Donaldson I., Martin J., deBruijn B., Wolting C., Lay V., Tuekam B., Zhang S., Baskin B., Bader G., Michalickova K., et al. PreBIND and Textomy – mining the biomedical literature for protein-protein interactions using a support vector machine. BMC Bioinformatics, 4:11, 2003.

    Article  Google Scholar 

  4. Friedman C., Kra P., Yu H., Krauthammer M., and Rzhetsky A. GENIES: a natural-language processing system for the extraction of molecular pathways from journal articles. Bioinformatics, 17 (Suppl 1):S74–S82, 2001.

    Google Scholar 

  5. Gaizauskas R., Demetriou G., Artymiuk P.J., and Willett P. Protein structures and information extraction from biological texts: the PASTA system. Bioinformatics, 19(1):135–143, 2003.

    Article  Google Scholar 

  6. Hearst M. Untangling text data mining. In Proc. 27th Annual Meeting of the Assoc. for Computational Linguistics, 1999.

    Google Scholar 

  7. Hersh W. Information Retrieval: A Health and Biomedical Perspective. Springer, NY, 2003.

    Google Scholar 

  8. Hersh W., Cohen A., Roberts P., and Rekapalli H.K. TREC 2006 genomics track overview. In Proc. TREC 2006. Available at: http://trec.nist.gov/pubs/trec15/papers/GEO06. OVERVIEW.pdf

  9. Hoffmann R. and Valencia A. July 2004.A gene network for navigating the literature. Nat. Genet., 36(7):664,

    Article  Google Scholar 

  10. Hristovski D. and Peterlin B. Literature-based disease candidate gene discovery. In Proc. Medinfo. American Medical Informatics Association, Bethesda, 2004, p. 1649.

    Google Scholar 

  11. Natarajan J., Berrar D., Hack C.J., and Dubizky W. Knowledge discovery in biology and biotechnology texts: a review of techniques, evaluation strategies, and applications. Crit. Rev. Biotechnol., (25):31–52, 2005.

    Article  Google Scholar 

  12. Smalheiser N. and Swanson D. Using ARROWSMITH: a computer-assisted approach to formulating and assessing scientific hypotheses. Comput. Methods Programs Biomed., 57:149–153, 1998.

    Article  Google Scholar 

  13. Swanson D.R. Complementary structure in disjoint science literatures. In Proc. 23rd Annual Int. ACM SIGIR Conf. on Research and Development in Information Retrieval, 1990, pp. 280–289.

    Google Scholar 

  14. Yeh A.S., Hirschman L., and Morgan A.A. Evaluation of text data mining for database curation: lessons learned from the KDD Challenge Cup. Bioinformatics, 19 (Suppl 1):i331–i339, 2003.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer Science+Business Media, LLC

About this entry

Cite this entry

Zhou, L., Xu, H. (2009). Biomedical Scientific Textual Data Types and Processing. In: LIU, L., ÖZSU, M.T. (eds) Encyclopedia of Database Systems. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-39940-9_495

Download citation

Publish with us

Policies and ethics