Skip to main content

Abstract

In this work the problem associated to the obtaining of protein patterns associated with certain cancer types starting from biomedical texts is presented. The research is based on the study of the application of text mining and retrieval techniques to biomedical texts and its adaptation to this problem.Our goal is to annotate a significant corpus of biomedical texts, select the more relevant ones and to train machine learning methods to automatically categorize them along certain dimensions that we have previously defined. The idea behind this project is to identify a group of proteins associated with different cancer types.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Zhou, D., He, Y.: Extracting Interactions between Proteins from the Literature. J. Biomed. Informatics 41(2), 393–407 (2008)

    Article  Google Scholar 

  2. Corney, D.P.A., Buxton, B.F., Langdon, W.B., Jones, D.T.: BioRAT: Extracting Biological Information from Full-length Papers. Bioinformatics 20(17), 3206–3213 (2004)

    Article  Google Scholar 

  3. Rindflesch, T.C., Tanabe, L., Weinstein, J.N., Hunter, L.: EDGAR: Extraction of Drugs, Genes and Relations from the Biomedical Literature. In: Pacific Symp. Biocomputing, pp. 515–524 (2000)

    Google Scholar 

  4. Rzhetsky, A., Iossifov, I., Koike, T., Krauthammer, M., Kra, P.: GeneWays: a System for Extracting, Analyzing, Visualizing, and Integrating Molecular Pathway Data. J. Biomed. Informatics 37(1), 43–53 (2004)

    Article  Google Scholar 

  5. Hagit, S., Feldman, R.: Mining the Biomedical Literature in the Genomic Era: An Overview. J. Comp. Biology 10(6), 821–855 (2003)

    Article  Google Scholar 

  6. Chapman, W.W., Bridewell, W., Hanbury, P., Cooper, G.F., Buchanan, B.G.: A simple algorithm for identifying negated findings and diseases in discharge summaries. J. Biomed. Inform. 34, 301–310 (2001)

    Article  Google Scholar 

  7. Ruch, P.: Automatic Assignment of Biomedical Categories: Toward a Generic Approach. Bioinformatics 22(6), 658–664 (2006)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Carrera, P.V. et al. (2009). Applying Text Mining to Search for Protein Patterns. In: Omatu, S., et al. Distributed Computing, Artificial Intelligence, Bioinformatics, Soft Computing, and Ambient Assisted Living. IWANN 2009. Lecture Notes in Computer Science, vol 5518. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02481-8_145

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-02481-8_145

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-02480-1

  • Online ISBN: 978-3-642-02481-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics