Skip to main content

Using the PRIDE Proteomics Identifications Database for Knowledge Discovery and Data Analysis

  • Protocol
  • First Online:
Proteome Bioinformatics

Part of the book series: Methods in Molecular Biology™ ((MIMB,volume 604))

Abstract

The PRIDE Proteomics Identifications Database provides users with the ability to explore and compare mass spectrometry-based proteomics experiments that reveal details of the protein expression found in a broad range of taxonomic groups, tissues and disease states. A PRIDE experiment typically includes identifications of proteins, peptides and protein modifications. Many of the submitted experiments also include processed peak lists representing the mass spectra that provide the evidence for these identifications.

Since the inception of the PRIDE project, a number of tools supporting submission of data to PRIDE have been developed. Of particular note is the “PRIDE Converter” that has become the tool most frequently used for the production of PRIDE submissions at the time of writing.

The PRIDE XML format has been expanded to provide submitters with the capacity to annotate fragment ion information on to peptide identifications and the fragmentation spectra that provide the experimental evidence for these peptides. A novel algorithm for annotating fragment ion information on to peptides and their evidential mass spectra has also been developed that will ultimately provide a route for evaluating the quality of peptide identifications arising from tandem mass spectrometry. This algorithm allows the visualisation of potential fragment ions on to the identified mass spectra, even where no such information has been submitted.

In this chapter, we describe how PRIDE can be applied as a research tool and how the experiments in PRIDE can be compared and analysed. We also explore how complex queries can be constructed using the PRIDE BioMart. Finally, we will describe how the user can integrate PRIDE data with annotation from other resources, using federated BioMart queries.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Protocol
USD 49.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 159.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Martens, L., Hermjakob, H., Jones, P. et al. (2005) Pride: the proteomics identifications database. Proteomics 5, 3537-45.

    Article  CAS  PubMed  Google Scholar 

  2. Jones, P., Côté, R.G., Cho, S.Y. et al. (2008) Pride: new developments and new datasets. Nucleic Acids Res 36, D878-83.

    Article  CAS  PubMed  Google Scholar 

  3. Kasprzyk, A., Keefe, D., Smedley, D. et al. (2004) Ensmart: a generic system for fast and flexible access to biological data. Genome Res 14, 160-9.

    Article  CAS  PubMed  Google Scholar 

  4. Benson, D.A., Karsch-Mizrachi, I., Lipman, D.J. et al. (2000) Genbank. Nucleic Acids Res 28, 15-18.

    Article  CAS  PubMed  Google Scholar 

  5. Wheeler, DL., Chappey, C., Lash, AE. et al. (2000) Database resources of the national center for biotechnology information. Nucleic Acids Res 28, 10-14.

    Article  CAS  PubMed  Google Scholar 

  6. Côté, R.G., Jones, P., Martens, L. et al. (2008) The ontology lookup service: more data and better tools for controlled vocabulary queries. Nucleic Acids Res 36, W372-6.

    Article  PubMed  Google Scholar 

  7. Jones, P. and Côté, R. (2008) The pride proteomics identifications database: data submission, query, and dataset comparison. Methods Mol Biol 484, 287-303.

    Article  CAS  PubMed  Google Scholar 

  8. Siepen, J.A., Swainston, N., Jones, A.R. et al. (2007) An informatic pipeline for the data capture and submission of quantitative proteomic data using itraq. Proteome Sci 5, 4.

    Article  PubMed  Google Scholar 

  9. (2007) Democratizing proteomics data. Nat Biotechnol 25, 262.

    Google Scholar 

  10. (2007) Time for leadership. Nat Biotechnol 25, 821.

    Google Scholar 

  11. (2007) Mind the technology gap. Nat Methods 4, 765.

    Google Scholar 

  12. (2008) Thou shalt share your data. Nat Methods 5, 209.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Philip Jones .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Humana Press, a part of Springer Science+Business Media, LLC

About this protocol

Cite this protocol

Jones, P., Martens, L. (2010). Using the PRIDE Proteomics Identifications Database for Knowledge Discovery and Data Analysis. In: Hubbard, S., Jones, A. (eds) Proteome Bioinformatics. Methods in Molecular Biology™, vol 604. Humana Press. https://doi.org/10.1007/978-1-60761-444-9_20

Download citation

  • DOI: https://doi.org/10.1007/978-1-60761-444-9_20

  • Published:

  • Publisher Name: Humana Press

  • Print ISBN: 978-1-60761-443-2

  • Online ISBN: 978-1-60761-444-9

  • eBook Packages: Springer Protocols

Publish with us

Policies and ethics