Abstract
The PRIDE Proteomics Identifications Database provides users with the ability to explore and compare mass spectrometry-based proteomics experiments that reveal details of the protein expression found in a broad range of taxonomic groups, tissues and disease states. A PRIDE experiment typically includes identifications of proteins, peptides and protein modifications. Many of the submitted experiments also include processed peak lists representing the mass spectra that provide the evidence for these identifications.
Since the inception of the PRIDE project, a number of tools supporting submission of data to PRIDE have been developed. Of particular note is the “PRIDE Converter” that has become the tool most frequently used for the production of PRIDE submissions at the time of writing.
The PRIDE XML format has been expanded to provide submitters with the capacity to annotate fragment ion information on to peptide identifications and the fragmentation spectra that provide the experimental evidence for these peptides. A novel algorithm for annotating fragment ion information on to peptides and their evidential mass spectra has also been developed that will ultimately provide a route for evaluating the quality of peptide identifications arising from tandem mass spectrometry. This algorithm allows the visualisation of potential fragment ions on to the identified mass spectra, even where no such information has been submitted.
In this chapter, we describe how PRIDE can be applied as a research tool and how the experiments in PRIDE can be compared and analysed. We also explore how complex queries can be constructed using the PRIDE BioMart. Finally, we will describe how the user can integrate PRIDE data with annotation from other resources, using federated BioMart queries.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Martens, L., Hermjakob, H., Jones, P. et al. (2005) Pride: the proteomics identifications database. Proteomics 5, 3537-45.
Jones, P., Côté, R.G., Cho, S.Y. et al. (2008) Pride: new developments and new datasets. Nucleic Acids Res 36, D878-83.
Kasprzyk, A., Keefe, D., Smedley, D. et al. (2004) Ensmart: a generic system for fast and flexible access to biological data. Genome Res 14, 160-9.
Benson, D.A., Karsch-Mizrachi, I., Lipman, D.J. et al. (2000) Genbank. Nucleic Acids Res 28, 15-18.
Wheeler, DL., Chappey, C., Lash, AE. et al. (2000) Database resources of the national center for biotechnology information. Nucleic Acids Res 28, 10-14.
Côté, R.G., Jones, P., Martens, L. et al. (2008) The ontology lookup service: more data and better tools for controlled vocabulary queries. Nucleic Acids Res 36, W372-6.
Jones, P. and Côté, R. (2008) The pride proteomics identifications database: data submission, query, and dataset comparison. Methods Mol Biol 484, 287-303.
Siepen, J.A., Swainston, N., Jones, A.R. et al. (2007) An informatic pipeline for the data capture and submission of quantitative proteomic data using itraq. Proteome Sci 5, 4.
(2007) Democratizing proteomics data. Nat Biotechnol 25, 262.
(2007) Time for leadership. Nat Biotechnol 25, 821.
(2007) Mind the technology gap. Nat Methods 4, 765.
(2008) Thou shalt share your data. Nat Methods 5, 209.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Humana Press, a part of Springer Science+Business Media, LLC
About this protocol
Cite this protocol
Jones, P., Martens, L. (2010). Using the PRIDE Proteomics Identifications Database for Knowledge Discovery and Data Analysis. In: Hubbard, S., Jones, A. (eds) Proteome Bioinformatics. Methods in Molecular Biology™, vol 604. Humana Press. https://doi.org/10.1007/978-1-60761-444-9_20
Download citation
DOI: https://doi.org/10.1007/978-1-60761-444-9_20
Published:
Publisher Name: Humana Press
Print ISBN: 978-1-60761-443-2
Online ISBN: 978-1-60761-444-9
eBook Packages: Springer Protocols