Using the PRIDE Proteomics Identifications Database for Knowledge Discovery and Data Analysis

Jones, Philip; Martens, Lennart

doi:10.1007/978-1-60761-444-9_20

Philip Jones³ &
Lennart Martens³

Part of the book series: Methods in Molecular Biology™ ((MIMB,volume 604))

4740 Accesses
5 Citations

Abstract

The PRIDE Proteomics Identifications Database provides users with the ability to explore and compare mass spectrometry-based proteomics experiments that reveal details of the protein expression found in a broad range of taxonomic groups, tissues and disease states. A PRIDE experiment typically includes identifications of proteins, peptides and protein modifications. Many of the submitted experiments also include processed peak lists representing the mass spectra that provide the evidence for these identifications.

Since the inception of the PRIDE project, a number of tools supporting submission of data to PRIDE have been developed. Of particular note is the “PRIDE Converter” that has become the tool most frequently used for the production of PRIDE submissions at the time of writing.

The PRIDE XML format has been expanded to provide submitters with the capacity to annotate fragment ion information on to peptide identifications and the fragmentation spectra that provide the experimental evidence for these peptides. A novel algorithm for annotating fragment ion information on to peptides and their evidential mass spectra has also been developed that will ultimately provide a route for evaluating the quality of peptide identifications arising from tandem mass spectrometry. This algorithm allows the visualisation of potential fragment ions on to the identified mass spectra, even where no such information has been submitted.

In this chapter, we describe how PRIDE can be applied as a research tool and how the experiments in PRIDE can be compared and analysed. We also explore how complex queries can be constructed using the PRIDE BioMart. Finally, we will describe how the user can integrate PRIDE data with annotation from other resources, using federated BioMart queries.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Protocol: USD 49.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 159.00; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Martens, L., Hermjakob, H., Jones, P. et al. (2005) Pride: the proteomics identifications database. Proteomics 5, 3537-45.
Article CAS PubMed Google Scholar
Jones, P., Côté, R.G., Cho, S.Y. et al. (2008) Pride: new developments and new datasets. Nucleic Acids Res 36, D878-83.
Article CAS PubMed Google Scholar
Kasprzyk, A., Keefe, D., Smedley, D. et al. (2004) Ensmart: a generic system for fast and flexible access to biological data. Genome Res 14, 160-9.
Article CAS PubMed Google Scholar
Benson, D.A., Karsch-Mizrachi, I., Lipman, D.J. et al. (2000) Genbank. Nucleic Acids Res 28, 15-18.
Article CAS PubMed Google Scholar
Wheeler, DL., Chappey, C., Lash, AE. et al. (2000) Database resources of the national center for biotechnology information. Nucleic Acids Res 28, 10-14.
Article CAS PubMed Google Scholar
Côté, R.G., Jones, P., Martens, L. et al. (2008) The ontology lookup service: more data and better tools for controlled vocabulary queries. Nucleic Acids Res 36, W372-6.
Article PubMed Google Scholar
Jones, P. and Côté, R. (2008) The pride proteomics identifications database: data submission, query, and dataset comparison. Methods Mol Biol 484, 287-303.
Article CAS PubMed Google Scholar
Siepen, J.A., Swainston, N., Jones, A.R. et al. (2007) An informatic pipeline for the data capture and submission of quantitative proteomic data using itraq. Proteome Sci 5, 4.
Article PubMed Google Scholar
(2007) Democratizing proteomics data. Nat Biotechnol 25, 262.
Google Scholar
(2007) Time for leadership. Nat Biotechnol 25, 821.
Google Scholar
(2007) Mind the technology gap. Nat Methods 4, 765.
Google Scholar
(2008) Thou shalt share your data. Nat Methods 5, 209.
Google Scholar

Download references

Author information

Authors and Affiliations

EMBL-European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK
Philip Jones & Lennart Martens

Authors

Philip Jones
View author publications
You can also search for this author in PubMed Google Scholar
Lennart Martens
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Philip Jones .

Editor information

Editors and Affiliations

Fac. Life Sciences, University of Manchester, Oxford Rd., Manchester, M13 9PT, United Kingdom
Simon J. Hubbard
Fac. Veterinary Science, University of Liverpool, Liverpool, L69 7ZJ, United Kingdom
Andrew R. Jones

Rights and permissions

Reprints and permissions

Copyright information

About this protocol

Cite this protocol

Jones, P., Martens, L. (2010). Using the PRIDE Proteomics Identifications Database for Knowledge Discovery and Data Analysis. In: Hubbard, S., Jones, A. (eds) Proteome Bioinformatics. Methods in Molecular Biology™, vol 604. Humana Press. https://doi.org/10.1007/978-1-60761-444-9_20

Download citation

DOI: https://doi.org/10.1007/978-1-60761-444-9_20
Published: 05 December 2009
Publisher Name: Humana Press
Print ISBN: 978-1-60761-443-2
Online ISBN: 978-1-60761-444-9
eBook Packages: Springer Protocols

Publish with us

Policies and ethics