Abstract
In Proteomics, fast enhancements with regard to technology are responsible for the creation of huge data sets. Consequently, in 2006 the European Commission funded a Coordination Action named ProDaC (Proteomics Data Collection) within the 6th EU Framework Programme to foster a community-wide data collection and data sharing. The aims of ProDaC were the development of documentation and storage standards, setup of a standardized data submission pipeline and collection of data.
To reach these goals, the necessary work was structured in six thematic fields (work packages): Standards for Proteomics Data Representation, Standards Implementation, Data Integration Tools, Proteomics Repository Adaptation, Data Flow Management, and Proteomics Data Exploitation. The methods building the basis of the respective fields and the achieved results are described in the following sections.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Abbreviations
- HUPO:
-
Human Proteome Organisation
- PSI:
-
Proteomics standards initiative
- DCC:
-
Data collection center
- ProDaC:
-
Proteomics data collection
- LIMS:
-
Laboratory information management system
- XML:
-
eXtensible markup language
- PRIDE:
-
Proteomics identification database
- CV:
-
Controlled vocabulary
References
Kaiser, J. (2002) Proteomics - public-private group maps out initiatives. Science 296, 827.
The HUPO Proteomics Standards Initiative (PSI) - website [http://www.psidev.info/].
Human Proteome Organisation - website [http://www.hupo.org/].
MIAPE (Minimum Information about a Proteomics Experiment) on the Proteomics Standards Initiative website. http://www.psidev.info/index.php?q=node/91.
Taylor, C. F., Paton, N. W., Lilley, K. S., Binz, P. A., Julian, R. K., Jr., Jones, A. R., Zhu, W., Apweiler, R., Aebersold, R., Deutsch, E. W., Dunn, M. J., Heck, A. J., Leitner, A., Macht, M., Mann, M., Martens, L., Neubert, T. A., Patterson, S. D., Ping, P., Seymour, S. L., Souda, P., Tsugita, A., Vandekerckhove, J., Vondriska, T. M., Whitelegge, J. P., Wilkins, M. R., Xenarios, I., Yates, J. R., 3rd, and Hermjakob, H. (2007) The minimum information about a proteomics experiment (MIAPE). Nat Biotechnol 25, 887-93.
Proteomics Data Collection (ProDaC) - website [http://www.fp6-prodac.eu/].
Sixth Framework Programme: Coordination Actions - website [http://cordis.europa.eu/fp6/instr_ca.htm].
Bluggel, M., Bailey, S., Korting, G., Stephan, C., Reidegeld, K. A., Thiele, H., Apweiler, R., Hamacher, M., and Meyer, H. E. (2004) Towards data management of the HUPO Human Brain Proteome Project pilot phase. Proteomics 4, 2361-2.
Chamrad, D. C., Korting, G., Schafer, H., Stephan, C., Thiele, H., Apweiler, R., Meyer, H. E., Marcus, K., and Bluggel, M. (2006) Gaining knowledge from previously unexplained spectra-application of the PTM-Explorer software to detect PTM in HUPO BPP MS/MS data. Proteomics 6, 5048-58.
Hamacher, M., Apweiler, R., Arnold, G., Becker, A., Bluggel, M., Carrette, O., Colvis, C., Dunn, M. J., Frohlich, T., Fountoulakis, M., van Hall, A., Herberg, F., Ji, J., Kretzschmar, H., Lewczuk, P., Lubec, G., Marcus, K., Martens, L., Palacios Bustamante, N., Park, Y. M., Pennington, S. R., Robben, J., Stuhler, K., Reidegeld, K. A., Riederer, P., Rossier, J., Sanchez, J. C., Schrader, M., Stephan, C., Tagle, D., Thiele, H., Wang, J., Wiltfang, J., Yoo, J. S., Zhang, C., Klose, J., and Meyer, H. E. (2006) HUPO Brain Proteome Project: summary of the pilot phase and introduction of a comprehensive data reprocessing strategy. Proteomics 6, 4890-8.
Hamacher, M., Marcus, K., Stephan, C., van Hall, A., and Meyer, H. E. (2005) HUPO BPP Workshop on Mouse Models for Neurodegeneration - choosing the right models. Proteomics 5, 3558-9.
Hamacher, M., Marcus, K., van Hall, A., Meyer, H. E., and Stephan, C. (2006) The HUPO Brain Proteome Project - no need to hurry? J Neural Transm 113, 963-71.
Hamacher, M., Stephan, C., Bluggel, M., Chamrad, D., Korting, G., Martens, L., Muller, M., Hermjakob, H., Parkinson, D., Dowsey, A., Reidegeld, K. A., Marcus, K., Dunn, M. J., Meyer, H. E., and Apweiler, R. (2006) The HUPO Brain Proteome Project jamboree: centralised summary of the pilot studies. Proteomics 6, 1719-21.
Hamacher, M., Stephan, C., Eisenacher, M., Hardt, T., Marcus, K., and Meyer, H. E. (2008) Maintaining standardization: an update of the HUPO Brain Proteome Project. Expert Rev Proteomics 5, 165-73.
Martens, L., Muller, M., Stephan, C., Hamacher, M., Reidegeld, K. A., Meyer, H. E., Bluggel, M., Vandekerckhove, J., Gevaert, K., and Apweiler, R. (2006) A comparison of the HUPO Brain Proteome Project pilot with other proteomics studies. Proteomics 6, 5076-86.
Mueller, M., Martens, L., Reidegeld, K. A., Hamacher, M., Stephan, C., Bluggel, M., Korting, G., Chamrad, D., Scheer, C., Marcus, K., Meyer, H. E., and Apweiler, R. (2006) Functional annotation of proteins identified in human brain during the HUPO Brain Proteome Project pilot study. Proteomics 6, 5059-75.
Reidegeld, K. A., Muller, M., Stephan, C., Bluggel, M., Hamacher, M., Martens, L., Korting, G., Chamrad, D. C., Parkinson, D., Apweiler, R., Meyer, H. E., and Marcus, K. (2006) The power of cooperative investigation: summary and comparison of the HUPO Brain Proteome Project pilot study results. Proteomics 6, 4997-5014.
Stephan, C., Hamacher, M., Bluggel, M., Korting, G., Chamrad, D., Scheer, C., Marcus, K., Reidegeld, K. A., Lohaus, C., Schafer, H., Martens, L., Jones, P., Muller, M., Auyeung, K., Taylor, C., Binz, P. A., Thiele, H., Parkinson, D., Meyer, H. E., and Apweiler, R. (2005) 5th HUPO BPP Bioinformatics Meeting at the European Bioinformatics Institute in Hinxton, UK - Setting the analysis frame. Proteomics 5, 3560-2.
Stephan, C., Reidegeld, K., Meyer, H. E., and Hamacher,  M. (2005) HUPO Brain Proteome Project Pilot Studies: bioinformatics at work. Proteomics 5, 2716-7.
Stephan, C., Reidegeld, K. A., Hamacher, M., van Hall, A., Marcus, K., Taylor, C., Jones, P., Muller, M., Apweiler, R., Martens, L., Korting, G., Chamrad, D. C., Thiele, H., Bluggel, M., Parkinson, D., Binz, P. A., Lyall, A., and Meyer, H. E. (2006) Automated reprocessing pipeline for searching heterogeneous mass spectrometric data of the HUPO Brain Proteome Project pilot phase. Proteomics 6, 5015-29.
World Wide Web Consortium (W3C) - website [http://www.w3.org/].
Martens, L., Hermjakob, H., Jones, P., Adamski, M., Taylor, C., States, D., Gevaert, K., Vandekerckhove, J., and Apweiler, R. (2005) PRIDE: the proteomics identifications database. Proteomics 5, 3537-45.
Jones, P., Cote, R. G., Cho, S. Y., Klie, S., Martens, L., Quinn, A. F., Thorneycroft, D., and Hermjakob, H. (2008) PRIDE: new developments and new datasets. Nucleic Acids Res 36, D878-83.
Jones, P., Cote, R. G., Martens, L., Quinn, A. F., Taylor, C. F., Derache, W., Hermjakob, H., and Apweiler, R. (2006) PRIDE: a public repository of protein and peptide identifications for the proteomics community. Nucleic Acids Res 34, D659-63.
Chamrad, D. C., Koerting, G., Gobom, J., Thiele, H., Klose, J., Meyer, H. E., and Blueggel, M. (2003) Interpretation of mass spectrometry data for high-throughput proteomics. Anal Bioanal Chem 376, 1014-22.
Bruker Daltonics - Proteinscape - website [http://www.proteinscape.com/].
Proteios Software Environment - website [http://www.proteios.org/].
Levander, F., Krogh, M., Warell, K., Gärdén, P., James, P., and Häkkinen, J. (2007) Automated reporting from gel-based proteomics experiments using the open source Proteios database application. Proteomics 7, 668-74.
Gärdén, P., Alm, R., and Hakkinen, J. (2005) PROTEIOS: an open source proteomics initiative. Bioinformatics 21, 2085-7.
Matrix Science: Mascot Integra - website [http://www.matrixscience.com/integra.html].
Mass-spectrometry Oriented LIMS Project - website [http://genesis.ugent.be/ms_lims/].
Biontrack Bioinformatics Solutions: Proline Proteomics Platform - website [http://www.biontrack.com/].
(2005) GenoLogics advances clinical proteomics research with ProteusLIMS 3.0. Expert Rev Proteomics 2, 832.
Cannataro, M., Cuda, G., and Veltri, P. (2005) Modeling and designing a proteomics application on PROTEUS. Methods Inf Med 44, 221-6.
GenoLogics: Proteus - website [http://www.genologics.com/proteomics].
Systems Biology Experiment Analysis Management System (SBEAMS) - Proteomics - website[http://www.sbeams.org/Proteomics/].
SibioClé - website [http://www.bioxpr.com/index.php?Itemid=31&id=61&option=com_content&task=view].
Deutsch, E. W., Lam, H., and Aebersold, R. (2008) PeptideAtlas: a resource for target selection for emerging targeted proteomics workflows. EMBO Rep 9, 429-34.
Desiere, F., Deutsch, E. W., King, N. L., Nesvizhskii, A. I., Mallick, P., Eng, J., Chen, S., Eddes, J., Loevenich, S. N., and Aebersold, R. (2006) The PeptideAtlas project. Nucleic Acids Res 34, D655-8.
Seattle Proteome Center (SPC): PeptideAtlas - website [http://www.peptideatlas.org/].
Seattle Proteome Center (SPC): Trans-Proteomic Pipeline (TPP) - website [http://tools.proteomecenter.org/TPP.php].
(2007) Time for leadership. Nat Biotechnol (Editorial) 25, 821.
(2007) Democratizing proteomics data. Nat Biotechnol (Editorial) 25, 262.
(2008) Thou shalt share your data. Nat Methods (Editorial) 5, 209.
PRIDE - PRoteomics IDEntifications database - website [http://www.ebi.ac.uk/pride/].
Provisions for implementing co-ordination actions [http://ec.europa.eu/research/fp6/pdf/ca-provisions_250603.pdf].
Hamacher, M., Stephan, C., Eisenacher, M., van Hall, A., Marcus, K., Martens, L., Park, Y. M., Gutstein, H. B., Herberg, F., and Meyer, H. E. (2007) Proteomics for everyday use: activities of the HUPO Brain Proteome Project during the 5th HUPO World Congress. Proteomics 7, 1012-5.
Eisenacher, M., Hardt, T., Hamacher, M., Martens, L., Hakkinen, J., Levander, F., Apweiler, R., Meyer, H. E., and Stephan, C. (2007) Proteomics Data Collection - the 1st ProDaC workshop 26 April 2007 Ecole Normale Superieur, Lyon, France. Proteomics 7, 3034-7.
Eisenacher, M., Hardt, T., Hamacher, M., Martens, L., Hakkinen, J., Levander, F., Apweiler, R., Meyer, H. E., and Stephan, C. (2008) Proteomics Data Collection - 2nd ProDaC Workshop 5 October 2007, Seoul, Korea. Proteomics 8, 1326-30.
Eisenacher, M., Hardt, T., Martens, L., Häkkinen, J., Apweiler, R., Hamacher, M., Meyer, H. E., and Stephan, C. (2008) Proteomics Data Collection - 3rd ProDaC Workshop in Toledo, Spain. Proteomics 8(20), 4163-4167.
Seattle Proteome Center (SPC) at the Institute for Systems Biology - website [http://www.proteomecenter.org/].
The HUPO Proteomics Standards Initiative: mzML 1.0.0 Specification - website [http://www.psidev.info/index.php?q=wiki/mzML_Development].
The HUPO Proteomics Standards Initiative: General Information - website [http://www.psidev.info/index.php?q=node/105].
ProDaC Work Package 2 Development Page - website [http://trac.thep.lu.se/trac/fp6-prodac]
mzML Validator - Subversion Repository - Website [https://psidev.svn.sourceforge.net/svnroot/psidev/psi/mzml/validator/].
mzML Validator - Web-Based Implementation - Website [http://eddie.thep.lu.se/prodac_validator/validator.pl].
Proteomics Data Collection (ProDaC): List of Developments - website [http://www.fp6-prodac.eu/ProDaC_site/developments].
The Open Biomedical Ontologies - website [http://www.obofoundry.org/].
Helsens, K., Martens, L., Vandekerckhove, J., and Gevaert, K. (2007) MascotDatfile: an open-source library to fully parse and analyse MASCOT MS/MS search results. Proteomics 7, 364-6.
Pride Wizard - website [http://www.mcisb.org/resources/PrideWizard/index.html].
Siepen, J. A., Swainston, N., Jones, A. R., Hart, S. R., Hermjakob, H., Jones, P., and Hubbard, S. J. (2007) An informatic pipeline for the data capture and submission of quantitative proteomic data using iTRAQ. Proteome Sci 5, 4.
The PRIDE Converter - website [http://code.google.com/p/pride-converter/].
Spectrum Mill for MassHunter Workstation - website [http://www.chem.agilent.com/Scripts/PDS.asp?lPage=7771].
The Proteome Harvest PRIDE Submission Spreadsheet - website [http://www.ebi.ac.uk/pride/proteomeharvest/index.html].
Cote, R. G., Jones, P., Apweiler, R., and Hermjakob, H. (2006) The Ontology Lookup Service, a lightweight cross-platform tool for controlled vocabulary queries. BMC Bioinformatics 7, 97.
Medizinisches Proteom-Center (MPC): Software - website [http://www.medizinisches-proteom-center.de/index.php?option=com_content&view=category&layout=blog&id=51&Itemid=37].
Acknowledgements
This work was funded by ProDaC (European Commis-sion project, 6th framework programme, project number LSHG-CT-2006-036814).
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Humana Press, a part of Springer Science+Business Media, LLC
About this protocol
Cite this protocol
Stephan, C., Eisenacher, M., Kohl, M., Meyer, H.E. (2010). Proteomics Data Collection (ProDaC): Publishing and Collecting Proteomics Data Sets in Public Repositories Using Standard Formats. In: Hubbard, S., Jones, A. (eds) Proteome Bioinformatics. Methods in Molecular Biology™, vol 604. Humana Press. https://doi.org/10.1007/978-1-60761-444-9_24
Download citation
DOI: https://doi.org/10.1007/978-1-60761-444-9_24
Published:
Publisher Name: Humana Press
Print ISBN: 978-1-60761-443-2
Online ISBN: 978-1-60761-444-9
eBook Packages: Springer Protocols