skip to main content
10.1145/1854776.1854909acmconferencesArticle/Chapter ViewAbstractPublication PagesbcbConference Proceedingsconference-collections
research-article

Using RDF for managing protein-protein interaction data

Published: 02 August 2010 Publication History

Abstract

Biological function is mediated and controlled to a large extent by interactions among proteins. The study of interactions about proteins has lead to the accumulation of a large amount of data, also referred to as protein-protein interaction (PPI) data. Such data, stored in publicly available databases, are often queried by using simple key-based query interfaces with little semantic. Current PPI databases enable the retrieval of one or more proteins that interact with a target protein. Nevertheless, a lot of biological information available on different sources (e.g. Gene Ontology) is not currently used in such databases.
Semantic Web technologies offer more possibilities for the storage, querying and mining of such data. Thus, the annotation of existing protein interaction databases with biological information and the application of Semantic web techniques for the storage and analysis may result in more powerful querying interfaces and may enable the development of novel algorithms for PPI analysis. In a previous work we showed how to use such annotations to realize an annotated PPI database, here we present the mapping of such relational database in RDF enabling the use of Semantic web techniques.
The main contributions of this paper are: (a) the annotation of a protein interaction database, (b) the mapping of resulting annotated data into Resource Description Framework (RDF), and (c) a semantic based querying interface. In particular, a semantic-based query interface enables users to query these data by using biological concepts.

References

[1]
D2RQ Treating Non-RDF Databases as Virtual RDF Graphs.
[2]
The home page of rdf-ppi project.
[3]
T. Aittokallio and B. Schwikowski. Graph-based methods for analysing networks in cell biology. Brief Bioinform, 7(3):243--255, 2006.
[4]
C. Alfarano, C. E. Andrade, K. Anthony, N. Bahroos, M. Bajec, K. Bantoft, D. Bete, B. Bobechko, K. Boutilier, E. Burgess, K. Buzadzija, C. R., D. C., I. Donaldson, D. Dorairajoo, M. J. Dumontie, M. R. Dumontier, V. Earles, R. Farral, H. Feldman, E. Garderman, Y. Gong, R. Gonzaga, V. Grytsan, E. Gryz, V. Gu, E. Haldorsen, A. Halupa, R. Haw, A. Hrvojic, L. Hurrell, R. Isserlin, F. Jack, F. Juma, A. Khan, T. Kon, S. Konopinsky, V. Le, E. Lee, S. Ling, M. Magidin, J. Moniakis, J. Montojo, S. Moore, B. Muskat, I. Ng, J. P. Paraiso, B. Parker, G. Pintilie, R. Pirone, J. J. Salama, S. Sgro, T. Shan, Y. Shu, J. Siew, D. Skinner, K. Snyder, R. Stasiuk, D. Strumpf, B. Tuekam, S. Tao, Z. Wang, M. White, R. Willis, C. Wolting, S. Wong, A. Wrong, C. Xin, R. Yao, Y. B., S. Zhang, K. Zheng, T. Pawson, B. F. Ouellette, and C. W. Hogue. The biomolecular interaction network database and related tools 2005 update. Nucleic Acids Res, 33(Database issue):418--424, January 2005.
[5]
D. Allemang and J. Hendler. Semantic Web for the Working Ontologist: Effective Modeling in RDFS and OWL. Morgan Kaufmann, May 2008.
[6]
R. Angles and C. Gutierrez. The expressive power of sparql. pages 114--129. 2008.
[7]
J. Broekstra, A. Kampman, and F. van Harmelen. Sesame: A generic architecture for storing and querying rdf and rdf schema, 2002.
[8]
L. Cabral, J. Domingue, E. Motta, T. Payne, and F. Hakimpour. Approaches to semantic web services: an overview and comparisons. pages 225--239. 2004.
[9]
M. Cannataro, P. Guzzi, and P. Veltri. Using ontologies for annotating and retrieving protein-protein interactions data. pages 1--5, aug. 2009.
[10]
P. Erdos and A. Renyi. On the evolution of random graphs. Publ. Math. Inst. Hung. Acad. Sci., 5:17--61, 1960.
[11]
M. A. Harris, J. Clark, A. Ireland, J. Lomax, M. Ashburner, R. Foulger, K. Eilbeck, S. Lewis, B. Marshall, C. Mungall, J. Richter, G. M. Rubin, J. A. Blake, C. Bult, M. Dolan, H. Drabkin, J. T. Eppig, D. P. Hill, L. Ni, M. Ringwald, R. Balakrishnan, J. M. Cherry, K. R. Christie, M. C. Costanzo, S. S. Dwight, S. Engel, D. G. Fisk, J. E. Hirschman, E. L. Hong, R. S. Nash, A. Sethuraman, C. L. Theesfeld, D. Botstein, K. Dolinski, B. Feierbach, T. Berardini, S. Mundodi, S. Y. Rhee, R. Apweiler, D. Barrell, E. Camon, E. Dimmer, V. Lee, R. Chisholm, P. Gaudet, W. Kibbe, R. Kishore, E. M. Schwarz, P. Sternberg, M. Gwinn, L. Hannick, J. Wortman, M. Berriman, V. Wood, P. Tonellato, P. Jaiswal, T. Seigfried, and R. White. The gene ontology (go) database and informatics resource. Nucleic Acids Res Nucleic Acids Res, 32(Database issue):258--61, January 2004.
[12]
Y. Ho, A. Gruhler, A. Heilbut, G. D. Bader, L. Moore, S.-L. Adams, A. Millar, P. Taylor, K. Bennett, K. Boutilier, L. Yang, C. Wolting, I. Donaldson, S. Schandorff, J. Shewnarane, M. Vo, J. Taggart, M. Goudreault, B. Muskat, C. Alfarano, D. Dewar, Z. Lin, K. Michalickova, A. R. Willems, H. Sassi, P. A. Nielsen, K. J. Rasmussen, J. R. Andersen, L. E. Johansen, L. H. Hansen, H. Jespersen, A. Podtelejnikov, E. Nielsen, J. Crawford, V. Poulsen, B. D. Sorensen, J. Matthiesen, R. C. Hendrickson, F. Gleeson, T. Pawson, M. F. Moran, D. Durocher, M. Mann, C. W. V. Hogue, D. Figeys, and M. Tyers. Systematic identification of protein complexes in saccharomyces cerevisiae by mass spectrometry. Nature, 415:180--183, 2002.
[13]
H. W. Mewes, D. Frishman, U. Gïeldener, G. Mannhaupt, K. Mayer, M. Mokrejs, B. Morgenstern, M. Mïensterkïetter, S. Rudd, and B. Weil. Mips: a database for genomes and protein sequences. Nucleic Acids Res, 30(1):31--34, January 2002.
[14]
M. Paolucci, T. Kawamura, T. Payne, and K. Sycara. Semantic matching of web services capabilities. pages 333--347. 2002.
[15]
S. Powers. Practical RDF. O'Reilly Media, Inc., 1st edition, August 2003.
[16]
L. Salwinski, C. S. Miller, A. J. Smith, F. K. Pettit, J. U. Bowie, and D. Eisenberg. The Database of Interacting Proteins: 2004 update. Nucl. Acids Res., 32(suppl1):D449--451, 2004.
[17]
J. I. Svihla M. Benchmarking rdf production tools. In: Proceedings of 18th International Conference on Database and Expert Systems Applications - DEXA. Heidelberg: Springer, 2007, (ISBN 978-3-540-74467-2.):700--710.
[18]
Y. Theoharis, V. Christophides, and G. Karvounarakis. Benchmarking database representations of rdf/s stores. pages 685--701. 2005.
[19]
P. Uetz, L. Giot, G. Cagney, T. Mansfield, R. Judson, J. Knight, D. Lockshon, V. Narayan, M. Srinivasan, and P. e. a. Pochart. A comprehensive analysis of protein-protein interactions in saccharomyces cerevisiae. Nature, 403:623--627, 2000.
[20]
D. B. West. Introduction to Graph Theory (2nd Edition). Prentice Hall, NY, August 2000.

Cited By

View all
  • (2015)Computational Methods for Modeling Biological Interaction NetworksPattern Recognition in Computational Molecular Biology10.1002/9781119078845.ch26(505-524)Online publication date: 18-Dec-2015
  • (2013)Evaluation of Protein-Protein Interaction Management SystemsProceedings of the 2013 24th International Workshop on Database and Expert Systems Applications10.1109/DEXA.2013.39(100-104)Online publication date: 26-Aug-2013

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
BCB '10: Proceedings of the First ACM International Conference on Bioinformatics and Computational Biology
August 2010
705 pages
ISBN:9781450304382
DOI:10.1145/1854776
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 02 August 2010

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. ontologies
  2. protein-protein interaction
  3. resource description framework
  4. terms

Qualifiers

  • Research-article

Conference

BCB'10
Sponsor:

Acceptance Rates

Overall Acceptance Rate 254 of 885 submissions, 29%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)1
Reflects downloads up to 02 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2015)Computational Methods for Modeling Biological Interaction NetworksPattern Recognition in Computational Molecular Biology10.1002/9781119078845.ch26(505-524)Online publication date: 18-Dec-2015
  • (2013)Evaluation of Protein-Protein Interaction Management SystemsProceedings of the 2013 24th International Workshop on Database and Expert Systems Applications10.1109/DEXA.2013.39(100-104)Online publication date: 26-Aug-2013

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media