Abstract
Several approaches for data interoperation identified by Karp have been implemented for biological databases. We extend Karp’s approach for interoperation not only to protein databases but also to knowledge bases and other information sources. This paper outlines algebra for protein data source composition based on our existing work of Protein Ontology (PO). In this paper we consider the case of establishing correspondence between various protein data sources using semantic relationships over the conceptual framework of PO. Here we provide specific set of relationships over PO framework to cover data semantics for integrating data information from diverse protein data sources. These relationships help in defining semantic query algebra for PO to efficiently reason and query the instance store.
An erratum to this chapter can be found at http://dx.doi.org/10.1007/11915072_109.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Altman, R.B., Bada, M., Chai, X.J., Carillo, M.W., Chen, R.O., Abernethy, N.F.: RiboWeb: An Ontology-Based System for Collaborative Molecular Biology. IEEE Intelligent Systems 14, 68–76 (1999)
Ashburner, M., Ball, C.A., Blake, J.A., Butler, H., Cherry, J.C., Corradi, J., Dolinski, K.: Creating the Gene Ontology Resource: Design and Implementation. Genome Research 11, 1425–1433 (2001)
Bairoch, A., Apweiler, R.: The SWISS-PROT protein sequence data bank and its supplement TrEMBL. Nucleic Acids Research 25, 31–36 (1997)
Bernstein, F.C., Koetzle, T.F., Williams, G.J., Meyer, E.F., Brice, M.D., Rodgers, J.R., Kennard, O., Shimanouchi, T., Tasumi, M.: The Protein Data Bank: a computer-based archival file for macromolecular structures. Journal of Molecular Biology 112, 535–542 (1977)
Boeckmann, B., Bairoch, A., Apweiler, R., Blatter, M., Estreicher, A., Gasteiger, E., Martin, M.J., Michoud, K., Donovan, C., Phan, I., Pilbout, S., Schneider, M.: The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003. Nucleic Acids Research 31, 365–370 (2003)
Garavelli, J.S.: The RESID Database of Protein Modifications: 2003 developments. Nucleic Acids Research 31, 499–501 (2003)
Gyssens, M., Paredaens, P., Gucht, D.: A graph-oriented object database model. In: 9th ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems. ACM Press, Nashville (1990)
Karp, P.D.: A strategy for database interoperation. Journal of Computational Biology 2, 573–583 (1996)
Lewis, S.E.: Gene Ontology: looking backwards and forwards. Genome Biology 6, 103.1–103.4 (2004)
Mani, I., Hu, Z., Hu, W.: PRONTO: A Large-scale Machine-induced Protein Ontology. In: 2nd Standards and Ontologies for Functional Genomics Conference (SOFG 2004), UK (2004)
Mckusick, V.A.: Mendelian Inheritance in Man. In: A Catalog of Human Genes and Genetic Disorders. Johns Hopkins University Press, Baltimore (1998)
Melnik, S.: Declarative mediation in distributed systems. In: 19th International Conference on Conceptual Modeling (ER 2000), Salt Lake City, Utah. Springer, Heidelberg (2000)
Murzin, A.G., Brenner, S.E., Hubbard, T., Chothia, C.: SCOP: A Structural Classification of Proteins Database for the Investigation of Sequences and Structures. Journal of Molecular Biology 247, 536–540 (1995)
Rajugan, R.: A Layered View Model for XML with Conceptual and Logical Extension, and its Applications, Faculty of Information Technology, University of Technology, Sydney (UTS), Australia, Sydney, PhD thesis, p. 460 (2006)
Sidhu, A.S., Dillon, T.S., Chang, E.: Ontological Foundation for Protein Data Models. In: 1st IFIP WG 2.12 & WG 12.4 International Workshop on Web Semantics (SWWS 2005), In conjunction with On The Move Federated Conferences (OTM 2005), Agia Napa, Cyprus. Springer, Heidelberg (2005a)
Sidhu, A.S., Dillon, T.S., Chang, E.: An Ontology for Protein Data Models. In: 27th Annual International Conference of the IEEE Engineering in Medicine and Biology Society 2005 (IEEE EMBC 2005), Shanghai, China. IEEE Press, Los Alamitos (2005b)
Sidhu, A.S., Dillon, T.S., Chang, E.: Advances in Protein Ontology Project. In: 19th IEEE International Symposium on Computer-Based Medical Systems (CBMS 2006), Salt Lake City, Utah. IEEE CS Press, Los Alamitos (2006a)
Sidhu, A.S., Dillon, T.S., Chang, E.: Integration of Protein Data Sources Through PO. In: Bressan, S., Küng, J., Wagner, R. (eds.) DEXA 2006. LNCS, vol. 4080, pp. 519–527. Springer, Heidelberg (2006b)
Sidhu, A.S., Dillon, T.S., Chang, E.: Protein Ontology: Data Integration using Protein Ontology. In: Ma, Z., Chen, J.Y. (eds.) Database Modeling in Biology: Practices and Challenges. Springer, New York (2006c)
Sidhu, A.S., Dillon, T.S., Sidhu, B.S., Setiawan, H.: A Unified Representation of Protein Structure Databases. In: Reddy, M.S., Khanna, S. (eds.) Biotechnological Approaches for Sustainable Development. Allied Publishers, India (2004)
Weissig, H., Bourne, P.E.: Protein structure resources. Biological Crystallography D58, 908–915 (2002)
Wesbrook, J., Feng, Z., Jain, S., Bhat, T.N., Thanki, N., Ravichandran, V., Gilliland, G.L., Bluhm, W.F., Weissig, H., Greer, D.S., Bourne, P.E., Berman, H.M.: The Protein Data Bank: unifying the archive. Nucleic Acids Research 30, 245–248 (2002)
Westbrook, J., Ito, N., Nakamura, H., Henrick, K., Berman, H.M.: PDBML: the representation of archival macromolecular structure data in XML. Bioinformatics 21, 988–992 (2005)
Wouters, C., Rajugan, R., Dillon, T.S., And Rahayu, J.W.: Ontology Extraction Using Views for Semantic Web. In: Taniar, D., Rahayu, W. (eds.) Web Semantics and Ontology, pp. 1–40. Idea Group Publishing, USA (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Sidhu, A.S., Dillon, T.S., Chang, E. (2006). Towards Semantic Interoperability of Protein Data Sources. In: Meersman, R., Tari, Z., Herrero, P. (eds) On the Move to Meaningful Internet Systems 2006: OTM 2006 Workshops. OTM 2006. Lecture Notes in Computer Science, vol 4278. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11915072_90
Download citation
DOI: https://doi.org/10.1007/11915072_90
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-48273-4
Online ISBN: 978-3-540-48276-5
eBook Packages: Computer ScienceComputer Science (R0)