Abstract
The post Human Genome Project era calls for reliable, integrated, flexible, and convenient data management techniques to facilitate research activities. Querying biological data that is large in volume and complex in structure such as 3D proteins requires expressive models to explicitly support and capture the semantics of the complex data. Protein 3D structure search and comparison not only enable us to predict unknown structures, but can also reveal distant evolutionary relationships that are otherwise undetectable, and perhaps suggest unsuspected functional properties. In this work, we model 3D protein structures by adding spatial semantics and constructs to represent the contributing forces such as hydrogen bonds and high-level structures such as protein secondary structures. This paper makes a contribution to modeling the specialty of life science data and develops methods to meet the novel challenges posed by such data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Altschul, S.F., Gish, W., Miller, W., Myers, E.W., Lipman, D.J.: Basic Local Alignment Search Tool. Journal of Molecular Biology 215, 403–410 (1990)
Berman, H., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T.N., Weissig, H., Shindyalov, I., Bourne, P.: The Protein Data Bank. Nucleic Acids Research 28(1), 235–242 (2000)
Bhat, T.N., Bourne, P., Feng, Z., Gilliland, G., Jain, S., Ravichandran, V., Schneider, B., Schneider, K., Thanki, N., Weissig, H., Westbrook, J., Berman, H., The, P.D.B.: data uniformity project. Nucleic Acids Research 29(1), 214–218 (2001)
Bourne, P., Berman, H., Mcmahob, B., Watenpaugh, K., Weissig, H., Fitzgerald, P.: The macromolecular CIF dictionary. Meth, Enzymol, 227, 571–590 (1997)
Bourne, P.E., Addess, K., Bluhm, W., Chen, L., Deshpande, N., Feng, Z., Fleri, W., Green, R., Merino-Ott, J., Townsend-Merino, W., Weissig, H., Westbrook, J., Berman, H.: The distribution and query systems of the RCSB Protein Data Bank. Nucleic Acids Research 32(Database Issue), D223–D225 (2004)
Branden, C., Tooze, J.: Introduction to Protein Structure. Garland Publishing, New York (1999)
Buttler, D., Coleman, M., Critchlow, T., Fileto, R., Han, W., Liu, L., Pu, C., Rocco, D., Xiong, L.: Querying Multiple Bioinformatics Information Sources: Can Semantic Web Research Help? SIGMOD Record 31(4) (2002)
Chen, J., Anderson, J.B., De Weese-Scott, C., Fedorova, N.D., Geer, L.Y., He, S., Hurwitz, D.I.: MMDB: Entrez’s 3D-structure database. Nucleic Acids Research 31(1), 474–477 (2003)
Chen, J.Y., Carlis, J.V.: Similar_Join: Extending DBMS with a Bio-specific Operator. In: Proceedings of the 2003 ACM Symposium on Applied Computing, Melbourne, FL, USA, pp. 109–114 (2003)
Chen, P.P.-S.: The Entity-Relationship Model-Toward a Unified View of Data. ACM Transactions on Database Systems 1(1), 9–36 (1976)
Epstein, C.J., Goldberger, R.F., Anfinsen, C.B.: Cold Spring Harbor Symp. Speech. Quant. Biol. 28, 439 (1963)
Gerstein, M.: Integrative database analysis in structural genomics. Nature Structural Biology, Structural genomics supplement (2000)
Gibrat, J.-F., Madej, T., Bryant, S.H.: Surprising similarities in structure comparison. Current Opinion in Structural Biology 6, 377–385 (1996)
Greer, D., Westbook, J., Bourne, P.: An ontology driven architecture for derived representations of macromolecular structure. Bioinformatics 18(9), 1280–1281 (2002)
Holm, L., Sander, C.: 3-D Lookup: Fast Protein Structur Database Searches at 90% Reliability. In: Third International Conference on Intelligent Systems for Molecular Biology, Robinson College, Cambridge, England, pp. 179–187. AAAI Press, Menlo Park (1995)
Kabsch, W., Sander, C.: Dictionary of Protein Secondary Structure: Pattern Recognition of Hydrogen-Bonded and Geometrical Features. Biopolymers 22, 2577–2683 (1983)
Leach, A., Gillet, V.: An Introduction to Chemoinformatics. Kluwer Academic Publishers, Dordrecht (2003)
Madej, T., Gibrat, J.-F., Bryant, S.H.: Threading a Databse of Protein Cores. Proteins: Structure, Function, and Genetics 23, 356–369 (1995)
Mizuguchi, K., Go, N.: Comparison of spatial arrangements of secondary structural elements in proteins. Protein Engineering 8(4), 353–362 (1995)
Murthy, M.R.N.: A fast methods of comparing protein structures. FEBS 168(1), 97–102 (1984)
Stone, J., Wu, X., Greenblatt, M.: A Semantic Network for Modeling Biological Knowledge in Multiple Databases, University of Vermont Computer Science Technical Report (2003)
Westbrook, J., Feng, Z., Chen, L., Yang, H., Berman, H.: The Protein Data Bank and Structural genomics. Nucleic Acids Research 31(1), 489–491 (2003)
Westbrook, J., Feng, Z., Jain, S., Bhat, T.N., Thanki, N., Ravichandran, V., Gilliland, G., Bluhm, W., Weissig, H., Greer, D., Bourne, P., Berman, H.: The Protein Data Bank: unifying the archive. Nucleic Acids Research 30(1), 245–248 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ram, S., Wei, W. (2004). Modeling the Semantics of 3D Protein Structures. In: Atzeni, P., Chu, W., Lu, H., Zhou, S., Ling, TW. (eds) Conceptual Modeling – ER 2004. ER 2004. Lecture Notes in Computer Science, vol 3288. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30464-7_52
Download citation
DOI: https://doi.org/10.1007/978-3-540-30464-7_52
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23723-5
Online ISBN: 978-3-540-30464-7
eBook Packages: Springer Book Archive