Skip to main content
Log in

Structure-based identification and clustering of protein families and superfamilies

  • Research Papers
  • Published:
Journal of Computer-Aided Molecular Design Aims and scope Submit manuscript

Summary

We describe an approach to protein structure comparison designed to detect distantly related proteins of similar fold, where the procedure must be sufficiently flexible to take into account the elasticity of protein folds without losing specificity. Protein structures are represented as a series of secondary structure elements, where for each element a local environment describes its relations with the elements that surround it. Secondary structures are then aligned by comparing their features and local environments. The procedure is illustrated with searches of a database of 468 protein structures in order to identify proteins of similar topology to porcine pepsin, porphobilinogen deaminase and serum amyloid P-component. In all cases the searches correctly identify protein structures of similar fold as the search proteins. Multiple cross-comparisons of protein structures allow the clustering of proteins of similar fold. This is exemplified with a clustering of α/β- and β-class protein structures. We discuss applications of the comparison and clustering of three-dimensional protein structures to comparative modelling and structure-based protein design.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. BrowneW.J., NorthA.C.T., PhillipsD.C., BrewK., VanamanT.C. and HillR.L., J. Mol. Biol., 42 (1969) 65.

    Google Scholar 

  2. GreerJ., J. Mol. Biol., 153 (1981) 1027.

    Google Scholar 

  3. BlundellT.L., SibandaB.L., SternbergM.J.E. and ThorntonJ.M., Nature, 326 (1987) 347.

    Google Scholar 

  4. SutcliffeM.J., HaneefI., CarneyD. and BlundellT.L., Protein Eng., 1 (1987) 377.

    Google Scholar 

  5. SutcliffeM.J., HaneefI. and BlundellT.L., Protein Eng., 1 (1987) 385.

    Google Scholar 

  6. JohnsonM.S., OveringtonJ.P. and BlundellT.L., J. Mol. Biol., 231 (1993) 735.

    Google Scholar 

  7. OveringtonJ.P., JohnsonM.S., ŠaliA. and BlundellT.L., Proc. R. Soc. London, Ser. B, 241 (1990) 132.

    Google Scholar 

  8. OveringtonJ.P., DonnellyD., JohnsonM.S., ŠaliA. and BlundellT.L., Protein Sci., 1 (1992) 216.

    Google Scholar 

  9. OveringtonJ.P., ZhuZ.-Y., ŠaliA., JohnsonM.S., SowdhaminiR., LouieG.V. and BlundellT.L., Biochem. Soc. Trans., 21 (1993) 597.

    Google Scholar 

  10. McLachlanA.D., J. Mol. Biol., 128 (1979) 49.

    Google Scholar 

  11. MatthewsB.W. and RossmannM.G., Methods Enzymol., 115 (1985) 397.

    Google Scholar 

  12. RossmannM.G. and ArgosP., J. Mol. Biol., 105 (1976) 75.

    Google Scholar 

  13. RossmannM.G. and ArgosP., J. Mol. Biol., 109 (1977) 99.

    Google Scholar 

  14. RemingtonS.J. and MatthewsB.W., Proc. Natl. Acad. Sci. USA, 75 (1978) 2180.

    Google Scholar 

  15. RemingtonS.J. and MatthewsB.W., J. Mol. Biol., 140 (1980) 77.

    Google Scholar 

  16. NeedlemanS.B. and WunschC.D., J. Mol. Biol., 48 (1970) 443.

    Google Scholar 

  17. SmithT.F. and WatermanM.S., J. Mol. Biol., 147 (1981) 195.

    Google Scholar 

  18. ArgosP., VingronM. and VogtG., Protein Eng., 4 (1991) 375.

    Google Scholar 

  19. ŠaliA. and BlundellT.L., J. Mol. Biol., 212 (1990) 403.

    Google Scholar 

  20. ZhuZ.-Y., ŠaliA. and BlundellT.L., Protein Eng., 5 (1992) 43.

    Google Scholar 

  21. TaylorW.R. and OrengoC.A., Protein Eng., 2 (1989) 505.

    Google Scholar 

  22. TaylorW.R. and OrengoC.A., J. Mol. Biol., 208 (1989) 1.

    Google Scholar 

  23. OrengoC.A. and TaylorW.R., J. Theor. Biol., 147 (1990) 517.

    Google Scholar 

  24. LeskA.M. and ChothiaC., J. Mol. Biol., 136 (1980) 225.

    Google Scholar 

  25. TramontanoA., ChothiaC. and LeskA.M., Protein Struct. Funct. Genet., 6 (1989) 382.

    Google Scholar 

  26. ChothiaC., LevittM. and RichardsonD., J. Mol. Biol., 105 (1977) 1.

    Google Scholar 

  27. SubbiahS., LaurentsD.V. and LevittM., Curr. Biol., 3 (1993) 1441.

    Google Scholar 

  28. VriendG. and SanderC., Protein Struct. Funct. Genet., 11 (1991) 52.

    Google Scholar 

  29. HolmL., OuzounisC., SanderC., TuparevG. and VriendG., Protein Sci., 1 (1992) 1691.

    Google Scholar 

  30. YeeD.P. and DillK.A., Protein Sci., 2 (1993) 884.

    Google Scholar 

  31. HolmL. and SanderC., J. Mol. Biol., 233 (1993) 123.

    Google Scholar 

  32. HolmL. and SanderC., FEBS Lett., 315 (1993) 301.

    Google Scholar 

  33. HolmL. and SanderC., Nature, 361 (1993) 309.

    Google Scholar 

  34. LeskA.M. and ChothiaC., J. Mol. Biol., 160 (1982) 325.

    Google Scholar 

  35. ChothiaC. and LeskA.M., J. Mol. Biol., 160 (1982) 309.

    Google Scholar 

  36. MurthyM.R.N., FEBS Lett., 168 (1984) 97.

    Google Scholar 

  37. RichardsF.M. and KundrotC.E., Protein Struct. Funct. Genet., 3 (1988) 71.

    Google Scholar 

  38. MitchellE.M., ArtymiukP.J., RiceD.W. and WillettP., J. Mol. Biol., 212 (1989) 151.

    Google Scholar 

  39. ArtymiukP.J., RiceD.W., MitchellE.M. and WillettP., Protein Eng., 4 (1989) 39.

    Google Scholar 

  40. ArtymiukP.J., GrindleyH.M., ParkJ.E., RiceD.W. and WillettP., FEBS Lett., 303 (1992) 48.

    Google Scholar 

  41. GrindleyH.M., ArtymiukP.J., RiceD.W. and WillettP., J. Mol. Biol., 229 (1993) 707.

    Google Scholar 

  42. OrengoC.A., BrownN.P. and TaylorW.R., Protein Struct. Funct. Genet., 14 (1992) 139.

    Google Scholar 

  43. OrengoC.A., FloresT.P., JonesD.T., TaylorW.R. and ThorntonJ.M., Curr. Biol., 3 (1993) 131.

    Google Scholar 

  44. OrengoC.A., FloresT.P., TaylorW.R. and ThorntonJ.M., Protein Eng., 6 (1993) 485.

    Google Scholar 

  45. KochI., KadenF. and SelbigJ., Protein Struct. Funct. Genet., 12 (1992) 314.

    Google Scholar 

  46. JohnsonM.S., SutcliffeM.J. and BlundellT.L., J. Mol. Evol., 30 (1990) 43.

    Google Scholar 

  47. JohnsonM.S., ŠaliA. and BlundellT.L., Methods Enzymol., 183 (1990) 670.

    Google Scholar 

  48. KabschW. and SanderC., Biopolymers, 22 (1983) 2577.

    Google Scholar 

  49. Smith, D.K. and Thornton, J.M., unpublished results.

  50. ChouK.-C., NemethyG. and ScheregaH.A., J. Am. Chem. Soc., 106 (1984) 3161.

    Google Scholar 

  51. SowdhaminiR., SrinivasanN., RamakrishnanC. and BalaramP., J. Mol. Biol., 223 (1992) 845.

    Google Scholar 

  52. OobatakeM. and OoiT., J. Theor. Biol., 67 (1977) 567.

    Google Scholar 

  53. BronC. and KerboschJ., Commun. Assoc. Comput. Machinery, 16 (1973) 575.

    Google Scholar 

  54. FredmanM.L., Bull. Math. Biol., 46 (1984) 553.

    Google Scholar 

  55. FelsensteinJ., Evolution, 39 (1985) 783.

    Google Scholar 

  56. Zhu, Z.-Y., unpublished results.

  57. FitchW.M. and MargoliashE., Science, 155 (1967) 279.

    Google Scholar 

  58. AndreevaN.S., FedorovA.A., GustchinaA.E., SchutzkeverN.E. and SafroM.G., Mol. Biol. (Moscow), 12 (1978) 704.

    Google Scholar 

  59. CooperJ.B., KhanG., TaylorG., TickleI.J. and BlundellT.L., J. Mol. Biol., 214 (1990) 199.

    Google Scholar 

  60. Abad-ZapateroC., RydelT.J. and EricksonJ., Protein Struct. Funct. Genet., 8 (1990) 62.

    Google Scholar 

  61. SieleckiA.R., HayakawaK., FujinagaM., MurphyM.E.P., FraserM., MuirA.K., CarilliC.T., LewickiJ.A., BaxterJ.D. and JamesM.N.G., Science, 243 (1989) 1346.

    Google Scholar 

  62. DhanarajV., DealwisC.G., FrazaoC., BadassoM., SibandaB.L., TickleI.J., CooperJ.B., DriessenH.P.C., NewmanM., AguilarC., WoodS.P., BlundellT.L., HobartP.M., GeogheganK.F., AmmiratiM.J., DanleyD.E., O'ConnorB.A. and HooverD.J., Nature, 357 (1992) 466.

    Google Scholar 

  63. GillilandG.L., WinborneE.L., NachmanJ. and WlodawerA., Protein Struct. Funct. Genet., 8 (1990) 82.

    Google Scholar 

  64. NewmanM., SafroM., FrazaoC., KhanG., ZdanovA., TickleI.J., BlundellT.L. and AndreevaN., J. Mol. Biol., 221 (1991) 1295.

    Google Scholar 

  65. NewmanM., WatsonF., RoychowdhuryP., JonesH., BadassoM., CleasbyA., WoodS.P., TickleI.J. and BlundellT.L., J. Mol. Biol., 230 (1993) 260.

    Google Scholar 

  66. BlundellT.L., JenkinsJ.A., SewellB.T., PearlL.H., CooperJ.B., TickleI.J., VeerapandianB. and WoodS.P., J. Mol. Biol., 211 (1990) 919.

    Google Scholar 

  67. JamesM.N.G. and SieleckiA.R., In JurnakF. and McPhersonA. (Eds.) Biological Macromolecules and Assemblies, Wiley, New York, NY, 1983, pp. 43–60.

    Google Scholar 

  68. SugunaK., PadlanE.A., SmithC.W., CarlsonW.D. and DaviesD.R., Proc. Natl. Acad. Sci. USA, 34 (1987) 7009.

    Google Scholar 

  69. Aguilar, C., Badasso, M., Cooper, J.B., Wood, S.P. and Blundell, T.L., in preparation.

  70. FitzgeraldP.M.D., McKeeverB.M., VanMiddlesworthJ.F., SpringerJ.P., HeimbachJ.C., LeuC.-T., HerberW.K., DixonR.A.F. and DarkeP.L., J. Biol. Chem., 265 (1990) 14209.

    Google Scholar 

  71. PearlL.H. and TaylorW.R., Nature, 329 (1987) 351.

    Google Scholar 

  72. MillerM., JaskolskiM., RaoJ.K.M., LeisJ. and wlodawerA., Nature, 337 (1989) 576.

    Google Scholar 

  73. JaskolskiM., MillerM., RaoJ.K.M., LeisJ. and WlodawerA., Biochemistry, 29 (1990) 5889.

    Google Scholar 

  74. NaviaM.A., FitzgeraldP.M.D., McKeeverB.M., LeuC.-T., HimbachJ.C., HerberW.K., SigalI.S., DarkeP.L. and SpringerJ.P., Nature, 337 (1989) 615.

    Google Scholar 

  75. WlodawerA., MillerM., JaskolskiM., SathyanaranaB.K., BaldwinE., WeberI.T., SelkL.M., ClawsonL., SchneiderJ. and KentS.B.H., Science, 245 (1989) 616.

    Google Scholar 

  76. LapattoR., BlundellT.L., HemmingsA., OveringtonJ., WilderspinA., WoodS., MersonJ.R., WhittleP.J., DanleyD.E., GeogheganK.F., HawrylikS.J., LeesS.E., ScheldK.G. and HobartP.M., Nature, 342 (1989) 299.

    Google Scholar 

  77. OlendorfD.H., FoundlingS.I., WendoloskiJ.J., SedlacekJ., StropP. and SalemmeF.R., Protein Struct. Funct. Genet., 14 (1992) 382.

    Google Scholar 

  78. LouieG.V., BrownlieP.D., LambertR., CooperJ.B., BlundellT.L., WoodS.P., WarrenM.J., WoodcockS.C. and JordanP.M., Nature, 359 (1992) 33.

    Google Scholar 

  79. BakerE.N., RumballS.V. and AndersonB.F., Trends Biochem. Sci., 12 (1987) 350.

    Google Scholar 

  80. AndersonB.F., BakerH.M., NorrisG.E., RiceD.W. and BakerE.N., J. Mol. Biol., 209 (1989) 711.

    Google Scholar 

  81. SarraR., GarrattR., GorinskyB., JhotiH. and LindleyP., Acta Crystallogr., B46 (1991) 763.

    Google Scholar 

  82. SpurlinoJ., LuG.-Y. and QuiochoF.A., J. Biol. Chem., 266 (1991) 5202.

    Google Scholar 

  83. SackJ.S., TrakhanovS.D., TsigannikI.H. and QuiochoF.A., J. Mol. Biol., 206 (1989) 193.

    Google Scholar 

  84. SackJ.S., SaperM.A. and QuiochoF.A., J. Mol. Biol., 206 (1989) 171.

    Google Scholar 

  85. QuiochoF.A. and VyasN.K., Nature, 310 (1984) 381.

    Google Scholar 

  86. VyasN.K., VyasM.N. and QuiochoF.A., Science, 242 (1988) 1290.

    Google Scholar 

  87. MowbrayS.L. and ColeL.B., J. Mol. Biol., 225 (1992) 155.

    Google Scholar 

  88. Emsley, J., White, H.E., O'Hara, B.P., Oliva, G., Srinivasan, N., Tickle, I.J., Blundell, T.L., Pepys, M.B. and Wood, S.P., Nature, (1994) in press.

  89. EinsparH., ParksE.H., SugunaK., SubramanianE. and SuddathF.L., J. Biol. Chem., 261 (1986) 16518.

    Google Scholar 

  90. HardmanK.D. and AinsworthC.F., Biochemistry, 11 (1972) 4910.

    Google Scholar 

  91. KeitelT., SimonO., BorrissR. and HeinemannU., Proc. Natl. Acad. Sci. USA, 90 (1993) 5287.

    Google Scholar 

  92. Srinivasan, N., White, H.E. and Blundell, T.L., in preparation.

  93. MeyerE., ColeG., RadhakrishnanR. and EppO., Acta Crystallogr., B44 (1988) 26.

    Google Scholar 

  94. MoultJ., SussmanF. and JamesM.N.G., J. Mol. Biol., 182 (1985) 555.

    Google Scholar 

  95. VanRoeyP. and BeermanT.A., Proc. Natl. Acad. Sci. USA, 86 (1989) 6587.

    Google Scholar 

  96. PletnevV.Z., KuzinA.P. and MalininaL.V., Bioorg. Khim., 8 (1982) 1637.

    Google Scholar 

  97. SuhS.W., BathM.A., NaivaG.H., CohenG.H., RaoD.N., RudikoffS. and DaviesD.R., Protein Struct. Funct. Genet., 1 (1986) 74.

    Google Scholar 

  98. SkarzynskiT., MoodyP.C.E. and WonacottA.J., J. Mol. Biol., 193 (1987) 171.

    Google Scholar 

  99. HallM.D., LevittD.G. and BanaszakL.J., J. Mol. Biol., 226 (1992) 867.

    Google Scholar 

  100. VolzK. and MatsumuraP., J. Biol. Chem., 266 (1991) 15511.

    Google Scholar 

  101. StockA., MottonenJ.M., StockJ. and SchuttC.E., Nature, 344 (1989) 745.

    Google Scholar 

  102. PaiE.F., KrengelU., PetskoG.A., GoodyR.S., KabschW. and WittinghoferA., EMBO J., 9 (1990) 2351.

    Google Scholar 

  103. laCourT.F.M., NyborgJ., ThirupS. and ClarkB.F.C., EMBO J., 4 (1985) 2385.

    Google Scholar 

  104. SmithW.W., BurnetR.M., DarlingG.D. and LudwigM.L., J. Mol. Biol., 117 (1977) 195.

    Google Scholar 

  105. ShirakiaharaY. and EvansP.R., J. Mol. Biol., 204 (1988) 973.

    Google Scholar 

  106. EvansP.R., FarrantsG.W. and HudsonP.J., Phil. Trans. R. Soc. London, Ser. B., 53 (1981) 53.

    Google Scholar 

  107. StehleT. and SchulzG.E., J. Mol. Biol., 224 (1992) 1127.

    Google Scholar 

  108. BohnJ.T., FilmanD.J., MatthewsD.A., HamlinR.C. and KrautJ., J. Biol. Chem., 257 (1982) 13560.

    Google Scholar 

  109. SrinivasanN. and BlundellT.L., Protein Eng., 6 (1993) 501.

    Google Scholar 

  110. ŠaliA. and BlundellT.L., J. Mol. Biol., 234 (1993) 779.

    Google Scholar 

  111. ŠaliA., OveringtonJ.P., JohnsonM.S. and BlundellT.L., Trends Biochem. Sci., 15 (1990) 235.

    Google Scholar 

  112. JonesD.T., TaylorW.R. and ThorntonJ.M., Nature, 358 (1992) 86.

    Google Scholar 

  113. BowieJ.U., LüthyR. and EisenbergD., Science, 253 (1991) 164.

    Google Scholar 

  114. Sowdhamini, R. and Rufino, S.D., in preparation.

  115. EvansS.V., J. Mol. Graphics, 11 (1993) 134.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Rufino, S.D., Blundell, T.L. Structure-based identification and clustering of protein families and superfamilies. J Computer-Aided Mol Des 8, 5–27 (1994). https://doi.org/10.1007/BF00124346

Download citation

  • Received:

  • Accepted:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF00124346

Key words

Navigation