Abstract
We have implemented exhaustive genetic sequence alignment codes on a variety of high performance computers. In this article, we compare and contrast the implementation issues encountered on different high performance computer architecture and the approaches used to overcome these problems. In addition, we discuss advanced sequence alignment techniques, including context sensitive and multiple sequence alignments.
Similar content being viewed by others
References
Altschul, S.F., Gish, W., Miller, W., Myers, E.W., Lipman, D.J. (1990). J. Mol. Biol. 215, 403–410.
Brutlag, D.L., Dautricourt, J.P., Diaz, R., Fier, J., Moxon, B. and Stamm, R. (1993). Computers and Chemistry 17, 203–207.
Collins, J., and Reddaway, S. (1990). “High Efficiency Sequence Database Searching: Using the Distributed Array Processor” in Computers and DNA Bell, G, and Marr T. Eds, Addison-Wesley, pp 85–92.
Dayhoff, M.O., Schwartz, R.M., Orcutt, B.C. (1978). In “Atlas of Protein Sequence and Structure” 5(3) M.O. Dayhoff (ed.), 345–352.
Deshpande, A.S., Richards, D.S., and Pearson, W.R. (1991). Comput. Applic. Biosci. 7, 237–247.
George, D.G., Barker, W.C. and Hunt, L.T. (1990). Methods in Enzymology 183, 333–351.
Gotoh, O., and Tagashira, Y. (1993). Nucleic Acids Res., 14, 57–64.
Gupta, S.K., Kececioglu, J.D., and Schaeffer, A.A. (1995). J. Computational Biol. 2, 459–472.
Henikoff S. and Henikoff, J. (1992). Proc. Natl. Acad. Sci. USA 89, 10915–10919.
Hirschberg, D.S. (1975). Communications of the ACM 18, 341–343.
Huang, X, Hardison, R., and Miller, W. (1990). Comput. Applic. Biosci. 6, 373–381.
Jones, R. (1990). Thinking machines Corporation Technical Report CB90-3.
Jones, R. Taylor, W, Zhang, X, Mesirov, J., and Lander, E. (1990). “Protein Sequence Comparison on the Connection Machine CM-2” in Computers and DNA Bell, G, and Marr T. Eds, Addison-Wesley, pp 99–107.
Lander, E.S., Mesirov, J.P., and Taylor, W.IV, (1988). “Protein Sequence Comparison on a Data Parallel Computer” Proceedings of the International Conference on Parallel Processing, pp 257–268.
Lipman, D.J., Wilbur, W.J., Smith, T.F., and Waterman, M.S. (1984). Nuc Acids Res, 12, 215–226.
Lipman, D.J., Altschul, S.F., and Kececioglu, J.D. (1989). Proc. Natl. Acad. Sci. USA. 86, 4412–4415.
McClain, W.H., and Nicholas, H.B. Jr. (1987). Discriminating between transfer RNA molecurles. J. Mol. Biol. 194, 635–642.
Needleman, S.B. and Wunsch, C.D. (1970). J. Mol. Biol. 48, 443–453.
Nicholas, H.B. Jr., and McClain, W.H. (1987). An algorithm for discriminating sequences and its application to yeast transfer RNA. Comput. Applic. BioSci., 3, 177–181.
Nicholas, H., Giras, G., Hartonas-Garmhausen, V., Kopko, M., Maher, C., and Ropelewski, A. (1991). Distributing the comparison of dna and protein sequences across heterogeneous supercomputers. Supercomputing '91 Proceedings, pp 139–146.
Notredame, C. and Higgens, D. (1996). Nucleic Acids Res. 24, 1515–1524.
Pearson, W. (1990). Methods in Enzymology, 183, 63–98.
Peltola, H., Soderland, H., and Ukkonen, E. (1986). Nuc. Acid Res. 14, 99–107.
Ropelewski, A., Nicholas, H., and Fish, S. (1995). Distributed Sequence Analysis Virtual Environments and Distributed Computing at SC'95 GII Testbed and HPC Challenge Applications on the I-WAY, 72.
Smith, T.F., and Waterman, M.S. (1981). J. Mol. Biol. 147, 195–197.
Smith, T.F., Waterman, M.S., and Burks, C. (1985). Nuc. Acid Res. 13, 645–656.
Waterman, M.S. and Eggert, M. (1987). J. Mol. Biol. 197, 723–728.
Wimberly, F.C., Lambert, M.H., Nystrom, N.A., Ropelewski, A.J., and Young, W.S. (1996). Parallel Computing 22, 1073–1089.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Ropelewski, A., Nicholas, H. & Deerfield, D. Implementation of Genetic Sequence Alignment Programs on Supercomputers. The Journal of Supercomputing 11, 237–253 (1997). https://doi.org/10.1023/A:1007955824098
Issue Date:
DOI: https://doi.org/10.1023/A:1007955824098