Abstract
Finding regions of similarity between two data streams is a computational intensive and memory consuming problem, which refers as sequence alignment for biological sequences. Smith-Waterman algorithm is an optimal method of finding the local sequence alignment. It requires a large amount of computation and memory space, and is also constrained by the memory access speed of the Graphics Processing Units (GPUs) global memory when accelerating by using GPUs. Since biologists are commonly concerned with one or a few species in their research areas, SpecAlign is proposed to accelerate Smith-Waterman alignment of species-based protein sequences within the available GPU memory. It is designed to provide the best alignments of all the database sequences aligned on GPU. The new implementation improves performance by optimizing the organization of database, increasing GPU threads for every database sequence, and reducing the number of memory accesses to alleviate memory bandwidth bottleneck. Experimental results show that SpecAlign improves the performance by about 32 % on average when compared with CUDASW++2.0 and DOPA with Ssearch trace for 100 shortlisted sequences on NVIDIA GTX295. It also outperforms CUDASW++2.0 with Ssearch trace for 100 shortlisted sequences by about 52 % on NVIDIA GTX460.
Similar content being viewed by others
References
Ino, F., Munekawa, Y., Hagihara, K.: Sequence homology search using fine-grained cycle sharing of idle GPUs. IEEE Trans. Parallel Distrib Syst 23(4), 751–759 (2011)
Hasan, L., Kentie, M., Al-Ars, Z.: DOPA: GPU-based protein alignment using database and memory access optimizations. BMC Res. Notes 4(1), 261–272 (2011)
Hains, D., Cashero, Z., Ottenberg, M., Bohm, W., Rajopadhye, S.: Improving CUDASW++, a parallelization of Smith-Waterman for CUDA enabled devices. In: Proceedings of 10th IEEE International Workshop on High Performance Computational Biology (HICOMB’11), pp. 490–501. Shanghai, China (2011)
Sandes, E.F.d.O., Melo, A.C.M.A.d.: CUDAlign: Using GPU to accelerate the comparison of megabase genomic sequences. In: Proceedings of 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP’10), pp. 137–146. Bangalore, India (2010)
Yang, L., Huang, W., Johnson, J., Vaidya, S.: GPU accelerated Smith-Waterman. In: Proceedings of 6th International Conference on Computational Science (ICCS’06). Springer, Reading, UK (2006)
Khajeh-Saeed, A., Poole, S., Perot, J.B.: Acceleration of the Smith-Waterman algorithm using single and multiple graphics processors. J. Comput. Phys. 229(11), 4247–4258 (2010)
Sandes EFdO, Melo ACMAd, : Smith-Waterman alignment of huge sequences with GPU in linear space. In: Proceedings of IEEE International Parallel and Distributed Processing Symposium (IPDPS’11), pp. 1199–1211. Anchorage, AK (2011)
Blazewicz, J., Frohmberg, W., Kierzynka, M., Pesch, E., Wojciechowski, P.: Protein alignment algorithms with an efficient backtracking routine on multiple GPUs. BMC Bioinforma. 12(1), 181–197 (2011)
Lv, F., Nevatia, R.: Single view human action recognition using key pose matching and Viterbi path searching. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR ’07). Minneapolis, Minnesota, USA (2007)
Liu, W., Schmidt, B., Voss, G., Schroder, A., Muller-Wittig, W.: Bio-sequence satabase scanning on a GPU. In: Proceedings of IEEE International Parallel and Distributed Processing, Symposium (IPDPS’06) (2006)
Manavski, S.A., Valle, G.: CUDA compatible GPU cards as efficient hardware accelerators for Smith-Waterman sequence alignment. BMC Bioinforma. 9(suppl. 2), S10 (2008)
Liu, Y., Maskell, D.L., Schmidt, B.: CUDASW++: Optimizing Smith-Waterman sequence database searches for CUDA-enabled graphics processing units. BMC Res. Notes 2(1), 73–82 (2009)
Liu, Y., Schmidt, B., Maskell, D.L.: CUDASW++2.0: enhanced Smith-Waterman protein database search on CUDA-enabled GPUs based on SIMT and virtualized SIMD abstractions. BMC Res. Notes 3(1), 93 (2010)
Nawaz, Z., Nadeem, M., Someren, H.V., Bertels, K.: A parallel FPGA design of the Smith-Waterman traceback. In: Proceedings of 2010 International Conference on Field-Programmable Technology (FPT’10), pp. 454–459. Beijing, China (2010)
UniProtKB. Uniprotkb/swiss-prot protein knowledgebase. http://web.expasy.org/docs/relnotes/relstat.html
Acknowledgments
This work is supported by the National 863 R&D Program of China under grant No.2012AA01A306, the National Natural Science Foundation of China under grant No.61133008. We are grateful to the anonymous reviewers for their helpful comments.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Feng, X., Jin, H., Zheng, R. et al. Accelerating Smith-Waterman Alignment of Species-Based Protein Sequences on GPU. Int J Parallel Prog 43, 359–380 (2015). https://doi.org/10.1007/s10766-013-0284-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10766-013-0284-3