Skip to main content
Log in

Accelerating Smith-Waterman Alignment of Species-Based Protein Sequences on GPU

  • Published:
International Journal of Parallel Programming Aims and scope Submit manuscript

Abstract

Finding regions of similarity between two data streams is a computational intensive and memory consuming problem, which refers as sequence alignment for biological sequences. Smith-Waterman algorithm is an optimal method of finding the local sequence alignment. It requires a large amount of computation and memory space, and is also constrained by the memory access speed of the Graphics Processing Units (GPUs) global memory when accelerating by using GPUs. Since biologists are commonly concerned with one or a few species in their research areas, SpecAlign is proposed to accelerate Smith-Waterman alignment of species-based protein sequences within the available GPU memory. It is designed to provide the best alignments of all the database sequences aligned on GPU. The new implementation improves performance by optimizing the organization of database, increasing GPU threads for every database sequence, and reducing the number of memory accesses to alleviate memory bandwidth bottleneck. Experimental results show that SpecAlign improves the performance by about 32 % on average when compared with CUDASW++2.0 and DOPA with Ssearch trace for 100 shortlisted sequences on NVIDIA GTX295. It also outperforms CUDASW++2.0 with Ssearch trace for 100 shortlisted sequences by about 52 % on NVIDIA GTX460.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15

Similar content being viewed by others

References

  1. Ino, F., Munekawa, Y., Hagihara, K.: Sequence homology search using fine-grained cycle sharing of idle GPUs. IEEE Trans. Parallel Distrib Syst 23(4), 751–759 (2011)

    Article  Google Scholar 

  2. Hasan, L., Kentie, M., Al-Ars, Z.: DOPA: GPU-based protein alignment using database and memory access optimizations. BMC Res. Notes 4(1), 261–272 (2011)

    Google Scholar 

  3. Hains, D., Cashero, Z., Ottenberg, M., Bohm, W., Rajopadhye, S.: Improving CUDASW++, a parallelization of Smith-Waterman for CUDA enabled devices. In: Proceedings of 10th IEEE International Workshop on High Performance Computational Biology (HICOMB’11), pp. 490–501. Shanghai, China (2011)

  4. Sandes, E.F.d.O., Melo, A.C.M.A.d.: CUDAlign: Using GPU to accelerate the comparison of megabase genomic sequences. In: Proceedings of 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP’10), pp. 137–146. Bangalore, India (2010)

  5. Yang, L., Huang, W., Johnson, J., Vaidya, S.: GPU accelerated Smith-Waterman. In: Proceedings of 6th International Conference on Computational Science (ICCS’06). Springer, Reading, UK (2006)

  6. Khajeh-Saeed, A., Poole, S., Perot, J.B.: Acceleration of the Smith-Waterman algorithm using single and multiple graphics processors. J. Comput. Phys. 229(11), 4247–4258 (2010)

    Article  MATH  MathSciNet  Google Scholar 

  7. Sandes EFdO, Melo ACMAd, : Smith-Waterman alignment of huge sequences with GPU in linear space. In: Proceedings of IEEE International Parallel and Distributed Processing Symposium (IPDPS’11), pp. 1199–1211. Anchorage, AK (2011)

  8. Blazewicz, J., Frohmberg, W., Kierzynka, M., Pesch, E., Wojciechowski, P.: Protein alignment algorithms with an efficient backtracking routine on multiple GPUs. BMC Bioinforma. 12(1), 181–197 (2011)

    Google Scholar 

  9. Lv, F., Nevatia, R.: Single view human action recognition using key pose matching and Viterbi path searching. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR ’07). Minneapolis, Minnesota, USA (2007)

  10. Liu, W., Schmidt, B., Voss, G., Schroder, A., Muller-Wittig, W.: Bio-sequence satabase scanning on a GPU. In: Proceedings of IEEE International Parallel and Distributed Processing, Symposium (IPDPS’06) (2006)

  11. Manavski, S.A., Valle, G.: CUDA compatible GPU cards as efficient hardware accelerators for Smith-Waterman sequence alignment. BMC Bioinforma. 9(suppl. 2), S10 (2008)

  12. Liu, Y., Maskell, D.L., Schmidt, B.: CUDASW++: Optimizing Smith-Waterman sequence database searches for CUDA-enabled graphics processing units. BMC Res. Notes 2(1), 73–82 (2009)

    Google Scholar 

  13. Liu, Y., Schmidt, B., Maskell, D.L.: CUDASW++2.0: enhanced Smith-Waterman protein database search on CUDA-enabled GPUs based on SIMT and virtualized SIMD abstractions. BMC Res. Notes 3(1), 93 (2010)

    Google Scholar 

  14. Nawaz, Z., Nadeem, M., Someren, H.V., Bertels, K.: A parallel FPGA design of the Smith-Waterman traceback. In: Proceedings of 2010 International Conference on Field-Programmable Technology (FPT’10), pp. 454–459. Beijing, China (2010)

  15. UniProtKB. Uniprotkb/swiss-prot protein knowledgebase. http://web.expasy.org/docs/relnotes/relstat.html

Download references

Acknowledgments

This work is supported by the National 863 R&D Program of China under grant No.2012AA01A306, the National Natural Science Foundation of China under grant No.61133008. We are grateful to the anonymous reviewers for their helpful comments.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hai Jin.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Feng, X., Jin, H., Zheng, R. et al. Accelerating Smith-Waterman Alignment of Species-Based Protein Sequences on GPU. Int J Parallel Prog 43, 359–380 (2015). https://doi.org/10.1007/s10766-013-0284-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10766-013-0284-3

Keywords

Navigation