Optimizing Multiple Pairwise Alignment of Genomic Sequences in Multicore Clusters

Montañola, Alberto; Roig, Concepció; Hernández, Porfidio

doi:10.1007/978-3-319-07581-5_15

Alberto Montañola⁶,
Concepció Roig⁶ &
Porfidio Hernández⁷

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 294))

1147 Accesses

Abstract

Multiple sequence alignment (MSA), used in biocomputing to study similarities between different genomic sequences, is known to require important memory and computation resources. Determining the efficient amount of resources to allocate is important to avoid waste of them, thus reducing the economical costs required in running for example a specific cloud instance. The pairwise alignment is the initial key step of the MSA problem, which will compute all pair alignments needed. We present a method to determine the optimal amount of memory and computation resources to allocate by the pairwise alignment, and we will validate it through a set of experimental results for different possible inputs. These allow us to determine the best parameters to configure the applications in order to use effectively the available resources of a given system.

This work was supported by the MEyC-Spain under contract TIN 2011-28689-C02-02 and Consolider CSD2007-0050. The CUR of DIUE of GENCAT and the European Social Fund.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Breaking the computational barriers of pairwise genome comparison

Article Open access 11 August 2015

Optimization of consistency-based multiple sequence alignment using Big Data technologies

Article 24 May 2018

Fast and Accurate Multiple Sequence Alignment with MSAProbs-MPI

References

Montañola, A., Roig, C., Hernández, P.: Pairwise Sequence Alignment Method for Distributed Shared Memory Systems. In: 21st Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP), pp. 432–436 (2013)
Google Scholar
Edgar, R.C., Batzoglou, S.: Multiple sequence alignment. Current Opinion in Structural Biology 16(3), 368–373 (2006), http://dx.doi.org/10.1016/j.sbi.2006.04.004 ISSN 0959-440X
Daugelaite, J., O’ Driscoll, A., Sleator, R.D.: An Overview of Multiple Sequence Alignments and Cloud Computing in Bioinformatics. ISRN Biomathematics 2013, Article ID 615630, 14 pages (2013)
Google Scholar
Notredame, C., Higgins, D.G., Heringa, J.: T-Coffee: A novel method for fast and accurate multiple sequence alignment. J. Mol. Biol. 302(1), 205–217 (2000), PMID: 10964570 (PubMed - indexed for MEDLINE)
Google Scholar
Smith, T.F., Waterman, M.S.: Identification of Common Molecular Subsequences. Journal of Molecular Biology 147, 195–197 (1981), doi:10.1016/0022-2836(81)90087-5
Article Google Scholar
Needleman, S.B., Wunsch, C.D.: A general method applicable to the search for similarities in the amino acid sequence of two proteins. Journal of Molecular Biology 48(3), 443–453 (1970), doi:10.1016/0022-2836(70)90057-4; PMID 5420325
Google Scholar
Larkin, M.A., Blackshields, G., Brown, N.P., Chenna, R., McGettigan, P.A., McWillian, H., Valentin, F., Wallace, I.M., Wilm, A., Lopez, R., Thompson, J.D., Gibson, T.J., Higgins, D.G.: ClustalW and ClustalX version 2. Bioinformatics 23(21), 2947–2948 (2007)
Article Google Scholar
Edgar, R.C.: MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Research 32(5), 1792–1797 (2004)
Article Google Scholar
Zola, J., Yang, X., Rospondek, S., Aluru, S.: Parallel T-Coffee: A Parallel Multiple Sequence Aligner. In: Proc. of ISCA PDCS-2007, pp. 248–253 (2007)
Google Scholar
Li, K.B.: ClustalW-MPI: ClustalW analysis using distributed and parallel computing. Bioinformatics 19, 1585–1586 (2003)
Article Google Scholar
Farrar, M.: Striped Smith-Waterman speeds database searches six times over other SIMD implementations. Bioinformatics 23(2), 156–161 (2007), doi:10.1093/bioinformatics; first published online (November 16, 2006)
Google Scholar
Liu, Y., Maskell, D.L., Schmidt, B.: CUDASW++: optimizing Smith-Waterman sequence database searches for CUDA-enabled graphics processing units. BMC Research Notes 2, 73 (2009)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Computer Science Department, Universitat de Lleida, Lleida, Spain
Alberto Montañola & Concepció Roig
Computer Architecture and Operating Systems Department, Universitat Autònoma de Barcelona, Cerdanyola del Vallès, Barcelona, Spain
Porfidio Hernández

Authors

Alberto Montañola
View author publications
You can also search for this author in PubMed Google Scholar
Concepció Roig
View author publications
You can also search for this author in PubMed Google Scholar
Porfidio Hernández
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Alberto Montañola .

Editor information

Editors and Affiliations

EMBL Outstation - Hinxton, European Bioinformatics Institute, Hinxton, United Kingdom
Julio Saez-Rodriguez
Department of Informatics, University of Minho, Braga, Portugal
Miguel P. Rocha
Department of Informatics Campus Universitario As Lagoas s/n, University of Vigo, Ourense, Spain
Florentino Fdez-Riverola
Department of Computing Science, University of Salamanca, Salamanca, Spain
Juan F. De Paz Santana

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Montañola, A., Roig, C., Hernández, P. (2014). Optimizing Multiple Pairwise Alignment of Genomic Sequences in Multicore Clusters. In: Saez-Rodriguez, J., Rocha, M., Fdez-Riverola, F., De Paz Santana, J. (eds) 8th International Conference on Practical Applications of Computational Biology & Bioinformatics (PACBB 2014). Advances in Intelligent Systems and Computing, vol 294. Springer, Cham. https://doi.org/10.1007/978-3-319-07581-5_15

Download citation

DOI: https://doi.org/10.1007/978-3-319-07581-5_15
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-07580-8
Online ISBN: 978-3-319-07581-5
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics