Emerging Methodologies in Multiple Sequence Alignment Using High Throughput Data

Guzman, Francisco M. Ortuño; Rojas, I.; Pomares, H.; Urquiza, J. M.; Florido, J. P.

doi:10.1007/978-3-642-19914-1_25

Emerging Methodologies in Multiple Sequence Alignment Using High Throughput Data

Francisco M. Ortuño Guzman⁶,
I. Rojas⁶,
H. Pomares⁶,
J. M. Urquiza⁶ &
…
J. P. Florido⁶

Conference paper

826 Accesses
1 Citations

Part of the book series: Advances in Intelligent and Soft Computing ((AINSC,volume 93))

Abstract

New computational methodologies are increasingly being demanded in Bioinformatics due to the amount of data provided by high-throughput experiments. One of these approaches is multiple sequence alignment since feature integration is necessary to obtain more accurate and faster alignments. Alignments of nucleotide and protein sequences can help us to understand tasks like biological functions or structures in these molecules. Recent applications tend to use more available data that represent similarity among sequences: homologies, structures, functions, domains, motifs, etc. Thus, we present a review of current methods in multiple sequence alignments and their improvements integrating accurately and efficiently these heterogeneous data.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Pei, J.M.: Multiple protein sequence alignment. Current Opinion in Structural Biology 18(3), 382–386 (2008)
Article MathSciNet Google Scholar
Kemena, C., Notredame, C.: Upcoming challenges for multiple sequence alignment methods in the high-throughput era. Bioinformatics 25(19), 2455–2465 (2009)
Article Google Scholar
Althaus, E., Caprara, A., Lenhof, H.P., Reinert, K.: A branch-and-cut algorithm for multiple sequence alignment. Mathematical Programming 105(2-3), 387–425 (2006)
Article MathSciNet MATH Google Scholar
Thompson, J.D., Higgins, D.G., Gibson, T.J.: Clustal-w - improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Research 22(22), 4673–4680 (1994)
Article Google Scholar
Edgar, R.C.: Muscle: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Research 32(5), 1792–1797 (2004)
Article Google Scholar
Lassmann, T., Sonnhammer, E.L.L.: Kalign - an accurate and fast multiple sequence alignment algorithm. Bmc Bioinformatics 6 (2005)
Google Scholar
Katoh, K., Misawa, K., Kuma, K., Miyata, T.: Mafft: a novel method for rapid multiple sequence alignment based on fast fourier transform. Nucleic Acids Research 30(14), 3059–3066 (2002)
Article Google Scholar
Taheri, J., Zomaya, A.Y.: Rbt-ga: a novel metaheuristic for solving the multiple sequence alignment problem. Bmc Genomics 10 (2009)
Google Scholar
Do, C.B., Mahabhashyam, M.S.P., Brudno, M., Batzoglou, S.: Probcons: Probabilistic consistency-based multiple sequence alignment. Genome Research 15(2), 330–340 (2005)
Article Google Scholar
Chen, W.Y., Liao, B., Zhu, W., Xiang, X.Y.: Multiple sequence alignment algorithm based on a dispersion graph and ant colony algorithm. Journal of Computational Chemistry 30(13), 2031–2038 (2009)
Article Google Scholar
Notredame, C., Higgins, D.G., Heringa, J.: T-coffee: A novel method for fast and accurate multiple sequence alignment. Journal of Molecular Biology 302(1), 205–217 (2000)
Article Google Scholar
O’Sullivan, O., Suhre, K., Abergel, C., Higgins, D.G., Notredame, C.: 3dcoffee: Combining protein sequences and structures within multiple sequence alignments. Journal of Molecular Biology 340(2), 385–395 (2004)
Article Google Scholar
Armougom, F., Moretti, S., Poirot, O., Audic, S., Dumas, P., Schaeli, B., Keduas, V., Notredame, C.: Expresso: automatic incorporation of structural information in multiple sequence alignments using 3d-coffee. Nucleic Acids Research 34, W604–W608 (2006)
Article Google Scholar
Taylor, W.R., Orengo, C.A.: Protein-structure alignment. Journal of Molecular Biology 208(1), 1–22 (1989)
Article Google Scholar
Shi, J.Y., Blundell, T.L., Mizuguchi, K.: Fugue: Sequence-structure homology recognition using environment-specific substitution tables and structure-dependent gap penalties. Journal of Molecular Biology 310(1), 243–257 (2001)
Article Google Scholar
Pei, J.M., Grishin, N.V.: Promals: towards accurate multiple sequence alignments of distantly related proteins. Bioinformatics 23(7), 802–808 (2007)
Article Google Scholar
Wallace, I.M., O’Sullivan, O., Higgins, D.G., Notredame, C.: M-coffee: combining multiple sequence alignment methods with t-coffee. Nucleic Acids Research 34(6), 1692–1699 (2006)
Article Google Scholar
Thompson, J.D., Koehl, P., Ripp, R., Poch, O.: Balibase 3.0: Latest developments of the multiple sequence alignment benchmark. Proteins-Structure Function and Bioinformatics 61(1), 127–136 (2005)
Article Google Scholar
Raghava, G.P.S., Searle, S.M.J., Audley, P.C., Barber, J.D., Barton, G.J.: Oxbench: A benchmark for evaluation of protein multiple sequence alignment accuracy. Bmc Bioinformatics 4 (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Architecture and Computer Technology, University of Granada, Spain
Francisco M. Ortuño Guzman, I. Rojas, H. Pomares, J. M. Urquiza & J. P. Florido

Authors

Francisco M. Ortuño Guzman
View author publications
You can also search for this author in PubMed Google Scholar
I. Rojas
View author publications
You can also search for this author in PubMed Google Scholar
H. Pomares
View author publications
You can also search for this author in PubMed Google Scholar
J. M. Urquiza
View author publications
You can also search for this author in PubMed Google Scholar
J. P. Florido
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dep. Informática / CCTC, Universidade do Minho, 4710 - 057, Braga, Portugal
Miguel P. Rocha
Department of Computing Science and Control Faculty of Science, University of Salamanca, Plaza de la Merced S/N, 37008, Salamanca, Spain
Juan M. Corchado Rodríguez
Edificio Politécnico, ESEI: Escuela Superior de Ingeniería Informática, 32004, Ourense, Spain
Florentino Fdez-Riverola
Structural Biology and BioComputing Programme (CNIO), Spanish National Cancer Research Centre, Melchor Fdez Almagro 3, 28029, Madrid, Spain
Alfonso Valencia

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Guzman, F.M.O., Rojas, I., Pomares, H., Urquiza, J.M., Florido, J.P. (2011). Emerging Methodologies in Multiple Sequence Alignment Using High Throughput Data. In: Rocha, M.P., Rodríguez, J.M.C., Fdez-Riverola, F., Valencia, A. (eds) 5th International Conference on Practical Applications of Computational Biology & Bioinformatics (PACBB 2011). Advances in Intelligent and Soft Computing, vol 93. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19914-1_25

Download citation

DOI: https://doi.org/10.1007/978-3-642-19914-1_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-19913-4
Online ISBN: 978-3-642-19914-1
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics