Improved Genetic Algorithm for Multiple Sequence Alignment Using Segment Profiles (GASP)

Lv, Yanping; Li, Shaozi; Zhou, Changle; Guo, Wenzhong; Xu, Zhengming

doi:10.1007/11811305_43

Yanping Lv²²,
Shaozi Li²²,
Changle Zhou²²,
Wenzhong Guo²³ &
…
Zhengming Xu²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4093))

Included in the following conference series:

International Conference on Advanced Data Mining and Applications

2847 Accesses
2 Citations

Abstract

This paper presents a novel genetic algorithm (GA) for multiple sequence alignment in protein analysis. The most significant improvement afforded by this algorithm results from its use of segment profiles to generate the diversified initial population and prevent the destruction of conserved regions by crossover and mutation operations. Segment profiles contain rich local information, thereby speeding up convergence. Secondly, it introduces the use of the norMD function in a genetic algorithm to measure multiple alignment Finally, as an approach to the premature problem, an improved progressive method is used to optimize the highest-scoring individual of each new generation. The new algorithm is compared with the ClustalX and T-Coffee programs on several data cases from the BAliBASE benchmark alignment database. The experimental results show that it can yield better performance on data sets with long sequences, regardless of similarity.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Thompson, J.D., Plewniak, F.: A comprehensive comparison of multiple sequence alignment programs. Nuc. Acids. Res. 27, 2682–2690 (1999)
Article Google Scholar
Thompson, J.D., Gibson, T.J.: The CLUSTAL_X windows interface: flexible strategies for MSA aided by quality analysis tools. Nuc. Acids. Res. 25(24), 4876–4882 (1997)
Article Google Scholar
Brudno, M., Chapman, M.: Fast and sensitive multiple alignment of large genomic sequences. Bioinformatics 4, 66 (2003)
Google Scholar
Notredame, C., Higgins, D.G.: SAGA: sequence alignment by genetic algorithm. Nuc. Acids. Res. 24, 1515–1524 (1996)
Article Google Scholar
Eddy, R.: Biological Sequence Analysis: Probabilistic models of proteins and nucleic acids, pp. 51–68. Cambridge University Press, Cambridge (1998)
MATH Google Scholar
Dayhoff, M., Schwartz, R.M.: A model of evolutionary change in proteins. Atlas of Protein Sequence and Structure 5, 345–352 (1978)
Google Scholar
Thompson, J.D., Plewniak, F.: Multiple Sequence Alignment Objective Function. J. Mol. Biol. 314(4), 937–951 (2001)
Article Google Scholar
Benner, S.A., Cohen, M.A.: Amino acid substitution during functionally constrained divergent evolution of protein sequences. Protein Eng. 7, 1323–1332 (1994)
Article Google Scholar
Shiyi, S., Jun, Y.: Super Pairwise Alignment (SPA): An Efficient Approach to Global Alignment for Homologous Sequences. J. Com. Biol. 9(3), 477–486 (2002)
Article Google Scholar
Thompson, J.D.: BAliBASE: A benchmark alignment database for the evaluation of multiple alignment programs. Bioinformatics 15, 87–88 (1999)
Article Google Scholar
Notredame, C., Higgins, D., Heringa, J.: T-Coffee: A novel method for multiple sequence alignments. J. Mol. Biol. 302, 205–217 (2000)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Intelligent Information Technology Lab., Department of Computer Science, Xiamen University, Xiamen, 361005, China
Yanping Lv, Shaozi Li, Changle Zhou & Zhengming Xu
Department of Computer Science, Fuzhou University, Fuzhou, 350002, China
Wenzhong Guo

Authors

Yanping Lv
View author publications
You can also search for this author in PubMed Google Scholar
Shaozi Li
View author publications
You can also search for this author in PubMed Google Scholar
Changle Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Wenzhong Guo
View author publications
You can also search for this author in PubMed Google Scholar
Zhengming Xu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Information Technology and Electronic Engineering, The University of Queensland, Queensland, Australia
Xue Li
University of Alberta, Canada
Osmar R. Zaïane
Northwest Polytechnical University, China
Zhanhuai Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lv, Y., Li, S., Zhou, C., Guo, W., Xu, Z. (2006). Improved Genetic Algorithm for Multiple Sequence Alignment Using Segment Profiles (GASP). In: Li, X., Zaïane, O.R., Li, Z. (eds) Advanced Data Mining and Applications. ADMA 2006. Lecture Notes in Computer Science(), vol 4093. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11811305_43

Download citation

DOI: https://doi.org/10.1007/11811305_43
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-37025-3
Online ISBN: 978-3-540-37026-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics