skip to main content
10.1145/2001576.2001624acmconferencesArticle/Chapter ViewAbstractPublication PagesgeccoConference Proceedingsconference-collections
research-article

A genetic algorithm to enhance transmembrane helices prediction

Published: 12 July 2011 Publication History

Abstract

A transmembrane helix (TMH) topology prediction is becoming a central problem in bioinformatics because the structure of TM proteins is difficult to determine by experimental means. Therefore, methods which could predict the TMHs topologies computationally are highly desired. In this paper we introduce TMHindex, a method for detecting TMH segments solely by the amino acid sequence information. Each amino acid in a protein sequence is represented by a Compositional Index deduced from a combination of the difference in amino acid appearances in TMH and non-TMH segments in training protein sequences and the amino acid composition information. Furthermore, genetic algorithm was employed to find the optimal threshold value to separate TMH segments from non-TMH segments. The method successfully predicted 376 out of the 378 TMH segments in 70 testing protein sequences. The level of accuracy achieved using TMHindex in comparison to recent methods for predicting the topology of TM proteins is a strong argument in favor of our method.

References

[1]
M. Cserzo, F. Eisenhaber, B. Eisenhaber, and I. Simon. ™ or not ™: transmembrane protein prediction with low false positive rate using DAS-™filter. Bioinformatics, 20(1):136--137, 2004.
[2]
T. Nugent and D. T. Jones. Transmembrane protein topology prediction using support vector machines. BMC Bioinformatics, 10, 2009.
[3]
R. Y. Kahsay, G. R. Gao, and L. Liao. An improved hidden markov model for transmembrane protein detection and topology prediction and its applications to complete genomes. Bioinformatics, 21(9):1853--1858, 2005.
[4]
M. G. Claros and Gunnar von Heijne. Toppred II: an improved software for membrane protein structure predictions. Computer Applications in the Biosciences, 10(6):685--686, 1994.
[5]
Takatsugu Hirokawa, S. Boon-Chieng, and Shigeki Mitaku. SOSUI: classification and secondary structure prediction system for membrane proteins. Bioinformatics, 14(4):378--379, 1998.
[6]
H. Shen and J. J. Chou. Membrain: Improving the accuracy of predicting transmembrane helices. Plos One, 6(3), 2008.
[7]
G. E. Tusnady and I. Simon. Principles governing amino acid composition of integral membrane proteins: application to topology prediction. J Mol Biol., 289(2):489--506, 1998.
[8]
A. Krogh, B. Larsson, G. von Heijne, and E. L. Sonnhammer. Predicting transmembrane protein topology with a hidden markov model: Application to complete genomes. J. Mol. Biol., 305(3):567--580, 2001.
[9]
H. Zhou and Y. Zhou. Predicting the topology of transmembrane helical proteins using mean burial propensity and a hidden-markov-model-based method. Protein Sci., 12(7):1547--1555, 2003.
[10]
L. Käll, A. Krogh, and E. L. Sonnhammer. A combined transmembrane topology and signal peptide prediction method. J. Mol. Biol., (338):1027--1036, 2004.
[11]
B. Rost, R. Casadio, and P. Fariselli. Refining neural network predictions for helical transmembrane proteins by dynamic programming. In David J. States, Pamkaj Agarwal, Terry Gaasterland, Lawrence Hunter, and Randall Smith, editors, Proceedings of the Fourth International Conference on Intelligent Systems for Molecular Biology, pages 192--200, Menlo Park, June 12--15 1996. AAAI Press.
[12]
T. Nugent and D.T. Jones. Transmembrane protein topology prediction using support vector machines. BMC Bioinformatics, 26(159), 2009.
[13]
M. Suyama and O. Ohara. Domcut: prediction of inter-domain linker regions in amino acid sequences. Bioinformatics, 19(5):673--674, 2003.
[14]
F. Tekaia, E. Yeramian, and B. Dujon. Amino acid composition of genomes, lifestyles of organisms, and evolutionary trends: a global picture with correspondence analysis. Gene, 297(4):51--60, 2002.
[15]
S. Roy, D. Martinez, O. Platero, T. Lane, and M. Werner-Washburne. Exploiting amino acid composition for predicting protein-protein interactions. PLoS One, 11(4), 2009.
[16]
E. Falkenauer. Genetic algorithms and grouping problems. John Wiley and Sons, 1998.
[17]
J.H. Holland. Adaptation in Natural Artificial Systems. University of Michigan Press, 1975.
[18]
R. Garey. Computers and Intractability: A guide to the theory of NP-completeness. Freeman and Co, 1979.
[19]
Salah Bouktif, Faheem Ahmed, Issa Khalil, and Giuliano Antoniol. A novel composite model approach to improve software quality prediction. Information & Software Technology, 52(12):1298--1311, 2010.
[20]
E. L. Sonnhammer, G. von Heijne, and A. Krogh. A hidden markov model for predicting transmembrane helices in protein sequences. In Proceedings of the 6th International Conference on Intelligent Systems for Molecular Biology (ISMB-98), Montréal, Québec, Canada, June 28 - July 1, 1998, pages 175--182. AAAI, 1998.
[21]
J. M. Cuthbertson, D. A. Doyle, and M. S. Sansom. Transmembrane helix prediction: a comparative evaluation and analysis. Protein Eng Des Sel., 18(6):295--308, 2005.
[22]
S. Jayasinghe, K. Hristova, and S.H. White. Energetics, stability, and prediction of transmembrane helices. J. Mol. Biol., 312(5):927--934, 2001.

Cited By

View all
  • (2016)The use of data mining techniques to predict mortality and length of stay in an ICU2016 12th International Conference on Innovations in Information Technology (IIT)10.1109/INNOVATIONS.2016.7880045(1-5)Online publication date: Nov-2016
  • (2016)Novel Tree-Based Proximity Search with SMOTE and Compositional Indexing Techniques for Protein Domain Identification2016 27th International Workshop on Database and Expert Systems Applications (DEXA)10.1109/DEXA.2016.029(71-75)Online publication date: Sep-2016
  • (2015)Protein Inter‐Domain Linker PredictionPattern Recognition in Computational Molecular Biology10.1002/9781119078845.ch12(212-235)Online publication date: 18-Dec-2015
  • Show More Cited By

Index Terms

  1. A genetic algorithm to enhance transmembrane helices prediction

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    GECCO '11: Proceedings of the 13th annual conference on Genetic and evolutionary computation
    July 2011
    2140 pages
    ISBN:9781450305570
    DOI:10.1145/2001576
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 12 July 2011

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. amino acid composition
    2. genetic algorithm
    3. membrane protein
    4. transmembrane helices

    Qualifiers

    • Research-article

    Conference

    GECCO '11
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,669 of 4,410 submissions, 38%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)6
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 05 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2016)The use of data mining techniques to predict mortality and length of stay in an ICU2016 12th International Conference on Innovations in Information Technology (IIT)10.1109/INNOVATIONS.2016.7880045(1-5)Online publication date: Nov-2016
    • (2016)Novel Tree-Based Proximity Search with SMOTE and Compositional Indexing Techniques for Protein Domain Identification2016 27th International Workshop on Database and Expert Systems Applications (DEXA)10.1109/DEXA.2016.029(71-75)Online publication date: Sep-2016
    • (2015)Protein Inter‐Domain Linker PredictionPattern Recognition in Computational Molecular Biology10.1002/9781119078845.ch12(212-235)Online publication date: 18-Dec-2015
    • (2013)Prediction of protein inter-domain linkers using compositional index and simulated annealingProceedings of the 15th annual conference companion on Genetic and evolutionary computation10.1145/2464576.2482740(1603-1608)Online publication date: 6-Jul-2013
    • (2013)WRF-TMH: predicting transmembrane helix by fusing composition index and physicochemical properties of amino acidsAmino Acids10.1007/s00726-013-1466-444:5(1317-1328)Online publication date: 14-Mar-2013

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media