Abstract
Long non-coding RNAs (lncRNAs), each with >200 nucleotides in length, constitute a large portion of the human transcriptome. Although recent studies indicate that lncRNAs play key roles in gene regulation, development and disease, the RNA functional motifs are still poorly understood. Most of the existing algorithms for motif finding are severely limited in scalability with regards to sequence and motif size. In this study, we propose a novel genetic algorithm for discriminative motif identification capable of handling large input sequences and motif sizes by utilizing genetic operators to learn and evolve in response to the input sequences. We utilize our method on long non-coding RNA (lncRNA) transcripts as a test case to identify functional motifs associated with subcellular localization. Our methodology shows high accuracy and the ability to identify functional motifs associated with subcellular localization in lncRNAs, which recapitulates a previous experimental study.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bailey, T.L.: DREME: motif discovery in transcription factor ChIP-seq data. Bioinformatics 27, 1653–1659 (2011). doi:10.1093/bioinformatics/btr261
Derrien, T., Johnson, R., Bussotti, G., et al.: The GENCODE v7 catalog of human long noncoding RNAs: analysis of their gene structure, evolution, and expression. Genome Res. 22, 1775–1789 (2012). doi:10.1101/gr.132159.111
Che, D., Song, Y., Rasheed, K.: MDGA: Motif discovery using a genetic algorithm. In: Proceedings 2005 Conference on Genetic and Evolutionary Computation, pp. 447–452 (2005)
Shannon, C.E.: A mathematical theory of communication. Bell Syst. Tech. J. 27, 379–423 (1948). doi:10.1145/584091.584093
R Core Team. R: a language and environment for statistical computing. R Foundation for Statistical Computing (2015). http://www.r-project.org/
Scrucca, L.: GA: a package for genetic algorithms in R. J. Stat. Softw. 53(4), 1–37 (2013)
Hansen, L., Mariño-ramÃrez, L., Landsman, D.: Differences in local genomic context of bound and unbound motifs. Gene 506, 125–134 (2012). doi:10.1016/j.gene.2012.06.005
Zhang, B., Gunawardane, L., Niazi, F., et al.: A novel RNA motif mediates the strict nuclear localization of a long noncoding RNA. Mol. Cell. Biol. 34, 2318–2329 (2014). doi:10.1128/MCB.01673-13
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Gudenas, B.L., Wang, L. (2017). A Genetic Algorithm for Finding Discriminative Functional Motifs in Long Non-coding RNAs. In: Cai, Z., Daescu, O., Li, M. (eds) Bioinformatics Research and Applications. ISBRA 2017. Lecture Notes in Computer Science(), vol 10330. Springer, Cham. https://doi.org/10.1007/978-3-319-59575-7_43
Download citation
DOI: https://doi.org/10.1007/978-3-319-59575-7_43
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-59574-0
Online ISBN: 978-3-319-59575-7
eBook Packages: Computer ScienceComputer Science (R0)