Abstract
Predicting the oncogenic potential of a gene fusion transcript is an important and challenging task in the study of cancer development. To this date, the available approaches mostly rely on protein domain analysis to provide a probability score explaining the oncogenic potential of a gene fusion. In this paper, a Convolutional Neural Network model is proposed to discriminate gene fusions into oncogenic or non-oncogenic, exploiting only the protein sequence without protein domain information. Our proposed model obtained accuracy value close to 90% on a dataset of fused sequences.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Mertens, F., Johansson, B., Fioretos, T., Mitelman, F.: The emerging complexity of gene fusions in cancer. Nat. Rev. Cancer 15(6), 371 (2015)
Babiceanu, M., et al.: Recurrent chimeric fusion RNAs in non-cancer tissues and cells. Nucleic Acids Res. 44(6), 2859–2872 (2016)
Shugay, M., Ortiz de MendÃbil, I., Vizmanos, J.L., Novo, F.J.: Oncofuse: a computational framework for the prediction of the oncogenic potential of gene fusions. Bioinformatics 29(20), 2539–2546 (2013)
Abate, F., et al.: Pegasus: a comprehensive annotation and prediction tool for detection of driver gene fusions in cancer. BMC Syst. Biol. 8(1), 97 (2014)
Min, S., Lee, B., Yoon, S.: Deep learning in bioinformatics. Brief. Bioinform. 18(5), 851–869 (2017)
Rizzo, R., Fiannaca, A., La Rosa, M., Urso, A.: A deep learning approach to DNA sequence classification. In: Angelini, C., Rancoita, P.M.V., Rovetta, S. (eds.) CIBB 2015. LNCS, vol. 9874, pp. 129–140. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-44332-4_10
Forbes, S.A., et al.: COSMIC: mining complete cancer genomes in the catalogue of somatic mutations in cancer. Nucleic Acids Res. 39(suppl–1), D945–D950 (2010)
Choong, A.C.H., Lee, N.K.: Evaluation of convolutionary neural networks modeling of DNA sequences using ordinal versus one-hot encoding method. bioRxiv, p. 186965 (2017)
Bulka, B., Freeland, S.J., et al.: An interactive visualization tool to explore the biophysical properties of amino acids and their contribution to substitution matrices. BMC Bioinform. 7(1), 329 (2006)
Chollet, F., et al.: Keras (2015). https://keras.io
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Lovino, M., Urgese, G., Macii, E., di Cataldo, S., Ficarra, E. (2020). Predicting the Oncogenic Potential of Gene Fusions Using Convolutional Neural Networks. In: Raposo, M., Ribeiro, P., Sério, S., Staiano, A., Ciaramella, A. (eds) Computational Intelligence Methods for Bioinformatics and Biostatistics. CIBB 2018. Lecture Notes in Computer Science(), vol 11925. Springer, Cham. https://doi.org/10.1007/978-3-030-34585-3_24
Download citation
DOI: https://doi.org/10.1007/978-3-030-34585-3_24
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-34584-6
Online ISBN: 978-3-030-34585-3
eBook Packages: Computer ScienceComputer Science (R0)