Abstract
Finding new statistical approaches to high throughput data analysis is a very hot topic nowadays. Such a data needs dedicated methods and algorithms of analysis due to huge number of features, but often also due to a small number of samples. Methylation data are also very special, because of dependencies between features and their neighbourhood. There is a need to find a novel, data driven algorithm for these data owing to big variety of distributions data sets. Purpose of this method is detection of regions with different levels of demethylation. From the biological point of view, the most important genome regions are TSS (transcription start site) regions. Hypermethylation of these part of a gene leads to repression and thus stop the gene expression. This phenomenon often happens in cancer disease and impairs a number of molecular processes in the cell. The proposed algorithm is performed for AML patients data in comparison to healthy control. By combination of statistics methods and mathematical modelling together, it enables detection of demethylated regions or DNA and their classification as low, medium or high demethylated.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Zemach, A., McDaniel, I.E., Silva, P., Zilberman, D.: Genome-wide evolutionary analysis of eukaryotic DNA methylation. Science 328(5980), 916–919 (2010)
Gonzalo, S.: Epigenetic alterations in aging. J. Appl. Physiol. 109(2), 586–597 (2010)
Du, P., Bourgon, R.: MethyAnalysis: DNA methylation data analysis and visualization, R package version 1.10.0 (2014)
Sofer, T., Schifano, E.D., Hoppin, J.A., Hou, L., Baccarelli, A.A.: A-clustering: a novel method for the detection of co-regulated methylation regions, and regions associated with exposure. Bioinformatics 29, 2884–2891 (2013)
Jaffe, A.E., Murakami, P., Lee, H., Leek, J.T., Fallin, M.D., Feinberg, A.P., Irizarry, R.A.: Bump hunting to identify differentially methylated regions in epigenetic epidemiology studies. Int. J. Epidemiol. 41, 200–209 (2012)
Butcher, L.M., Beck, S.: Probe Lasso: a novel method to rope in differentially methylated regions with 450 K DNA methylation data. Methods 72, 21–28 (2015)
Jung, N., Dai, B., Gentles, A.J., Majeti, R., Feinberg, A.P.: An LSC epigenetic signature is largely mutation independent and implicates the HOXA cluster in AML pathogenesis. Nat. Commun. 6, 8489 (2015)
Aryee, M.J., et al.: Minfi: a flexible and comprehensive Bioconductor package for the analysis of Infinium DNA methylation microarrays. Bioinformatics 30(10), 1363–1369 (2014)
Sandoval, J., Heyn, H., Moran, S., Serra-Musach, J., Pujana, M.A., Bibikova, M., Esteller, M.: Validation of a DNA methylation microarray for 450,000 CpG sites in the human genome. Epigenetics 6(6), 692–702 (2011)
Houseman, E.A., et al.: Model-based clustering of DNA methylation array data: a re-cursive-partitioning algorithm for high-dimensional data arising as a mixture of beta distri-butions. BMC Bioinform. 9(1), 365 (2008)
Hodges Jr., J.L., Lehmann, E.L.: Estimates of location based on rank tests. Ann. Math. Stat. 34, 598–611 (1963)
McLachlan, G., Peel, D.: Finite Mixture Models. Wiley, New York (2004)
Claeskens, G., Hjort, N.L.: Model Selection and Model Averaging, vol. 330. Cambridge University Press, Cambridge (2008)
Huberty, C.J.: Applied Discriminant Analysis, vol. 297. Wiley, New York (1994)
Mann, H.B., Whitney, D.R.: On a test of whether one of two random variables is stochastically larger than the other. Ann. Math. Stat. 18(1), 50–60 (1947)
Stouffer, S.A., Suchman, E.A., DeVinney, L.C., Star, S.A., Williams Jr., R.M.: The American soldier: adjustment during army life. (Studies in social psychology in World War II), vol. 1 (1949)
Kuerbitz, S.J., Pahys, J., Wilson, A., Compitello, N., Gray, T.A.: Hypermethylation of the imprinted NNAT locus occurs frequently in pediatric acute leukemia. Carcinogenesis 23(4), 559–564 (2002)
Acknowledgements
This work was financed by SUT grant no. BKM/506/RAU1/2016/t.29 (AC) and SUT grant no. 02/010/BK_16/3015 (JP). All the calculations were carried out using infrastructure funded by GeCONiI project (POIG.02.03.01-24-099/13).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Cecotka, A., Polanska, J. (2017). Novel Method of Identifying DNA Methylation Fingerprint of Acute Myeloid Leukaemia. In: Fdez-Riverola, F., Mohamad, M., Rocha, M., De Paz, J., Pinto, T. (eds) 11th International Conference on Practical Applications of Computational Biology & Bioinformatics. PACBB 2017. Advances in Intelligent Systems and Computing, vol 616. Springer, Cham. https://doi.org/10.1007/978-3-319-60816-7_23
Download citation
DOI: https://doi.org/10.1007/978-3-319-60816-7_23
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-60815-0
Online ISBN: 978-3-319-60816-7
eBook Packages: EngineeringEngineering (R0)