Abstract
This article describes the use of a comprehensive approach for the identification of differentiating gene related blocks of SNPs based on Fisher’s p-value integration with a pooled correlation approximation. This pre-selection step is proposed as an alternative to advanced haplotype analyses for computational complexity reduction. The method, previously used for pathway regulation inference in eQTL data, is with the necessary modification especially suited for high-dimensional population genetics studies with a case/control design, where extensive numbers of SNPs are identified, leading to numerous haplotype blocks to be tested. This approach extends standard allele frequency analysis to more advanced haplotype identification. The novel method succeeds at reducing the runtime while maintaining a high level of biological result accuracy when compared against the exact test for haplotypes.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Cui, Y., Li, S., Williams, B.L.: A combined \( p \)-value approach to infer pathway regulations in eQTL mapping. Stat. Interface 4(3), 389–401 (2011)
Excoffier, L., Laval, G., Schneider, S.: Arlequin (version 3.0): an integrated software package for population genetics data analysis. Evol. Bioinform. 1, 47–50 (2005)
Fisher, R.A.: Statistical methods for research workers. In: Kotz, S., Johnson, N.L. (eds.) Breakthroughs in Statistics, pp. 66–70. Springer, New York (1992)
Goudet, J., Raymond, M., de Meeüs, T., Rousset, F.: Testing differentiation in diploid populations. Genetics 144(4), 1933–1940 (1996)
Hedges, L., Olkin, I.: Statistical Methods for Meta-Analysis. Elsevier Science, New York (2014)
McKenna, A., Hanna, M., Banks, E., Sivachenko, A., Cibulskis, K., Kernytsky, A., Garimella, K., Altshuler, D., Gabriel, S., Daly, M., et al.: The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 20(9), 1297–1303 (2010)
Raymond, M., Rousset, F.: An exact test for population differentiation. Evolution 49(6), 1280–1283 (1995)
Satterthwaite, F.E.: An approximate distribution of estimates of variance components. Biom. Bull. 2(6), 110–114 (1946)
Scheet, P., Stephens, M.: A fast and flexible statistical model for large-scale population genotype data: applications to inferring missing genotypes and haplotypic phase. Am. J. Hum. Genet. 78(4), 629–644 (2006)
Acknowledgements
This work was funded by The Polish National Centre for Research and Development grant no. PBS3/A7/29/2015/ID-247184 (AP) and National Science Centre, Poland grant no. 2015/19/B/ST6/01736 (JP). Calculations were carried out using GeCONiI infrastructure (POIG02.03.01-24-099).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Papiez, A. et al. (2019). Can an Integrative SNP Approach Substitute Standard Identification in Comprehensive Case/Control Analyses?. In: Fdez-Riverola, F., Mohamad, M., Rocha, M., De Paz, J., González, P. (eds) Practical Applications of Computational Biology and Bioinformatics, 12th International Conference. PACBB2018 2018. Advances in Intelligent Systems and Computing, vol 803. Springer, Cham. https://doi.org/10.1007/978-3-319-98702-6_15
Download citation
DOI: https://doi.org/10.1007/978-3-319-98702-6_15
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-98701-9
Online ISBN: 978-3-319-98702-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)