Abstract
Haplotypes, defined as a set of DNA polymorphism markers physically located on a single chromosome, have gained exploding magnitude of interest owing to its potential value in disease gene identification and in pharmacogenomics. Because molecular haplotyping methods remain too costly to be used on a regular basis, statistical techniques for haplotype inference have emerged as the most time- and cost-efficient approach. This chapter explains the statistical theory and algorithms behind several in silico haplotype phasing strategies; reviews the partition-ligation idea for dealing with a large number of linked SNP marker loci; and proposes new methods for handling genotype uncertainty in the genotyping machine output as well as the pooled marker data. We also discuss the application of haplotype information in disease mutation detection in case-control designs and the impact of haplotype information on locus estimation accuracy. As an illustration, we applied the haplotyping tool PL-EM jointly with the LD mapping algorithm BLADE to a case-control study of the SNP markers surrounding the Alzheimer disease susceptible gene APOE.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Akey, J.M., Zhang, K., Xiong, M., Doris, P., Jin, L.: The effect that genotyping errors have on the robustness of common linkage-disequilibrium measures. Am. J. Hum. Genet., 1447–1456 (2001)
Ardlie, K.G., Kruglyak, L., Seielstad, M.: Patterns of linkage disequilibrium in the hu man genome. Nat. Rev. Genet., 299–309 (2002)
Bansal, A., van den Boom, D., Kammerer, S., Honisch, C., Adam, G., Cantor, C.R., Kleyn, P., Braun, A.: Association testing by DNA pooling: an effective initial screen. Proc. Natl. Acad. Sci. U.S.A., 16871–16874 (2002)
Chen, R., Liu, J.S.: Predictive updating methods with application to Bayesian classification. J. R. Stat. Soc. Ser. B., 397–415 (1996)
Chiano, M.N., Clayton, D.G.: Fine genetic mapping using haplotype analysis and the missing data problem. Ann. Hum. Genet., 55–60 (1998)
Clark, A.G.: Inference of haplotypes from PCR-amplified samples of diploid populations. Mol. Biol. Evol., 111–122 (1990)
Douglas, J.A., Boehnke, M., Gillanders, E., Trent, J.M., Gruber, S.B.: Experimentally-derived haplotypes substantially increase the efficiency of linkage disequilibrium studies. Nat. Genet., 361–364 (2001)
Excoffier, L., Slatkin, M.: Maximum-likelihood estimation of molecular haplotype frequencies in a diploid population. Mol. Biol. Evol., 921–927 (1995)
Gabriel, S.B., Schaffner, S.F., Nguyen, H., Moore, J.M., Roy, J., Blumenstiel, B., Higgins, J., DeFelice, M., Lochner, A., Faggart, M., Liu-Cordero, S.N., Rotimi, C., Adeyemo, A., Cooper, R., Ward, R., Lander, E.S., Daly, M.J., Altshuler, D.: The structure of haplotype blocks in the human genome. Science, 2225–2229 (2002)
Graham, J., Thompson, E.A.: Disequilibrium likelihoods for fine-scale mapping of a rare allele. Am. J. Hum. Genet., 1517–1530 (1998)
Hastbacka, J., de la Chapelle, A., Kaitila, I., Sistonen, P., Weaver, A., Lander, E.: Linkage disequilibrium mapping in isolated founder populations: diastrophic dysplasia in Finland. Nat. Genet., 204–211 (1992)
Hawley, M.E., Kidd, K.K.: HAPLO: a program using the EM algorithm to estimate the frequencies of multi-site haplotypes. J. Hered, 409–411 (1995)
Horikawa, Y., Oda, N., Cox, N.J., Li, X., Orho-Melander, M., Hara, M., Hinokio, Y., Lindner, T.H., Mashima, H., Schwarz, P.E., del Bosque-Plata, L., Horikawa, Y., Oda, Y., Yoshiuchi, I., Colilla, S., Polonsky, K.S., Wei, S., Concannon, P., Iwasaki, N., Schulze, J., Baier, L.J., Bogardus, C., Groop, L., Boerwinkle, E., Hanis, C.L., Bell, G.I.: Genetic variation in the gene encoding calpain-10 is associated with type 2 diabetes mellitus. Nat. Genet., 163–175 (2000)
Ito, T., Chiku, S., Inoue, E., Tomita, M., Morisaki, T., Morisaki, H., Kamatani, N.: Estimation of haplotype frequencies, linkage-disequilibrium measures, and combination of haplotype copies in each pool by use of pooled DNA data. Am. J. Hum. Genet., 384–398 (2003)
Johnson, G.C., Esposito, L., Barratt, B.J., Smith, A.N., Heward, J., Di Genova, G., Ueda, H., Cordell, H.J., Eaves, I.A., Dudbridge, F., Twells, R.C., Payne, F., Hughes, W., Nutland, S., Stevens, H., Carr, P., Tuomilehto-Wolf, E., Tuomilehto, J., Gough, S.C., Clayton, D.G., Todd, J.A.: Haplotype tagging for the identification of common disease genes. Nat. Genet., 233–237 (2001)
Kang, H., Qin, Z.S., Niu, T., Liu, J.S.: Incorporating Genotyping Uncertainty in Haplotype Inference. Am. J. Hum. Genet. (2003) (in press)
Kaplan, N.L., Hill, W.G., Weir, B.S.: Likelihood methods for locating disease genes in nonequilibrium populations. Am. J. Hum. Genet., 18–32 (1995)
Kerem, B., Rommens, J.M., Buchanan, J.A., Markiewicz, D., Cox, T.K., Chakravarti, A., Buchwald, M., Tsui, L.C.: Identification of the cystic fibrosis gene: Genetic analysis. Science, 1073–1080 (1989)
Kim, U.K., Jorgenson, E., Coon, H., Leppert, M., Risch, N., Drayna, D.: Positional cloning of the human quantitative trait locus underlying taste sensitivity to phenylthiocarbamide. Science, 221–225 (2003)
Kirk, K.M., Cardon, L.R.: The impact of genotyping error on haplotype reconstruction and frequency estimation. Eur. J. Hum. Genet., 616–622 (2002)
Lam, J.C., Roeder, K., Devlin, B.: Haplotype fine mapping by evolutionary trees. Am. J. Hum. Genet., 659–673 (2000)
Liu, J.S.: The collapsed Gibbs sampler in Bayesian computations with applications to a gene regulation problem. J. Am. Stat. Assoc., 958–966 (1994)
Liu, J.S.: Monte Carlo strategies in scientific computing. Springer, New York (2001)
Liu, J.S., Sabatti, C., Teng, J., Keats, B.J., Risch, N.: Bayesian analysis of haplotypes for linkage disequilibrium mapping. Genome Res., 1716–1724 (2001)
Liu, J.S., Wu, Y.N.: Parameter expansion for data augmentation. J. Am. Statist. Assoc., 1264–1274 (1999)
Long, J.C., Williams, R.C., Urbanek, M.: An E-M algorithm and testing strategy for multiple-locus haplotypes. Am. J. Hum. Genet., 799–810 (1995)
Lu, X., Niu, T., Liu, J.S.: Haplotype Information and Linkage Disequilibrium Mapping. Genome Res., 2112–2117 (2003)
Martin, E.R., Lai, E.H., Gilbert, J.R., Rogala, A.R., Afshari, A.J., Riley, J., Finch, K.L., Stevens, J.F., Livak, K.J., Slotterbeck, B.D., Slifer, S.H., Warren, L.L., Conneally, P.M., Schmechel, D.E., Purvis, I., Pericak-Vance, M.A., Roses, A.D., Vance, J.M.: SNPing away at complex diseases: analysis of single-nucleotide polymorphisms around APOE in Alzheimer disease. Am. J. Hum. Genet., 383–394 (2000)
McPeek, M.S., Strahs, A.: Assessment of linkage disequilibrium by the decay of haplotype sharing, with application to fine-scale genetic mapping. Am. J. Hum. Genet., 858–875 (1999)
Michalatos-Beloin, S., Tishkoff, S.A., Bentley, K.L., Kidd, K.K., Ruano, G.: Molecular haplotyping of genetic markers 10 kb apart by allele-specific long-range PCR. Nucleic Acids Res., 4841–4843 (1996)
Morris, A.P., Whittaker, J.C., Balding, D.J.: Bayesian fine-scale mapping of disease loci, by hidden Markov models. Am. J. Hum. Genet., 155–169 (2000)
Niu, T., Qin, Z.S., Xu, X., Liu, J.S.: Bayesian haplotype inference for multiple linked single-nucleotide polymorphisms. Am. J. Hum. Genet., 157–169 (2002)
Ozelius, L.J., Kramer, P.L., de Leon, D., Risch, N., Bressman, S.B., Schuback, D.E., Brin, M.F., Kwiatkowski, D.J., Burke, R.E., Gusella, J.F., et al.: Strong allelic association between the torsion dystonia gene (DYT1) andloci on chromosome 9q34 in Ashkenazi Jews. Am. J. Hum. Genet., 619–628 (1992)
Patil, N., Berno, A.J., Hinds, D.A., Barrett, W.A., Doshi, J.M., Hacker, C.R., Kautzer, C.R., Lee, D.H., Marjoribanks, C., McDonough, D.P., Nguyenm, B.T., Norris, M.C., Sheehan, J.B., Shen, N., Stern, D., Stokowski, R.P., Thomas, D.J., Trulson, M.O., Vyas, K.R., Frazer, K.A., Fodor, S.P., Cox, D.R.: Blocks of limited haplotype diversity revealed by high-resolution scanning of human chromosome 21. Science, 1719–1723 (2001)
Pe.er, I., Beckmann, J.S.: Resolution of haplotypes and haplotype frequencies from SNP genotypes of pooled samples. To appear in RECOMB (2003)
Qin, Z.S., Niu, T., Liu, J.S.: Partition-ligation-expectation-maximization algorithm for haplotype inference with single-nucleotide polymorphisms. Am. J. Hum. Genet., 1242–1247 (2002)
Rannala, B., Slatkin, M.: Likelihood analysis of disequilibrium mapping, and related problems. Am. J. Hum. Genet., 459–473 (1998)
Rannala, B., Slatkin, M.: Methods for multipoint disease mapping using linkage disequilibrium. Genet. Epidemiol., S71–S77 (2000)
Sham, P.: Shifting paradigms in gene-mapping methodology for complex traits. Pharmacogenomics, 195–202 (2001)
Shastry, B.S.: SNP alleles in human disease and evolution. J. Hum. Genet., 561–566 (2002)
Stephens, M., Smith, N.J., Donnelly, P.: A new statistical method for haplotype reconstruction from population data. Am. J. Hum. Genet., 978–989 (2001)
Tsuchihashi, Z., Dracopoli, N.C.: Progress in high throughput SNP genotyping methods. Pharmacogenomics J., 103–110 (2002)
van Dyk, D., Meng, X.L.: The Art of Data Augmentation. J. Comput. Graph Stat., 1–50 (2001)
Van Eerdewegh, P., Little, R.D., Dupuis, J., Del Mastro, R.G., Falls, K., Simon, J., Torrey, D., Pandit, S., McKenny, J., Braunschweiger, K., Walsh, A., Liu, Z., Hayward, B., Folz, C., Manning, S.P., Bawa, A., Saracino, L., Thackston, M., Benchekroun, Y., Capparell, N., Wang, M., Adair, R., Feng, Y., Dubois, J., FitzGerald, M.G., Huang, H., Gibson, R., Allen, K.M., Pedan, A., Danzig, M.R., Umland, S.P., Egan, R.W., Cuss, F.M., Rorke, S., Clough, J.B., Holloway, J.W., Holgate, S.T., Keith, T.P.: Association of the ADAM33 gene with asthma and bronchial hyperresponsiveness. Nature, 426–430 (2002)
Wang, S., Kidd, K.K., Zhao, H.: On the use of DNA pooling to estimate haplotype frequencies. Genet. Epidemiol., 74–82 (2003)
Wiltshire, T., Pletcher, M.T., Batalov, S., Barnes, S.W., Tarantino, L.M., Cooke, M.P., Wu, H., Smylie, K., Santrosyan, A., Copeland, N.G., Jenkins, N.A., Kalush, F., Mural, R.J., Glynne, R.J., Kay, S.A., Adams, M.D., Fletcher, C.F.: Genome-wide single-nucleotide polymorphism analysis defines haplotype patterns in mouse. Proc. Natl. Acad. Sci. U.S.A., 3380–3385 (2003)
Xiong, M., Guo, S.W.: Fine-scale genetic mapping based on linkage disequilibrium: theory and applications. Am. J. Hum. Genet., 1513–1531 (1997)
Zhang, K., Deng, M., Chen, T., Waterman, M.S., Sun, F.: A dynamic programming algorithm for haplotype block partitioning. Proc. Natl. Acad. Sci. U. S. A., 7335–7339 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Niu, T., Lu, X., Kang, H., Qin, Z.S., Liu, J.S. (2004). Haplotype Inference and Its Application in Linkage Disequilibrium Mapping. In: Istrail, S., Waterman, M., Clark, A. (eds) Computational Methods for SNPs and Haplotype Inference. RSNPsH 2002. Lecture Notes in Computer Science(), vol 2983. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24719-7_4
Download citation
DOI: https://doi.org/10.1007/978-3-540-24719-7_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-21249-2
Online ISBN: 978-3-540-24719-7
eBook Packages: Springer Book Archive