skip to main content
10.1145/2506583.2506592acmconferencesArticle/Chapter ViewAbstractPublication PagesbcbConference Proceedingsconference-collections
tutorial

Haplotype-based prediction of gene alleles using pedigrees and SNP genotypes

Published: 22 September 2013 Publication History

Abstract

Computational methods for gene allele prediction have been proposed to substitute dedicated and expensive assays with cheaper in-silico analyses that operate on routinely collected data, such as SNP genotypes. Most of these methods are tailored to the needs and characteristics of human genetic studies where they achieve good prediction accuracy. However, genomic analyses are becoming increasingly important in livestock species too. For livestock species generally the underlying---usually quite large and complex---pedigree is known and available; this information is not fully exploited by current allele prediction methods.
In this paper, we propose a new gene allele prediction method based on a simple, but robust, combinatorial formulation for the problem of discovering haplotype-allele associations. The inherent uncertainty of the haplotype inference process is reduced by taking into account the inheritance of gene alleles across the population pedigree while genotypes are phased. The accuracy of the method has been extensively evaluated on a representative real-world livestock dataset under several scenarios and choices of parameters. The median error rate ranged from 0.0537 to 0.0896, with an average of 0.0678; this is 21% better than another state-of-the-art prediction algorithm that does not use the pedigree information. The experimental results support the validity of the proposed approach and, in particular, of the use of pedigree information in gene allele predictions.

References

[1]
G. Abecasis, S. Cherny, W. Cookson, and L. Cardon. Merlin--rapid analysis of dense genetic maps using sparse gene flow trees. Nature Genetics, 30(1):97--101, 2002.
[2]
F. Ayele, E. Hailu, C. Finan, A. Aseffa, G. Davey, M. Newport, C. Rotimi, and A. Adeyemo. Prediction of HLA class II alleles using SNPs in an african population. PLoS ONE, 7(6):e40206, 2012.
[3]
N. Bacciu, L. Bomba, E. Nicolazzi, M. Milanesi, and P. Ajmone-Marsan. Linkage disequilibrium in BTA6 for the italian holstein breed using a 54k and 800k SNP-chip. In Proceedings of the 16th QTL-MAS workshop, Alghero - Italy, 2012.
[4]
J. Bohmanova, M. Sargolzaei, and F. Schenkel. Characteristics of linkage disequilibrium in north american holsteins. BMC Genomics, 11(1):421, 2010.
[5]
A. Dilthey, S. Leslie, L. Moutsianas, J. Shen, C. Cox, M. Nelson, and G. McVean. Multi-population classical HLA type imputation. PLoS Computational Biology, 9(2):e1002877, 2013.
[6]
C. Elsik, R. Tellam, K. Worley, et al. The genome sequence of taurine cattle: a window to ruminant biology and evolution. Science, 324(5926):522--528, 2009.
[7]
M. Groenen, A. Archibald, H. Uenishi, C. Tuggle, Y. Takeuchi, M. Rothschild, et al. Analyses of pig genomes provide insight into porcine demography and evolution. Nature, 491(7424):393--398, 2012.
[8]
L. Groeneveld, J. Lenstra, H. Eding, M. Toro, B. Scherf, D. Pilling, et al. Genetic diversity in farm animals--a review. Animal Genetics, 41(s1):6--31, 2010.
[9]
D. Gudbjartsson, T. Thorvaldsson, A. Kong, G. Gunnarsson, and A. Ingolfsdottir. Allegro version 2. Nature Genetics, 37(10):1015--1016, Oct. 2005.
[10]
D. Habier, J. Tetens, F. Seefried, P. Lichtner, and G. Thaller. The impact of genetic relationship information on genomic breeding values in german holstein cattle. Genetics Selection Evolution, 42(1):5, 2010.
[11]
B. Hayes, P. Bowman, A. Chamberlain, M. Goddard, et al. Invited review: Genomic selection in dairy cattle: Progress and challenges. J. of Dairy Science, 92(2):433, 2009.
[12]
B. Hayes, P. Bowman, A. Chamberlain, K. Verbyla, M. Goddard, et al. Accuracy of genomic breeding values in multi-breed dairy cattle populations. Genetics Selection Evolution, 41(1):51, 2009.
[13]
J. Li and T. Jiang. Computing the minimum recombinant haplotype configuration from incomplete genotype data on a pedigree by integer linear programming. J. of Computational Biology, 12(6):719--739, 2005.
[14]
S. Li, H. Wang, A. Smith, B. Zhang, X. Zhang, G. Schoch, et al. Predicting multiallelic genes using unphased and flanking single nucleotide polymorphisms. Genetic Epidemiology, 35(2):85--92, 2011.
[15]
N. Littlestone and M. Warmuth. The weighted majority algorithm. In Foundations of Computer Science, 1989., 30th Annual Symposium on, pages 256--261. IEEE, 1989.
[16]
M. McCarthy, G. Abecasis, L. Cardon, D. Goldstein, J. Little, J. Ioannidis, and J. Hirschhorn. Genome-wide association studies for complex traits: consensus, uncertainty and challenges. Nature Reviews Genetics, 9(5):356--369, 2008.
[17]
V. Pacheco Contreras, D. Lourenco Jaramillo, G. Parra Bracamonte, J. Martínez González, and A. Sifuentes Rinćon. Convenient genotyping of nine bovine κ-casein variants. Electronic J. of Biotechnology, 14(4):12--12, 2011.
[18]
Y. Pirola, P. Bonizzoni, and T. Jiang. An efficient algorithm for haplotype inference on pedigrees with recombinations and mutations. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 9(1):12--25, 2012.
[19]
Y. Pirola, G. Della Vedova, S. Biffani, A. Stella, and P. Bonizzoni. A fast and practical approach to genotype phasing and imputation on a pedigree with erroneous and incomplete information. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 9(6):1582--1594, Nov.--Dec. 2012.
[20]
M. Pszczola, T. Strabel, H. Mulder, and M. Calus. Reliability of direct genomic values for animals with different relationships within and to the reference population. J. of Dairy Science, 95(1):389--400, 2012.
[21]
M. Setty, A. Gusev, and I. Pe'er. HLA type inference via haplotypes identical by descent. J. of Computational Biology, 18(3):483--493, 2011.
[22]
A. Summer, E. Santus, L. Casanova, H. Joerg, A. Rossoni, C. Nicoletti, et al. Short communication: Characterization of a monoclonal antibody for κ-casein b of cow's milk. J. of Dairy Science, 93(2):796--800, 2010.
[23]
P. VanRaden. Efficient methods to compute genomic predictions. J. of Dairy Science, 91(11):4414--4423, 2008.
[24]
C. Walsh, T. Guinee, W. Reville, D. Harrington, J. Murphy, B. O'Kennedy, and R. FitzGerald. Influence of κ-casein genetic variant on rennet gel microstructure, cheddar cheesemaking properties and casein micelle size. International Dairy Journal, 8(8):707--714, 1998.
[25]
M. Xie, J. Li, and T. Jiang. Accurate HLA type inference using a weighted similarity graph. BMC Bioinformatics, 11(Suppl 11):S10, 2010.
[26]
K. Zhang and H. Zhao. A comparison of several methods for haplotype frequency estimation and haplotype reconstruction for tightly linked markers from general pedigrees. Genetic Epidemiology, 30(5):423--437, 2006.
[27]
X. Zhang, S. Li, H. Wang, J. Hansen, and L. Zhao. Empirical evaluations of analytical issues arising from predicting HLA alleles using multiple SNPs. BMC Genetics, 12(1):39, 2011.

Cited By

View all
  • (2018)Genetic Improvement in South African Livestock: Can Genomics Bridge the Gap Between the Developed and Developing Sectors?Frontiers in Genetics10.3389/fgene.2018.003319Online publication date: 23-Aug-2018
  • (2018)Genome-Wide Association Study for Susceptibility to and Recoverability From Mastitis in Danish Holstein CowsFrontiers in Genetics10.3389/fgene.2018.001419Online publication date: 24-Apr-2018
  • (2016)Use of SNP genotypes to identify carriers of harmful recessive mutations in cattle populationsBMC Genomics10.1186/s12864-016-3218-917:1Online publication date: 3-Nov-2016
  • Show More Cited By

Index Terms

  1. Haplotype-based prediction of gene alleles using pedigrees and SNP genotypes

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      BCB'13: Proceedings of the International Conference on Bioinformatics, Computational Biology and Biomedical Informatics
      September 2013
      987 pages
      ISBN:9781450324342
      DOI:10.1145/2506583
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 22 September 2013

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. Computational biology
      2. SNP
      3. genotypes
      4. haplotype
      5. pedigree

      Qualifiers

      • Tutorial
      • Research
      • Refereed limited

      Conference

      BCB'13
      Sponsor:
      BCB'13: ACM-BCB2013
      September 22 - 25, 2013
      Wshington DC, USA

      Acceptance Rates

      BCB'13 Paper Acceptance Rate 43 of 148 submissions, 29%;
      Overall Acceptance Rate 254 of 885 submissions, 29%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)2
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 03 Mar 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2018)Genetic Improvement in South African Livestock: Can Genomics Bridge the Gap Between the Developed and Developing Sectors?Frontiers in Genetics10.3389/fgene.2018.003319Online publication date: 23-Aug-2018
      • (2018)Genome-Wide Association Study for Susceptibility to and Recoverability From Mastitis in Danish Holstein CowsFrontiers in Genetics10.3389/fgene.2018.001419Online publication date: 24-Apr-2018
      • (2016)Use of SNP genotypes to identify carriers of harmful recessive mutations in cattle populationsBMC Genomics10.1186/s12864-016-3218-917:1Online publication date: 3-Nov-2016
      • (2015)Challenges and opportunities in genetic improvement of local livestock breedsFrontiers in Genetics10.3389/fgene.2015.000336Online publication date: 25-Feb-2015
      • (2015)On the Fixed Parameter Tractability and Approximability of the Minimum Error Correction ProblemCombinatorial Pattern Matching10.1007/978-3-319-19929-0_9(100-113)Online publication date: 16-Jun-2015

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media