Abstract
A correlated probit model approximation for conditional probabilities (Mendell and Elston 1974) is used to estimate the variance for binary matched pairs data by maximum likelihood. Using asymptotic data, the bias of the estimates is shown to be small for a wide range of intra-class correlations and incidences. This approximation is also compared with other recently published, or implemented, improved approximations. For the small sample examples presented, it shows a substantial advantage over other approximations. The method is extended to allow covariates for each observation, and fitting by iteratively reweighted least squares.
Similar content being viewed by others
References
Breslow N.E. and Clayton D.G. 1993. Approximate inference in generalized linear mixed models. Journal of the American Statistical Association 88: 9–25.
Breslow N.E. and Lin X. 1995. Bias correction in generalized linear mixed models with a single component of dispersion. Biometrika 82: 81–91.
Browne W.J. and Draper D. 2002. A comparison of Bayesian and likelihood methods for fitting multilevel models. Journal of the Royal Statistical Society A (forthcoming).
Engel B. and Keen A. 1994. A simple approach for the analysis of generalized linear mixed models. Statistica Neerlandica 48: 1–22.
Engel B. and Buist W.G. 1997. Bias reduction of heritability estimates in threshold models. Ph.D. Thesis, Agricultural University, Wageningen, 143–151.
Engel B., Buist W.G., and Visscher A. 1995. Inference for threshold models with variance components from the generalized linear mixed model perspective. Genetics Selection Evolution 27: 15–32.
Gilmour A.R., Anderson R.D. and Rae A.L. 1985. The analysis of binomial data by a Generalized Linear Mixed Model. Biometrika 72: 593–599.
Goldstein H. 1991. Nonlinear multilevel models with an application to discrete response data. Biometrika 76: 622–623.
Goldstein H. 1997. Consistent estimates for multilevel generalised linear models using an iterated bootstrap. Multilevel Modelling Newsletter 8(1): 3–6.
Goldstein H. and Rasbash J. 1996. Improved approximations for multilevel models with binary responses. Journal of the Royal Statistical Society A 159: 505–513.
Goldstein H., Rasbash J., Plewis I., Draper D., Browne W., Yang M., Woodhouse G. and Healy M. 1998. A user's guide to MLwiN, multilevel models project. Institute of Education, London.
Hinde J.P. 1982. Compound Poisson regression models. In Gilchrist R. (Ed.), GLIM82, Springer-Verlag, New York.
Hoeschele I. and Tier B. 1995. Estimation of variance components of threshold characters by marginal posterior modes and means via Gibbs sampling, Genetics, Selection, Evolution 27: 519–540.
Kuk A.Y.C. 1995. Asymptotically unbiased estimation in generalized linear models with random effects. Journal of the Royal Statistical Society B 57: 395–407.
Lee Y. and Nelder J.A. 1996. Hierarchical generalized linear models. Journal of the Royal Statistical Society B 58: 619–678.
McCulloch C.E. 1994. Maximum likelihood variance components estimation for binary data. Journal of the American Statistical Association 89: 330–335.
McGilchrist C.A. 1994. Estimation in generalized mixed models. Journal of the Royal Statistical Society B 56: 61–69.
Mendell N.R. and Elston R.C. 1974. Multifactorial qualitative traits: Genetic analysis and prediction of recurrence risks. Biometrics 30: 41–57.
Molenberghs G., Fitzmaurice G.M. and Lipsitz S.R. 1996. Efficient estimation of the intraclass correlation for a binary trait. Journal of the Agricultural, Biological and Environmental Statistics 1: 78–96.
Moreno C., Sorensen D., García-Cortés L.A., Varona L. and Altrarriba J. 1997. On biased inferences about variance components in the binary threshold model. Genetics, Selection, Evolution 29: 145–160.
Nelder J.A. 1993. The K system for GLMs in Genstat. Technical Report TRI/93. Numerical Algortihms Group, Oxford.
Ochi Y. and Prentice R.L. 1984. Likelihood inference in a correlated probit regression model. Biometrika 71: 531–543.
Petherick J.C., Seawright E., Waddington D., Duncan I.J.H. and Murphy L.B. 1995. The role of perception in the causation of dustbathing behaviour in domestic fowl. Animal Behaviour 49: 1521–1530.
Raudenbush S.W., Yang M-L. and Yosef M. 2000. Maximum likelihood for hierarchical models via high-order multivariate Laplace approximations. Journal of Computational and Graphical Statistics, forthcoming.
Rodriguez G. and Goldman N. 1995. An assessment of estimation procedures for multilevel models with binary responses. Journal of the Royal Statistical Society A 158: 73–89.
Schall R. 1991. Estimation in generalized linear models with random effects. Biometrika 78: 719–727.
Thompson R. and Baker R.J. 1981. Composite link functions in generalised linear models. Journal of the Royal Statistical Society C 30: 125–131.
Waddington D., Welham S.J., Gilmour A.R. and Thompson R. 1994. Comparisons of some GLMM estimators for a simple binomial model. Genstat Newsletter 30: 13–24.
Welham S.J. 1993. Procedure GLMM. In: R.W. Payne and G.M. Arnold (Eds.), Genstat 5 Procedure Library Manual, Release 2[3]. Numerical Algorithms Group, Oxford.
Williams D.A. 1988. Extra-binomial variation in toxicology. Procedures of the Fourteenth International Biometric Conference, Namur, Belgium, 301–313.
Zeger S.L. and Karim M.R. 1991. Generalized linear models with random effects: A Gibbs sampling approach. Journal of the American Statistical Association 86: 79–86.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Waddington, D., Thompson, R. Using a correlated probit model approximation to estimate the variance for binary matched pairs. Statistics and Computing 14, 83–90 (2004). https://doi.org/10.1023/B:STCO.0000021406.25797.98
Issue Date:
DOI: https://doi.org/10.1023/B:STCO.0000021406.25797.98