Detecting random-effects model misspecification via coarsened data

https://doi.org/10.1016/j.csda.2010.06.012Get rights and content

Abstract

Mixed effects models provide a suitable framework for statistical inference in a wide range of applications. The validity of likelihood inference for this class of models usually depends on the assumptions on random effects. We develop diagnostic tools for detecting random-effects model misspecification in a rich class of mixed effects models. These methods are illustrated via simulation and application to soybean growth data.

Introduction

Mixed effects models are widely used in statistical applications in biology, agriculture, sociology, and environmental science, where clustered data are often collected. Under the framework of mixed effects models, it is straightforward to make prediction at the inter-cluster level as well as the intra-cluster level. Davidian and Giltinan (2003) and McCulloch et al. (2008) provide a comprehensive survey of mixed effects models. A major concern in this realm lies in random-effects assumptions. Hartford and Davidian (2000), Heagerty and Kurland (2001), Agresti et al. (2004) and Litière et al., 2007, Litière et al., 2008 showed various adverse effects of erroneous random-effects assumptions on inference.

Semiparametric and nonparametric methods have been developed to relax the parametric assumptions on random effects. Davidian and Gallant (1993) used a flexible, smooth density to characterize the distribution of random effects. Fattinger et al. (1995) used spline functions to transform the random effects, initially assumed to be normal, so that the resulting random effects can follow an arbitrary distribution dictated by data. Lai and Shih (2003) took a nonparametric approach to estimate the distribution of random effects. The price one pays to avoid suspicious parametric assumptions by using these methods is often heavy computation. Some of these methods only provide discrete estimation for the random-effects distribution. This can be unsatisfactory when the distributional characteristics of random effects are of interest. Because a parsimonious parametric model can be appealing for its simplicity and potential gain in efficiency, techniques to check parametric assumptions on random effects in mixed effects models are valuable.

There are three main lines of development thus far for assessing the validity of model specification. The first line of work is based on the information matrix equivalence under a correct model (White, 1981, White, 1982). Focusing on the variance–covariance structure for the random effects in nonlinear mixed models (NLMM), Vonesh et al. (1996) compared two variance–covariance estimators that are equivalent under the correct model. Litière (2007) developed a series of tests motivated by the information matrix equivalence to diagnose random-effects misspecification in linear mixed models (LMM) and generalized linear mixed models (GLMM). The second line of development aims at obtaining an empirical distribution of the random effects. For example, Lange and Ryan (1989) estimated the cumulative distribution of random effects using the empirical Bayes estimates of individual random effects. Ritz (2004) extended their results and derived a weighted empirical process for the random effects. Waagepetersen (2006) simulated random effects from the conditional distribution of random effects given the observed data in GLMM. The third line of research has some similarity with the first line but is relatively underdeveloped. Instead of comparing two information matrices, one compares two estimators for the fixed effects, with one robust to model misspecification and the other sensitive to it. This idea was once prevalent in the econometrics community (Hausman, 1978) and was recently used by Tchetgen and Coull (2006) to construct a test for a specific form of GLMM.

These existing diagnostic methods are either too general to provide guidance for correction when misspecification is detected or too specific to allow immediate extension to complex models or to detect departures other than normality assumption. Admittedly, as shown by Verbeke and Molenberghs (submitted for publication), without assuming other component models in a hierarchical mixed effects model correctly specified, random-effects assumptions are unverifiable. With multiple uncertain component models, one can test the sensitivity of inference to multiple model assumptions but may not be able to single out the assumptions on random effects. If one has more confidence in the other component models than in the random effects, then it is possible to verify random-effects assumptions by studying the robustness of inference under these assumptions. This is the underlying philosophy of the methods discussed in this article. As a starting point, Huang (2009) provided empirical evidence that MLEs in a GLMM for binary response are not robust to data grouping in the presence of random-effects model misspecification. Motivated by this finding, she constructed diagnostic tests based on the discrepancy between the observed-data MLEs and the MLEs from the induced grouped data. Compared to the aforementioned existing methods, her tests can be more informative because the outcomes of the tests can depend on the source of model misspecification besides its existence.

In this article, we shed further light on the idea motivating Huang’s method, which is the content of Section 2. In Section 3, this idea is extended to a richer class of mixed effects models where the response is not binary. New test statistics that are computationally more efficient than those in Huang (2009) are defined in Section 4, where simulation studies are also presented. In Section 5, the diagnostic techniques are applied to a real data example. Concluding remarks and future research topics are given in Section 6.

Section snippets

Test statistics

Denote by Ω the p×1 vector of unknown parameters in the model, by Ω˜ and Ω˜c the limiting MLE based on the observed data and that based on the grouped data, respectively, where the limit is taken as the number of clusters, m, tends to infinity and the size of the cluster is bounded. Under a correct model, it is expected that Ω˜=Ω˜c, and erroneous model assumptions can result in Ω˜Ω˜c. Following this claim, Huang (2009) considered testing H0:Ω˜=Ω˜c, and proposed test statistics that compare the

Model specification

Now consider an NLMM with the intra-cluster model given by Yij=f(Xij,βi)+ϵij,i=1,,m,j=1,,ni, where Xij denotes the covariates associated with observation j in cluster i,βi is the cluster-specific parameter vector for cluster i,f() is a function known up to βi, and ϵij is the intra-cluster random errors. Define Xi=(Xi1T,,XiniT)T,ϵi=(ϵi1,,ϵini)T, and assume ϵi|βiN{0,Ri(βi,ξ)}, where ξ denotes parameters in the variance–covariance matrix of ϵi besides βi. The inter-cluster model is assumed

New test statistics

To lessen computational burden, we construct a new set of test statistics that is free of Ωˆc. This feature is especially appealing when fitting the model for the coarsened data is more numerically problematic than fitting the model for the observed data. The motivation of these test statistics is that, under H0, because both Ωˆ and Ωˆc are consistent, the solution to the normal score equation associated with the observed data should approximately solve the normal score equation for the

Application to soybean growth data

We now apply the diagnostic method to a soybean growth data set in Davidian and Giltinan (1995, Chapter 11, Section 11.2). In this study, from each of m=48 plots of soybean, the average leaf weight per plant is collected at different time points after planting. Let Yij denote the average leaf weight per plant in plot i at time tij, for i=1,,48 and j=1,,ni, where ni varies between 8 and 10. We posit the following growth model, Yij=βi11+exp{βi3(tij/10β2i)}+ϵij,i=1,,m,j=1,,ni, where βi=(βi1,βi

Discussion

We revisit the method for diagnosing random-effects model misspecification in GLMM for binary response proposed by Huang (2009) and further explore the theoretical driven force of the method. We then extend the method to a much richer class of mixed effects models for non-binary response and show that, using coarsened data, one can reveal random-effects model misspecification. For heteroscedastic mixed effects models, one may be able to identify the source of misspecification from the pattern

References (35)

  • P.J. Heagerty et al.

    Misspecified maximum likelihood estimates and generalized linear mixed models

    Biometrika

    (2001)
  • D.F. Heitjan et al.

    Ignorability and coarse data

    The Annals of Statistics

    (1991)
  • N.J. Horton et al.

    Goodness-of fit for GEE: an example with mental health service utilization

    Statistics in Medicine

    (1999)
  • D.W. Hosmer et al.

    Goodness of fit tests for the multiple logistic regression model

    Communications in Statistics, Theory and Method

    (1980)
  • X. Huang

    Diagnosis of random-effect model misspecification in generalized linear mixed models for binary response

    Biometrics

    (2009)
  • T.L. Lai et al.

    Nonparametric estimation in nonlinear mixed effects models

    Biometrika

    (2003)
  • N. Lange et al.

    Assessing normality in random effects models

    Annals of Statistics

    (1989)
  • Cited by (18)

    • Residual analysis of linear mixed models using a simulation approach

      2012, Computational Statistics and Data Analysis
      Citation Excerpt :

      Nobre and Singer also summarize areas of application for each type of residual defined for LMMs, e.g. checking linearity of effects, assessing the covariance structure for individual subjects, checking for outliers, and assessing normality and homoscedasticity of residuals. Recently, Huang (2011) extended a diagnosing-method for the misspecification of random-effects in generalized LMMs (GLMM) suitable for binary response and based on data-coarsening to a much wider class of GLMMs with non-binary response. The assumed normality of residual errors and normality of random effects may be assessed with quantile–quantile (QQ) plots (Pinheiro and Bates, 2000).

    • Generalized linear models with clustered data: Fixed and random effects models

      2011, Computational Statistics and Data Analysis
      Citation Excerpt :

      It is more restrictive regarding clustering complexity, but it can fit models with non-Gaussian (e.g., Cauchy or Logistic) distributions for the random intercepts. A method for detecting a random effects model misspecification (Huang, 2011) is useful in this context. With glmmML, the null hypothesis of no clustering can also be tested by bootstrapping.

    View all citing articles on Scopus
    View full text