On estimation and influence diagnostics for the Grubbs’ model under heavy-tailed distributions
Introduction
The problem of comparing the precision and accuracy of different measuring instruments may appear in various scientific applications like engineering (Grubbs, 1948, Grubbs, 1973) and medicine (Barnett, 1969). Taking measurements of the same unknown characteristic from different individuals or experimental units has been the usual way for comparing the instruments. These may differ in some aspects such as cost, speed and convenience. The relative quality in the measurements is evaluated considering the precision and bias of the different instruments.
The assessment of robustness aspects of the parameter estimates in statistical models has been an important concern of various researchers in the last decades. The deletion methodology, which consists of studying the impact on the parameter estimates after dropping individual observations, is probably the most employed technique to detect influential observations (see, for example, Cook and Weisberg (1982) and Chatterjee and Hadi (1988)). Nevertheless, the local influence procedure (Cook, 1986), that investigates the influence of small perturbations in the model/data on the parameter estimates, has received increasing attention in the last 20 years, mainly due to its’ flexibility in constructing different kinds of graphics and its’ applicability in various statistical models (see discussion in Cook (1997)). In particular, Galea et al. (2002) and Lachos et al. (2007) applied the methodology in normal comparative calibration and Grubbs’ models, notifying under some usual perturbation schemes the well known lack of robustness of the least-squares estimates against outlying observations.
Several methodologies have been proposed to attenuate the influence of outlying observations on the parameter estimates under normality, such as modifications of the least-squares methodology (see, for instance, Huber (1981)). Other approaches that assume heavy-tailed error distributions, for which the maximum likelihood estimates appear to be robust against extreme observations, have been proposed (see, for example, Galea et al. (2005)). In this work, we will assume scale mixtures of normal distributions (Andrews and Mallows, 1974) for the accommodation of extreme and outlying observations in the Grubbs’ model. Properties of distributions in this class, such as Student-t, power exponential and contaminated normal may be found in Andrews and Mallows (1974) and Lange and Sinsheimer (1993). In this paper, scale mixtures of normal distributions are assumed for the Grubbs’ model, and the hierarchical representation proposed by Pinheiro et al. (2001) is considered. Our aim is to apply the local influence method in the Grubbs’ model under heavy-tailed distributions in order to assess the influence of minor perturbations on the model/data, our results are generalizations of the results obtained by Lachos et al. (2007). The rest of the paper is organized as follows: In Section 2 some inferential aspects are discussed and an EM-type algorithm is developed for the parameter estimation; Section 3 introduces the local influence methodology (Cook, 1986, Zhu and Lee, 2001). The normal curvature for some usual perturbation schemes is derived in Section 4. The methodology is illustrated in Section 5 in which Grubbs’ models under normal and scale mixtures of normal distributions are compared according to the robustness aspects of the maximum likelihood estimates. Finally, some concluding remarks are given in Section 6.
Section snippets
Model description
Grubbs, 1948, Grubbs, 1973, Grubbs, 1983 proposes a linear model for comparing different instruments, in which the characteristic of the th experimental unit is measured once by all the instruments. The model assumes the form
where denotes the measurement of the th instrument for the th experimental unit and is called additive bias. The measurement errors are assumed to be independent of the random variables . In addition, one has that
Local influence
The aim of local influence (Cook, 1986) is to investigate the behavior of some influence measure when small perturbations are made into the model/data, where is a -dimensional vector of perturbations restricted to some open subset . In this work we assess local influence by using an appropriate measure based on the complete log-likelihood function and particularly recommended for incomplete data.
Let and be the perturbed log-likelihood functions for observed and
Curvature derivation
In this section we will derive the normal curvature for the Grubbs’ model by considering the hierarchical formulation given in (5). We will compute and by using results of matrix differentiation described in Magnus and Neudecker (1988). Details on the differential calculations for the matrices and under different perturbation schemes are given in Appendix B.
Application
We consider Grubbs’ model given in (2) with the following hierarchical formulation: where , and denote the distribution function for the mixture variable , .
In our analysis we suppose that the mixture variables follows a Gamma distribution, Beta, discrete and point mass in , that is, the marginal response follows a Student-t, slash, contaminated normal and normal distribution,
Concluding remarks
In this work, we have discussed the parameter estimation in the Grubbs’ model under a class of distributions that presents heavier tails than the normal ones. Through a local influence study some aspects of robustness of the maximum likelihood estimators under the scale mixture of normal distributions were noted. Explicit expressions are obtained for matrix under different perturbation schemes considered. It is noted, however that other perturbation schemes can be considered in analogous way.
Acknowledgements
This work was partially supported by the FONDECYT grants 11075071 and 1070919, Chile and CNPq and FAPESP, Brazil. The authors thank the Associate Editor and a referee for valuable suggestions.
References (40)
- et al.
Influence analysis of nonlinear mixed-effects models
Computational Statistics & Data Analysis
(2004) - et al.
Assessment of local influence in elliptical linear models with longitudinal structure
Computational Statistics & Data Analysis
(2007) - et al.
Inference about comparative precision in linear structural relationships
Journal of Statistical Planning and Inference
(1986) - et al.
Scale mixtures of normal distributions
Journal of the Royal Statistical Society, Series B
(1974) Simultaneous pairwise linear structural relationships
Biometrics
(1969)An efficient scores test for comparing several measuring devices
Journal of Quality Technology
(2001)- et al.
Structural comparative calibration using the EM algorithm
Journal of Applied Statistics
(1995) - et al.
One structural comparative calibration under a t-model
Computational Statistics
(1996) - et al.
Sensitivity Analysis in Linear Regression
(1988) - et al.
Test for precision and accuracy of multiple measuring devices
Technometrics
(1993)
Assessment of local influence (with discussion)
Journal of the Royal Statistical Society, Ser. B
Residuals and Influence in Regression
Maximum likelihood from incomplete data via the EM algorithm (with discussion)
Journal of the Royal Statistical Society, Ser. B
Symmetric Multivariate and Related Distributions
Multivariate Student-t regression models: Pitfalls and inference
Biometrika
Local influence in comparative calibration models
Biometrical Journal
Local influence in comparative calibration models under elliptical t-distributions
Biometrical Journal
On estimating precision of measuring instruments and product variability
Journal of the American Statistical Association
Cited by (24)
Conditional likelihood inference in a heteroscedastic functional measurement error model
2023, Chemometrics and Intelligent Laboratory SystemsOutliers detection method of multiple measuring points of parameters in power plant units
2015, Applied Thermal EngineeringCitation Excerpt :Based on the research of Grubbs, the distribution of test statistics for outlier detection drawn from heavy-tailed distributions was studied in literature [26], which extend classical results of outlier test statistics for the finite-variance case to the heavy-tailed infinite variance case. And literature [27] indicates Grubbs' model is based on the assumption that random terms have a normal distribution, and Osorio et al. proposed robustness aspects of parameter estimates to identify outliers under heavy-tailed distributions. Facing these conclusions, robust statistical outlier detection methods are more applicable for parameters in thermal system with small number of measuring points, because few or no assumptions about the distribution of data set is needed to make, and robust methods do not rely on distribution parameters.
Robust linear functional mixed models
2015, Journal of Multivariate AnalysisCitation Excerpt :Ultrastructural, structural and functional elliptical measurement error models are investigated in [7,3,2,39]. It is also the case that recent statistical literature has experienced growing interest in elliptical generalizations of normal models [34,4], perhaps influenced by the development of more efficient computational procedures. The paper is organized as follows.
On estimation of a heteroscedastic measurement error model under heavy-tailed distributions
2012, Computational Statistics and Data AnalysisInfluence analyses of skew-normal/independent linear mixed models
2010, Computational Statistics and Data AnalysisMultivariate Regression with Stable Errors Using Order Statistics
2022, Fluctuation and Noise Letters