Weighted empirical likelihood estimates and their robustness properties

https://doi.org/10.1016/j.csda.2006.07.032Get rights and content

Abstract

Maximum likelihood methods are by far the most popular methods for deriving statistical estimators. However, parametric likelihoods require distributional specifications. The empirical likelihood is a nonparametric likelihood function that does not require such distributional assumptions, but is otherwise analogous to its parametric counterpart. Both likelihoods assume that the random variables are independent with a common distribution. A nonparametric likelihood function for data that are independent, but not necessarily identically distributed is introduced. The contaminated normal density is used to compare the robustness properties of weighted empirical likelihood estimators to empirical likelihood estimators. It is shown that as the contamination level of the sample increases, the root mean squared error of the empirical likelihood estimator for the mean increases. Conversely, the root mean squared error of the weighted empirical likelihood estimator for the mean remains closer to the theoretical root mean squared error.

Introduction

Likelihood-based methods are effective in finding efficient estimators, constructing tests with good power properties, and quantifying uncertainty through confidence intervals and confidence regions (Owen, 2001). Nevertheless, when the distribution is misspecified, parametric likelihood-based estimates are inefficient. In this case, the empirical likelihood is preferred because it does not require such distributional assumptions. Weighted empirical likelihood requires even fewer distributional assumptions since it relaxes the identical distributed assumption. Hence, it can offset problems arising from auxiliary information such as contamination by modifying the constraints or objective function. We develop weighted empirical likelihood for point estimators which extends to confidence intervals, confidence regions, and likelihood-based methods.

Empirical likelihood methods extend classical maximum likelihood methods for random samples from a common distribution of known functional form to the situation where the form of the distribution is unknown. In many applications, the observations may be from different distributions having a common mean. For example, if two types of measuring instruments have different variabilities and these instruments are used to obtain data, a fraction of these data comes from a more variable distribution. Classical robustness considered the problem of estimating the mean in one such case. In particular, Tukey (1960) introduced the contaminated normal family of densities CNγ,σ2:fγ,σ(x)=12π(1-γ)e-x2/2+1σ2πγe-x2/2σ2,where γ, 0γ1 is the contamination parameter, and σ is the scale parameter. The robustness implications of contaminated data were studied by Lehmann (1983), Gastwirth and Cohen (1970), Andrews and Mallows (1974), and more recently by Taskinen et al. (2003). They also arise in regression (Sinha and Wiens, 2002), and financial data (Ellis et al., 2003).

We compare the properties of the weighted empirical likelihood estimate to those of the usual empirical likelihood, the trimmed mean, and the Winsorized mean in the context of robust estimation. Results indicate that the weighted empirical likelihood estimate is somewhat more robust to contamination than the usual empirical likelihood estimate. As the sample size increases, weighted empirical likelihood estimators are comparable to the trimmed mean and the Winsorized mean. The trimmed mean and Winsorized mean's disadvantages are that the amount of trim and the number of observations replaced are somewhat arbitrary.

In Section 2, we review the empirical likelihood approach and introduce the weighted empirical likelihood for the mean. The major difference between weighted empirical likelihood and empirical likelihood is that the former incorporates a weight vector that weighs each observation's contribution to the likelihood function. Section 3 details how to obtain the weight vector, presents the nonlinear programming problem, then solves it to obtain the weighted empirical likelihood estimator for the mean. Section 4 explores the properties of the weighted empirical likelihood estimator when estimating the mean in the contaminated normal model. Concluding remarks are presented in Section 5.

Section snippets

Empirical likelihood

For a random sample X=X1,X2,,Xn of size n, the parametric likelihoodL(θ)=Lθ;X1,X2,,Xn,θΘ=i=1nfXi;θifX1,X2,,Xnare independentis a function of θ which takes values in the space Θ. When the density function f(X;θ) is unknown one can use the empirical likelihood method (Owen, 1988, Owen, 2001).

Definition 2.1

Suppose X1,X2,,Xn is a random sample from an unknown distribution. Let the parameter θ denote the mean. Suppose pi is the probability mass placed on Xi, i=1npi=1, pi0. Let t(p)=i=1npiXi denote the

The weight vector w

Consider the case where σ>1. We use the data-labeling rule of Tietjen and Moore (1972) to detect the contamination. After detecting the contamination we define the weight vectorw=wb+cd,where

  • wb=(1/n,,1/n)

  • c is a scalar

  • d is a vector of unit length.

Theorem 3.1

If X=X1,X2,,Xn is an sample of size n, and k is the number of points labeled as contamination, then d is the vector of unit length defined asd=1knn-ky,where y is the rank ordered vector defined as kn-kkn-k-1-1.The rank ordered vector y contains n-k

Empirical results

The root mean squared error of an estimator, the square root of the average squared difference between an estimator and a parameter, is a good measure of the performance of an estimator. To compare robustness properties of the weighted empirical likelihood estimator for the mean to those of the empirical likelihood estimator for the mean, trimmed mean, and Winsorized mean, we compute point estimators for the mean for the stated estimators for various sample sizes. We then compare the root mean

Discussion

We construct the weighted empirical likelihood function, a nonparametric likelihood function that is suitable for data that are independent but not necessarily identically distributed. We compare the robustness properties of weighted empirical likelihood to those of empirical likelihood, trimmed mean, and Winsorized mean by comparing the root mean squared errors of the mean for each method. Using the contaminated normal distribution, we showed that as the level of contamination increases, the

Acknowledgments

The authors would like to thank Professor Joseph Gastwirth of the Department of Statistics, George Washington University for suggestions regarding robustness. The authors also thank Professors David W. Scott and Katherine Ensor of the Department of Statistics, Rice University, and Professor Yin Zhang of the Department of Computational and Applied Mathematics, Rice University.

References (13)

  • S. Taskinen et al.

    Sign test of independence between two independent random vectors

    Statist. Probab. Lett.

    (2003)
  • D.F. Andrews et al.

    Scale mixtures of normal distributions

    J. Roy. Statist. Soc. Ser. B

    (1974)
  • E. Choi et al.

    Rendering parametric procedures more robust by empirically tilting the model

    Biometrika

    (2000)
  • S. Ellis et al.

    Fitting a pareto-normal-pareto distribution to the residuals of financial data

    Comput. Statist.

    (2003)
  • J. Gastwirth et al.

    Small sample behavior of some robust linear estimators of location

    J. Amer. Statist. Assoc.

    (1970)
  • Glenn, N., 2002. Robust empirical likelihood. Ph.D. Thesis, Rice University, Houston,...
There are more references available in the full text version of this article.

Cited by (0)

View full text