Abstract
In this paper, we study doubly robust estimation and robust empirical likelihood of regression parameter for generalized linear models with missing responses. A doubly robust estimating equation is proposed to estimate the regression parameter, and the resulting estimator has consistency and asymptotic normality, regardless of whether the assumed model contains the true model. A robust empirical log-likelihood ratio statistic for the regression parameter is constructed, showing that the statistic weakly converges to the standard \(\chi ^2\) distribution. The result can be directly used to construct the confidence region of the regression parameter. A method for selecting the tuning parameters of \(\psi \)-function is also given. Simulation studies show the robustness of the estimator of the regression parameter and evaluate the performance of the robust empirical likelihood method. A real data example shows that the proposed method is feasible.
Similar content being viewed by others
References
Arnold, S.F.: The Theory of Linear Models and Multivariate Analysis. Wiley, New York (1981)
Bang, H., Robins, J.M.: Doubly robust estimation in missing data and causal inference models. Biometrics 61, 962–973 (2005)
Cantoni, E., Ronchetti, E.: Robust inference for generalized linear models. J. Am. Stat. Assoc. 96, 1022–1030 (2001)
Kim, J., Basak, J.M., Holtzman, D.M.: The role of apolipoprotein E in Alzheimer’s disease. Neuron 63, 287–303 (2009)
Little, R.J.A., Rubin, D.B.: Statistical Analysis With Missing data. John Wiley & Sons Inc, New York (1987)
Liu, C.C., Kanekiyo, T., Xu, H.X., Bu, G.J.: Apolipoprotein E and Alzheimer disease: risk, mechanisms, and therapy. Nat. Rev. Neurol. 9, 106–118 (2013)
Lo, S.N., Ronchetti, E.: Robust and accurate inference for generalized linear models. J. Multivar. Anal. 100, 2126–2136 (2009)
Moustaki, I., Victoria-Feser, M.P.: Bounded-influence robust estimation in generalized linear latent variable models. J. Am. Stat. Assoc. 101, 644–653 (2006)
Noh, M., Lee, Y.: Robust modeling for inference from generalized linear model classes. J. Am. Stat. Assoc. 102, 1059–1072 (2007)
Orsini, N., Bellocco, R., Sjolander, A.: Doubly robust estimation in generalized linear models. Stata J. 13, 185–205 (2013)
Owen, A.B.: Empirical likelihood ratio confidence regions. Ann. Stat. 18, 90–120 (1990)
Qin, J., Lawless, J.: Empirical likelihood and general estimating equations (in likelihood and related topics). Ann. Stat. 22, 300–325 (1994)
Robins, J.M., Rotnitzky, A.: Discussion on “Celebrating the new millennium’’ by Bickel, P. J. and Kwon, J. Stat. Sin. 11, 920–926 (2001)
Varatharajah, Y., Ramanan, V.K., Iyer, R., Vemuri, P.: Predicting short-term MCI-to-AD progression using imaging, CSF, genetic factors, cognitive resilience, and demographics. Sci. Rep. 9(1), 2235 (2019)
Welsh, A.H.: On M-processes and M-estimation. Ann. Stat. 17, 337–361 (1989). [Correction 18, 1500 (1990)]
Xue, D., Xue, L.G., Cheng, W.H.: Empirical likelihood for generalized linear models with missing responses. J. Stat. Plan. Inference 141, 2007–2020 (2011)
Acknowledgements
The author gratefully acknowledges Associate Editor and one anonymous referee for their helpful comments which improved the presentation of the manuscript. The research was supported by the National Natural Science Foundation of China (11971001). Data used was obtained from the data base of Alzheimer’s Disease Neuroimaging Initiative and was provided by Dr. Chunling Li of the Hong Kong Polytechnic University.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Appendix. Proofs of Theorems
Appendix. Proofs of Theorems
In this appendix, we prove Theorems 1 and 2. The following Lemmas 1–4 are useful for proving these Theorems. Their proofs are given in the supplementary material.
Lemma 1
Suppose that conditions (C1)–(C7) hold. Then
where \({{{\mathcal {B}}}}_n=\{\beta |\,\Vert \beta -\beta _0\Vert \le cn^{-1/2}\}\) for a constant \(c>0\), \({\widehat{Q}}(\beta )\) is defined by (2.6),
and \(r_i(\beta )\) is defined by (2.1).
Lemma 2
Suppose that conditions (C1)–(C7) hold. Then
where \({\widehat{Q}}(\beta _0)\), \(Q_n(\beta _0)\) and \(B(\beta _0)\) are defined in (2.6), Lemma 1 and Theorem 1, respectively.
Lemma 3
Suppose that conditions (C1)–(C7) hold. Then
where \({\hat{\eta _i}}(\beta _0)\) and \(B(\beta _0)\) are defined in (2.9) and Theorem 1, respectively.
Lemma 4
Suppose that conditions (C1)–(C7) hold. Then
where \({\hat{\eta _i}}(\beta _0)\) is defined in (2.9)
Now we turn back to prove Theorems 1 and 2.
Proof of Theorem 1
We now prove the asymptotic normality of \({{\hat{\beta }}}\). The proof is divided into two steps: step (I) provides the existence of the estimator \({{\hat{\beta }}}\), and step (II) proves the asymptotic normality of \({{\hat{\beta }}}\).
(I) The existence of the estimator of \(\beta _0\). We prove the following fact: Under conditions (C1)–(C8) and with probability one there exists an estimator of \(\beta _0\) solving the estimating equation (2.6) in \({{{\mathcal {B}}}}_n^*\), where \({{{\mathcal {B}}}}_n^*=\big \{\beta |\, \Vert \beta -\beta _0\Vert =Mn^{-1/2}\big \}\) for some positive constant M. From Lemma 2 we obtain that uniformly for \(\beta \in {{{\mathcal {B}}}}_n^*\),
where \(A(\beta _0)\) is defined in Theorem 1. Therefore, we have, uniformly for \(\beta \in {{{\mathcal {B}}}}_n^*\),
We note that the above formula is dominated by the term \(\sim M^2\) because \(\sqrt{n}\Vert \beta -\beta _0\Vert =M\), whereas \(|n(\beta -\beta _0)^TQ_n(\beta _0)|=MO_P(1)\), and \(n(\beta -\beta _0)A(\beta _0)(\beta -\beta _0)\sim M^2\). So, for any given \(\eta >0\), if M is chosen large enough, then it will follows that uniformly for \(\beta \in {{{\mathcal {B}}}}_n^*\), \(n(\beta -\beta _0){\widehat{Q}}(\beta )<0\) on an event with probability \(1-\eta \). From the arbitrariness of \(\eta \), we can prove the existence of the estimator of \(\beta _0\) in \({{{\mathcal {B}}}}_n^*\) as in the proof of Theorem 5.1 of Welsh (1989). The details are omitted.
(II) The asymptotic normality. From step (I) we find that \({{\hat{\beta }}}\) is a solution in \({{{\mathcal {B}}}}_n^*\) to the equation \({\widehat{Q}}(\beta )=0\), namely \({\widehat{Q}}({{\hat{\beta }}})=0\), where \({\widehat{Q}}(\beta )\) is defined in (2.6). From (A.1) we have
and hence
Theorem 1 follows from (A.2), Lemma 2 and Slutsky’s theorem. \(\square \)
Proof of Theorem 2
By the Lagrange multiplier method, \({\hat{l}}(\beta _0)\) can be represented as
where \(\lambda =\lambda (\beta )\) is a \(d\times 1\) vector given as the solution to
By Lemmas 2–4, and using the same arguments as are used in the proof of (2.8) in Owen (1990), we can show that
Applying the Taylor formula to (A.3), and invoking Lemmas 2–4 and (A.4), we get
Note that \(h(\lambda )=0\). It follows that
This together with Lemmas 2–4 and (A.4) proves that
and
Therefore, from (A.5)–(A.7) we have
where \({\widehat{Q}}(\beta _0)\) is defined in (2.6). This, together with (A.8), Lemmas 2 and 3 as well as Slutsky’s theorem, proves Theorem 2. \(\square \)
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Xue, L. Doubly robust estimation and robust empirical likelihood in generalized linear models with missing responses. Stat Comput 34, 39 (2024). https://doi.org/10.1007/s11222-023-10347-8
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11222-023-10347-8