Accurate higher-order likelihood inference on $$P(Y

Cortese, Giuliana; Ventura, Laura

doi:10.1007/s00180-012-0343-z

Accurate higher-order likelihood inference on $P(Y<X)$

Original Paper
Published: 17 July 2012

Volume 28, pages 1035–1059, (2013)
Cite this article

Computational Statistics Aims and scope Submit manuscript

Giuliana Cortese¹ &
Laura Ventura¹

306 Accesses
9 Citations
Explore all metrics

Abstract

The stress-strength reliability $R=P(Y<X)$, where $X$ and $Y$ are independent continuous random variables, has obtained wide attention in many areas of application, such as in engineering statistics and biostatistics. Classical likelihood-based inference about $R$ has been widely examined under various assumptions on $X$ and $Y$. However, it is well-known that first order inference can be inaccurate, in particular when the sample size is small or in presence of unknown parameters. The aim of this paper is to illustrate higher-order likelihood-based procedures for parametric inference in small samples, which provide accurate point estimators and confidence intervals for $R$. The proposed procedures are illustrated under the assumptions of Gaussian and exponential models for $(X,Y)$. Moreover, simulation studies are performed in order to study the accuracy of the proposed methodology, and an application to real data is discussed. An implementation of the proposed method in the R software is provided.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Estimation of $$P[Y<Z]$$ under Geometric-Lindley model

Article 07 January 2023

Bhupendra Singh, Amit Singh Nayal & Abhishek Tyagi

Estimation of the reliability parameter for a Poisson-exponential stress-strength model

Article 25 December 2023

Alessandro Barbiero

Statistical Inferences of $$R=P(X<Y)$$ for Exponential Distribution Based on Generalized Order Statistics

Article 24 April 2019

M. J. S. Khan & Bushra Khatoon

References

Bamber D (1975) The area above the ordinal dominance graph and the area below the receiver operating characteristic graph. J Math Psychol 12:387–415
Article MathSciNet MATH Google Scholar
Barndorff-Nielsen OE (1983) On a formula for the distribution of the maximum likelihood estimator. Biometrika 70:343–365
Article MathSciNet MATH Google Scholar
Barndorff-Nielsen OE (1991) Modified signed log likelihood ratio. Biometrika 78:557–563
Article MathSciNet MATH Google Scholar
Barndorff-Nielsen OE, Chamberlin S (1994) Stable and invariant adjusted directed likelihoods. Biometrika 81:485–499
Article MathSciNet MATH Google Scholar
Barndorff-Nielsen OE, Cox DR (1994) Inference and asymptotics. Chapman and Hall, London
MATH Google Scholar
Brazzale AR, Davison AC, Reid N (2007) Applied asymptotics. Case-studies in small sample statistics. Cambridge University Press, Cambridge
Book MATH Google Scholar
Efron B, Hinkley D (1978) Assessing the accuracy of the maximum likelihood estimator: observed versus expected Fisher information. Biometrika 65:457–482
Article MathSciNet MATH Google Scholar
Faraggi D (2000) The effect of random measurement error on receiver operating characteristic (ROC) curves. Stat Med 19:61–70
Article Google Scholar
Fraser DAS, Reid N, Wu J (1999) A simple general formula for tail probabilities for frequentist and Bayesian inference. Biometrika 86:249–264
Article MathSciNet MATH Google Scholar
Giummolé F, Ventura L (2002) Practical point estimation from higher-order pivots. J Stat Comput Simul 72:419–430
Article MathSciNet MATH Google Scholar
Guttman I, Johnson RA, Bhattacharyya GK, Reiser B (1988) Confidence limits for stress-strength models with explanatory variables. Technometrics 30:161–168
Article MathSciNet MATH Google Scholar
Hanley JA (1989) Receiver operating characteristic (ROC) methodology: the state of the art. Crit Rev Diagn Imaging 29:307–335
Google Scholar
Jiang L, Wong ACM (2008) A note on inference for $P(X<Y)$ for right truncated exponentially distributed data. Stat Pap 49:637–651
Article MathSciNet MATH Google Scholar
Kotz S, Lumelskii Y, Pensky M (2003) The stress-strength model and its generalizations. Theory and applications. World Scientific, Singapore
Book MATH Google Scholar
Newcombe RG (1998) Two-sided confidence intervals for the single proportion: comparison of seven methods. Stat Med 17:857–872
Article Google Scholar
Obuchowski NA, Lieber ML (1998) Confidence intervals for the receiver operating characteristic area in studies with small samples. Acad Radiol 5:561–571
Article Google Scholar
Pace L, Salvan A (1997) Principles of statistical inference. World Scientific, Singapore
MATH Google Scholar
Pace L, Salvan A (1999) Point estimation based on confidence intervals. J Stat Comput Simul 64:1–21
Article MathSciNet MATH Google Scholar
R Development Core Team (2010) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. ISBN 3-900051-07-0, http://www.R-project.org
Reid N, Fraser DAS (2010) Mean likelihood and higher order approximations. Biometrika 97:159–170
Article MathSciNet MATH Google Scholar
Reiser B (2000) Measuring the effectiveness of diagnostic markers in the presence of measurement error through the use of ROC curves. Stat Med 19:2115–2129
Article Google Scholar
Reiser B, Faraggi D (1997) Confidence intervals for the generalized ROC criterion. Biometrics 53:644–652
Article MATH Google Scholar
Schisterman EF, Faraggi D, Reiser B (2004) Adjusting the generalized ROC curve for covariates. Stat Med 23:3319–3331
Article Google Scholar
Severini TA (1999) An empirical adjustment to the likelihood ratio statistic. Biometrika 86:235–247
Article MathSciNet MATH Google Scholar
Severini TA (2000) Likelihood methods in statistics. Oxford University Press, New York
MATH Google Scholar
Skovgaard IM (1996) A general large deviation approximation to one parameter tests. Bernoulli 2:145–165
Article MathSciNet MATH Google Scholar
Wolfe DA, Hogg RV (1971) On constructing statistics and reporting data. Am Stat 25:27–30
Article Google Scholar
Zweig MH, Campbell G (1993) Receiver-operating characteristic (ROC) plots: a fundamental evaluation tool in clinical medicine. Clin Chem 39:561–577
Google Scholar

Download references

Acknowledgments

This research was supported by grants from MIUR, and from University of Padova (Progetto di Ateneo 2011), Italy. The authors thank the Referees and the Associate Editor for their useful comments and suggestions, which lead to an improved version of the paper.

Author information

Authors and Affiliations

Department of Statistical Sciences, University of Padua, Via Cesare Battisti 241, 35121, Padua, Italy
Giuliana Cortese & Laura Ventura

Authors

Giuliana Cortese
View author publications
You can also search for this author in PubMed Google Scholar
Laura Ventura
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Giuliana Cortese.

Appendices

Appendix A: Computation of statistics for the Gaussian and exponential models

Under the assumption of Gaussian distributions with equal variances for $Y$ and $X$ as described in Sect. 3.1, in order to compute the Wald statistic, the profile observed information is obtained from (5). Its expression is

$$\begin{aligned} j_p(\hat{\psi }) = \frac{\hat{G}}{(n\, \hat{\lambda }_2)^2 \, +\, n_1\, n_2\, (\hat{\lambda }_1 \hat{\lambda }_2\, -\, D(\hat{\theta }))^2} \, , \end{aligned}$$

(17)

where $\hat{G} =2 n n_1 n_2 \hat{\lambda }_2^2 \hat{d}^2$, with $\hat{d} = \partial \varPhi ^{-1}(\hat{\psi })/\partial \hat{\psi }= 1/\phi (\varPhi ^{-1}(\hat{\psi }))$ and $\phi (\cdot )$ standard normal probability density function.

In the same example, for computing the adjustment term $q$ of the $r_p^*$ statistic, given in (7), we have

$$\begin{aligned} \frac{ |\ell _{\lambda ;\hat{\lambda }}(\hat{\theta }_\psi )|}{(|j(\hat{\theta })| |j_{\lambda \lambda }(\hat{\theta }_\psi )|)^{1/2}} = \frac{2 \hat{\lambda }_2^2 \left(2 n_1 n_2 \hat{B}^2 +(n \hat{\lambda }_2)^2 + n_1 n_2 \hat{B} (\tilde{\lambda }_1 \tilde{\lambda }_2 - D(\hat{\theta }_{\psi })) \right) }{\tilde{\lambda }_2^2\, \left( \hat{G}\, (12 n_1 n_2 \hat{B}^2 + 8\hat{C}^2 - 2 n^2 (\tilde{\lambda }_2^2 - 3\hat{\lambda }_2^2) - 8n \hat{C}) \right)^{1/2} } \, , \end{aligned}$$

with $\hat{B} = \hat{\lambda }_1 \hat{\lambda }_2 -D(\hat{\theta })$, $\hat{C} = n_1 \hat{\lambda }_1 \hat{\lambda }_2 + n_2 D(\hat{\theta })$ and $\hat{F} = n_1\tilde{\lambda }_1 \tilde{\lambda }_2 \hat{\lambda }_1 \hat{\lambda }_2 + n_2 D(\hat{\theta }) D(\hat{\theta }_{\psi })$. The tilde above the parameters indicates an estimate constrained to a fixed $\psi $, while the hat indicates an MLE estimate. The sample space derivatives needed for computing the $q$ term are

$$\begin{aligned} \ell _{;\hat{\psi }}(\hat{\theta })=0, \ \quad \ell _{;\hat{\lambda }}(\hat{\theta })=\left( \ 0, \ -\frac{n}{\hat{\lambda }_2} \ \right), \ \quad \ell _{;\hat{\psi }}(\hat{\theta }_\psi )= \frac{2 n_2 \hat{\lambda }_2}{\tilde{\lambda }_2^2} K [ \tilde{\lambda }_2 b - D(\hat{\theta })], \end{aligned}$$

with $K= \partial \varPhi ^{-1} (\hat{\psi }) / \partial \hat{\psi }$.

The sample space derivatives $ \ell _{;\hat{\lambda }}(\hat{\theta }_\psi ) = ( \ell _{;\hat{\lambda }_1}, \ \ell _{;\hat{\lambda }_2} ) $ are computed as

$$\begin{aligned}&\displaystyle \ell _{;\hat{\lambda }_1}(\hat{\theta }_\psi ) = \frac{-2 \hat{\lambda }_2}{\tilde{\lambda }_2^2} \left[n_1 \hat{\lambda }_1 \hat{\lambda }_2 + n_2 D(\hat{\theta }) - n_1 \tilde{\lambda }_1 \tilde{\lambda }_2 - n_2 \tilde{\lambda }_2 b\right],\\&\displaystyle \ell _{;\hat{\lambda }_2}(\hat{\theta }_\psi ) = \frac{-1}{\tilde{\lambda }_2^2} \left[n \hat{\lambda }_2 + 2 n_1 \hat{\lambda }_1^2 \hat{\lambda }_2 + \frac{2 n_2 D(\hat{\theta })^2}{\hat{\lambda }_2} - 2 n_1 \tilde{\lambda }_1 \tilde{\lambda }_2 \hat{\lambda }_1 - \frac{2 n_2 \tilde{\lambda }_2 b D(\hat{\theta })}{\hat{\lambda }_2}\right], \end{aligned}$$

while $ \ell _{\lambda ;\hat{\psi }}(\hat{\theta }_\psi ) = ( \ell _{\lambda _1;\hat{\psi }}, \ \ell _{\lambda _2;\hat{\psi }} ) $ are given as

$$\begin{aligned} \ell _{\lambda _1;\hat{\psi }}(\hat{\theta }_\psi )=\frac{2 n_2 \hat{\lambda }_2 K}{\tilde{\lambda }_2}, \ \quad \ell _{\lambda _2;\hat{\psi }}(\hat{\theta }_\psi )= \frac{2 n_2 \hat{\lambda }_2 K}{\tilde{\lambda }_2} \left[\frac{2 D(\hat{\theta })}{\tilde{\lambda }_2} - b\right], \end{aligned}$$

with $b = \varPhi ^{-1} (\psi ) + \tilde{\lambda }_1 $.

The quantity $ \ell _{\lambda ;\hat{\lambda }}(\hat{\theta }_\psi )$ is equal to the $(d-1) $ square matrix

$$\begin{aligned} \begin{bmatrix} \ell _{\lambda _1;\hat{\lambda }_1}&\quad \ell _{\lambda _1;\hat{\lambda }_2} \\ \ell _{\lambda _2;\hat{\lambda }_1}&\quad \ell _{\lambda _2;\hat{\lambda }_2}. \\ \end{bmatrix} \end{aligned}$$

The sample space derivatives therein can be computed from the full likelihood function

$$\begin{aligned} \ell (\theta )&= \ell (\psi ,\lambda _1,\lambda _2)\\&= -n log \lambda _2 - \frac{1}{\lambda _2^2} \left[\frac{n \hat{\lambda }_2^2}{2} + n_1 (\lambda _1 \lambda _2 - \hat{\lambda }_1 \hat{\lambda }_2)^2 + n_2 (D(\hat{\theta }) - D(\theta ))^2 \right], \end{aligned}$$

which only depends on the MLE estimates and the parameter $\theta $. Thus, we have

$$\begin{aligned}&\displaystyle \ell _{\lambda _1;\hat{\lambda }_1}(\hat{\theta }_\psi )= \frac{2n \hat{\lambda }_2}{\tilde{\lambda }_2} \ \qquad \qquad \ell _{\lambda _1;\hat{\lambda }_2}(\hat{\theta }_\psi )= \frac{2}{\tilde{\lambda }_2} \left[n_1 \hat{\lambda }_1 + n_2 \frac{D(\hat{\theta })}{\hat{\lambda }_2}\right]\\&\displaystyle \ell _{\lambda _2;\hat{\lambda }_1}(\hat{\theta }_\psi )= \frac{2 \hat{\lambda }_2}{\tilde{\lambda }_2^3} \left[2 n_1 \hat{\lambda }_1 \hat{\lambda }_2 + 2 n_2 D(\hat{\theta }) - n_2 b \tilde{\lambda }_2 - n_1 \tilde{\lambda }_1 \tilde{\lambda }_2 \right]\\&\displaystyle \ell _{\lambda _2;\hat{\lambda }_2}(\hat{\theta }_\psi )= \frac{2}{\hat{\lambda }_2 \tilde{\lambda }_2^3} \left[n \hat{\lambda }_2^2 + 2n_1 \hat{\lambda }_1^2 \hat{\lambda }_2^2 + 2n_2 D(\hat{\theta })^2 - n_1 \tilde{\lambda }_1 \tilde{\lambda }_2 \hat{\lambda }_1 \hat{\lambda }_2 - n_2 b \tilde{\lambda }_2 D(\hat{\theta }) \right]. \end{aligned}$$

Appendix B: Implementation in R

We present the R code for the estimation of $R$ under a parametric setting. Continuous quantities of interest can be assumed as exponential variables or Gaussian variables with either equal or different variances. The R package ProbYX can be downloaded from the webpage http://homes.stat.unipd.it/gcortese under the heading ‘Software’.

The two data sets from two different populations are called, respectively, ydat and xdat. MLEs for the parameter of interest $\psi $ and the nuisance parameter $\lambda $ can be obtained from the following code

MLEs(ydat,xdat,distr),

where distr can be set equal to either "exp", "norm_EV" or "norm_DV", according as the distributions assumed for the continuous markers are exponential or Gaussian with equal or unequal variances, respectively. The log-likelihood can be computed for a given set of values of $\psi $ and $\lambda $ by means of the function

loglik(ydat,xdat,lambda,psi,distr),

where lambda and psi are, respectively, $\lambda $ and $\psi $. For the case of Gaussian distributions with different variances the following simpler reparameterisation has been used

$$\begin{aligned} \psi = R(\theta ) = \varPhi \left( \frac{\mu _2 - \mu _1}{\sqrt{\sigma _1^2 + \sigma _2^2}} \right) \qquad \text{ and} \qquad \lambda =(\lambda _1,\lambda _2,\lambda _3) = (\mu _1, \sigma _1^2, \sigma _2^2) \, , \end{aligned}$$

with $\sigma _1^2$, $\sigma _2^2$ variances of the Gaussian variables $Y$ and $X$, respectively.

Point estimates and confidence intervals for $R$ are obtained by using the R function Prob as follows:

Prob(ydat,xdat,distr,method,level),

where the argument method, set to be equal to either “Wald”, “RP” or “RPstar”, allows to choose confidence intervals based on the Wald, $r_p$ or $r_p^*$ statistic, respectively (see formula (3) and (4) and Sect. 3). When the methods “Wald” or “RP” are selected, point estimate for $\psi $ corresponds to the MLE, while the method “RPstar” yields the $r_p^*$-based point estimate for $\psi $ as presented in Sect. 3. The confidence level $(1-\alpha )$ can be decided by setting level $=\alpha $.

The Wald, $r_p$ and $r_p^*$ statistics can also be computed for a given value of $\psi $ by applying, respectively, the R functions

${\mathtt {w <- wald(ydat,xdat,psi,distr)} },$
${\mathtt {r <- rp(ydat,xdat,psi,distr)} },$
${\mathtt {r\_star <- rpstar(ydat,xdat,psi,distr)} }$.

The steps 4–6, given at the end of Sect. 3, for application of the higher-order procedures based on the signed log-likelihood ratio statistic $r_p^*$, can be implemented by means of the following R commands:

${\mathtt {smoother <- smooth.spline(V.r\_star,r\_star.range)} }$
${\mathtt {psi1 <- predict(smoother,z1)\$y} }$
${\mathtt {psi2 <- predict(smoother,z2)\$y} }$
${\mathtt {hatpsi <- predict(smoother,0)\$y} }$

where V.r_star is the vector of values of $r_p^*$, r_star, calculated by applying the R function rpstar repeatedly on an appropriate range r_star.range of values of $\psi $, psi1 and psi2 are the limits $\psi ^*_1$ and $\psi ^*_2$ of the confidence interval corresponding to the percentiles z1 $=z_{1-\alpha /2}$ and z2 $=z_{\alpha /2}$, and hatpsi is the point estimate $\hat{\psi }^*$.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Cortese, G., Ventura, L. Accurate higher-order likelihood inference on $P(Y<X)$ . Comput Stat 28, 1035–1059 (2013). https://doi.org/10.1007/s00180-012-0343-z

Download citation

Received: 22 October 2010
Accepted: 12 June 2012
Published: 17 July 2012
Issue Date: June 2013
DOI: https://doi.org/10.1007/s00180-012-0343-z

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Accurate higher-order likelihood inference on \(P(Y<X)\)

Abstract

Access this article

Similar content being viewed by others

Estimation of $$P[Y<Z]$$ under Geometric-Lindley model

Estimation of the reliability parameter for a Poisson-exponential stress-strength model

Statistical Inferences of $$R=P(X<Y)$$ for Exponential Distribution Based on Generalized Order Statistics

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendices

Appendix A: Computation of statistics for the Gaussian and exponential models

Appendix B: Implementation in R

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Accurate higher-order likelihood inference on \(P(Y<X)\)

Abstract

Access this article

Similar content being viewed by others

Estimation of $$P[Y<Z]$$ under Geometric-Lindley model

Estimation of the reliability parameter for a Poisson-exponential stress-strength model

Statistical Inferences of $$R=P(X<Y)$$ for Exponential Distribution Based on Generalized Order Statistics

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendices

Appendix A: Computation of statistics for the Gaussian and exponential models

Appendix B: Implementation in R

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation