Skip to main content
Log in

Accurate higher-order likelihood inference on \(P(Y<X)\)

  • Original Paper
  • Published:
Computational Statistics Aims and scope Submit manuscript

Abstract

The stress-strength reliability \(R=P(Y<X)\), where \(X\) and \(Y\) are independent continuous random variables, has obtained wide attention in many areas of application, such as in engineering statistics and biostatistics. Classical likelihood-based inference about \(R\) has been widely examined under various assumptions on \(X\) and \(Y\). However, it is well-known that first order inference can be inaccurate, in particular when the sample size is small or in presence of unknown parameters. The aim of this paper is to illustrate higher-order likelihood-based procedures for parametric inference in small samples, which provide accurate point estimators and confidence intervals for \(R\). The proposed procedures are illustrated under the assumptions of Gaussian and exponential models for \((X,Y)\). Moreover, simulation studies are performed in order to study the accuracy of the proposed methodology, and an application to real data is discussed. An implementation of the proposed method in the R software is provided.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2

Similar content being viewed by others

References

  • Bamber D (1975) The area above the ordinal dominance graph and the area below the receiver operating characteristic graph. J Math Psychol 12:387–415

    Article  MathSciNet  MATH  Google Scholar 

  • Barndorff-Nielsen OE (1983) On a formula for the distribution of the maximum likelihood estimator. Biometrika 70:343–365

    Article  MathSciNet  MATH  Google Scholar 

  • Barndorff-Nielsen OE (1991) Modified signed log likelihood ratio. Biometrika 78:557–563

    Article  MathSciNet  MATH  Google Scholar 

  • Barndorff-Nielsen OE, Chamberlin S (1994) Stable and invariant adjusted directed likelihoods. Biometrika 81:485–499

    Article  MathSciNet  MATH  Google Scholar 

  • Barndorff-Nielsen OE, Cox DR (1994) Inference and asymptotics. Chapman and Hall, London

    MATH  Google Scholar 

  • Brazzale AR, Davison AC, Reid N (2007) Applied asymptotics. Case-studies in small sample statistics. Cambridge University Press, Cambridge

    Book  MATH  Google Scholar 

  • Efron B, Hinkley D (1978) Assessing the accuracy of the maximum likelihood estimator: observed versus expected Fisher information. Biometrika 65:457–482

    Article  MathSciNet  MATH  Google Scholar 

  • Faraggi D (2000) The effect of random measurement error on receiver operating characteristic (ROC) curves. Stat Med 19:61–70

    Article  Google Scholar 

  • Fraser DAS, Reid N, Wu J (1999) A simple general formula for tail probabilities for frequentist and Bayesian inference. Biometrika 86:249–264

    Article  MathSciNet  MATH  Google Scholar 

  • Giummolé F, Ventura L (2002) Practical point estimation from higher-order pivots. J Stat Comput Simul 72:419–430

    Article  MathSciNet  MATH  Google Scholar 

  • Guttman I, Johnson RA, Bhattacharyya GK, Reiser B (1988) Confidence limits for stress-strength models with explanatory variables. Technometrics 30:161–168

    Article  MathSciNet  MATH  Google Scholar 

  • Hanley JA (1989) Receiver operating characteristic (ROC) methodology: the state of the art. Crit Rev Diagn Imaging 29:307–335

    Google Scholar 

  • Jiang L, Wong ACM (2008) A note on inference for \(P(X<Y)\) for right truncated exponentially distributed data. Stat Pap 49:637–651

    Article  MathSciNet  MATH  Google Scholar 

  • Kotz S, Lumelskii Y, Pensky M (2003) The stress-strength model and its generalizations. Theory and applications. World Scientific, Singapore

    Book  MATH  Google Scholar 

  • Newcombe RG (1998) Two-sided confidence intervals for the single proportion: comparison of seven methods. Stat Med 17:857–872

    Article  Google Scholar 

  • Obuchowski NA, Lieber ML (1998) Confidence intervals for the receiver operating characteristic area in studies with small samples. Acad Radiol 5:561–571

    Article  Google Scholar 

  • Pace L, Salvan A (1997) Principles of statistical inference. World Scientific, Singapore

    MATH  Google Scholar 

  • Pace L, Salvan A (1999) Point estimation based on confidence intervals. J Stat Comput Simul 64:1–21

    Article  MathSciNet  MATH  Google Scholar 

  • R Development Core Team (2010) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. ISBN 3-900051-07-0, http://www.R-project.org

  • Reid N, Fraser DAS (2010) Mean likelihood and higher order approximations. Biometrika 97:159–170

    Article  MathSciNet  MATH  Google Scholar 

  • Reiser B (2000) Measuring the effectiveness of diagnostic markers in the presence of measurement error through the use of ROC curves. Stat Med 19:2115–2129

    Article  Google Scholar 

  • Reiser B, Faraggi D (1997) Confidence intervals for the generalized ROC criterion. Biometrics 53:644–652

    Article  MATH  Google Scholar 

  • Schisterman EF, Faraggi D, Reiser B (2004) Adjusting the generalized ROC curve for covariates. Stat Med 23:3319–3331

    Article  Google Scholar 

  • Severini TA (1999) An empirical adjustment to the likelihood ratio statistic. Biometrika 86:235–247

    Article  MathSciNet  MATH  Google Scholar 

  • Severini TA (2000) Likelihood methods in statistics. Oxford University Press, New York

    MATH  Google Scholar 

  • Skovgaard IM (1996) A general large deviation approximation to one parameter tests. Bernoulli 2:145–165

    Article  MathSciNet  MATH  Google Scholar 

  • Wolfe DA, Hogg RV (1971) On constructing statistics and reporting data. Am Stat 25:27–30

    Article  Google Scholar 

  • Zweig MH, Campbell G (1993) Receiver-operating characteristic (ROC) plots: a fundamental evaluation tool in clinical medicine. Clin Chem 39:561–577

    Google Scholar 

Download references

Acknowledgments

This research was supported by grants from MIUR, and from University of Padova (Progetto di Ateneo 2011), Italy. The authors thank the Referees and the Associate Editor for their useful comments and suggestions, which lead to an improved version of the paper.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Giuliana Cortese.

Appendices

Appendix A: Computation of statistics for the Gaussian and exponential models

Under the assumption of Gaussian distributions with equal variances for \(Y\) and \(X\) as described in Sect. 3.1, in order to compute the Wald statistic, the profile observed information is obtained from (5). Its expression is

$$\begin{aligned} j_p(\hat{\psi }) = \frac{\hat{G}}{(n\, \hat{\lambda }_2)^2 \, +\, n_1\, n_2\, (\hat{\lambda }_1 \hat{\lambda }_2\, -\, D(\hat{\theta }))^2} \, , \end{aligned}$$
(17)

where \(\hat{G} =2 n n_1 n_2 \hat{\lambda }_2^2 \hat{d}^2\), with \(\hat{d} = \partial \varPhi ^{-1}(\hat{\psi })/\partial \hat{\psi }= 1/\phi (\varPhi ^{-1}(\hat{\psi }))\) and \(\phi (\cdot )\) standard normal probability density function.

In the same example, for computing the adjustment term \(q\) of the \(r_p^*\) statistic, given in (7), we have

$$\begin{aligned} \frac{ |\ell _{\lambda ;\hat{\lambda }}(\hat{\theta }_\psi )|}{(|j(\hat{\theta })| |j_{\lambda \lambda }(\hat{\theta }_\psi )|)^{1/2}} = \frac{2 \hat{\lambda }_2^2 \left(2 n_1 n_2 \hat{B}^2 +(n \hat{\lambda }_2)^2 + n_1 n_2 \hat{B} (\tilde{\lambda }_1 \tilde{\lambda }_2 - D(\hat{\theta }_{\psi })) \right) }{\tilde{\lambda }_2^2\, \left( \hat{G}\, (12 n_1 n_2 \hat{B}^2 + 8\hat{C}^2 - 2 n^2 (\tilde{\lambda }_2^2 - 3\hat{\lambda }_2^2) - 8n \hat{C}) \right)^{1/2} } \, , \end{aligned}$$

with \(\hat{B} = \hat{\lambda }_1 \hat{\lambda }_2 -D(\hat{\theta })\), \(\hat{C} = n_1 \hat{\lambda }_1 \hat{\lambda }_2 + n_2 D(\hat{\theta })\) and \(\hat{F} = n_1\tilde{\lambda }_1 \tilde{\lambda }_2 \hat{\lambda }_1 \hat{\lambda }_2 + n_2 D(\hat{\theta }) D(\hat{\theta }_{\psi })\). The tilde above the parameters indicates an estimate constrained to a fixed \(\psi \), while the hat indicates an MLE estimate. The sample space derivatives needed for computing the \(q\) term are

$$\begin{aligned} \ell _{;\hat{\psi }}(\hat{\theta })=0, \ \quad \ell _{;\hat{\lambda }}(\hat{\theta })=\left( \ 0, \ -\frac{n}{\hat{\lambda }_2} \ \right), \ \quad \ell _{;\hat{\psi }}(\hat{\theta }_\psi )= \frac{2 n_2 \hat{\lambda }_2}{\tilde{\lambda }_2^2} K [ \tilde{\lambda }_2 b - D(\hat{\theta })], \end{aligned}$$

with \(K= \partial \varPhi ^{-1} (\hat{\psi }) / \partial \hat{\psi }\).

The sample space derivatives \( \ell _{;\hat{\lambda }}(\hat{\theta }_\psi ) = ( \ell _{;\hat{\lambda }_1}, \ \ell _{;\hat{\lambda }_2} ) \) are computed as

$$\begin{aligned}&\displaystyle \ell _{;\hat{\lambda }_1}(\hat{\theta }_\psi ) = \frac{-2 \hat{\lambda }_2}{\tilde{\lambda }_2^2} \left[n_1 \hat{\lambda }_1 \hat{\lambda }_2 + n_2 D(\hat{\theta }) - n_1 \tilde{\lambda }_1 \tilde{\lambda }_2 - n_2 \tilde{\lambda }_2 b\right],\\&\displaystyle \ell _{;\hat{\lambda }_2}(\hat{\theta }_\psi ) = \frac{-1}{\tilde{\lambda }_2^2} \left[n \hat{\lambda }_2 + 2 n_1 \hat{\lambda }_1^2 \hat{\lambda }_2 + \frac{2 n_2 D(\hat{\theta })^2}{\hat{\lambda }_2} - 2 n_1 \tilde{\lambda }_1 \tilde{\lambda }_2 \hat{\lambda }_1 - \frac{2 n_2 \tilde{\lambda }_2 b D(\hat{\theta })}{\hat{\lambda }_2}\right], \end{aligned}$$

while \( \ell _{\lambda ;\hat{\psi }}(\hat{\theta }_\psi ) = ( \ell _{\lambda _1;\hat{\psi }}, \ \ell _{\lambda _2;\hat{\psi }} ) \) are given as

$$\begin{aligned} \ell _{\lambda _1;\hat{\psi }}(\hat{\theta }_\psi )=\frac{2 n_2 \hat{\lambda }_2 K}{\tilde{\lambda }_2}, \ \quad \ell _{\lambda _2;\hat{\psi }}(\hat{\theta }_\psi )= \frac{2 n_2 \hat{\lambda }_2 K}{\tilde{\lambda }_2} \left[\frac{2 D(\hat{\theta })}{\tilde{\lambda }_2} - b\right], \end{aligned}$$

with \(b = \varPhi ^{-1} (\psi ) + \tilde{\lambda }_1 \).

The quantity \( \ell _{\lambda ;\hat{\lambda }}(\hat{\theta }_\psi )\) is equal to the \((d-1) \) square matrix

$$\begin{aligned} \begin{bmatrix} \ell _{\lambda _1;\hat{\lambda }_1}&\quad \ell _{\lambda _1;\hat{\lambda }_2} \\ \ell _{\lambda _2;\hat{\lambda }_1}&\quad \ell _{\lambda _2;\hat{\lambda }_2}. \\ \end{bmatrix} \end{aligned}$$

The sample space derivatives therein can be computed from the full likelihood function

$$\begin{aligned} \ell (\theta )&= \ell (\psi ,\lambda _1,\lambda _2)\\&= -n log \lambda _2 - \frac{1}{\lambda _2^2} \left[\frac{n \hat{\lambda }_2^2}{2} + n_1 (\lambda _1 \lambda _2 - \hat{\lambda }_1 \hat{\lambda }_2)^2 + n_2 (D(\hat{\theta }) - D(\theta ))^2 \right], \end{aligned}$$

which only depends on the MLE estimates and the parameter \(\theta \). Thus, we have

$$\begin{aligned}&\displaystyle \ell _{\lambda _1;\hat{\lambda }_1}(\hat{\theta }_\psi )= \frac{2n \hat{\lambda }_2}{\tilde{\lambda }_2} \ \qquad \qquad \ell _{\lambda _1;\hat{\lambda }_2}(\hat{\theta }_\psi )= \frac{2}{\tilde{\lambda }_2} \left[n_1 \hat{\lambda }_1 + n_2 \frac{D(\hat{\theta })}{\hat{\lambda }_2}\right]\\&\displaystyle \ell _{\lambda _2;\hat{\lambda }_1}(\hat{\theta }_\psi )= \frac{2 \hat{\lambda }_2}{\tilde{\lambda }_2^3} \left[2 n_1 \hat{\lambda }_1 \hat{\lambda }_2 + 2 n_2 D(\hat{\theta }) - n_2 b \tilde{\lambda }_2 - n_1 \tilde{\lambda }_1 \tilde{\lambda }_2 \right]\\&\displaystyle \ell _{\lambda _2;\hat{\lambda }_2}(\hat{\theta }_\psi )= \frac{2}{\hat{\lambda }_2 \tilde{\lambda }_2^3} \left[n \hat{\lambda }_2^2 + 2n_1 \hat{\lambda }_1^2 \hat{\lambda }_2^2 + 2n_2 D(\hat{\theta })^2 - n_1 \tilde{\lambda }_1 \tilde{\lambda }_2 \hat{\lambda }_1 \hat{\lambda }_2 - n_2 b \tilde{\lambda }_2 D(\hat{\theta }) \right]. \end{aligned}$$

Appendix B: Implementation in R

We present the R code for the estimation of \(R\) under a parametric setting. Continuous quantities of interest can be assumed as exponential variables or Gaussian variables with either equal or different variances. The R package ProbYX can be downloaded from the webpage http://homes.stat.unipd.it/gcortese under the heading ‘Software’.

The two data sets from two different populations are called, respectively, ydat and xdat. MLEs for the parameter of interest \(\psi \) and the nuisance parameter \(\lambda \) can be obtained from the following code

MLEs(ydat,xdat,distr),

where distr can be set equal to either "exp", "norm_EV" or "norm_DV", according as the distributions assumed for the continuous markers are exponential or Gaussian with equal or unequal variances, respectively. The log-likelihood can be computed for a given set of values of \(\psi \) and \(\lambda \) by means of the function

loglik(ydat,xdat,lambda,psi,distr),

where lambda and psi are, respectively, \(\lambda \) and \(\psi \). For the case of Gaussian distributions with different variances the following simpler reparameterisation has been used

$$\begin{aligned} \psi = R(\theta ) = \varPhi \left( \frac{\mu _2 - \mu _1}{\sqrt{\sigma _1^2 + \sigma _2^2}} \right) \qquad \text{ and} \qquad \lambda =(\lambda _1,\lambda _2,\lambda _3) = (\mu _1, \sigma _1^2, \sigma _2^2) \, , \end{aligned}$$

with \(\sigma _1^2\), \(\sigma _2^2\) variances of the Gaussian variables \(Y\) and \(X\), respectively.

Point estimates and confidence intervals for \(R\) are obtained by using the R function Prob as follows:

Prob(ydat,xdat,distr,method,level),

where the argument method, set to be equal to either “Wald”, “RP” or “RPstar”, allows to choose confidence intervals based on the Wald, \(r_p\) or \(r_p^*\) statistic, respectively (see formula (3) and (4) and Sect. 3). When the methods “Wald” or “RP” are selected, point estimate for \(\psi \) corresponds to the MLE, while the method “RPstar” yields the \(r_p^*\)-based point estimate for \(\psi \) as presented in Sect. 3. The confidence level \((1-\alpha )\) can be decided by setting level \(=\alpha \).

The Wald, \(r_p\) and \(r_p^*\) statistics can also be computed for a given value of \(\psi \) by applying, respectively, the R functions

  • \({\mathtt {w <- wald(ydat,xdat,psi,distr)} },\)

  • \({\mathtt {r <- rp(ydat,xdat,psi,distr)} },\)

  • \({\mathtt {r\_star <- rpstar(ydat,xdat,psi,distr)} }\).

The steps 4–6, given at the end of Sect. 3, for application of the higher-order procedures based on the signed log-likelihood ratio statistic \(r_p^*\), can be implemented by means of the following R commands:

  • \({\mathtt {smoother <- smooth.spline(V.r\_star,r\_star.range)} }\)

  • \({\mathtt {psi1 <- predict(smoother,z1)\$y} }\)

  • \({\mathtt {psi2 <- predict(smoother,z2)\$y} }\)

  • \({\mathtt {hatpsi <- predict(smoother,0)\$y} }\)

where V.r_star is the vector of values of \(r_p^*\), r_star, calculated by applying the R function rpstar repeatedly on an appropriate range r_star.range of values of \(\psi \), psi1 and psi2 are the limits \(\psi ^*_1\) and \(\psi ^*_2\) of the confidence interval corresponding to the percentiles z1 \(=z_{1-\alpha /2}\) and z2 \(=z_{\alpha /2}\), and hatpsi is the point estimate \(\hat{\psi }^*\).

Rights and permissions

Reprints and permissions

About this article

Cite this article

Cortese, G., Ventura, L. Accurate higher-order likelihood inference on \(P(Y<X)\) . Comput Stat 28, 1035–1059 (2013). https://doi.org/10.1007/s00180-012-0343-z

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00180-012-0343-z

Keywords

Navigation