Skip to main content
Log in

Computation of exact confidence limits from discrete data

  • Published:
Computational Statistics Aims and scope Submit manuscript

Abstract

Suppose that the data have a discrete distribution determined by (∞, ψ) where θ is the scalar parameter of interest and ψ is a nuisance parameter vector. The Buehler 1 - α upper confidence limit for θ is as small as possible, subject to the constraints that (a) its coverage probability is at least 1 - α and (b) it is a nondecreasing function of a pre-specified statisticT. This confidence limit has important biostatistical and reliability applications. The main result of the paper is that for a wide class of models (including binomial and Poisson), parameters of interest 9 and statisticsT (which possess what we call the “logical ordering” property) there is a dramatic increase in the ease with which this upper confidence limit can be computed. This result is illustrated numerically for θ a difference of binomial probabilities. Kabaila & Lloyd (2002) also show that ifT is poorly chosen then an assumption required for the validity of the formula for this confidence limit may not be satisfied. We show that for binomial data this assumption must be satisfied whenT possesses the “logical ordering” property.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Figure 1
Figure 2
Figure 3

Similar content being viewed by others

References

  • Agresti, A., & Min, Y. (2001), ‘On small-sample confidence intervals for parameters in discrete distributions’,Biometrics 55, 1202–1209.

    MathSciNet  MATH  Google Scholar 

  • Buehler R. J. (1957), ‘Confidence intervals for the product of two binomial parameters’,Journal of the American Statistical Association 52, 482–493.

    Article  MathSciNet  Google Scholar 

  • Chan, I.S.F., & Zhang, Z. (1999), ‘Test-based exact confidence intervals for the difference of two binomial proportions’,Biometrics 55, 1202–1209.

    Article  Google Scholar 

  • Cox, D.R. & Hinkley, D.V. (1974), Theoretical Statistics, Chapman & Hall.

  • Crowder M.J., Kimber, A.C. & Smith, R.L. (1991), Statistical Analysis of Reliability Data, Chapman & Hall.

  • Harris B. & Soms A.P. (1991), ‘Theory and counterexamples for confidence limits on system reliability’,Statistics & Probability Letters 11, 411–417.

    Article  MathSciNet  Google Scholar 

  • Kabaila P. (1998), ‘The choice of statistic on which to base tight upper confidence limits’,Australian & New Zealand Journal of Statistics 40, 189–196.

    Article  MathSciNet  Google Scholar 

  • Kabaila P. (2001), ‘Better Buehler confidence limits’,Statistics & Probability Letters 52, 145–154.

    Article  MathSciNet  Google Scholar 

  • Kabaila P. (2005), ‘Computation of exact confidence intervals from discrete data using studentized test statistics’, to appear in Statistics and Computing.

  • Kabaila P., & Lloyd C.J. (1997), ‘Tight upper confidence limits from discrete data’,Australian Journal of Statistics 37, 193–204.

    Article  MathSciNet  Google Scholar 

  • Kabaila P., & Lloyd C.J. (2002), ‘The importance of the designated statistic on Buehler upper limits on a system failure probability’,Technometrics 44, 390–395.

    Article  MathSciNet  Google Scholar 

  • Kabaila P., & Lloyd C.J. (2003), ‘The efficiency of Buehler confidence limits’,Statistics & Probability Letters 65, 21–28.

    Article  MathSciNet  Google Scholar 

  • Kabaila P., & Lloyd C.J. (2004a), ‘Buehler confidence limits and nesting’,Australian & New Zealand Journal of Statistics 46, 463–469.

    Article  MathSciNet  Google Scholar 

  • Kabaila P., & Lloyd C.J. (2004b), ‘Buehler limits from approximate upper limits: a numerical investigation of the role of nominal coverage’, to appear in Journal of Applied Statistical Science.

  • Lloyd C.J., & Kabaila P. (2003), ‘On the optimality and limitations of Buehler bounds’,Australian & New Zealand Journal of Statistics 45, 167–174.

    Article  MathSciNet  Google Scholar 

  • Mehrotra, D. V., Chan, I. S. F. & Berger, R. L. (2003), ‘A cautionary note on exact unconditional inference for a difference between two independent binomial proportions’,Biometrics 59, 441–450.

    Article  MathSciNet  Google Scholar 

  • Santner, T.J., & Snell, M.K. (1980), ‘Small-sample confidence intervals for P1 - P2 and P1/p2 in 2 × 2 contingency tables’,Journal of the American Statistical Association 75, 386–394.

    MathSciNet  MATH  Google Scholar 

  • Soms A.P. (1989), ‘Exact confidence intervals, based on the Z statistic, for the difference between two proportions’,Communications in Statistics: Theory and Methods 18, 1325–1856.

    Article  MathSciNet  Google Scholar 

Download references

Acknowledgment

The research reported in this paper was supported by an Australian Research Council grant.

Author information

Authors and Affiliations

Authors

Appendices

Appendix A: Proof of Theorem 3.1

Proof of Theorem 3.1 (a).

If j is such that b(λ) is a nonincreasing and nonconstant function of λj (where λ = (λ1,…, λm)) then define \(\tilde{Y}_{j}=-Y_{j}\), \(\tilde{y}_{j}=-y_{j}\) and \(\tilde{\lambda}_{j}=-\lambda_{j}\); otherwise define \(\tilde{Y}_{j}=Y_{j}\), \(\tilde{y}_{j}=y_{j}\) and \(\tilde{\lambda}_{j}=\lambda_{j}(j=1, \ldots, m)\). Let \(\tilde{F}_{i}\left(\cdot | \tilde{\lambda}_{i}\right)\) denote the distribution function of \(\tilde{Y}_{i}(i=1, \ldots, m)\). Also let \(\tilde{\lambda}=\left(\tilde{\lambda}_{1}, \ldots, \tilde{\lambda}_{m}\right)\) and \(\tilde{y}=\left(\tilde{y}_{1}, \ldots, \tilde{y}_{m}\right)\). Define \(\tilde{b}(\tilde{\lambda})\) to be that function of \(\tilde{\lambda}\) such that \(\tilde{b}(\tilde{\lambda})=b(\lambda)\) for all λA. Also define \(\tilde{t}(\tilde{y})\) to be that function of \(\tilde{y}\) such that \(\tilde{b}(\tilde{y})=b(y)\) for all \(y \in \mathcal{Y}_{1} \times \cdots \times \mathcal{Y}_{m}\).

\(\tilde{Y}_{1}, \ldots, \tilde{Y}_{m}\) are independent. For each yR, \(\tilde{F}_{i}\left(y | \tilde{\lambda}_{i}\right)\) is a nonincreasing function of \(\tilde{\lambda}_{i}(i=1, \ldots, m)\). Also, \(\tilde{b}(\tilde{\lambda})\) and \(\tilde{t}(\tilde{y})\) are nondecreasing in each of the components of \(\tilde{\lambda}=\left(\tilde{\lambda}_{1}, \ldots, \tilde{\lambda}_{m}\right)\) and \(\tilde{y}=\left(\tilde{y}_{1}, \ldots, \tilde{y}_{m}\right)\) respectively.

Proof of Theorem 3.1 (b).

By condition B2, t(y1,…, ym) is a nondecreasing function of y1,…, ym. Fix \(y \in \mathcal{Y}\) and i ∈ {1,…,m}. Let \(\mathcal{C}_{i}\) denote the set of (y1,…, yi‒ 1, yi +1, …, ym)’s, where \(y_{j} \in \mathcal{Y}_{j}\) (j = 1,…,m), such that \(\left(y_{1}, \ldots, y_{i-1}, y_{i}^{*}, y_{i+1}, \ldots, y_{m}\right) \in D(y)\) for some \(y_{i}^{*} \in \mathcal{Y}_{i}\). Fix \(\left(y_{1}, \ldots, y_{i-1}, y_{i+1}, \dots, y_{m}\right) \in \mathcal{C}_{i}\). Observe that by condition B2 the set of all \(z \in \mathcal{Y}_{i}\) such that \(\left(y_{1}, \dots, y_{i-1}, z, y_{i+1}, \dots, y_{m}\right) \in D(y)\) is the set of all \(z \in \mathcal{Y}_{i}\) less than or equal to some \(k \in \mathcal{Y}_{i}\). Define G to be the set \(\left\{\left(y_{1}, \ldots, y_{i-1}, z, y_{i+1}, \ldots, y_{m}\right) : z \in \mathcal{Y}_{i}, z \leq k\right\}\), It now follows from Assumption A that P(YG ∣ λ) is a nonincreasing function of λi for each fixed (λ1,…, λi ‒1, yi +1, …, ym) This argument applies for each fixed i and each fixed element of \(\mathcal{C}_{i}\). Hence P(YD(y) ∣ λ) is a nonincreasing function of λi (i = 1,…, m).

Fix θ1, θ2B satisfying θ1 > θ2. By condition C, there exists a function d such that for every λ = (λ1,…, λm) ∈ A satisfying b(λ) = θ1, \(\lambda^{*}=\left(\lambda_{1}^{*}, \ldots, \lambda_{m}^{*}\right)=d(\lambda)\) satisfies \(\lambda_{i}^{*} \leq \lambda_{i}(i=1, \ldots, m)\) and b(λ*) = θ2. For each λ ∈ A satisfying b(λ) = θ1,

$$P(Y \in D(y) | \lambda) \leq P(Y \in D(y)) | d(\lambda) ).$$

Hence

$$\begin{array}{l}{\sup \left\{P(Y \in D(y) | \lambda) : b(\lambda)=\theta_{1}, \lambda \in A\right\}} \\ { \leq \sup \left\{P(Y \in D(y) | d(\lambda)) : b(\lambda)=\theta_{1}, \lambda \in A\right\}} \\ { \leq \sup \left\{P(Y \in D(y) | \lambda) : b(\lambda)=\theta_{2}, \lambda \in A\right\}}.\end{array}$$

In other words, supψ P(T ≤ t(y) ∣ θ, ψ) is a nonincreasing function of θ. This argument applies for every \(y \in \mathcal{Y}\).

Appendix B: Proof of Theorem 3.2

We may suppose, without loss of generality, that b(λ) is nondecreasing in each of the components of λ = (λ1,…, λm). This is because if b(λ) is nonincreasing in λj then nj ‒ ∼ Binomial(nj, 1 ‒ λj) and b(λ) is nondecreasing in 1 ‒ λj. Define y* = (0,…, 0). Since T = t(Y) satisfies the “logical ordering” property, t(y) achieves its minimum value at y = y*. Thus

$$P(T \leq t(y) | \lambda) \geq P\left(Y=y^{*} | \lambda\right)=\prod_{i=1}^{m}\left(1-\lambda_{i}\right)^{n_{i}}$$

In some cases, such as θ = λ1λ2, we need to disallow some boundary values of λi so that (θ, ψ) is a one-one function of λ = (λ1,…, λm). We therefore restrict attention to λ = (λ1,…, λm) ∈ (0, 1)m. Clearly, we may choose λ = (λ1,…, λm) ∈ (0, 1)m such that \(\prod\nolimits_{i=1}^{m}\left(1-\lambda_{i}\right)^{n_{i}}>\alpha\). Hence there exists (θ, ψ) such that P(Y = y*θ, ψ) > α. Therefore there exists θ such that supψ P(T ≤ t(y) ∣θ, ψ) > α. This argument holds for every

$$y \in \mathcal{Y}$$

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kabaila, P. Computation of exact confidence limits from discrete data. Computational Statistics 20, 401–414 (2005). https://doi.org/10.1007/BF02741305

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF02741305

Keywords

Navigation