Optimal testing strategies for large, sparse multinomial models
Section snippets
Introduction and notation
The analysis of categorical data, and associated statistical tests, has been debated extensively in the literature. Three key questions arise from the debate:
- 1.
What is the most appropriate test statistic to use, in terms of attained significance level and power?
- 2.
Do asymptotic results apply, or should one use an exact test?
- 3.
Should one perform a conditional or unconditional analysis?
Much of the theoretical and simulation work to address these issues has been restricted to consideration of relatively
Statistical issues
Many authors have studied tests involving categorical data and the related issues presented in this paper. Much of the work to date has been in the context of relatively small contingency tables, or perhaps 2×n tables. Recent extensions have been made to higher dimension tables (Zelterman et al., 1995; Parshall et al., 1999). In addition, a number of authors have reviewed tests for HWE, which present essentially the same categorical data problem. Again, much of this work has been limited to
Statistical methods for unconditional tests
The presence of nuisance parameters in the null hypothesis given by Eq. (2) makes the problem of testing for HWE with many alleles more complex. One does not want to specify the allele frequencies; only the functional form of HWE needs to hold. A true exact unconditional test for HWE calculates (Suissa and Shuster, 1985; Berger and Boos, 1994), where T represents the test statistic and t is its observed value. The calculation of the supremum over a parameter space of just
Results
In this section, we summarize the results by comparing conditional and unconditional tests, exact and asymptotic tests, and the test statistics themselves. Table 4 lists attained significance levels for each of the 11 tests, and for each combination of settings for n and . Table 5, Table 6 list the empirical power when f=0.05 and 0.10, respectively, for each test and for each combination of n and . Results for f=0.01 (not shown) are similar with respect to guidelines, with an obvious
Acknowledgements
This work was supported in part by National Institutes of Health grant GM32518.
References (35)
- et al.
An empirical investigation of some effects of sparseness in contingency tables
Comput. Statist. Data Anal.
(1987) In dispraise of the exact test
J. Statist. Plann. Inference
(1978)- et al.
Testing the goodness of fit of the multinomial distribution based on graphical representation
Comput. Statist. Data Anal.
(1987) A survey of exact inference for contingency tables
Statist. Sci.
(1992)- et al.
Some exact conditional tests of independence for R× C cross-classification tables
Psychometrika
(1977) - et al.
P values maximized over a confidence set for the nuisance parameter
J. Amer. Statist. Assoc.
(1994) - Budowle, B., Moretti, T.R., 1999. Genotype profiles for six population groups at the 13 CODIS short tandem repeat core...
- Budowle, B., Monson, K.L., Anoe, K.S., Baechtel, F.S., Bergman, D.L., Buel E., Campbell, P.A., Clement, M.E., Coey,...
- et al.
Statistical power of an exact test of Hardy–Weinberg proportions of genotypic data at a multiallelic locus
Hum. Heredity
(1994) A comparison of the X2, , and multinomial probability criteria for significance tests when expected frequencies are small
J. Amer. Statist. Assoc.
(1976)
Some methods for strengthening the common χ2 tests
Biometrics
Pearson's X2 and the loglikelihood ratio statistic G2a comparative review
Internat. Statist. Rev.
Multinomial goodness of fit tests
J. Roy. Statist. Soc. B
The appropriateness of some common procedures for testing the equality of two independent binomial populations
Amer. Statist.
A comparison of tests for Hardy–Weinberg equilibrium
Biometrics
P-valuesinterpretation and methodology
Amer. Statist.
Performing the exact test of Hardy–Weinberg proportion for multiple alleles
Biometrics
Cited by (7)
Testing departure from Hardy-Weinberg proportions
2017, Methods in Molecular BiologyTesting departure from hardy-weinberg proportions
2012, Methods in Molecular BiologyExact tests for Hardy-Weinberg proportions
2009, GeneticsChoice of test for association in small sample unordered r × c tables
2007, Statistics in Medicine