The split-up algorithm: a fast symbolic method for computing p-values of distribution-free statistics

van de Wiel, Mark

doi:10.1007/s180-001-8328-6

The split-up algorithm: a fast symbolic method for computing p-values of distribution-free statistics

Published: 04 November 2019

Volume 16, pages 519–538, (2001)
Cite this article

Computational Statistics Aims and scope Submit manuscript

Mark van de Wiel¹

577 Accesses
Explore all metrics

Summary

Many distribution-free statistics have the drawback that computing exact p-values under the null hypothesis is an intensive task. When the sample sizes are small or the number of ties is large, approximations are often unsatisfactory. Moreover, tables of exact critical values are not available for conditional rank statistics (ties, censoring), for rank statistics with arbitrary regression constants, or for permutation test statistics. In those cases, it is important to have a fast algorithm for computing exact p-values. We present a new algorithm and apply it to a large class of distribution-free one-sample, two-sample and serial statistics. The algorithm is based on splitting the probability generating function of the test statistic into two parts. We compare the speed of this “split-up algorithm” to that of existing procedures and we conclude that our new algorithm is faster in many cases.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Asymptotic Expansions for Several GEL-Based Test Statistics and Hybrid Bartlett-Type Correction with Bootstrap

Bayesian empirical likelihood methods for quantile comparisons

Article 10 April 2017

A new approach to distribution free tests in contingency tables

Article 08 September 2016

References

Gibbons, J.D., and J.W. Pratt (1975). P-values: interpretation and methodology. American Statistician, 29, 20–25.
MATH Google Scholar
Good, P.I. (1994). Permutation Tests, a Practical Guide to Resampling Methods for Testing Hypotheses. Springer-Verlag, New York.
MATH Google Scholar
Hallin, M., and G. Mélard (1988). Rank-based tests for randomness against first-order serial dependence. Journal of the American Statistical Association, 83, 1117–1128.
Article MathSciNet Google Scholar
Hallin, M., and M.L. Puri (1988). Optimal rank-based procedures for time series analysis: testing an ARMA model against other ARMA models. Annals of Statistics, 16, 402–432.
Article MathSciNet Google Scholar
Hallin, M., and M.L. Puri (1991). Time series analysis via rank order theory: signed-rank tests for ARMA models. Journal of Multivariate Analysis, 39, 1–29.
Article MathSciNet Google Scholar
Kendall, M.G., and A. Stuart (1977). The Advanced Theory of Statistics, Volume 2. Charles Griffin & Co., London.
MATH Google Scholar
Mehta, C.R., N.R. Patel, and L.J. Wei (1988). Constructing exact significance tests with restricted randomization rules. Biometrika, 75, 295–302.
Article Google Scholar
Pagano, M., and D. Tritchler (1983). On obtaining permutation distributions in polynomial time. Journal of the American Statistical Association, 78, 435–441.
Article MathSciNet Google Scholar
Prentice, R.L., and P. Marek (1979). A qualitative discrepancy between censored data rank tests. Biometrics, 35, 861–867.
Article Google Scholar
Puri, M.L., and P.K. Sen (1985). Nonparametric Methods in General Linear Models. Wiley, New York.
MATH Google Scholar
Streitberg, B., and J. Röhmel (1986). Exact distributions for permutation and rank tests: an introduction to some recently published algorithms. Statistical Software Newsletter, 12, 10–17.
Google Scholar
Streitberg, B., and J. Röhmel (1987). Exakte Verteilungen für Rang-und Randomisierungstests im allgemeinen c-Stichprobenproblem. EDV in Medizin und Biologie, 18, 12–19.
Google Scholar
Van de Wiel, M.A. (1996). Computing exact distributions of rank statistics with computer algebra. Master’s thesis, Eindhoven University of Technology, The Netherlands.
Google Scholar
Van de Wiel, M.A. (1998). Exact distributions of two-sample rank statistics and block rank statistics using computer algebra. Technical Report Memorandum COSOR 98-14, Eindhoven University of Technology, The Netherlands.
Van de Wiel, M.A., and A. Di Bucchianico (2001). Fast computation of the exact null distribution of Spearman’s rho and Page’s Lstatistic for samples with and without ties. Journal of Statistical Planning and Inference, 92, 133–145.
Article MathSciNet Google Scholar
Wald, A., and J. Wolfowitz (1943). An exact test for randomness in the nonparametric case based on serial correlation. Annals of Mathematical Statistics, 14, 378–388.
Article MathSciNet Google Scholar

Download references

Acknowledgements

I would like to thank Alessandro Di Bucchianico and the referees of this journal for their useful comments on earlier versions of this paper.

Author information

Authors and Affiliations

Department of Mathematics and Computing Science, Eindhoven University of Technology, P.O. Box 513, 5600 MB, Eindhoven, The Netherlands
Mark van de Wiel

Authors

Mark van de Wiel
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Mark van de Wiel.

Rights and permissions

Reprints and permissions

About this article

Cite this article

van de Wiel, M. The split-up algorithm: a fast symbolic method for computing p-values of distribution-free statistics. Computational Statistics 16, 519–538 (2001). https://doi.org/10.1007/s180-001-8328-6

Download citation

Published: 04 November 2019
Issue Date: December 2001
DOI: https://doi.org/10.1007/s180-001-8328-6

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

The split-up algorithm: a fast symbolic method for computing p-values of distribution-free statistics

Summary

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Asymptotic Expansions for Several GEL-Based Test Statistics and Hybrid Bartlett-Type Correction with Bootstrap

Bayesian empirical likelihood methods for quantile comparisons

A new approach to distribution free tests in contingency tables

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now