skip to main content
10.1145/2745802.2747956acmotherconferencesArticle/Chapter ViewAbstractPublication PageseaseConference Proceedingsconference-collections
research-article

Robust statistical methods: why, what and how: keynote

Published:27 April 2015Publication History

ABSTRACT

This keynote discusses the need for more robust statistical methods. For visualizing data I suggest using Kernel density plots rather than box plots. For parametric analysis, I propose more robust measures of central location such as trimmed means, which can support reliable tests of the differences between the central location of two or more samples. In addition, I also recommend non-parametric effect sizes such as Cliff's δ and Brunner and Munzel's p-hat that avoid some of the problems with rank-based non-parametric methods.

References

  1. Andrea Arcuri and Lionel Briand (2011). A practical guide for using statistical tests to assess randomized algorithms in software engineering. ACM/IEEE International Conference on Software Engineering (ICSE). Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Box, G. E. P. (1954) Some theorems on quadratic forms applied in the study of analysis of variance problems: 1: Effect of inequality of variance in the one-way model. Annals of Mathematics Statistics, 25, pp. 290--302Google ScholarGoogle ScholarCross RefCross Ref
  3. Edgar Brunner and Ullrich Munzel and Madan L. Puri. (2002) The multivariate nonparametric Behrens-Fisher problem. Journal of Statistical Planning and Inference, 108, pp. 37--52.Google ScholarGoogle ScholarCross RefCross Ref
  4. Norman Cliff (1993) Dominance Statistics: Ordinal Analyses to Answer Ordinal Questions. Psychological Bulletin, 114(3), pp. 494--509.Google ScholarGoogle ScholarCross RefCross Ref
  5. Mohamed El-Attar (2014) Using SMCD to reduce inconsistencies in misuse case models: A subject-based empirical evaluation. The Journal of Systems and Software, 87, pp. 104--118. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Barbara Kitchenham (1996) Software metrics Measurement for Software Process Improvement. NCC Blackwell. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Jeffrey D. Kromrey and Kristine Y. Hogarty and John M. Ferron and Constance V. Hines and Melinda R. Hess (2005) Robustness in Meta-Analysis: An Empirical Comparison of Point and Interval Estimates of Standardized Mean Differences and Cliff's Delta. Joint Statistical Meetings. Minneapolis, MN.Google ScholarGoogle Scholar
  8. L. Madeyski and M. Jureczko, (2014) "Which Process Metrics Can Significantly Improve Defect Prediction Models? An Empirical Study," Software Quality Journal, published online Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Philip H. Ramsey (1980) Exact type 1 error rates for robustness of Student's t test with unequal variances. Journal of Educational and Behavioral Statistics. 5(4), pp. 337--346.Google ScholarGoogle ScholarCross RefCross Ref
  10. Andrew F. Tappenden and James Miller (2014) Automated cookie collection testing. ACM Transactions on Software Engineering and Methodology (TOSEM). 23(1) Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. B. L. Welch (1938) The Significance of the Difference Between Two Means when the Population Variances are Unequal, Biometrika, 350(3/4), pp. 350--362.Google ScholarGoogle ScholarCross RefCross Ref
  12. Rand W. Wilcox (2012) Modern Statistics for the Social and Behavioural Sciences. CRC Press.Google ScholarGoogle Scholar
  13. Rand R. Wilcox (2012) Introduction to Robust Estimation and Hypothesis Testing. 3rd Edition. Academic Press/Elsevier.Google ScholarGoogle Scholar
  14. Rand R. Wilcox and H. J. Keselman. (2003) Modern Robust Data Analysis Methods: Measures of Central Tendency. Psychological Methods, 8(3), pp. 254--274.Google ScholarGoogle ScholarCross RefCross Ref
  15. Rand R. Wilcox and Ventura L. Charlina and Karen L. Thompson. (1986) New Monte Carlo results on the robustness of the ANOVA F, W and F* statistics. Communications in Statistics - Simulation and Computation. 15(4), pp. 933--943.Google ScholarGoogle ScholarCross RefCross Ref
  16. András Vargha and Harold D. Delaney (2000) A Critique and Improvement of the CL Common Language Effect Size Statistics of McGraw and Wong. Journal of Educational and Behavioral Statistics, 25(2) pp. 101--132.Google ScholarGoogle Scholar
  17. Yuen, K. K. (1974) The two sample trimmed t for unequal population variables. Biometrika, 61, pp. 165--170.Google ScholarGoogle ScholarCross RefCross Ref
  18. D. W. Zimmerman (2000) Statistical Significance Levels of Nonparametric Tests Biased by Heterogeneous Variances of Treatment Groups. The Journal of General Psychology, 127(4).Google ScholarGoogle ScholarCross RefCross Ref
  19. D. W. Zimmerman (1993) Rank transformations and the power of the Student t test and Welch t test for non-normal populations with unequal variances. Canadian Journal of Experimental Psychology, 47(3), pp. 523--539.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Robust statistical methods: why, what and how: keynote

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      EASE '15: Proceedings of the 19th International Conference on Evaluation and Assessment in Software Engineering
      April 2015
      305 pages
      ISBN:9781450333504
      DOI:10.1145/2745802

      Copyright © 2015 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 27 April 2015

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      EASE '15 Paper Acceptance Rate20of65submissions,31%Overall Acceptance Rate71of232submissions,31%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader