Skip to main content

What to Do Instead of Null Hypothesis Significance Testing or Confidence Intervals

  • Conference paper
  • First Online:
Beyond Traditional Probabilistic Methods in Economics (ECONVN 2019)

Part of the book series: Studies in Computational Intelligence ((SCI,volume 809))

Included in the following conference series:

Abstract

Based on the banning of null hypothesis significance testing and confidence intervals in Basic and Applied Psychology (2015), this presentation focusses on alternative ways for researchers to think about inference. One section reviews literature on the a priori procedure. The basic idea, here, is that researchers can perform much inferential work before the experiment. Furthermore, this possibility changes the scientific philosophy in important ways. A second section moves to what researchers should do after they have collected their data, with an accent on obtaining a better understanding of the obtained variance. Researchers should try out a variety of summary statistics, instead of just one type (such as means), because seemingly conceptually similar summary statistics nevertheless can imply very different qualitative stories. Also, rather than engage in the typical bipartite distinction between variance due to the independent variable and variance not due to the independent variable; a tripartite distinction is possible that divides variance not due to the independent variable into variance due to systematic or random factors, with important positive consequences for researchers. Finally, the third major section focusses on how researchers should or should not draw causal conclusions from their data. This section features a discussion of within-participants causation versus between-participants causation, with an accent on whether the type of causation specified in the theory is matched or mismatched by the type of causation tested in the experiment. There also is a discussion of causal modeling approaches, with criticisms. The upshot is that researchers could do much more a priori work, and much more a posteriori work too, to maximize the scientific gains they obtain from their empirical research.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    Nguyen (2016) provided an informative theoretical perspective on the ban.

  2. 2.

    Of course, the null hypothesis significance testing procedure does not test the hypothesis of interest but rather the null hypothesis that is not of interest, which is one of the many criticisms to which the procedure has been subjected. But as the present focus is on what to do instead, I will not focus on these criticisms. The interested reader can consult Trafimow and Earp (2017).

  3. 3.

    In addition, \( \omega \) is of more interest than \( \sigma \) though this is not of great importance yet.

  4. 4.

    The reader may wonder why skewness increases precision. For a quantitative answer, see Trafimow et al. (in press). For a qualitative answer, simply look up pictures of skew-normal distributions (contained in Trafimow et al., among other places). Observe that as the absolute magnitude of skewness increases, the bulk of the distributions become taller and narrower. Hence, sampling precision increases.

  5. 5.

    For skew-normal distributions it makes more sense to consider the square of the scale than to consider the square of the standard deviation, known as the variance. But researchers are used to variance and variance is sufficient to make the necessary points in this section.

  6. 6.

    I provide all the equations necessary to calculate the adjusted success rate in Trafimow (2017b).

References

  • Blanca, M.J., Arnau, J., López-Montiel, D., Bono, R., Bendayan, R.: Skewness and kurtosis in real data samples. Methodol. Eur. J. Res. Methods Behav. Soc. Sci. 9(2), 78–84 (2013)

    Google Scholar 

  • Cain, M.K., Zhang, Z., Yuan, K.H.: Behav. Res. Methods 49(5), 1716–1735 (2017)

    Article  Google Scholar 

  • Earp, B.D., Trafimow, D.: Replication, falsification, and the crisis of confidence in social psychology. Front. Psychol. 6(621), 1–11 (2015)

    Google Scholar 

  • Fishbein, M., Ajzen, I.: Predicting and changing behavior: The Reasoned Action Approach. Psychology Press (Taylor & Francis), New York (2010)

    Google Scholar 

  • Gillies, D.: Philosophical theories of probability. Routledge, London (2000)

    Google Scholar 

  • Grice, J.W., Cohn, A., Ramsey, R.R., Chaney, J.M.: On muddled reasoning and mediation modeling. Basic Appl. Soc. Psychol. 37(4), 214–225 (2015)

    Article  Google Scholar 

  • Gulliksen, H.: Theory of Mental Tests. Lawrence Erlbaum Associates Publishers, Hillsdale (1987)

    Google Scholar 

  • Ho, A.D., Yu, C.C.: Descriptive statistics for modern test score distributions: Skewness, kurtosis, discreteness, and ceiling effects. Educ. Psychol. Measur. 75(3), 365–388 (2015)

    Article  Google Scholar 

  • Kline, R.B.: The mediation myth. Basic Appl. Soc. Psychol. 37(4), 202–213 (2015)

    Article  Google Scholar 

  • Lord, F.M., Novick, M.R.: Statistical theories of mental test scores. Addison-Wesley, Reading (1968)

    MATH  Google Scholar 

  • Micceri, T.: The unicorn, the normal curve, and other improbable creatures. Psychol. Bull. 105(1), 156–166 (1989)

    Article  Google Scholar 

  • Nguyen, H.T.: On evidential measures of support for reasoning with integrated uncertainty: a lesson from the ban of P-values in statistical inference. In: Huynh, V.N. et al. (Eds.) Integrated Uncertainty in Knowledge Modeling and Decision Making, Lecture notes in Artificial Intelligence, vol, 9978, pp. 3–15. Springer, Cham (2016)

    Google Scholar 

  • Spirtes, P., Glymour, C., Scheines, R.: Causation, Prediction, and Search. The MIT Press, Cambridge (2000)

    MATH  Google Scholar 

  • Tate, C.U.: On the overuse and misuse of mediation analysis: it may be a matter of timing. Basic Appl. Soc. Psychol. 37(4), 235–246 (2015)

    Article  MathSciNet  Google Scholar 

  • Thoemmes, F.: Reversing arrows in mediation models does not distinguish plausible models. Basic Appl. Soc. Psychol. 37(4), 226–234 (2015)

    Article  Google Scholar 

  • Trafimow, D.: Editorial. Basic Appl. Soc. Psychol. 36(1), 1–2 (2014)

    Article  MathSciNet  Google Scholar 

  • Trafimow, D.: Introduction to special issue: what if planetary scientists used mediation analysis to infer causation? Basic Appl. Soc. Psychol. 37(4), 197–201 (2015)

    Article  Google Scholar 

  • Trafimow, D.: Using the coefficient of confidence to make the philosophical switch from a posteriori to a priori inferential statistics. Educ. Psychol. Measur. 77(5), 831–854 (2017a)

    Article  Google Scholar 

  • Trafimow, D.: Comparing the descriptive characteristics of the adjusted success rate to the phi coefficient, the odds ratio, and the difference between conditional proportions. Int. J. Stat. Adv. Theory Appl. 1(1), 1–19 (2017b)

    Google Scholar 

  • Trafimow, D.: The probability of simple versus complex causal models in causal analyses. Behav. Res. Methods 49(2), 739–746 (2017c)

    Article  Google Scholar 

  • Trafimow, D.: Some implications of distinguishing between unexplained variance that is systematic or random. Educ. Psychol. Measur. 78(3), 482–503 (2018)

    Article  Google Scholar 

  • Trafimow, D.: My ban on null hypothesis significance testing and confidence intervals. Studies in Computational Intelligence (in press a)

    Google Scholar 

  • Trafimow, D.: An a priori solution to the replication crisis. Philos. Psychol. 31(8), 1188–1214 (2018)

    Article  Google Scholar 

  • Trafimow, D., Amrhein, V., Areshenkoff, C.N., Barrera-Causil, C.J., Beh, E.J., Bilgiç, Y.K., Bono, R., Bradley, M.T., Briggs, W.M., Cepeda-Freyre, H.A., Chaigneau, S.E., Ciocca, D.R., Correa, J.C., Cousineau, D., de Boer, M.R., Dhar, S.S., Dolgov, I., Gómez-Benito, J., Grendar, M., Grice, J.W., Guerrero-Gimenez, M.E., Gutiérrez, A., Huedo-Medina, T.B., Jaffe, K., Janyan, A., Karimnezhad, A., Korner-Nievergelt, F., Kosugi, K., Lachmair, M., Ledesma, R.D., Limongi, R., Liuzza, M.T., Lombardo, R., Marks, M.J., Meinlschmidt, G., Nalborczyk, L., Nguyen, H.T., Ospina, R., Perezgonzalez, J.D., Pfister, R., Rahona, J.J., Rodríguez-Medina, D.A., Romão, X., Ruiz-Fernández, S., Suarez, I., Tegethoff, M., Tejo, M., van de Schoot, R., Vankov, I.I., Velasco-Forero, S., Wang, T., Yamada, Y., Zoppino, F.C.M., Marmolejo-Ramos, F.: Manipulating the alpha level cannot cure significance testing. Front. Psychol. 9, 699 (2018a)

    Article  Google Scholar 

  • Trafimow, D., Clayton, K.D., Sheeran, P., Darwish, A.-F.E., Brown, J.: How do people form behavioral intentions when others have the power to determine social consequences? J. Gen. Psychol. 137, 287–309 (2010)

    Article  Google Scholar 

  • Trafimow, D., Kiekel, P.A., Clason, D.: The simultaneous consideration of between-participants and within-participants analyses in research on predictors of behaviors: the issue of dependence. Eur. J. Soc. Psychol. 34, 703–711 (2004)

    Article  Google Scholar 

  • Trafimow, D., MacDonald, J.A.: Performing inferential statistics prior to data collection. Educ. Psychol. Measur. 77(2), 204–219 (2017)

    Article  Google Scholar 

  • Trafimow, D., Marks, M.: Editorial. Basic Appl. Soc. Psychol. 37(1), 1–2 (2015)

    Article  MathSciNet  Google Scholar 

  • Trafimow, D., Marks, M.: Editorial. Basic Appl. Soc. Psychol. 38(1), 1–2 (2016)

    Article  MathSciNet  Google Scholar 

  • Trafimow, D., Wang, T., Wang, C.: Means and standard deviations, or locations and scales? That is the question! New Ideas Psychol. 50, 34–37 (2018b)

    Article  Google Scholar 

  • Trafimow, D., Wang, T., Wang, C.: From a sampling precision perspective, skewness is a friend and not an enemy! Educ. Psychol. Meas. (in press)

    Google Scholar 

  • Trueblood, J.S., Busemeyer, J.R.: A quantum probability account of order effects in inference. Cogn. Sci. 35, 1518–1552 (2011)

    Article  Google Scholar 

  • Trueblood, J.S., Busemeyer, J.R.: A quantum probability model of causal reasoning. Front. Psychol. 3, 138 (2012)

    Article  Google Scholar 

  • Valentine, J.C., Aloe, A.M., Lau, T.S.: Life after NHST: How to describe your data without “p-ing” everywhere. Basic Appl. Soc. Psychol. 37(5), 260–273 (2015)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to David Trafimow .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Trafimow, D. (2019). What to Do Instead of Null Hypothesis Significance Testing or Confidence Intervals. In: Kreinovich, V., Thach, N., Trung, N., Van Thanh, D. (eds) Beyond Traditional Probabilistic Methods in Economics. ECONVN 2019. Studies in Computational Intelligence, vol 809. Springer, Cham. https://doi.org/10.1007/978-3-030-04200-4_8

Download citation

Publish with us

Policies and ethics