ABSTRACT
In pharmaceutical and medical studies, randomized controlled trials (RCTs) aim to prove that a new treatment has better or superior efficacy than standard treatment or placebo. In fact, RCTs can also be used to evaluate the efficacy of a new treatment having similar or equivalence efficacy, or not worse or non-inferior efficacy depending on the objectives of the research. Meanwhile, the non-inferiority trials are more frequently found in research. However, the equivalence trials are another efficacy that RCTs would like to know sometimes. The purposes of this article are to provide a basic understanding for readers about the distinctions among the types of research, statistical hypothesis testing, the interpretation of hypothesis testing as well as the differences between statistical significance and clinical significance and also introduce an innovative equivalence test calls 2-df for shift-scale equivalence test.
- Zhong, Baoliang. "How to calculate sample size in randomized controlled trial?." Journal of thoracic disease 1, no. 1 (2009): 51.Google Scholar
- Pocock, Stuart J., John JV McMurray, and Tim J. Collier. "Making sense of statistics in clinical trial reports: part 1 of a 4-part series on statistics for clinical trials." Journal of the American College of Cardiology 66, no. 22 (2015): 2536-2549.Google ScholarCross Ref
- Wellek, Stefan. Testing statistical hypotheses of equivalence and noninferiority. CRC press, 2010.Google ScholarCross Ref
- Mariani, Alessandro Wasum, and Paulo Manuel Pêgo-Fernandes. "Statistical significance and clinical significance." Sao Paulo Medical Journal 132, no. 2 (2014): 71-72.Google ScholarCross Ref
- Kaewkungwal J, Singhasivanon P. Textbook of Clinical. Research. Pittisuttithum P, editor.1st ed. Bangkok: Amarin Printing & Publishing; 2011.Google Scholar
- Shafiq, Nusrat, and Samir Malhotra. "Superiority trials: raising the bar of null hypothesis statistical testing." BMJ Evidence-Based Medicine 20, no. 5 (2015): 154-155.Google ScholarCross Ref
- Food and drug administration (FDA). Guidance for industry: Non-inferiority clinical trials to establish effectiveness, 2016. [cited 2019 Sep 20]. Available from: https://www.fda.gov/media/78504/downloadGoogle Scholar
- Schumi, Jennifer, and Janet T. Wittes. "Through the looking glass: understanding non-inferiority." Trials 12, no. 1 (2011): 1-12.Google ScholarCross Ref
- Dal-Ré, Rafael, Perrine Janiaud, and John PA Ioannidis. "Real-world evidence: How pragmatic are randomized controlled trials labeled as pragmatic?." BMC medicine 16, no. 1 (2018): 1-6.Google ScholarCross Ref
- Committee for Proprietary Medicinal Products (CPMP). "Points to consider on switching between superiority and non‐inferiority." British journal of clinical pharmacology 52, no. 3 (2001): 223-228.Google Scholar
- Lehmann, Erich L., and Joseph P. Romano. Testing statistical hypotheses. Springer Science & Business Media, 2006.Google Scholar
- Schuirmann, Donald J. "A comparison of the two one-sided tests procedure and the power approach for assessing the equivalence of average bioavailability." Journal of pharmacokinetics and biopharmaceutics 15, no. 6 (1987): 657-680.Google ScholarCross Ref
- Wanitjirattikal, Puntipa. "Comparing The Equivalence Testing With Using Two One-Sided Test, Square Root Of F Distribution And 2-DF For Shift-Scale-Equivalence Testing." In ECMS, pp. 400-406. 2019.Google Scholar
Recommendations
On the Two-Sample Randomisation Test for IR Evaluation
SIGIR '21: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information RetrievalWhile previous work in comparing statistical significance tests for IR system evaluation have focused on paired data tests (e.g., for evaluating two systems using a common test collection), two-sample tests must be used when the reproducibility of IR ...
Topic Set Size Design for Paired and Unpaired Data
ICTIR '18: Proceedings of the 2018 ACM SIGIR International Conference on Theory of Information RetrievalTopic set size design is an approach to determining the sample sizes of an experiment (e.g., number of topics) based on a statistical requirement, namely a desired statistical power or a cap on the confidence interval (CI) width for the difference in ...
On the invariance, coincidence, and statistical equivalence of the GLRT, rao test, and wald test
Three common techniques to discriminate between alternatives in a binary hypothesis testing problem are: the generalized likelihood ratio test (GLRT), the Rao test, and theWald test. In this paper, we investigate some characteristics of the ...
Comments