Nonparametric procedures for partially paired data in two groups
Introduction
In typical two-arm randomized controlled trials or observational studies, the efficacy of the active treatment is validated by comparing it with a control treatment. Such trials are typically conducted in a pre-post setting. The primary question of interest is whether the treatment is effective. The two assessments (pre and post) of the outcome can be viewed as levels of a within-subject factor and the two groups as levels of a between-subject factor. Therefore, in the terminology of repeated measures analysis, the primary question of interest becomes investigation of time-by-treatment interaction (Davis, 2002). In practice, data may not be available at either of the assessment points. Such instances may arise in randomized incomplete block designs or meta analysis. For an example of a randomized incomplete block design (RIBD), consider the special case of two units per block. These units could be two organs of the same person or a pair of twins. Suppose a unit is randomly selected from the two units and a treatment is applied to it whereas the other unit does not receive the treatment. For some of the blocks, it may happen that one of the units may not be available due to reasons other than treatment assignment or aspects unrelated to the outcome of interest, for example, due to transportation problems or work schedules. This missing mechanism is completely at random in the sense that data are missing for reasons unrelated to the study variables. This design leads to data for which the proposed methods can be applied. For another example, suppose the data to be analyzed are obtained from multiple sources where for some of the sources both versions of a treatment have been applied and for the others only one version of the treatment is applied. In the classical linear model paradigm, the corresponding models would include a random cluster (block) effect. In the nonparametric setting, the random effects are typically accounted for by allowing a general dependence (covariance) model.
Under the assumption of normality, data analysis for the above inferential problem can be carried out by a test with equal or unequal variances as appropriate. In the absence of normality, Generalized Estimating Equations (GEE) or Generalized Linear Mixed Model (GLMM) can be applied if the data fit certain parametric models (Diggle et al., 2013). When data are not measured using a metric scale (such as ordered categorical data), or when data have heavy tails or are skewed for any parametric model to be appropriate, nonparametric methods are preferred. Often, nonparametric methods have as much power as parametric methods when the assumptions of parametric methods are met, and have even more power when the data come from non-normal populations (DeWayne et al., 2010).
Modern nonparametric methods forego parametric or semi-parametric assumptions by formulating hypotheses in terms of distribution functions or some suitable functionals of the distribution functions. These functionals, referred to as nonparametric relative effects, are also used to quantify the magnitude of the effects of interest (Brunner et al., 2002). The use of empirical distribution functions to estimate these functionals leads to rank-based methods in a natural way. Although most of the rank-based procedures seem to sacrifice too much of the basic information in the samples, theoretical efficiency investigations have shown that this is not the case (Hodges and Lehmann, 1956, Hallin and Tribel, 2000).
Nonparametric methods for dependent data have been developed in a body of literature spanning over four decades. One of the earlier attempts was the use of rank-based methods for repeated measures by Brunner and Neumann (1982), which was later generalized by Thompson, 1990, Thompson, 1991 under continuity assumption. The idea to formulate nonparametric hypotheses for the marginal models in terms of the distribution functions, where marginal distributions are used to describe treatment effects and to formulate hypotheses, was introduced by Akritas and Arnold (1994), and further developed in the mixed effects model context by Akritas and Brunner (1997). These ideas were elaborated for repeated measures design by Brunner et al. (1999). Motivated by the difficulty in interpreting alternatives when hypotheses are formulated in terms of distribution functions, Konietschke et al. (2012) developed methods for repeated measures in one group for hypotheses formulated in terms of nonparametric relative effects. These methods allow construction of confidence intervals and they also address the nonparametric Behrens–Fisher problem (Brunner and Munzel, 2000) in the sense that the distribution of the data in the various groups could still be different under the null hypothesis. The results of Konietschke et al. (2012) were later generalized to a factorial design setup by Brunner et al. (2017).
The seemingly elementary data analysis problem discussed above could quickly turn into a challenge if the data are subject to missing values and the assumption of normality is grossly violated. In today’s data collection methods, unavailability of data from study units due to, for example, failure of devices or progression of disease, happen quite commonly. Data could be missing completely at random (MCAR ) for a reason unrelated to the study variables or missing at random (MAR) for a reason unrelated to the actual missing value, but may be related to other study variables. In this paper we focus on the former type. Albeit simple, sometimes this missing type can be motivated from design and survey perspectives, for example, see the discussion at the end of Section 2. See also Fong et al. (2017), Fuchs et al. (2017), Samawi and Vogel, 2014, Samawi and Vogel, 2015 and Xu and Harrar (2012) for examples in medicine and public health.
Under the assumptions of parametric models, Expectation–Maximization (EM) or Multiple Imputation (MI) algorithms (Little and Rubin, 2002) can be applied in conjunction with the assumed models for the complete data. Recent semi-parametric approaches that use all available information, but at least require the existence of the first few moments, include Xu and Harrar, 2012, Samawi and Vogel, 2014, Samawi and Vogel, 2015, Amro and Pauly (2017), and the references therein. The methods of Samawi and Vogel, 2014, Samawi and Vogel, 2015 are in the context of partially paired data and work by combining paired and independent sample tests for the paired and unpaired portion of the data, respectively. Apart from using permutations to determine the null distribution, Amro and Pauly (2017) also combine statistics separately calculated from the complete and incomplete data.
Nonparametric methods for incomplete paired data received the attention of researchers fairly recently (Akritas et al., 2002, Akritas et al., 2006, Konietschke et al., 2012, Samawi and Vogel, 2014, Samawi and Vogel, 2015, Fong et al., 2017). The papers by Akritas et al., 2002, Akritas et al., 2006 introduced nonparametric tests for one and multiple-group paired data under the so-called random missingness. However, the hypotheses of interest are formulated in terms of marginal distributions and the asymptotic variances are estimated under the null hypotheses. Therefore, the tests cannot be used for construction of confidence intervals. The methods of Fong et al. (2017) and Samawi and Vogel, 2014, Samawi and Vogel, 2015, which are constructed by combining the Wilcoxon-Signed-Rank and Wilcoxon–Mann–Whitney tests, also suffer from this very problem. Further, these methods are intended for partially paired data in one group. Konietschke et al. (2012) considered a nonparametric method for partially paired data. The strengths of their method are that hypotheses are formulated in terms of the nonparametric relative effect and the asymptotic variance of the test statistic is derived under general conditions. Therefore, the asymptotic results can be used to construct confidence intervals. However, as the method is designed for a single group situation and it cannot be applied to compare treatments in two-arms and two-assessments studies. Hence, this paper attempts to address the shortcoming outlined above. More specifically, this paper aims to develop a nonparametric test for two-arm trials where the outcome, assessed at two time points, is subject to missing values. The nonparametric method accommodates binary, ordered categorical, discrete and continuous data in a unified manner. The approach can also be used to construct a confidence interval for the primary effect of interest, which is the interaction effect. Unlike some of the other methods (e.g. Samawi and Vogel, 2014, Samawi and Vogel, 2015, Xu and Harrar, 2012, Amro and Pauly, 2017), the new method compares the complete and incomplete pieces of the data to construct a test and a confidence interval. For further clarity we display the schematic layout of the dataset in the paper as given in Table 1.
The remainder of the paper is organized as follows. The statistical model and the nonparametric effect size are detailed in Section 2. An estimator for the nonparametric effect is proposed and a rank-based expression for the estimator is derived. The asymptotic theory for the nonparametric method, which constitutes the main contribution of the paper, is presented in Section 3. The application of the theory for carrying out a significance test and constructing a confidence interval calls for derivation of a consistent estimator for the asymptotic variance. This is done in Section 4. The applications of the asymptotic theory for tests and confidence intervals for the nonparametric effect are outlined in Section 5. More importantly, Section 5 proposes a small-sample test and a confidence interval. Simulation studies are carried out in Section 6 to evaluate the numerical accuracy of the asymptotic results and finite sample approximations. Another aim of Section 6 is to numerically compare the performance of the nonparametric test developed and to compare its finite sample approximation with competing parametric method in terms of size and power of tests. We close Section 6 by analyzing real datasets from public health to illustrate the application of the methods. Further discussion of the methods, conclusions and recommendations are provided in Section 7. All proofs and technical details are placed in the Appendix.
Section snippets
Model and interaction-effect size measure
Suppose there are subjects from group that can be partitioned into complete cases with measurements at both time points, incomplete cases that are observed only at time point , and incomplete cases that are observed only at time point . For instance, denotes the number of incomplete subjects in group 1 at time point 1. Denote the paired observations from the complete cases by for , and . Further, denote the observations from
Asymptotic theory
Under the asymptotic framework of Assumption 1, the estimator is asymptotically unbiased and strongly consistent. These facts are stated and proved in Proposition 3.
Proposition 3 Under Assumption 1, is an asymptotically unbiased estimator of . is a strongly consistent estimator of , i.e. .
Next, we investigate the asymptotic distribution of . Our method of derivation involves obtaining an asymptotic equivalent version of which can be expressed as the sum of
Estimation of the asymptotic variance
In order to be able to use the asymptotic distribution result for statistical inference, we need a consistent estimator of . From the Weak Law of Large Numbers, we know that for where However, and cannot directly be used to construct a consistent estimator of because they are defined in terms of unobservable random variables and . In the following, we
Test procedures and confidence intervals
Recall that in Eq. (2), the quantity is defined as an interaction effect. So, means that there is no interaction effect, which in turn is equivalent to saying . Therefore, testing the hypothesis of no interaction effect can be formulated as The statistic and its limiting distribution, under Assumption 1, are which can be used to test the hypothesis of no interaction effect. The asymptotic
Simulation study
In this subsection we investigate the numerical accuracy of the asymptotic results of Sections 3 Asymptotic theory, 4 Estimation of the asymptotic variance via a simulation study. We also use simulations to perform numerical comparisons of the nonparametric test proposed in this paper with parametric tests for the same problem developed elsewhere. We carry out comparisons in terms of the size and power of the tests. In all the simulations, the run size is 10,000. The methods compared are:
- :
Discussion and conclusion
We have developed a nonparametric method for partially paired (correlated) data. The method is particularly suited to the situation where two treatments are compared by assessing outcomes at two occasions, and it is designed for the situation where data are available at either one of the occasions. The procedures proposed in the manuscript are geared towards testing significance and constructing confidence interval for the difference in the change induced by the two treatments. The method can
Acknowledgments
The authors are grateful to the associate editor and the three anonymous referees for critically reading the original version of the manuscript and making valuable suggestions that led to substantial improvements. The authors are also thankful to the editor for the orderly handling of the manuscript. The authors would like to express their gratitude to Prof. Derek S. Young for his willingness and time to read through the final version of manuscript and make editorial suggestions. Part of the
References (38)
- et al.
A unified approach to rank tests for mixed models
J. Statist. Plann. Inference
(1997) - et al.
Rank-score tests in factorial designs with repeated measures
J. Multivariate Anal.
(1999) - et al.
Motivational interviewing for smoking cessation in college students: a group randomized controlled trial
Prev. Med.
(2010) - et al.
Ranking procedures for matched pairs with missing data – asymptotic theory and a small sample approximation
Comput. Statist. Data Anal.
(2012) - et al.
Asthma randomized trial of indoor wood smoke (ARTIS): rationale and methods
Contemp. Clin. Trials
(2012) Asymptotic distribution of rank statistics under dependencies with multivariate application
J. Multivariate Anal.
(1990)- et al.
Nonparametric analysis of factorial designs with random missingness: bivariate data
J. Amer. Statist. Assoc.
(2006) - et al.
Fully nonparametric hypotheses for factorial designs I: multivariate repeated measures designs
J. Amer. Statist. Assoc.
(1994) - et al.
A nonparametric approach to matched pairs with missing data
Sociol. Methods Res.
(2002) - et al.
Permuting incomplete paired data: a novel exact and asymptotic correct randomization test
J. Stat. Comput. Simul.
(2017)
A class of distributions which includes the normal ones
Scand. J. Stat.
Distributions generated by perturbation of symmetry with emphasis on a multivariate skew t-distribution
J. R. Stat. Soc. Ser. B Stat. Methodol.
Probability and Measure
Box-type approximations in nonparametric factorial designs
J. Amer. Statist. Assoc.
Nonparametric Analysis of Longitudinal Data in Factorial Experiments
Rank-based procedures in factorial designs: hypotheses about non-parametric treatment effects
J. R. Stat. Soc. Ser. B Stat. Methodol.
The nonparametric Behrens-Fisher problem: asymptotic theory and a small-sample approximation
Biom. J.
Rank tests for correlated random variables
Biom. J.
Statistical Methods for the Analysis of Repeated Measurements
Cited by (6)
Nonparametric methods for clustered data in pre-post intervention design
2023, Journal of Statistical Planning and InferenceNonparametric Finite Mixture: Applications in Overcoming Misclassification Bias
2023, Journal of the American Statistical AssociationRobust testing procedures for scale differences in paired data
2023, Journal of Statistical Computation and SimulationRanking procedures for repeated measures designs with missing data: Estimation, testing and asymptotic theory
2022, Statistical Methods in Medical ResearchAsymptotic-based bootstrap approach for matched pairs with missingness in a single arm
2021, Biometrical JournalIntegration of statistical models and computer simulation in environmental accidents: A study on leakage of red mud in the Pará river, Amazon, Brazil
2021, Journal of the Brazilian Chemical Society