A method for detecting hidden additivity in two-factor unreplicated experiments

https://doi.org/10.1016/j.csda.2013.05.002Get rights and content

Abstract

Assessment of interaction in unreplicated two-factor experiments is a challenging problem that has received considerable attention in the literature. A model is proposed in which the levels of one factor belong in two or more groups. Within each group the effects of the two factors are additive but the groups may interact with the ungrouped factor. This structure is called “hidden additivity” if group membership is latent. To identify plausible groupings a search is performed over the space of all possible configurations, or placement of units into two or more groups. A multiplicity-adjusted all-configurations maximum interaction F (ACMIF) test to detect hidden additivity is developed. The method is illustrated using two data sets taken from the literature and a third taken from a recent study of copy number variation due to lymphoma. A simulation study demonstrates the power of the test for hidden additivity and compares it with other well-known tests from the literature.

Introduction

Consider a crossed two-way replicated experiment with factors A and B. The analysis of variance (ANOVA) model for response yijk is yijk=μ+αi+βj+(αβ)ij+ϵijk, where i=1,,a,j=1,,b and k=1,,n are indices for levels of A, levels of B, and experimental units, or replicates, respectively. The experimental errors are typically assumed ϵijkiidN(0,σ2). The αi and βj terms quantify the main effects of each factor. The (αβ)ij terms represent interaction parameters, which allow for the possibility that the effects of one factor on the response depend on the level of the other factor. The concept of interaction is fundamental in statistical analysis and determining its presence or absence is important for correct interpretation of data (Berrington de González and Cox, 2007).

In the unreplicated case where n=1, the interaction mean square term is usually used to estimate σ2. This leaves no information to assess interaction.

Cox (1984) provides a general overview of interaction effects. Many authors have posited restricted forms of interaction which are applicable to the unreplicated case. Alin and Kurt (2006) provides an overview of many existing methods to detect interaction in the unreplicated case, including reviews of Tukey (1949), Mandel, 1961, Mandel, 1971, Snee (1982), Johnson and Graybill (1972), Tusell, 1987, Tusell, 1990 (also see  Tusell, 1992), and Boik (1993).

Ghosh and Sharma (1963), Anscombe and Tukey (1963), Cressie (1978), Koziol (1989), McDonald (1972), and Zafar-Yab (1993) discuss and extend the one degree of freedom test proposed by Tukey (1949). Other techniques which detect non-additivity include those proposed by Gollob (1968), Hegemann and Johnson, 1976a, Hegemann and Johnson, 1976b, Milliken and Rasmuson (1977), Piepho (1994), Speed and Speed (1994), Pardo and Pardo (2005), Barker et al. (2009), and Post and Bondell (2011). These approaches generally assume a specific form of non-additivity and they differ in statistical power to detect various non-additive forms.

A common approach when analyzing unreplicated data is to assume additivity, or that (αβ)ij0. An example is the model commonly used to analyze data from the randomized complete block design (RCBD): yij=μ+αi+βj+ϵij. If the two factors in (2) genuinely interact then this model is misspecified, which will result in biased estimation. As an alternative to the additivity assumption, suppose that the levels of factor B fall into a smaller number of groups such that within these groups the effect of factor A is constant across levels of factor B, but interaction between group and factor A is possible. Fig. 1 illustrates this idea. We refer to this structure as “hidden additivity” and regard group membership as a latent variable. For a review of models involving latent variables, see Guo et al. (2006).

In Section  2, a test for interaction is developed based on the idea of hidden additivity and latent variables. All possible group assignments are considered and the configuration which provides the most evidence of hidden additivity is identified. A formal test of the null hypothesis of no interaction is developed by comparing the maximum A by Group interaction F ratio to an appropriate critical threshold. In an approach closely related to the test developed in Section  2, Kharrati-Kopaei and Sadooghi-Alvandi (2007) develops a procedure based on all of the possible configurations in which the levels of factor B in (2) can be split into two sub-tables. This approach is discussed in Section  2.

In Section  3, two data sets taken from the literature are shown to exhibit the hidden additivity form of interaction. The third example is a genomic study of copy number variation in dogs with lymphoma. This study serves as a rich source of two-factor unreplicated data including thousands of response variables (gene signal intensities). In Section  4, the power of the proposed method is compared with some well-known methods under several forms of non-additivity. Section  5 includes closing remarks.

Section snippets

Proposed method

Consider the class of two-way unreplicated designs, i.e. a single observation yij is available for each i=1,,a and j=1,,b. For convenience we refer to the levels of factor B as the “blocks”, and factor A as the “treatment”, although the method applies generally to two-way unreplicated experiments. Let mij=E[yij] and m(j)=(m1j,,maj)T, i.e. the jth column vector of the array of means. Under the additive model (2), m(j)=α+(μ+βj)1, where α=(α1,,αa)T and 1 is an a-dimensional vector of ones. The

Dog lymphoma

Copy number variation (CNV) is a form of genetic variation in which there is a duplicated segment of DNA. Often the number of copies of this duplicated DNA can vary between normal and tumor cells. Genetic CNV between normal and tumor cells may suggest genetic instability induced by the tumor, which may provide insight into breakdowns in genetic regulation within tumor cells. Table 1 and Fig. 1 provide data and an interaction plot for a study of CNV among six dogs. Both normal and tumor cells

Simulation

A simulation study was conducted using SAS version 9.3 to compare the performance of ACMIF using Bonferroni and simulated thresholds, the KKSA test, the one degree of freedom test, and the rows-linear test in terms of statistical power. In all simulations N=100,000 Monte Carlo (MC) data sets were generated with a=3 and b=7. Error variance was investigated at levels σ2={1,5,10}. For computational simplicity the block effect was modeled as a fixed effect for the methods presented. A significance

Discussion

Applying multiple tests for non-additivity

In the wheat, corn, and lymphoma examples, there is evidence of multiple types of interaction, some of which were detected using only a subset of the available methods. There are many ways to investigate interaction, some more suited to certain classes than others. Kharrati-Kopaei and Sadooghi-Alvandi (2007) point out that there is no test which has high power against all forms of non-additivity, and so multiple tests may be applied to screen for

Conclusion

Hidden additivity can be well explained in the context of a RCBD. It is a form of non-additivity where the blocks fall into a small number of latent groups. Within these groups the treatment effects are constant across blocks, but between groups, the treatment effects vary. The proposed method accounts for block variability and can be applied to two-way unreplicated studies outside of randomized block experiments. A method of detecting hidden additivity in unreplicated two-factor experiments

Acknowledgments

We would like to thank Drs. Matthew Breen, Rachael Thomas, David Dickey, Thomas Severini, and Chris Smith for their contributions to this work. We would like to thank the reviewers for their helpful suggestions.

References (33)

  • L. Pardo et al.

    Nonadditivity in loglinear models using ϕ-divergences and MLEs

    Journal of Statistical Planning and Inference

    (2005)
  • F. Tusell

    Testing for interaction in two-way ANOVA tables with no replication

    Computational Statistics and Data Analysis

    (1990)
  • A. Alin et al.

    Testing non-additivity (interaction) in two-way ANOVA tables with no replication

    Statistical Methods in Medical Research

    (2006)
  • F. Anscombe et al.

    The examination and analysis of residuals

    Technometrics

    (1963)
  • Barker, C., Stefanski, L., Osborne, J., 2009. The orthogonal interactions model for unreplicated factorial experiments,...
  • A. Berrington de González et al.

    Interpretation of interaction: a review

    The Annals of Applied Statistics

    (2007)
  • R. Boik

    Testing additivity in two-way classifications with no replications: the locally best invariant test

    Journal of Applied Statistics

    (1993)
  • O. Carter et al.

    Blast furnace slags as agricultural liming materials

    Agronomy Journal

    (1951)
  • Y. Chen et al.

    A new quantitative method for analysing unreplicated factorial designs

    Biometrical Journal

    (2004)
  • D. Cox

    Interaction

    International Statistical Review

    (1984)
  • D. Cox et al.

    Theoretical Statistics

    (1974)
  • N. Cressie

    Removing nonadditivity from two-way tables with one observation per cell

    Biometrics

    (1978)
  • M. Ghosh et al.

    Power of Tukey’s test for non-additivity

    Journal of the Royal Statistical Society. Series B

    (1963)
  • H. Gollob

    A statistical model which combines features of factor analytic and analysis of variance techniques

    Psychometrika

    (1968)
  • J. Guo et al.

    Latent class regressors on latent factors

    Biostatistics

    (2006)
  • V. Hegemann et al.

    On analyzing two-way AoV data with interaction

    Technometrics

    (1976)
  • The data set “CNVdata.csv” is available as an online supplement.

    View full text