Two-stage designs for identification and estimation of polynomial models

doi:10.1016/j.csda.2003.08.008

Computational Statistics & Data Analysis

Volume 46, Issue 3, 15 June 2004, Pages 531-546

https://doi.org/10.1016/j.csda.2003.08.008 Get rights and content

Abstract

A sequential approach to experimentation allows the investigator to adjust the design according to data obtained from earlier stages, as well as prioritize design objectives at each stage of the study. In this paper, we develop a two-stage design strategy for model identification and parameter estimation when the relationship between the response and the predictor is known to belong to a class of polynomial models. This approach is an extension of Montepiedra and Yeh's (Comm. Statist. Simulations Comput. 27 (1997) 377) two-stage design when there are only two candidate models. The two-stage strategy is compared to robust uni-stage designs for the case when the model is known to be polynomial up to the third order (cubic model). A generalization of the concept of expected relative efficiency is also provided.

Introduction

Suppose that an experimenter is interested in fitting a regression model such that the mean response function E[y(x)] belongs to the class of functions $F_{m} ={g_{l} : g_{l} (x)=θ_{l}^{T} f_{l} (x), l=0,1,…,m},$ where $θ_{l}^{T} =[θ_{0l},θ_{1l},…,θ_{ll}]∈ R^{l+1}$ and f_l^T(x)=[1,x,…,x^l]. Hence, E[y(x)] is believed to follow a polynomial of order m, at the most. The random response variable Y(x) is observed at different values of the controllable variable x, and these observations are independent and normally distributed with the same variance σ². An experimental design ξ is defined to be a probability measure with respect to x on a so-called design region X.

For the sake of simplicity, we shall restrict X to be the compact interval [−1,1]. The matrix defined by $M_{l} (ξ)= ∫ −1 1 f_{l} (x)f_{l}^{T} (x)ξ(d x)$ is called the information matrix of ξ under the model E[y(x)]=g_l(x). If ξ has masses ξ_i only at the support points x_i,i=1,…,n, and Nξ_i=n_i are integers, then one can interpret ξ to be an experimental design wherein N uncorrelated observations are taken, with n_i replications at each level x_i of the controllable variable. The covariance matrix of the least-squares estimator for the parameter vector θ_l is proportional to the inverse of the information matrix M_l(ξ).

An approximate optimal design maximizes or minimizes some sensible functional based on the information matrix or its inverse, in the set Ξ of all probability measures. Some of the more widely used criteria are, for instance, the log determinant (D-criterion), the trace (A-criterion) of the inverse of the information matrix and the maximum variance of the mean response on a region of interest {x∈S} (G-criterion). See Fedorov (1972) or Pukelsheim (1993) for details. These classical criteria, however, assume that the true form of the model is known.

In this paper, we consider the dual problem of finding designs that give good powers in identifying the degree of a polynomial model up to order m and at the same time provide competitive parameter estimates of the model ultimately chosen. Hence, these designs should exhibit desirable robustness properties for estimation under different models as well as for model identification. We propose a two-stage strategy, which extends the ideas of Montepiedra and Yeh (1997) to the case when there are multiple rival polynomial regression models. In their paper, they extol the philosophy of the sequential approach since it allows the flexibility of revising the design depending on the information provided by the observations from the previous stages. They provided a methodology that specifically addresses the case when there are only two rival models.

We describe in Section 2 an extension of Montepiedra and Yeh's (1997) two-stage strategy to the case when the multiple rival models fall in the class of polynomials F_m. Examples are provided for the case when m=3. Comparisons with existing uni-stage designs are made with respect to both the standard relative efficiencies and a proposed extension of the concept of expected relative efficiencies introduced in Montepiedra and Yeh (1997).

There exist in the literature some optimality criteria which are designed to possess some robustness properties, i.e., they perform well (in the sense of providing good efficiencies) under different estimation and/or identification scenarios. They will be called uni-stage designs in this paper because all observations will be collected at the same time. Each of these criteria falls under one of three categories, based on the inference goal on which the design will be used: (i) estimation, (ii) model identification, or (iii) both (i) and (ii).

Läuter (1974) suggested taking the weighted average of some reasonable functional of the information matrix under different rival models, such that the weights β^T=[β₀,β₁,…,β_m] quantify some prior belief about the order of the polynomial model. Essentially, β constitutes a probability measure on {0,1,…,m}. For the D-criterion, the design ξ_M,β is defined to be optimal if it maximizes $Ψ_{β} (ξ)= ∑ l=0 m β_{l} l+1 log |M_{l} (ξ)|.$

Dette and Studden (1995) proposed the maximization of a weighted p-mean of the relative efficiencies in the different models and derived a complete analytical solution to this problem. Optimization criterion (2) turns out to be a special case of this problem.

Since the regression model is only known to belong to the class F_m, designs which can help identify the actual order of the polynomial are also desirable. For instance, if the researcher is to employ a stepwise selection procedure after the data are collected, he would be interested in testing H₀:θ_ll=0 vs. H₁:θ_ll≠0 at an appropriate step in the model selection process. The power to reject H₀ if H₁ is true depends on the design ξ through the quantity: $δ_{ll} (ξ)= |M_{l} (ξ)| |M_{l−1} (ξ)| =(b_{l}^{T} M_{l}^{−1} (ξ)b_{l})^{−1},$ where $b_{l}^{T} =[0,0,…,0,1]∈ R^{l+1}$ . Hence, Dette (1994) considered the maximization of the weighted p-mean $Φ_{p,β}^{b} (ξ)= ∑ l=1 m β_{l} (b_{l}^{T} M_{l}^{−1} b_{l})^{−p}^{1/p},$ where β^T=(β₁,…,β_l) again denotes a set of priors and p is the order associated with the weighted mean. The corresponding D-criterion is obtained if one lets p approach 0 in the limit: $Φ_{β} (ξ)= ∑ l=1 m β_{l} ln δ_{ll} (ξ)$ and the design ξ_I,β is said to be optimal if it maximizes (4). Atkinson and Cox (1974) studied a special case of (3) and Dette (1995) proposed a maximin alternative for the case when p=−∞.

Dette (1993) suggested a mixture of criteria for model identification and parameter estimation: $Ψ_{β} (ξ)=(1−β) log |M_{m} (ξ)| |M_{m−1} (ξ)| + β m+1 log |M_{m} (ξ)|,$ where β∈[0,1]. This criterion reflects the need for a design that provides good parameter estimates for the model g_m(x) and at the same time gives good power for testing for θ_mm. It is clear from the formulation of the criterion that the optimal designs thus obtained will provide competitive efficiencies for estimation under model g_m(x) and for testing for θ_mm. For more general related criteria, the reader is referred to Dette and Franke 2000, Dette and Franke 2001. In this paper, all our comparisons are limited to uni-stage design criteria strictly for robust estimation or strictly for robust testing.

Section snippets

The two-stage robust design

We now utilize the ideas introduced in Montepiedra and Yeh (1997) by extending their proposed methods to the case when several possible models are considered. Suppose that a fraction, f, of the total resources are allocated to the first stage of the experiment. Then the following two-stage procedure will be implemented:

Stage 1: Find the design $ξ_{1}^{∗}$ that maximizes expression (4) over the set Ξ of all probability measures on X. This criterion is applicable assuming that the experimenter uses

A comparative study

We present comparisons under two settings: $β=[0, 13, 13, 13]$ and $β=[0, 214, 414, 814]$ . The former is used when there is non-informative prior, which is a common scenario. The latter provides an example when there is knowledge that the higher order model is more plausible.

We use relative efficiencies to assess robustness of the design proposed under different model scenarios. Using terminology in the literature, we define the relative D-efficiency of a design ξ for estimating under model g_l(x) as $r_{l} (ξ)=$

Expected relative efficiency

From the discussion in the previous section, we tend to emphasize the relative efficiencies of ξ_2s^l if the objective were estimation/identification of the model g_l(x). This is sensible because if g_l(x) were the correct model, then design ξ_2s^l will most likely be the final design used, with the hope that g_l(x) will ultimately be selected and used for estimation. However, there is still some chance that the other two possible designs will be used. Thus, their relative efficiencies should not be

Discussion

This paper provides a strategy when an experiment can be performed in two stages, with no concern about the possible presence of a block effect induced by a restricted within-stage randomization scheme. The idea is that the first-stage design places full emphasis on model identification, and the second-stage design is determined by optimizing the precision of the model parameter estimates.

In 3 A comparative study, 4 Expected relative efficiency, the performance of the proposed strategy was

Acknowledgements

The authors thank two referees for their insightful comments and numerous suggestions which greatly improve the presentation of the paper.

References (14)

A.C. Atkinson et al.
Planning experiments for discriminating between models (with discussion)
J. Roy. Statist. Soc. Ser. B
(1974)
H. Dette
A generalization of D- and D₁-optimal designs in polynomial regression
Ann. Statist
(1990)
H. Dette
On a mixture of the D- and D₁-optimality criterion in polynomial regression
J. Statist. Plann. Inference
(1993)
H. Dette
Discrimination designs for polynomial regression on compact intervals
Ann. Statist
(1994)
H. Dette
Optimal designs for identifying the degree of a polynomial regression
Ann. Statist
(1995)
H. Dette et al.
Constrained D- and D₁-optimal designs for polynomial regression
Ann. Statist
(2000)
H. Dette et al.
Robust designs for polynomial regression by maximizing a minimum of D- and D₁-efficiencies
Ann. Statist
(2001)

There are more references available in the full text version of this article.

Cited by (5)

Sequential sparsity iterative optimal design model for calibration of complex systems with epistemic uncertainty
2016, International Journal for Uncertainty Quantification
On the efficiency of two-stage response-adaptive designs
2013, Statistics in Medicine
On the G-efficiency of the modified arcsin design
2012, Statistics
Optimal designs for estimating the interesting part of a dose-effect curve
2007, Journal of Biopharmaceutical Statistics
Response Surfaces, Mixtures, and Ridge Analyses: Second Edition
2007, Response Surfaces, Mixtures, and Ridge Analyses: Second Edition

View full text

Computational Statistics & Data Analysis

Two-stage designs for identification and estimation of polynomial models

Abstract

Introduction

Section snippets

The two-stage robust design

A comparative study

Expected relative efficiency

Discussion

Acknowledgements

Planning experiments for discriminating between models (with discussion)

J. Roy. Statist. Soc. Ser. B

A generalization of D- and D1-optimal designs in polynomial regression

Ann. Statist

On a mixture of the D- and D1-optimality criterion in polynomial regression

J. Statist. Plann. Inference

Discrimination designs for polynomial regression on compact intervals

Ann. Statist

Optimal designs for identifying the degree of a polynomial regression

Ann. Statist

Constrained D- and D1-optimal designs for polynomial regression

Ann. Statist

Robust designs for polynomial regression by maximizing a minimum of D- and D1-efficiencies

Ann. Statist

Sequential sparsity iterative optimal design model for calibration of complex systems with epistemic uncertainty

On the efficiency of two-stage response-adaptive designs

On the G-efficiency of the modified arcsin design

Optimal designs for estimating the interesting part of a dose-effect curve

Response Surfaces, Mixtures, and Ridge Analyses: Second Edition

A generalization of D- and D₁-optimal designs in polynomial regression

On a mixture of the D- and D₁-optimality criterion in polynomial regression

Constrained D- and D₁-optimal designs for polynomial regression

Robust designs for polynomial regression by maximizing a minimum of D- and D₁-efficiencies