Application of imperialist competitive algorithm to find minimax and standardized maximin optimal designs

doi:10.1016/j.csda.2016.06.014

Computational Statistics & Data Analysis

Volume 113, September 2017, Pages 330-345

https://doi.org/10.1016/j.csda.2016.06.014 Get rights and content

Abstract

Finding optimal designs for nonlinear models is complicated because the design criterion depends on the model parameters. If a plausible region for these parameters is available, a minimax optimal design may be used to remove this dependency by minimizing the maximum inefficiency that may arise due to misspecification in the parameters. Minimax optimal designs are often analytically intractable and are notoriously difficult to find, even numerically. A population-based evolutionary algorithm called imperialist competitive algorithm (ICA) is applied to find minimax or nearly minimax $D$ -optimal designs for nonlinear models. The usefulness of the algorithm is also demonstrated by showing it can hybridize with a local search to find optimal designs under a more complicated criterion, such as standardized maximin optimality.

Introduction

A wide class of evolutionary algorithms has been increasingly used to solve hard optimization problems in engineering, bioinformatics, computer science and finance. Of particular interest is the class of nature-inspired algorithms motivated from the influence of biology and the life sciences. Examples of such algorithms frequently used in the statistics literature are simulated annealing (SA) and genetic algorithms (GA). There are more recent and potentially more powerful ones such as differential evolution (DE), particle swarm optimization (PSO), imperialist competitive algorithm (ICA) and cuckoo search (CS) that have not yet been well tested for solving statistical problems. Our interest in this paper is application of one such algorithm to construct optimal experimental designs. The usefulness of implementing such designs in terms of cost saving and accurate statistical inferences is detailed in Atkinson (1996). An introduction to this subfield of optimal experimental designs is available in Berger and Wong (2009) and real applications of optimal designs can be found in Berger and Wong (2005).

A common appeal of such algorithms is that they are mainly assumptions free, fast, easy to implement and are broadly applicable to different types of constrained or unconstrained optimization problems. Consequently, they have good potential to optimize complicated functions with many variables regardless whether the objective function is differentiable or not. A common feature among these algorithms is that there require tuning parameters and if they are well chosen, the algorithm finds the optimum very fast. If the tuning parameters are poorly chosen, the algorithm does not give satisfactory answers. These algorithms do not usually have a firm theoretical basis, such as proof of its convergence to the optimum. However, these algorithms have been used successfully in many applied fields to solve real, complicated and high dimensional optimization problems where traditional formulations or methods fail. Our view is that the lack of proof of convergence should not hinder their use in statistics; for some problems such as the ones we work with, there is a theory for verifying whether the generated design is optimum, and if it is not, theory is available to assess its proximity to the optimum without knowing the optimum. We next briefly review selected applications of such algorithms for finding optimal designs in the literature.

Simulated annealing was first proposed by Kirkpatrick et al. (1983) and Meyer and Nachtsheim (1988) appeared to be among the first to use SA and constructed exact $D$ -optimal designs for both finite and continuous design spaces. Haines (1987) applied SA to construct exact $D$ -, $I$ -, and $G$ -optimal designs for polynomial regression models. Atkinson (1992) discussed starting values for SA with a focus on optimal design construction and recommended segmenting the search to a maximum number of evaluations. Other applications of SA include Zhou (2008), who found exact minimax $D$ -optimal design on discrete design spaces, and Wilmut and Zhou (2011), who constructed $D$ -optimal minimax two-level fractional factorial designs using a sequential algorithm. Woods (2010) implemented SA to obtain exact optimal designs for binary models under the optimum-in-average criterion. This algorithm is one of the very few that can be shown to converge to the optimum.

GA was proposed by Holland and John (0000) and has been applied to search for exact optimal designs mainly for linear models. An early proponent is Montepiedra et al. (1998), who found exact optimal designs for polynomial models; others include Heredia-Langner et al. (2003), Drain et al. (2004) and Mandal et al. (2015). Hamada et al. (2001) used GA to find near-optimal Bayesian experimental designs for linear and nonlinear regression and dichotomous data. A most recent review on the application of GA to solve optimal design problems is given in Lin et al. (2015).

Particle swarm optimization (PSO) algorithm proposed by Eberhart and Kennedy (1995) has emerged to be a popular tool for solving real world optimization problems. In PSO terminology, each possible solution is called a “particle”. Similar to many evolutionary algorithms, PSO is initialized with a population of random particles, called a “swarm”. Each particle starts to fly through the problem space with its own “velocity” that is being updated in every iteration according to the particle’s best position and the best global position over the swarm. PSO has been used to find several types of optimal designs for different problems. For example, Qiu et al. (2014) applied PSO to find locally $D$ - and $c$ -optimal designs for the compartmental, logistic and double exponential models and comparing PSO performance with the differential evolution algorithm proposed by Storn and Price (1997). Wong et al. (2015) demonstrated the usefulness of using PSO to find various types of optimal designs by applying it to solve several types of optimal design problems for different mixture models defined over a regular or irregular simplex. In addition, Chen et al. (2014) modified PSO to find minimax optimal designs for the logistic and enzyme kinetic models. Such optimal designs are notoriously difficult to find because the design criterion is non-differentiable and involves two layers of optimization.

The main goal of this paper is to investigate the capability of the Imperialist Competitive Algorithm (ICA) for finding optimal designs. Our work appears to be the first to use ICA for a statistical application. ICA is a meta-heuristic evolutionary algorithm inspired from the socio-political process of humans and proposed in Atashpaz-Gargari and Lucas (2007). In this sense, it is different from the above mentioned nature-inspired algorithms, which are inspired by animal behavior. ICA has been successfully applied in engineering subfields such as industrial, civil, mechanical, electronic, petroleum and computer engineering; see Hosseini and Al Khaled (2014) for a review. Our interest in ICA is in part due to recent reports from the engineering literature that suggest ICA can outperform some widely used evolutionary nature-inspired algorithms including PSO. For example, Hamel et al. (2012) compared performances of ICA with PSO to optimize some famous test functions like the De Jong’s, Rastrigin’s and Hartmann’s functions in nondestructive Eddy-Current Testing (ECT) problems. They reported that when the objective function has five or fewer parameters, ICA and PSO techniques performed almost the same. However, when the number of parameters was increased, ICA found the solution faster than PSO and with more accuracy (see Table 3 of Hamel et al., 2012 for details).

In what is to follow, we focus on finding minimax type of optimal designs using ICA. These are hard design problem because they involve solving nested multi-level optimization problems, the optimality criterion is not differentiable and there is no algorithm that we know of that can be shown to converge to such optimal designs in a general nonlinear regression setup. We show how to modify ICA with a perturbed move, coupled with a local search procedure to find the optimal designs more effectively.

In the next section, we review the statistical setup and theory for finding optimal designs. We present details and implementation information for the ICA in Section 3. In Section 4 we demonstrate how ICA may be applied to find minimax optimal designs for the power logistic model. Section 5 modifies ICA to find standardized maximin $D$ -optimal designs for the log-linear, exponential and enzyme kinetic models. Section 6 provides a discussion.

Section snippets

Background and minimax optimal designs

Throughout we focus on approximate designs proposed by Kiefer in late 1950s. His subsequent work and numerous applications of approximate designs ideas are now voluminously documented in Kiefer (1985). An approximate design $ξ$ is a probability measure defined on a user-selected design space $χ$ . Let $Ξ$ be the space of all such designs on $χ$ and let $ξ$ be an approximate design with $k$ support points at $x_{1}, x_{2}, \dots, x_{k}$ from $χ$ with corresponding weights $w_{1}, \dots, w_{k}$ , $w_{i} > 0$ , $\sum_{i = 1}^{k} w_{i} = 1$ . This means that when we have a

The Imperialist Competitive Algorithm (ICA)

This section describes the basic concepts behind ICA and we show they may be used to search for optimal designs. ICA begins with a random population of solutions called countries. Countries within ICA are analogous to particles in PSO. The population of countries is divided into some sub-populations called empires. Each empire contains one imperialist and some colonies. These empires start to evolve simultaneously by moving all colonies toward the imperialists through an operation called

Minimax $D$ -optimal designs for the power logistic model

The power logistic model is given by $f (x, θ) = {(\frac{1}{1 + exp (- b (x - a))})}^{s},$ where $θ = (a, b)$ . In the minimax approach, we suppose $θ$ belongs to the region of uncertainty $Θ = [a^{L}, a^{U}] \times [b^{L}, b^{U}]$ , where $a^{L}, a^{U}, b^{L}, b^{U}$ are known limits of the lower and upper bounds for $a$ and $b$ . This model is sometimes called the 3-parameter logistic model and is more flexible than the simple logistic model where $s = 1$ . When $s \neq 1$ , heteroscedasticity and skewness can be accommodated by the power logistic model but not the simple logistic

Standardized maximin $D$ -optimal designs

Minimax optimal designs are essentially equivalent to maximin optimal designs. These optimal designs can have very different criterion values depending on the nominal set of parameter values. Accordingly, it is desirable to standardize the criterion and control for the potentially widely varying magnitude of the criterion.

A design $ξ^{*}$ is a standardized maximin $D$ -optimal design (Dette, 1997) if $ξ^{*} = arg max_{ξ \in Ξ} inf_{θ \in Θ} [{(\frac{| M (ξ, θ) |}{| M (ξ_{θ}, θ) |})}^{\frac{1}{p}}],$ where $ξ_{θ}$ is the locally $D$ -optimal design for the parameter $θ$

Discussion

In this paper, we proposed and tested ICA as an alternative evolutionary algorithm to find minimax and standardized maximin optimal designs for nonlinear models. We applied the ICA algorithm and generated minimax optimal designs for the power logistic model, standardized maximin $D$ -optimal designs for the log-linear model, exponential model and several inhibition models useful in enzyme kinetic studies. These are hard problems because the design criterion is non-differentiable and requires

Acknowledgments

The authors would like to thank three anonymous referees and an Associate Editor for helpful comments and suggestions on an earlier version of the article. The research of Wong reported in this paper was partially supported by a grant award R01GM107639 from the National Institute of General Medical Sciences of the National Institutes of Health. The contents in this paper are solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes

References (62)

K. Chaloner et al.
Optimal Bayesian design applied to logistic regression experiments
J. Statist. Plann. Inference
(1989)
H. Dette et al.
Bayesian D-optimal designs for exponential regression models
J. Statist. Plann. Inference
(1997)
H. Dette et al.
E-optimal designs for the Michaelis–Menten model
Statist. Probab. Lett.
(1999)
S. Hosseini et al.
A survey on the imperialist competitive algorithm metaheuristic: implementation in engineering domain and directions for future research
Appl. Soft Comput.
(2014)
A. Kaveh et al.
Optimum design of skeletal structures using imperialist competitive algorithm
Comput. Struct.
(2010)
J. Qiu et al.
Using animal instincts to design efficient biomedical studies via particle swarm optimization
Swarm Evol. Comput.
(2014)
S. Talatahari et al.
Imperialist competitive algorithm combined with chaos for global optimization
Commun. Nonlinear Sci. Numer. Simul.
(2012)
M. Wilmut et al.
D-optimal minimax design criterion for two-level fractional factorial designs
J. Statist. Plann. Inference
(2011)
J. Zhou
D-optimal minimax regression designs on discrete design space
J. Statist. Plann. Inference
(2008)
K.M. Abdelbasit et al.
Experimental design for binary data
J. Amer. Statist. Assoc.
(1983)

Atashpaz-Gargari, E., 2008. Imperialist competitive algorithm,...

E. Atashpaz-Gargari et al.

Imperialist competitive algorithm: an algorithm for optimization inspired by imperialistic competition

A.C. Atkinson

The usefulness of optimum experimental designs

J. R. Stat. Soc. Ser. B Stat. Methodol.

(1996)

A.C. Atkinson

A segmented algorithm for simulated annealing

Stat. Comput.

(1992)

A.C. Atkinson et al.

Optimum Experimental Designs, with SAS, Vol. 34

(2007)

C.L. Atwood

Optimal and efficient designs of experiments

Ann. Math. Statist.

(1969)

Bendtsen, C., 2012. pso: particle swarm optimization, r package version 1.0.3. URL...

M.P. Berger et al.

Minimax D-optimal designs for item response theory models

Psychometrika

(2000)

M.P.F. Berger et al.

An Introduction to Optimal Designs for Social and Biomedical Research

(2009)

M.P.F. Berger et al.

Applied Optimal Designs

(2005)

B. Bogacka et al.

Optimum design of experiments for enzyme inhibition kinetic models

J. Biopharm. Statist.

(2011)

F. Bretz et al.

Practical considerations for optimal designs in clinical dose finding studies

Stat. Med.

(2010)

Brooke, A., Kendrick, D., Meeraus, A., 1996. GAMS Release 2.25: A user’s guide, GAMS Development Corporation...

R.-B. Chen et al.

Minimax optimal designs via particle swarm optimization methods

Stat. Comput.

(2014)

H. Chernoff

Locally optimal designs for estimating parameters

Ann. Math. Statist.

(1953)

H. Dette

Designing experiments with respect to standardized optimality criteria

J. R. Stat. Soc. Ser. B Stat. Methodol.

(1997)

H. Dette et al.

Robust and efficient designs for the Michaelis–Menten model

J. Amer. Statist. Assoc.

(2003)

H. Dette et al.

Optimal designs for the EMAX, log-linear and exponential models

Biometrika

(2010)

H. Dette et al.

Locally D-optimal designs for exponential regression models

Statist. Sinica

(2006)

D. Drain et al.

A genetic algorithm hybrid for constructing optimal response surface designs

Qual. Reliab. Eng. Int.

(2004)

B.P. Duarte et al.

A semi-infinite programming based algorithm for finding minimax optimal designs for nonlinear models

Stat. Comput.

(2013)

Cited by (20)

Numerical Methods for Finding A-optimal Designs Analytically
2023, Econometrics and Statistics
Citation Excerpt :
We mention two other nature-inspired metaheuristic algorithms that were recently used to search for optimal designs. They are the imperial competitive algorithm (ICA) (Masoudi et al., 2017; 2019) and differential evolution (DE) (Xu et al., 2019; Stokes et al., 2020). ICA is based on human behavior and DE has similar characteristics as Genetic Algorithm, which is likely more well known among statisticians.
The traditional way in statistics to find optimal designs for regression models is an analytical approach. Technical conditions that may be restrictive in practice are sometimes imposed to obtain the analytical results. Even then, the mathematical technique is invariably not amendable to find an optimal design under a different criterion or for the same criterion with a slightly changed model, suggesting that developing flexible and effective algorithms to search for the optimum is very useful. In particular, numerical results from an algorithm can be helpful to find analytical descriptions of optimal designs. As an example, particle swarm optimization has been shown to be quite effective for finding optimal designs for hard design problems and this paper demonstrates how its output can be used to find new analytic $A$ -optimal approximate designs for the Gamma and inverse Gaussian models, each with the inverse link function. The methodology is quite general and may be applied to find analytical $A$ -optimal designs for other models, like the Poisson model with the log link function, or other types of optimal designs.
A fuzzy adaptive metaheuristic algorithm for identifying sustainable, economical, and earthquake-resistant reinforced concrete cantilever retaining walls
2023, Journal of Computational Science
In earthquake-prone zones, the seismic performance of reinforced concrete cantilever (RCC) retaining walls is a critical factor. In this study, the seismic performance was investigated using horizontal and vertical pseudo-static coefficients. To tackle RCC weights and forces resulting from these earth pressures, 26 constraints for structural strengths and geotechnical stability along with 12 geometric variables are associated with each design. These constraints and design variables form a constraint optimization problem with a 12-dimensional solution space. To conduct effective search and produce sustainable and economical RCC designs that are robust against earthquake hazards, a novel adaptive fuzzy-based metaheuristic algorithm is proposed. The proposed method divides the search space into sub-regions and establishes exploration, information sharing, and exploitation search capabilities based on its novel search components. Further, fuzzy inference systems are employed to address parameterization and computational cost evaluation issues. It was found that the proposed algorithm can achieve low-cost, low-weight, and low-CO2 emission RCC designs under nine seismic conditions when compared with several classical and best-performing design optimizers.
An algorithm based on semidefinite programming for finding minimax optimal designs
2018, Computational Statistics and Data Analysis
Citation Excerpt :
Qiu (2014) and Chen et al. (2015) applied a well known member of this class of algorithms, called Particle Swarm Optimization (PSO) to generate several types of optimal designs. Masoudi et al. (2017) adopted another metaheuristic procedure called Imperialist Competitive Algorithm (ICA) for the same purpose. We did not compare performance of such algorithms with our proposed approach because we feel that comparing performance of algorithms should always be done meaningfully and fairly and we do not feel it is feasible to do so here.
An algorithm based on a delayed constraint generation method for solving semi-infinite programs for constructing minimax optimal designs for nonlinear models is proposed. The outer optimization level of the minimax optimization problem is solved using a semidefinite programming based approach that requires the design space be discretized. A nonlinear programming solver is then used to solve the inner program to determine the combination of the parameters that yields the worst-case value of the design criterion. The proposed algorithm is applied to find minimax optimal designs for the logistic model, the flexible 4-parameter Hill homoscedastic model and the general $n$ th order consecutive reaction model, and shows that it (i) produces designs that compare well with minimax $D -$ optimal designs obtained from semi-infinite programming method in the literature; (ii) can be applied to semidefinite representable optimality criteria, that include the common $A -, E -, G -, I -$ and $D$ -optimality criteria; (iii) can tackle design problems with arbitrary linear constraints on the weights; and (iv) is fast and relatively easy to use.
Sequential Design of Experiments to Maximize Learning from Carbon Capture Pilot Plant Testing
2018, Computer Aided Chemical Engineering
Citation Excerpt :
It incorporates the results from the first stage to determine the location of the next experiment. For the space-filling design, the minimax criterion (Masoudi et al., 2017) was used. This objective seeks to minimize the maximum distance between any location in the design space of interest and where data have already been obtained.
Pilot plant test campaigns can be expensive and time-consuming. Therefore, it is of interest to maximize the amount of learning and the efficiency of the test campaign given the limited number of experiments that can be conducted. This work investigates the use of sequential design of experiments (SDOE) to overcome these challenges by demonstrating its usefulness for a recent solvent-based CO₂ capture plant test campaign. Unlike traditional design of experiments methods, SDOE regularly uses information from ongoing experiments to determine the optimum locations in the design space for subsequent runs within the same experiment. However, there are challenges that need to be addressed, including reducing the high computational burden to efficiently update the model, and the need to incorporate the methodology into a computational tool. We address these challenges by applying SDOE in combination with a software tool, the Framework for Optimization, Quantification of Uncertainty and Surrogates (FOQUS) (Miller et al., 2014a, 2016, 2017). The results of applying SDOE on a pilot plant test campaign for CO₂ capture suggests that relative to traditional design of experiments methods, SDOE can more effectively reduce the uncertainty of the model, thus decreasing technical risk. Future work includes integrating SDOE into FOQUS and using SDOE to support additional large-scale pilot plant test campaigns.
Special issue on Design of Experiments
2017, Computational Statistics and Data Analysis
A fuzzy adaptive evolutionary-based feature selection and machine learning framework for single and multi-objective body fat prediction
2023, arXiv

View all citing articles on Scopus

View full text

Application of imperialist competitive algorithm to find minimax and standardized maximin optimal designs

Abstract

Introduction

Section snippets

Background and minimax optimal designs

The Imperialist Competitive Algorithm (ICA)

Minimax D-optimal designs for the power logistic model

Standardized maximin D-optimal designs

Discussion

Acknowledgments

J. Statist. Plann. Inference

J. Statist. Plann. Inference

Statist. Probab. Lett.

Appl. Soft Comput.

Comput. Struct.

Swarm Evol. Comput.

Commun. Nonlinear Sci. Numer. Simul.

J. Statist. Plann. Inference

J. Statist. Plann. Inference

Experimental design for binary data

J. Amer. Statist. Assoc.

Imperialist competitive algorithm: an algorithm for optimization inspired by imperialistic competition

The usefulness of optimum experimental designs

J. R. Stat. Soc. Ser. B Stat. Methodol.

A segmented algorithm for simulated annealing

Stat. Comput.

Optimum Experimental Designs, with SAS, Vol. 34

Optimal and efficient designs of experiments

Ann. Math. Statist.

Minimax D-optimal designs for item response theory models

Psychometrika

An Introduction to Optimal Designs for Social and Biomedical Research

Applied Optimal Designs

Optimum design of experiments for enzyme inhibition kinetic models

J. Biopharm. Statist.

Practical considerations for optimal designs in clinical dose finding studies

Stat. Med.

Minimax optimal designs via particle swarm optimization methods

Stat. Comput.

Locally optimal designs for estimating parameters

Ann. Math. Statist.

Designing experiments with respect to standardized optimality criteria

J. R. Stat. Soc. Ser. B Stat. Methodol.

Robust and efficient designs for the Michaelis–Menten model

J. Amer. Statist. Assoc.

Optimal designs for the EMAX, log-linear and exponential models

Biometrika

Locally D-optimal designs for exponential regression models

Statist. Sinica

A genetic algorithm hybrid for constructing optimal response surface designs

Qual. Reliab. Eng. Int.

A semi-infinite programming based algorithm for finding minimax optimal designs for nonlinear models

Stat. Comput.

Minimax $D$ -optimal designs for the power logistic model

Standardized maximin $D$ -optimal designs