A note on sample complexity of multistage stochastic programs

https://doi.org/10.1016/j.orl.2016.04.005Get rights and content

Abstract

We derive a lower bound for the sample complexity of the Sample Average Approximation method for a certain class of multistage stochastic optimization problems. In previous works, upper bounds for such problems were derived. We show that the dependence of the lower bound with respect to the complexity parameters and the problem’s data are comparable to the upper bound’s estimates. Like previous results, our lower bound presents an additional multiplicative factor showing that it is unavoidable for certain stochastic problems.

Introduction

Consider the following T-stage stochastic programming problem represented in the nested form minx1X1{f(x1)F1(x1)+E|ξ1[infx2X2(x1,ξ2)F2(x2,ξ2)+E|ξ[2][+E|ξ[T1][infxTXT(xT1,ξT)FT(xT,ξT)]]]}, driven by the random data process ξ1,,ξT. Here, xtRnt,t=1,,T, are decisions variables, Ft:Rnt×RdtR are continuous functions and Xt:Rnt1×RdtRnt, t=2,,T, are measurable multifunctions. The (continuous) function F1:Rn1R, the (nonempty) closed set X1 and the vector ξ1 are deterministic. Moreover, ξ[t](ξ1,,ξt) denotes the history (information) available until stage t by the decision maker.

If the (conditional) distribution of ξt (given ξ[t1]) is continuous, problem (1) cannot be addressed directly, except for some trivial cases. In fact, the (conditional) expected value operators are multidimensional integrals on Rdt, that are typically impossible to evaluate with high accuracy even for moderate values of the dimension.

Hence, one usually makes a discretization of the random data of problem (1) building a scenario tree. A classical idea is to construct the tree via Monte Carlo conditional sampling techniques. Given the scenario tree, one solves the SAA problem, that is, problem (1) with the discrete random data. This is the basic idea of the SAA method.

In general, even if we solve the SAA problem exactly, its first-stage optimal decision will not be optimal for the true problem. So, there exists an error that comes from the fact that we are approximating the true stochastic process. Suppose that the true stochastic problem has an optimal solution. One can investigate sufficient conditions on the stage sample sizes N2,,NT in order to guarantee that the following conditions happen (jointly) with probability at least 1α: (i) any first-stage δ-optimal solution of the SAA problem is a first-stage ϵ-optimal solution of the true problem, and (ii) the set of first-stage δ-optimal solutions of the SAA problem is nonempty; where ϵ>0, δ[0,ϵ), and α(0,1) are specified parameters that we refer to as complexity parameters. Let us point out that this notion of complexity (with condition (ii) being implicitly assumed) was proposed and studied in  [3], [6], [8].

In  [4], it was given an explicit definition of the sample complexity of SAA method for instances and classes of T-stage stochastic optimization problems. In the same reference, it was argued that estimates of the sample sizes derived in  [8], [6] are upper bounds estimates for the sample complexity of static and multistage problems, respectively, satisfying some reasonable regularity conditions. In  [4] it was obtained an explicit upper bound’s estimate for the complexity of T-stage problems under relaxed regularity conditions. We will see later that it was important to relax these conditions in order to make a fair comparison between the upper and lower bounds estimates of the sample complexity.

In Section  2, we state the definition of sample complexity for T-stage stochastic problems and some extensions on the complexity’s upper bounds obtained in  [4]. In Section  3, we present a family of T-stage convex stochastic optimization problems where it is possible to derive a lower bound for the sample complexity of each one of these problems. We apply this result to derive our lower bound for the sample complexity of a family of convex T-stage problems. In Section  4, we compare our lower bound with the one derived for multistage financial optimization problems through no-arbitrage reasoning arguments. We also indicate one possible way to extend our results to the class of linear multistage optimization problems. This section is followed by a technical appendix.

Section snippets

Definition of the sample complexity and its upper bound

We follow closely Ref.  [4] where the respective definitions were stated. Consider a scenario tree with T-stages possessing the following node structure: every tth-stage node has Nt+1 successors nodes at stage t+1, for t=1,,T1. Under this assumption, the total number of scenarios in the tree is equal to N=t=2TNt.

We denote the sets of (first-stage) ϵ-optimal solutions, respectively, of the true and the SAA problems as Sϵ{x1X1:f(x1)v+ϵ} and SˆN2,,NTϵ{x1X1:fˆ(x1)vˆ+ϵ}, for ϵ0. The

The main result

Here, we obtain a lower bound for the sample complexity of a class of T-stage stochastic problems that satisfies the previous regularity conditions and the uniformly bounded condition. So, when we compare the derived lower bound with the previous upper bound, we are obtaining estimates that hold for the same class of problems. Observe that in  [6], [7] condition (Mt.3) was assumed in a more restrictive form. In fact, it was assumed that χt(ξt+1)=Lt, for a.e. ξt+1supp(ξt+1) and t=1,,T1. Here,

Further considerations

In this section, we discuss two issues that were pointed by an anonymous referee.

A stream of research on multistage financial stochastic optimization problems has derived some sample complexity’s lower bounds through no-arbitrage reasoning arguments. Here, we give a very brief and incomplete review of this literature. In  [2], it was addressed how the discretization of the random data for multistage financial stochastic programming models, whose state-variables are typically assumed continuous,

Acknowledgments

The author wishes to thank Prof. Alexander Shapiro for useful discussions, helpful comments and support in writing this paper. An anonymous referee made a number of helpful comments which also improved this document. The author also wishes to thank Prof. Alfredo Iusem for helping to prepare a first-version of this document. This work was done while this author was visiting the School of Industrial and Systems Engineering of Georgia Institute of Technology, Atlanta, GA, 30332-0205. This work was

References (8)

There are more references available in the full text version of this article.

Cited by (6)

View full text