A note on sample complexity of multistage stochastic programs

doi:10.1016/j.orl.2016.04.005

Operations Research Letters

Volume 44, Issue 4, July 2016, Pages 430-435

https://doi.org/10.1016/j.orl.2016.04.005 Get rights and content

Abstract

We derive a lower bound for the sample complexity of the Sample Average Approximation method for a certain class of multistage stochastic optimization problems. In previous works, upper bounds for such problems were derived. We show that the dependence of the lower bound with respect to the complexity parameters and the problem’s data are comparable to the upper bound’s estimates. Like previous results, our lower bound presents an additional multiplicative factor showing that it is unavoidable for certain stochastic problems.

Introduction

Consider the following $T$ -stage stochastic programming problem represented in the nested form $min_{x_{1} \in X_{1}} {f (x_{1}) ≔ F_{1} (x_{1}) + E_{| ξ_{1}} [inf_{x_{2} \in X_{2} (x_{1}, ξ_{2})} F_{2} (x_{2}, ξ_{2}) + E_{| ξ_{[2]}} [\dots + E_{| ξ_{[T - 1]}} [inf_{x_{T} \in X_{T} (x_{T - 1}, ξ_{T})} F_{T} (x_{T}, ξ_{T})]]]},$ driven by the random data process $ξ_{1}, \dots, ξ_{T}$ . Here, $x_{t} \in R^{n_{t}}, t = 1, \dots, T$ , are decisions variables, $F_{t} : R^{n_{t}} \times R^{d_{t}} \to R$ are continuous functions and $X_{t} : R^{n_{t - 1}} \times R^{d_{t}} ⇉ R^{n_{t}}$ , $t = 2, \dots, T$ , are measurable multifunctions. The (continuous) function $F_{1} : R^{n_{1}} \to R$ , the (nonempty) closed set $X_{1}$ and the vector $ξ_{1}$ are deterministic. Moreover, $ξ_{[t]} ≔ (ξ_{1}, \dots, ξ_{t})$ denotes the history (information) available until stage $t$ by the decision maker.

If the (conditional) distribution of $ξ_{t}$ (given $ξ_{[t - 1]}$ ) is continuous, problem (1) cannot be addressed directly, except for some trivial cases. In fact, the (conditional) expected value operators are multidimensional integrals on $R^{d_{t}}$ , that are typically impossible to evaluate with high accuracy even for moderate values of the dimension.

Hence, one usually makes a discretization of the random data of problem (1) building a scenario tree. A classical idea is to construct the tree via Monte Carlo conditional sampling techniques. Given the scenario tree, one solves the SAA problem, that is, problem (1) with the discrete random data. This is the basic idea of the SAA method.

In general, even if we solve the SAA problem exactly, its first-stage optimal decision will not be optimal for the true problem. So, there exists an error that comes from the fact that we are approximating the true stochastic process. Suppose that the true stochastic problem has an optimal solution. One can investigate sufficient conditions on the stage sample sizes $N_{2}, \dots, N_{T}$ in order to guarantee that the following conditions happen (jointly) with probability at least $1 - α$ : (i) any first-stage $δ$ -optimal solution of the SAA problem is a first-stage $ϵ$ -optimal solution of the true problem, and (ii) the set of first-stage $δ$ -optimal solutions of the SAA problem is nonempty; where $ϵ > 0$ , $δ \in [0, ϵ)$ , and $α \in (0, 1)$ are specified parameters that we refer to as complexity parameters. Let us point out that this notion of complexity (with condition (ii) being implicitly assumed) was proposed and studied in [3], [6], [8].

In [4], it was given an explicit definition of the sample complexity of SAA method for instances and classes of $T$ -stage stochastic optimization problems. In the same reference, it was argued that estimates of the sample sizes derived in [8], [6] are upper bounds estimates for the sample complexity of static and multistage problems, respectively, satisfying some reasonable regularity conditions. In [4] it was obtained an explicit upper bound’s estimate for the complexity of $T$ -stage problems under relaxed regularity conditions. We will see later that it was important to relax these conditions in order to make a fair comparison between the upper and lower bounds estimates of the sample complexity.

In Section 2, we state the definition of sample complexity for $T$ -stage stochastic problems and some extensions on the complexity’s upper bounds obtained in [4]. In Section 3, we present a family of $T$ -stage convex stochastic optimization problems where it is possible to derive a lower bound for the sample complexity of each one of these problems. We apply this result to derive our lower bound for the sample complexity of a family of convex $T$ -stage problems. In Section 4, we compare our lower bound with the one derived for multistage financial optimization problems through no-arbitrage reasoning arguments. We also indicate one possible way to extend our results to the class of linear multistage optimization problems. This section is followed by a technical appendix.

Section snippets

Definition of the sample complexity and its upper bound

We follow closely Ref. [4] where the respective definitions were stated. Consider a scenario tree with $T$ -stages possessing the following node structure: every $t$ th-stage node has $N_{t + 1}$ successors nodes at stage $t + 1$ , for $t = 1, \dots, T - 1$ . Under this assumption, the total number of scenarios in the tree is equal to $N = \prod_{t = 2}^{T} N_{t} .$

We denote the sets of (first-stage) $ϵ$ -optimal solutions, respectively, of the true and the SAA problems as $S^{ϵ} ≔ {x_{1} \in X_{1} : f (x_{1}) \leq v^{*} + ϵ}$ and ${\hat{S}}_{N_{2}, \dots, N_{T}}^{ϵ} ≔ {x_{1} \in X_{1} : \hat{f} (x_{1}) \leq {\hat{v}}^{*} + ϵ},$ for $ϵ \geq 0$ . The

The main result

Here, we obtain a lower bound for the sample complexity of a class of $T$ -stage stochastic problems that satisfies the previous regularity conditions and the uniformly bounded condition. So, when we compare the derived lower bound with the previous upper bound, we are obtaining estimates that hold for the same class of problems. Observe that in [6], [7] condition (Mt.3) was assumed in a more restrictive form. In fact, it was assumed that $χ_{t} (ξ_{t + 1}) = L_{t},$ for a.e. $ξ_{t + 1} \in supp (ξ_{t + 1})$ and $t = 1, \dots, T - 1$ . Here,

Further considerations

In this section, we discuss two issues that were pointed by an anonymous referee.

A stream of research on multistage financial stochastic optimization problems has derived some sample complexity’s lower bounds through no-arbitrage reasoning arguments. Here, we give a very brief and incomplete review of this literature. In [2], it was addressed how the discretization of the random data for multistage financial stochastic programming models, whose state-variables are typically assumed continuous,

Acknowledgments

The author wishes to thank Prof. Alexander Shapiro for useful discussions, helpful comments and support in writing this paper. An anonymous referee made a number of helpful comments which also improved this document. The author also wishes to thank Prof. Alfredo Iusem for helping to prepare a first-version of this document. This work was done while this author was visiting the School of Industrial and Systems Engineering of Georgia Institute of Technology, Atlanta, GA, 30332-0205. This work was

References (8)

A. Geyer et al.
No-arbitrage conditions, scenario trees, and multi-asset financial optimization
European J. Oper. Res.
(2010)
P. Klaassen
Discretized reality and spurious profits in stochastic programming models for asset/liability management
European J. Oper. Res.
(1997)
A. Shapiro
On complexity of multistage stochastic programs
Oper. Res. Lett.
(2006)
A.J. Kleywegt et al.
The sample average approximation method for stochastic discrete optimization
SIAM J. Optim.
(2001)

There are more references available in the full text version of this article.

Cited by (6)

On complexity of multistage stochastic programs under heavy tailed distributions
2021, Operations Research Letters
Citation Excerpt :
One can refer to [21, Example 2.1] for an example. Here we use weaker H-calmness to replace the Lipschitz continuity assumption in [16,19]. We know from [10, Theorem 3.2] the following uniform polynomial rate of convergence.
In this paper, the complexity of sample average approximation (SAA) of multistage stochastic programs under heavy tailed distributions is investigated. Specifically, we estimate confidence levels when the accuracy parameter and sample size are given under independently and identically distributed (iid) and non-iid conditional samples, respectively. Different from the existing works, we emphasize the impact of heavy tailed distributions, non-iid conditional sampling and stages dependence of the random process in multistage stochastic programs.
Quantitative stability of multistage stochastic programs via calm modifications
2018, Operations Research Letters
In this paper, we revisit the quantitative stability of multistage stochastic programs. Different from the single calm modification used in Küchler (2008), we introduce two types of calm modifications which leads to a much simpler proof and tighter upper bound for the difference of optimal values of multistage stochastic programs under different stochastic processes than those of Küchler (2008). In addition, we avoid those restrictive assumptions in Küchler (2008) and the filtration distance in Heitsch et al. (2006). Finally, we illustrate our results with two numerical examples.
Certainty Equivalence Control-Based Heuristics in Multi-Stage Convex Stochastic Optimization Problems
2023, arXiv
Data-Driven Approximation Schemes for Joint Pricing and Inventory Control Models
2022, Management Science
Convergence analysis of sample average approximation for a class of stochastic nonlinear complementarity problems: from two-stage to multistage
2022, Numerical Algorithms
Data-Driven Approximation Schemes for Joint Pricing and Inventory Control Models
2019, SSRN

View full text

A note on sample complexity of multistage stochastic programs

Abstract

Introduction

Section snippets

Definition of the sample complexity and its upper bound

The main result

Further considerations

Acknowledgments

European J. Oper. Res.

European J. Oper. Res.

Oper. Res. Lett.

The sample average approximation method for stochastic discrete optimization

SIAM J. Optim.