Weak aggregating algorithm for the distribution-free perishable inventory problem

doi:10.1016/j.orl.2010.09.006

Operations Research Letters

Volume 38, Issue 6, November 2010, Pages 516-521

https://doi.org/10.1016/j.orl.2010.09.006 Get rights and content

Abstract

We formulate the multiperiod, distribution-free perishable inventory problem as a problem of prediction with expert advice and apply an online learning method (the Weak Aggregating Algorithm) to solve it. We show that the asymptotic average performance of this method is as good as that of any time-dependent stocking rule in a given parametric class.

Introduction

In the classical perishable inventory (also called “newsvendor”) problem, a decision-maker must choose the starting inventory level for a product in a future selling period when demand for the product is uncertain. No replenishment is possible during the selling period, and the product is perishable, i.e. it loses part or all of its value at the end of the period. The newsvendor aims to achieve the maximum return by balancing the risk of lost sales because of understocking against that of inventory spoilage because of overstocking.

This problem is a practical inventory control problem faced by many firms whose products are perishable (including newspaper publishers), but it is also a component of other important problems; for example, bank cash management, water reservoir management, and airline revenue management. The problem has a simple solution when the probability distribution of demand is known but becomes significantly more challenging when distributional information is limited, inaccurate, or unavailable. We deal with the case in which there is no prior knowledge of the distribution.

In the distribution-free case, if the newsvendor faces only a single decision period, the problem falls into the category of decision analysis under uncertainty typified by the min-max approach of [10]. If there are multiple selling periods, it is possible to gain information about the demand distribution over time and adapt ordering policies accordingly. (We will use the term multi-period to mean multiple selling periods rather than multiple opportunities to order, an interpretation that has been used in some prior research.) Recent examples of work on the multi-period version include both Bayesian (see [8]) and nonparametric approaches (see [11]). The latter paper contains an up-to-date review of the extensive literature in this area. Other articles proposing non-parametric approaches and/or bound for inventory problems include [1] which proposes distribution-free upper and lower bounds for the order quantity and reorder point in a service-constrained non-perishable inventory system. [7], [3] consider the perishable inventory system where the functional form of demand distribution is known and develop an operational statistics approach to find a decision rule that maximizes the performance uniformly for all possible values of the unknown demand parameters. Finally, [6] considers a sampling-based approach in which it is possible to obtain bounds on the number of samples needed to attain a specified accuracy level.

In this article, we propose a novel approach to the distribution-free, multi-period problem that utilizes recent advances in the theory of prediction and learning with expert advice (see Chapter 2 of [2]). This approach leads to an algorithm with performance guarantees under more general assumptions than those previously achieved (existing non-parametric results require, at least, independence of demands over time). The ‘experts’ in this treatment are passive predictors of the best starting inventory levels over time out of the continuum of possible levels suggested by functions in a given parametric class with a bounded finite-dimensional parameter space; thus, we consider an infinite pool of experts. (The case of a finite number of stocking levels, and hence experts, is a straightforward variant of this.) The algorithm progresses, in essence, by forming successive weighted averages of the expert predictions, where the weights are adjusted according to the success of the experts in previous periods.

We make the following contributions:

(1)
We cast the newsvendor problem as online learning with expert advice and show that the Weak Aggregating Algorithm (WAA) of [5] can be applied to the problem.
(2)
We prove that the performance of the algorithm is asymptotically as good as the performance of the best time-dependent strategy in a given parametric class with a bounded parameter space. Thus, in the setting of the newsvendor problem, we obtain stronger results than the existing analysis of the WAA for a finite collection of experts.
(3)
The performance bound holds in the absence of any statistical assumptions about the demand sequence.

In the next section we provide a formal statement of the WAA. The newsvendor problem, the explicit WAA specialized to the newsvendor problem and its analysis are given in Section 3.

Section snippets

The general weak aggregating algorithm

The Aggregating Algorithm (AA) [12] is a general approach to online learning that involves combining or ‘merging’ advice from a pool of experts (typically finite). The objective is to minimize the losses from a sequence of decisions that must be made in a stochastic environment. The convergence of the AA is moderated with a learning rate parameter that can be adjusted for each particular application but is otherwise constant. The Weak Aggregating Algorithm (WAA) is similar to the AA but uses a

The weak aggregating algorithm for the newsvendor problem

Let $p$ and $c$ be the unit selling price and cost of a product and assume that the value of unsold inventory at the end of the selling period is zero. The case of a positive unit salvage value $s$ reduces to the basic case by redefining $c ≔ c - s$ and $p ≔ p - s$ .

The newsvendor may face a stocking decision indefinitely many times, but the case of a finite horizon, defined by a terminal period $N$ , is also covered by our result. His decision in each selling period $n = 1, 2 \dots$ is $y_{n} \in [0, B]$ , where $B$ is a known upper

Acknowledgement

This work was partially supported by Natural Sciences and Engineering Research Council of Canada grant numbers 261512, 341412, and 388724 and Engineering and Physical Sciences Research Council (UK) grant number EP/F002998/1.

References (12)

L. Chu et al.
Solving operational statistics via a Bayesian analysis
Oper. Res. Letters
(2008)
Yuri Kalnishkan et al.
The weak aggregating algorithm and weak mixability
J. Comput. Syst. Sci.
(2008)
L. Liyanage et al.
A practical inventory control policy using operational statistics
Oper. Res. Letters
(2005)
V. Agrawal et al.
Distribution free bounds for service constrained $(q, r)$ inventory systems
Naval Res. Logist.
(2000)
N. Cesa-Bianchi et al.
Prediction, Learning, and Games
(2006)
G. Hardy et al.
Inequalities
(1967)

There are more references available in the full text version of this article.

Cited by (37)

An extended weak aggregating algorithm for a two dimensional data-driven multi-stage newsvendor problem
2024, Expert Systems with Applications
We investigate a multi-stage newsvendor problem with advance purchase discount (APD) in this paper. At the beginning of a stage, the decision maker (DM) makes the advance ordering decision for all the periods in this stage; at the start of every period within the stage, the DM makes the regular ordering decision. In this problem, the only available information we can observe is the past demands. To solve this problem, we extend the weak aggregating algorithm (WAA) with one decision variable, an online learning approach based on the theory of prediction and learning with expert advice, to a two-dimensional problem that involves advance ordering decisions in stages and regular ordering decisions nested in each stage. The difficulty of the problem lies in transferring learned knowledge of demand information from stage to stage. We design a cross-stage knowledge transfer scheme and obtain online ordering solutions for both advance-order and regular-order. We show that our solutions converge to the optimal solutions asymptotically. In addition, we derive theoretical guarantees for total gains in one stage and cumulative gains for all stages in the planning horizon. Through numerical studies, we find that our solutions are competitive to those offered by the best experts in hindsight. Finally, we do the sensitivity analysis to illustrate the effectiveness of our algorithm under different parameter values.
A location-inventory supply chain network model using two heuristic algorithms for perishable products with fuzzy constraints
2018, Computers and Industrial Engineering
Citation Excerpt :
Nevertheless, the assumption does not always conform to the actual situations, for example, meat, green vegetables, human blood, medicine, flowers, films, alcohol, and gasoline. Some products have perishability, i.e. they lose all or part of their value as time goes by (Levina, Levin, McGill, Nediak, & Vovk, 2010). These products can be divided into two groups: perishable products and decaying products.
Supply chain network is very important to the development of industries. This paper integrates a location-inventory problem into a supply chain network and develops an optimization model for perishable products with fuzzy capacity and carbon emissions constraints. This model is formulated a mixed integer nonlinear programming model. In order to solve this model, hybrid genetic algorithm (HGA) and hybrid harmony search (HHS) are put forward to minimize the total costs. Instances under different situations are calculated using these two algorithms and Lindo (optimization solver). The impacts of some factors such as the number of facilities, intact rates, and demand on the total costs are investigated. The results of numerical experiments demonstrate that the proposed algorithms can effectively deal with problems under different conditions and these two algorithms have their own advantages. Specially, the quality of HHS’s solution is higher than that of HGA’s solution, whereas HGA is faster than HHS.
COMPETITIVE STRATEGIES FOR TWO-PRODUCT, MULTI-PERIOD STATIONARY NEWSVENDOR PROBLEM WITH BUDGET CONSTRAINT
2023, Journal of Industrial and Management Optimization
Solving a Distribution-Free Multi-Period Newsvendor Problem With Advance Purchase Discount via an Online Ordering Solution
2023, SAGE Open
Competitive Online Strategy Based on Improved Exponential Gradient Expert and Aggregating Method
2023, Computational Economics
Weak aggregating specialist algorithm for online portfolio selection
2023, Computational Economics

View all citing articles on Scopus

View full text

Weak aggregating algorithm for the distribution-free perishable inventory problem

Abstract

Introduction

Section snippets

The general weak aggregating algorithm

The weak aggregating algorithm for the newsvendor problem

Acknowledgement

Oper. Res. Letters

J. Comput. Syst. Sci.

Oper. Res. Letters

Distribution free bounds for service constrained (q,r) inventory systems

Naval Res. Logist.

Prediction, Learning, and Games

Inequalities

Distribution free bounds for service constrained $(q, r)$ inventory systems