Elsevier

Operations Research Letters

Volume 38, Issue 6, November 2010, Pages 516-521
Operations Research Letters

Weak aggregating algorithm for the distribution-free perishable inventory problem

https://doi.org/10.1016/j.orl.2010.09.006Get rights and content

Abstract

We formulate the multiperiod, distribution-free perishable inventory problem as a problem of prediction with expert advice and apply an online learning method (the Weak Aggregating Algorithm) to solve it. We show that the asymptotic average performance of this method is as good as that of any time-dependent stocking rule in a given parametric class.

Introduction

In the classical perishable inventory (also called “newsvendor”) problem, a decision-maker must choose the starting inventory level for a product in a future selling period when demand for the product is uncertain. No replenishment is possible during the selling period, and the product is perishable, i.e. it loses part or all of its value at the end of the period. The newsvendor aims to achieve the maximum return by balancing the risk of lost sales because of understocking against that of inventory spoilage because of overstocking.

This problem is a practical inventory control problem faced by many firms whose products are perishable (including newspaper publishers), but it is also a component of other important problems; for example, bank cash management, water reservoir management, and airline revenue management. The problem has a simple solution when the probability distribution of demand is known but becomes significantly more challenging when distributional information is limited, inaccurate, or unavailable. We deal with the case in which there is no prior knowledge of the distribution.

In the distribution-free case, if the newsvendor faces only a single decision period, the problem falls into the category of decision analysis under uncertainty typified by the min-max approach of [10]. If there are multiple selling periods, it is possible to gain information about the demand distribution over time and adapt ordering policies accordingly. (We will use the term multi-period to mean multiple selling periods rather than multiple opportunities to order, an interpretation that has been used in some prior research.) Recent examples of work on the multi-period version include both Bayesian (see [8]) and nonparametric approaches (see [11]). The latter paper contains an up-to-date review of the extensive literature in this area. Other articles proposing non-parametric approaches and/or bound for inventory problems include [1] which proposes distribution-free upper and lower bounds for the order quantity and reorder point in a service-constrained non-perishable inventory system. [7], [3] consider the perishable inventory system where the functional form of demand distribution is known and develop an operational statistics approach to find a decision rule that maximizes the performance uniformly for all possible values of the unknown demand parameters. Finally, [6] considers a sampling-based approach in which it is possible to obtain bounds on the number of samples needed to attain a specified accuracy level.

In this article, we propose a novel approach to the distribution-free, multi-period problem that utilizes recent advances in the theory of prediction and learning with expert advice (see Chapter 2 of [2]). This approach leads to an algorithm with performance guarantees under more general assumptions than those previously achieved (existing non-parametric results require, at least, independence of demands over time). The ‘experts’ in this treatment are passive predictors of the best starting inventory levels over time out of the continuum of possible levels suggested by functions in a given parametric class with a bounded finite-dimensional parameter space; thus, we consider an infinite pool of experts. (The case of a finite number of stocking levels, and hence experts, is a straightforward variant of this.) The algorithm progresses, in essence, by forming successive weighted averages of the expert predictions, where the weights are adjusted according to the success of the experts in previous periods.

We make the following contributions:

  • (1)

    We cast the newsvendor problem as online learning with expert advice and show that the Weak Aggregating Algorithm (WAA) of [5] can be applied to the problem.

  • (2)

    We prove that the performance of the algorithm is asymptotically as good as the performance of the best time-dependent strategy in a given parametric class with a bounded parameter space. Thus, in the setting of the newsvendor problem, we obtain stronger results than the existing analysis of the WAA for a finite collection of experts.

  • (3)

    The performance bound holds in the absence of any statistical assumptions about the demand sequence.

In the next section we provide a formal statement of the WAA. The newsvendor problem, the explicit WAA specialized to the newsvendor problem and its analysis are given in Section 3.

Section snippets

The general weak aggregating algorithm

The Aggregating Algorithm (AA) [12] is a general approach to online learning that involves combining or ‘merging’ advice from a pool of experts (typically finite). The objective is to minimize the losses from a sequence of decisions that must be made in a stochastic environment. The convergence of the AA is moderated with a learning rate parameter that can be adjusted for each particular application but is otherwise constant. The Weak Aggregating Algorithm (WAA) is similar to the AA but uses a

The weak aggregating algorithm for the newsvendor problem

Let p and c be the unit selling price and cost of a product and assume that the value of unsold inventory at the end of the selling period is zero. The case of a positive unit salvage value s reduces to the basic case by redefining ccs and pps.

The newsvendor may face a stocking decision indefinitely many times, but the case of a finite horizon, defined by a terminal period N, is also covered by our result. His decision in each selling period n=1,2 is yn[0,B], where B is a known upper

Acknowledgement

This work was partially supported by Natural Sciences and Engineering Research Council of Canada grant numbers 261512, 341412, and 388724 and Engineering and Physical Sciences Research Council (UK) grant number EP/F002998/1.

References (12)

There are more references available in the full text version of this article.

Cited by (37)

  • A location-inventory supply chain network model using two heuristic algorithms for perishable products with fuzzy constraints

    2018, Computers and Industrial Engineering
    Citation Excerpt :

    Nevertheless, the assumption does not always conform to the actual situations, for example, meat, green vegetables, human blood, medicine, flowers, films, alcohol, and gasoline. Some products have perishability, i.e. they lose all or part of their value as time goes by (Levina, Levin, McGill, Nediak, & Vovk, 2010). These products can be divided into two groups: perishable products and decaying products.

View all citing articles on Scopus
View full text