Accelerating convergence in minisum location problem with p norms

https://doi.org/10.1016/j.cor.2013.05.015Get rights and content

Abstract

This paper presents a procedure for accelerating convergence of the Weiszfeld algorithm in the classical single facility location median problem in which the distances are measured by p-norms. To this end, we combined Steffensen's method, a generic acceleration scheme applied to iterative processes for solving fixed point equations, with the acceleration methods based on the transformation of the Weiszfeld algorithm by a factor which is a function of the parameter p. The convergence of the proposed methodology and the conditions under which it is guaranteed are analyzed. The computational results show that the total number of iterations to meet a given stopping criterion will be reduced with respect to the results obtained in other algorithms proposed in the literature. The running times are either reduced or quite similar with respect to the existing algorithms for which no results of convergence are provided.

Introduction

In the classical single facility minisum location problem, we are given a finite set of points in a real normed space in order to minimize the weighted sum of the distances to these points (existing facilities or demand points). In this case, the use of p-norms to measure distances has been proved to be particularly suitable (see [8], [11], [7]). Hence, given a set of demand points A={a1,,aM}Rn, the formulation of the single facility minisum location problem in Rn with a p-norm to measure distances is stated as follows:minxRnF(x)i=1Mwixaip,where wi0 is the weight associated with the demand point ai for i=1,,M and xaip=(k=1n|xkaik|p)1/pp1.

The restriction on the parameter p to a value greater than or equal to unity ensures that ·p has the properties of a norm. In a practical setting, the factor wi would be proportional to the demand at customer i and the term wixaip would be an estimate of the cost of serving the demand at customer i by the facility x. Thus, the objective function in (1) provides the total service (or distribution) cost. Regarding the existence and uniqueness of a solution for Problem (1), by the convexity of the objective function, we have that this problem has an optimal solution. Moreover, for p>1 the objective function is strictly convex when the demand points are not collinear (see [27]). Thus, in the following, we will assume that demand points are not collinear.

Problem (1) for p1 is a classical model in continuous location theory that has received much attention in the literature. The most popular method for solving this problem is given by a one-point iterative procedure that was first proposed for Euclidean distances (p=2) by Weiszfeld [53] and subsequently translated into English by Weiszfeld and Plastria [54]. This method was rediscovered by Miehle [34] and Cooper [19] approximately 20 years later. Its convergence was subsequently studied by Katz [30] and Kuhn [31], among others. The generalization to p distances with p[1,2] is given in Love et al. [33] and an analysis of its local convergence has been shown in Love et al. [33] and Brimberg and Love [9].

The global convergence result of this algorithm for p norms was proved by Brimberg and Love [10] for p[1,2]. These results were extended to more general functions in Frenk et al. [27], in which the authors used a hyperbolic approximation of the p norm in order to avoid the problem of singularities in the iteration function (for more details, see [4], [5], [15], [14], [31]).

Furthermore, these results have been extended to more general problems, including Banach spaces [24], [43], [44], the sphere [55], regional demand [17], [50], sets as demand facilities using closest Euclidean distances [12], radial distances [16], [20], [29], [35] and using different objective functions [27], [25], [46], [51], among others.

For the case p>2, Brimberg and Love [10] provided an example showing that, in general, the classical Weiszfeld algorithm does not converge. The literature contains several attempts to propose convergent algorithms by modifying the step size of the classical Weiszfeld algorithm, basically consisting of multiplying this step size by a factor, usually called the step size factor (see Section 2 for more details). Brimberg et al. [6] prove the local convergence of the algorithm on the plane for p1 when it is used as a step size factor λ with λ1/(p1), p>1. Uster and Love [49] show that the step-size factors λ=2/p for p[2,3] and λ=2/(p1) for p>3 yield good computational results although they do not prove the convergence of their proposal. Recently, Rodríguez-Chía and Valero-Franco [45] give a step-size factor for this algorithm that depends on each iterate and they provide a proof of the global convergence of this algorithm for p>2.

On the other hand, the solution of the single facility minisum location problem has been used as part of a solution procedure for many location problems. These procedures require the repeated application of the Weiszfeld algorithm, which is quite efficient in most cases. However, there are some cases in which convergence is very slow and may require thousands of iterations. Therefore, reducing the computational effort required for the solution of the single facility minisum location problem will improve the performance of these procedures. Actually, when a sequence or an iterative process is slowly converging, a convergence acceleration process must be used. This consists of transforming the slowly converging sequence into a new one that, under some assumptions, converges faster to the same limit. Since the Weiszfeld algorithm is an iterative procedure that is used to find the solution of a fixed point equation, the use of acceleration techniques for such fixed point algorithms may be a suitable option to improve its efficiency, see Brezinski [3], Burden et al. [13], Drezner [21] and Farnum [26].

The notion of accelerating the convergence of descent methods such as the Weiszfeld procedure requires an investigation of alternate step sizes [18]. A first attempt at accelerating the Weiszfeld procedure can be attributed to Katz [30], who suggested the use of Steffensen's iteration applied to each component for the case p=2. This method is not known to be globally convergent, but it may be used to accelerate the local convergence rate of the Weiszfeld procedure from linear to quadratic. However, the gain in the convergence rate may be offset by the longer computational time of Steffensen's iterations. Drezner [22] applies a factor, λ, to multiply the step size of the Weiszfeld procedure for the case of Euclidean distances (p=2). In the Euclidean case, it is known that convergence will occur for a step size not exceeding double the original (see [42]). Drezner [22] uses an approximation function of F(x) to calculate a λ at each iteration which may take on values considerably greater than 2. The computational results generally appear to indicate only a modest reduction in the number of iterations using this acceleration method as compared with a fixed λ of 2. The net effect on running times is unclear, due to the increased computations associated with the variable step size factor. In addition, Drezner [22] claims that λ=1.8 is a good compromise for a fixed step size in handling different problems for p=2. Frenk et al. [27] recommend the use of a step size factor between 0 and 2 for a generalized problem, after showing that the descent property and convergence are still maintained for all p[1,2]. In Brimberg et al. [6], in addition to prove the local convergence of the iterative sequence with λ=1/(p1) for p>1, different step sizes factors that provided good computational results are suggested, but no convergence results are proved. An interesting outcome of this analysis is that the parameter p decreases to a value of 1 and λ may be increased to + in the limit with local convergence of the sequence. Verkhovsky and Polyakov [52] proposed an accelerated algorithm to solve Problem (1) for p=2 using a feedback factor that can easily be implemented in engineering practices.

Similarly to Katz [30], Drezner [23] deals with a general acceleration method for the Weiszfeld procedure reducing its computational requirements for p=2. This acceleration method applied Aitken's Δ2 process to each component, also known as Steffensen's method when applied to fixed point problems, see Burden et al. [13] for further details.

Under some conditions, this method for the scalar case accelerates the convergence of any linearly convergent sequence. The approach suggested by Drezner [23] and Katz [30] for p=2 applies a different step size factor for the x-coordinate and the y-coordinate. This means that while the other methods extend the sequence of points in the same direction, this approach may be able to follow a converging sequence which forms a curved trajectory. As mentioned above, Drezner [23] and Katz [30] apply Aitken's Δ2 process to each component. However, Nievergelt [36] warned that Aitken's acceleration may fail if applied separately to each coordinate; therefore a more suitable generalization seems necessary.

The natural generalization to the vector case in the well-known Aitken's Δ2 process is called Henrici's transformation. This transformation is given by Sadok [47] that it is also called Steffensen's method for fixed point equations in several variables. Nievergelt [36] points out that Steffensen's acceleration of the iterations of a function of several variables lacked a theoretical foundation. Nievergelt [37] furnishes a partial result and Noda [38], [39], [40], [41] obtains several local convergence results for functions that are contracting maps in a neighborhood of their fixed point. Bellalij [1] proves a convergence acceleration result in the linear case under some natural conditions. Following the ideas presented by Jbilou and Sadok [28] concerning the vector extrapolation method, Le Ferrand [32] establishes the quadratic convergence of Steffensen's iteration. Nievergelt [36] extends this result to divergent sequences that must behave linearly by analyzing Henrici's transformation. Benchiboun [2] provides a convergence acceleration result that generalizes some results given by Nievergelt [36] and Sadok [47].

In this paper, we study the convergence of an acceleration method, based on the vectorial version of Steffensen's method, applied to different modifications of the Weiszfeld algorithms with a p-norm with p1. Note that this paper is the first attempt in the literature to apply the vectorial version of Steffensen's method to the Weiszfeld algorithm, since in previous papers, an independent analysis was considered for each component. For the purposes of this analysis, we also assume that the optimal solution of Problem (1) occurs at non-singular points of the iteration functions. Alternatively, the singularities of the iteration functions may be avoided by introducing a smoothing function such as the hyperbolic approximation of the p norm, see Love et al. [33] and Frenk et al. [27]. For a detailed discussion of the singularities see Brimberg and Love [9], [10].

The paper is organized in six sections. Section 2 provides a general formulation of a modification of the Weiszfeld algorithm by considering a step size factor. In Section 3, Steffensen's acceleration process is described. The conditions that guarantee the convergence of the proposed methodology are studied in Section 4. The computational analysis presented in Section 5 which illustrates the efficiency of the acceleration process applied to each of the modified Weiszfeld algorithms. The paper ends with some concluding remarks.

Section snippets

The acceleration sequence

Since the objective function in Problem (1) is convex, the necessary condition of optimality is also sufficient, i.e., x is an optimal solution if and only if F(x)xk=0k=1,,n.This condition generates an iterative procedure, known as the Weiszfeld algorithm that is given byxkq+1=xkq1Sk(xq)F(xq)xkk=1,,nwhere Sk(xq)=i=1Mwi|xkqaik|p2xqaipp1F(xq)xk=i=1Mwi|xkqaik|p2xqaipp1(xkqaik).

In order to develop an iterative scheme that generates a sequence which converges faster to the

Aitken's and Steffensen's acceleration

This section provides a brief review of several concepts related to linear convergence and Steffensen's acceleration, first for the scalar case and then for the case of vector sequences.

A sequence of real or complex scalars {zq}qN converges linearly to a limit z if and only if, the following two conditions hold [36]:

  • 1.

    There exists an integer KN, such that, zqz for each integer qK.

  • 2.

    There exists a real or complex scalar α such that 0<|α|<1 andlimqzq+1zzqz=α.

If α=0, then the sequence

Convergence analysis

This section analyzes the Steffensen's acceleration applied to the different variations of the Weiszfeld algorithm with alternative fixed step-size factors. Although the convergence analysis can be performed in the n-dimensional case, we have limited our study to the planar case for the sake of simplicity. Therefore, in order to study the convergence of the accelerated sequence in the following subsections we will study some of the properties that are required to obtain convergence results.

In

Computational results

After establishing some convergence properties of the accelerated scheme proposed in this paper, we provide a computational study of the number of iterations and the running times of this approach applied to the Weiszfeld algorithm with different fixed step size factors. We also consider the acceleration schemes by Drezner [23] and Verkhovsky and Polyakov [52]. It is worth mentioning that no convergence properties for these two acceleration schemes are known in the literature and therefore, a

Concluding remarks

In this paper we have addressed the first attempt to provide convergence results for an alternative approach, based on Steffensen's method, to accelerate the sequence generated by different modifications of the classical Weiszfeld algorithm when different fixed step size factors are used. The computational analysis reports that the approach is also useful from practical point of view, since the results are quite similar to or even better than those provided by other acceleration procedures

Acknowledgments

This research was partially supported by “Ministerio de Ciencia e Innovación/FEDER” (research projects MTM2007-67433-C02-02, MTM2010-19576-C02-02 and DE2009-0057), “Fundación Séneca” (research project 08716/PI/08) and Junta de Andalucía/FEDER (Grant number FQM-5849).

References (55)

  • A.M. Rodríguez-Chía et al.

    On solving the planar k-centrum problem with Euclidean distances

    European Journal of Operational Research

    (2010)
  • H. Sadok

    About Henrici's transformation for accelerating vector sequences

    Journal of Computational and Applied Mathematics

    (1990)
  • H. Üster et al.

    The convergence of the Weiszfeld algorithm

    Computers and Mathematics with Applications

    (2000)
  • L. Zhang

    On the convergence of a modified algorithm for the spherical facility location problem

    Operations Research Letters

    (2003)
  • M. Bellalij

    Analysis of Henrici's transformation for singular problems

    Numerical Algorithms

    (2003)
  • J. Brimberg

    The Fermat—Weber location problem revisited

    Mathematical Programming

    (1995)
  • J. Brimberg et al.

    Estimating of the parameters of the weighted lp norm by linear regression

    IIE Transactions

    (1996)
  • J. Brimberg et al.

    A new distance function for modeling travel distances in a transportation network

    Transportation Science

    (1992)
  • J. Brimberg et al.

    Local convergence in a generalized Fermat—Weber problem

    Annals of Operations Research

    (1992)
  • J. Brimberg et al.

    Global convergence of a generalized iterative procedure for the minisum location problem with lp distances

    Operations Research

    (1993)
  • J. Brimberg et al.

    General considerations on the use of the weighted lp norm as an empirical distance measure

    Transportation Science

    (1993)
  • J. Brimberg et al.

    Minisum location with closest Euclidean distances

    Annals of Operations Research

    (2002)
  • R.L. Burden et al.

    Numerical analysis

    (1981)
  • L. Cánovas et al.

    On the convergence of the Weiszfeld algorithm

    Mathematical Programming

    (2002)
  • R. Chandrasekaran et al.

    Open questions concerning Weiszfeld's algorithm for the Fermat—Weber location problem

    Mathematical Programming

    (1989)
  • A.I. Cohen

    Stepsize analysis for descent methods

    Journal of Optimization Theory and Applications

    (1981)
  • L. Cooper

    Location-allocation problems

    Operations Research

    (1963)
  • Cited by (2)

    • A branch-and-price approach for the continuous multifacility monotone ordered median problem

      2023, European Journal of Operational Research
      Citation Excerpt :

      Even those problems that might be considered easy, as for instance the classical Weber problem with Euclidean norms, are most of the times solved with algorithms (as the Weiszfeld algorithm, Weiszfeld, 1937), whose convergence is still unknown (Chandrasekaran & Tamir, 1989). Moreover, most problems studied in continuous location assume that a single facility is to be located, since their multifacility counterparts lead to difficult non-convex problems (Manzour-al Ajdad, Torabi, & Eshghi, 2012; Blanco, 2019; Blanco, ElHaj BenAli, & Puerto, 2014; Brimberg, 1995; Carrizosa et al., 1998; Mallozzi, Puerto, & Rodríguez-Madrena, 2019; Puerto, 2020; Rosing, 1992; Valero-Franco, Rodríguez-Chía, & Espejo, 2013). Motivated by the recent advances on discrete multifacility location problems with ordered median objectives (Deleplanque, Labbé, Ponce, & Puerto, 2020; Espejo, Puerto, & Rodriguez-Chía, 2021; Fernández, Pozo, & Puerto, 2014; Labbé, Ponce, & Puerto, 2017; Marín, Ponce, & Puerto, 2020), and the available results on conic optimization (Blanco et al., 2014; Puerto, 2020), we analyze here a family of difficult continuous multifacility location problems with ordered median objectives and distances induced by a general family of norms.

    View full text