An analysis of the properties of the variants of Newton’s method with third order convergence

https://doi.org/10.1016/j.amc.2006.05.116Get rights and content

Abstract

For the last five years, the variants of the Newton’s method with cubic convergence have become popular iterative methods to find approximate solutions to the roots of non-linear equations. These methods both enjoy cubic convergence at simple roots and do not require the evaluation of second order derivatives. In this paper, we investigate about the relationship between these methods which are in fact based on the approximation of the second order derivative present in the third order limited Taylor expansion. We also prove that they are different forms of the Halley method and are all contractive iterative methods in a common neighbourhood. We extend some of these variants to multivariate cases and prove their respective local cubic convergence from their corresponding linear models.

Introduction

Iterative methods for finding approximate solutions to the single non-linear equations f(x) = 0 that have no solutions in closed form have various applications in physics and chemistry. Newton’s method [1] is often the method of choice for approximating such solutions. The method is of quadratic convergence at a simple root, that is, the number of good digits is doubled at each iteration. The convergence of the iterates and the rate of convergence play an important role in the design of new iterative methods [2]. There are higher order methods that allow for faster convergence. For instance, Halley’s method [3], Householder’s method [4], double convex acceleration of Whittaker’s method [5], super-Newton’s method (Chebyshev) [6], super-Halley’s method or the Chebyshev–Halley’s family of iterative methods [7], [8] has third order convergence, but require the second derivative which is difficult to evaluate and thus time consuming for complicated functions. To avoid the calculation of second derivatives, some variants of Newton’s method with third order convergence at simple root have been developed. In these methods, Newton’s method is used as an intermediate step. These variants are termed as multi-point methods because they involve the calculation of the function or its first order derivative at multi-points. They are easy to implement and only require function and first derivative evaluations. Traub [9] has introduced the mid-point Newton method (MP) as one of these variants. However, during the last five years, there has been a ’Newton boom’. In fact, many variants of Newton’s method have been introduced and developed. By using different approximations to the definite integral in the Newton’s theorem [10], different iterative formulas can be obtained. For instance, Newton’s method can be obtained using a rectangle as approximation in the Newton theorem. Weerakoon and Fernando [10] have used the trapezoid rule instead and developed a new variant which was then termed as the Arithmetic Mean third order Newton method (AM) [11] because the denominator is the arithmetic mean of the derivatives at k iterate and at the Newton iterate. In 2002, Hasanov and al. [12] have developed another variant (VS) from the Newton theorem making use of the Simpson’s rule instead. Nedzhibov [13] has shown that approximating the integral in the Newton theorem using different quadrature formulae yields different iterative functions including those we have mentioned before. Furthermore, Kasturiarachi [2] independently introduced and developed the Leap-frogging Newton method. This method is termed as the Newton–Secant method (NS) [13] because it is a composition of Newton’s and Secant methods. It can also be derived from the discrete modifications of the AM method or the MP method. In 2003, Sormarni and Frontini [14], [15] have studied some of these variants mentioned and their applications to multiple roots. Homeier [16], [17] independently derived one of the variants and extended it to the multivariate case. More recently, in 2005, Homeier [18] have shown that one can modify the Weerakoon–Fernando [10] approach by using Newton’s theorem for the inverse function and derive a new class of cubically convergent Newton-type methods. New variants of Newton’s method can also be obtained using the harmonic and geometric mean instead of the arithmetic mean in the Weerakoon’s variant. In this way, Ozban [11] and Lukic [19] have derived the Harmonic Mean Newton method (HM) and the Geometric Mean third order Newton method (GM) respectively. These variants of Newton’s method have been proved to converge locally with third order of convergence assuming that the starting point is close to the root. However, approximating the Newton theorem using different quadrature formulae does not clearly explain why the variants are third order accurate iterative method. Also, the contractive property of these methods have not been studied. The contraction theorem is a sufficient condition to ensure convergence for the choice of the starting points. In this work, we study and analyze the properties of one of the variants (AM’s method) and explain the origin of the third order accuracy via the Taylor expansion. In Section 4, we shall prove that the method is a contraction in a neighbourhood of the root in which the value of the function is assumed to be small enough. We show that the method is in fact a form of the Halley method while considering a limited Taylor expansion and both methods contract in the same neighbourhood. We also extend AM’s method to multivariate case and prove that it converges locally with cubic convergence from its linear model. In Section 5, we show that the third order accurate variants are in fact different forms of the Halley method in the neighbourhood of the root and their contraction follows from that of the Halley method. We also extend some of the other variants to the multivariate case. In the last section, we conduct a comparative study of these variants via numerical experiments and add concluding remarks.

Section snippets

Iterative methods

We summarize the important methods we have mentioned.

Newton’s method:xk+1N=xk-f(xk)f(xk).We observe that the third order accurate variants have a similar form, namely,xk+1=xk-f(xk)Dm(xk),m=1,2,8,but only differ in the denominator Dm (xk).

Halley’s method:D1(xk)=f(xk)-12f(xk)f(2)(xk)f(xk).Arithmetic mean Newton’s method:D2(xk)=12(f(xk+1N)+f(xk)).Mid-point Newton’s method:D3(xk)=f12(xk+1N+xk).VS variant (Hasanov):D4(xk)=16f(xk+1N)+4fxk+1N+xk2+f(xk).V1 variant (Nedzhibov):D5(xk)=14f(xk+1N

Order of convergence

Definition 1

Order of convergence [19]

If the sequence {xk} tends to a limit x in such a way thatlimxk+1-x(xk-x)q=Cfor some C  0 and q  1, then the order of convergence of the sequence is said to be q, and C is known as the asymptotic error constant.

If q = 1, q = 2 or q = 3, the convergence is said to be linearly, quadratically or cubic, respectively.

Let ek = xk  x be the error in the kth iterate of the method which produces the sequence {xk}. Then, the relationek+1=Cekq+O(ekq+1)=O(ekq)is called the error equation. The value of q is called

Derivation from Taylor series

We pointed out before that the Newton method and the arithmetic mean third order accurate Newton method can be obtained from the Newton theoremf(x)=f(xk)+xkxf(u)duby approximating the definite integral in Eq. (11) by a rectangle and the trapezoid rule respectively [19] . The Newton method can also be derived from a second order accurate limited Taylor series. In this way, we show that the AM method can be derived from a third order accurate limited Taylor expansion. We consider the Taylor

Properties and contraction of the other variants

In this section, we show that the other third order accurate variants are different forms of Halley’s method. We also show that their contractive property follows from the contraction of the Halley method.

Numerical examples

We give an overview of the Model Problems [1], [23] that we shall consider (see Table 2).

We compare the 11 methods in terms of the empirical mean value over a sample of m test values ωi,ωc¯=1mi=1mωi.Also, the empirical variance sc2 for m runs is defined bysc2=1m-1i=1m(ωi-ωc¯)2.All programs are written and run on matlab.

We use notations:

  • MN—Newton’s method;

  • H—Halley’s method;

  • AM—Arithmetic Mean Newton’s method;

  • MP—Mid-point Newton’s method;

  • VS—VS variant;

  • V1—V1 variant;

  • NS—Newton–Secant’s method;

  • HM

Summary and concluding remarks

In this work, we have shown that the third order variants of Newton’s method are third order accurate methods because they actually approximates the second order derivative present in a third order limited Taylor series. We have shown that these variants are in fact different forms of Halley’s method and are all contractive methods in the same neighbourhood. We have also extended some variants to multivariate cases and prove the local cubic convergence from their respective linear models. From

Acknowledgements

The first author is thankful to the Tertiary Education Commission (TEC), Mauritius and University of Mauritius for the partial finance support to his work.

References (25)

  • A.S. Householder

    The Numerical Treatment of a Single Nonlinear Equation

    (1970)
  • M.A. Hernandez

    An acceleration procedure of the Whittaker method by means of convexity

    Zb. Rad. Prirod.-Mat. Fer. Sak. Mat.

    (1990)
  • Cited by (69)

    • A unified model for solving a system of nonlinear equations

      2016, Applied Mathematics and Computation
    • Evaluation of iterative methods for solving nonlinear scalar equations

      2023, Iranian Journal of Numerical Analysis and Optimization
    View all citing articles on Scopus
    View full text