Estimation of Hurst exponent revisited

https://doi.org/10.1016/j.csda.2006.07.033Get rights and content

Abstract

In order to estimate the Hurst exponent of long-range dependent time series numerous estimators such as based e.g. on rescaled range statistic (R/S) or detrended fluctuation analysis (DFA) are traditionally employed. Motivated by empirical behaviour of the bias of R/S estimator, its bias-corrected version is proposed. It has smaller mean squared error than DFA and behaves comparably to wavelet estimator for traces of size as large as 215 drawn from some commonly considered long-range dependent processes. It is also shown that several variants of R/S and DFA estimators are possible depending on the way they are defined and that they differ greatly in their performance.

Introduction

Let Xtt=1 be a real-valued stationary time series such that its covariance function γ(k)CovXt,Xi+k cγ|k|-γ for |k|, where 0<γ<1 and denotes asymptotic equivalence. This is the most important case of long-range dependence (LRD), which in a general situation is defined by the condition k=0|γ(k)|=, and it encompasses two frequently studied processes having this property, namely a fractional Gaussian noise (fGn) and a fractional autoregressive integrated moving average (FARIMA). The phenomenon of long-range dependence is a topic of active research in statistics as well as in many areas of applied sciences e.g. in economics, geophysics and meteorology. We refer to Beran (1994) for a book-length treatment of this subject.

Assuming the above condition of the hyperbolic decay of covariance, it is easily seen that, with Yn=t=1nXt, we haveVarYn2cγ(1-γ)(2-γ)n2-γCγn2H,where H1-γ/2 is traditionally called the Hurst exponent (cf. Hurst (1951)). As the Hurst exponent describes the strength of dependence, its estimation is of a great interest. A process with a larger value of H is more regular and less erratic than a process with a smaller one. Most estimators are based on scaling properties similar to a property following from (1), namely that VarYnkk2HVarYn, for any kN. Thus e.g. regressing a logarithm of some estimator of the variance of the partial sums against the logarithm of its size yields an estimator of 2H. However, as in the case described above, this does not always yield a reliable estimator of H. For a discussion of such approach and a review of long-range dependence see Robinson (1994). A Monte Carlo experiment comparing performance of some such estimators is discussed in Taqqu et al. (1995).

We consider below three of the most popular estimators based on scaling property: R/S estimator based on rescaled adjusted range statistic, DFA estimator pertaining to detrended fluctuation analysis and a wavelet estimator. Important competitors include, among others, the log periodogram estimator (Geweke and Porter-Hudak, 1983) based on an approximate scaling of spectral density and the Whittle (1953) estimator defined as the maximizer of an approximate version of loglikelihood. A semiparametric version of the last proposal called the local Whittle (LW) or Gaussian semiparametric estimator (Robinson, 1995) is proved to be more efficient than the log periodogram regression estimator. Modifications of the LW estimator intended to reduce its bias when the underlying LRD process is contaminated by noise have been proposed and studied by Andrews and Sun (2004), Arteche (2004) and Hurvich et al. (2005). They are shown to have a smaller mean squared error (MSE) than the previous proposals when the noise to signal ratio is considerable (see e.g. Hurvich and Ray, 2003). For recent contributions concerning estimation of Hurst exponent see also Lai (2004) and Stoev et al. (2006).

In Sections 2 and 3 we show that the performance of the studied estimators depends crucially on a way they are constructed. Moreover, as the main contribution of the paper, we indicate that R/S estimator, which is immensely popular among practitioners, but is widely known to be suboptimal can be enhanced by a bias correction to the effect that it outperforms the DFA estimator with respect to the MSE and performs on par with the wavelet estimator. In Section 2 we describe possible variants of R/S estimator, choose one of them and show how its performance can be improved by a bias correction. The bias correction method is based on approximate linearity of the bias of considered R/S estimator and takes advantage of the exact value of the slope of the approximating line.

In Section 3 careful analysis of the DFA estimator is provided. In Section 4 we briefly describe the wavelet estimator and in Section 5 we compare the performance of all three semiparametric estimators and the Whittle estimator for FARIMA and fGn processes with provided sample sizes ranging from 29 to 215. An application to analysis of daily exchange rates listed by the National Bank of Poland is discussed. Section 6 concludes the paper.

Section snippets

R/S method

For a trace Xtt=1n of a time series, consider a partial sum Yk=t=1kXt,1kn, and a sample variance Sn2=(n-1)-1t=1nXt-X¯n2, where X¯n=n-1t=1nXt is a sample mean. Rescaled adjusted range statistic R/S(n) introduced by Hurst (1951) is defined asR/S(n)=1Snmax1knYk-knYn-min1knYk-knYn.Observe that the numerator Rn in (2) can be viewed as a range of partial sums of Xt-X¯n,t=1,,n, or, equivalently, as the sum of the maximal and the minimal distance of the partial sums Yk,k=1,,n from a line

Variants of DFA estimator

The second method to assess the strength of long-range dependence is based on a different measure of fluctuations of partial sums Yi=t=1iXt (Peng et al., 1994). Namely, instead of measuring the maximal deviation in both directions of Yk from the line joining Y0=0 and Yn one considers an average of squared vertical distances of Yi from the LS line SSE¯k=k-1SSEk=1ki=1kYi-aki-bk2,where aki+bk is the LS line fitted to points i,Yi,i=1,,k. Taqqu et al. (1995) proved that ESSE¯kCk2H for the fGn

Wavelet estimator of H

We briefly describe Abry–Veitch estimator of H and refer to Veitch and Abry (1999) for a detailed treatment. As before, let X1,,Xn denote an observable part of a trace of a discrete long-range dependent process such that its covariance function satisfies γ(k)cγk2H-2. We assume additionally that Xtt=1 is a Gaussian process. Denote by d^ik a wavelet coefficient pertaining to a wavelet ψ at resolution i and location k calculated through a pyramidal algorithm initiated by scaling coefficients at

Simulation results

In Tables 2 and 3 values of RMSE for H^R/S, H^dfa and H^d are displayed. H^R/S stands for the bias-corrected H^d,p estimator with i1=1, H^dfa denotes the DFA estimator H^m,s with i1=2 and the choice of the minimal block length of the wavelet estimator H^d is described in the previous section. It turns out that contrary to a common belief R/S estimator, after suitable correction, is the strong competitor to other estimators of the Hurst exponent. Namely, it has consistently lower RMSE than H^dfa

Conclusions

The goal of our paper was to construct a bias correction to the R/S estimator of the Hurst parameter H. The proposed method is based on an approximate linearity (as a function of H) of the bias of one of the possible variants of H^R/S, namely H^d,p defined in Section 2.1. A specific form of the employed bias correction is justified by Theorem 2 and Corollary 1. It turns out that the bias-corrected R/S estimator performs much better than its uncorrected versions and it compares favourably with

References (32)

  • J. Geweke et al.

    The estimation and application of long memory time series model

    J. Time Ser. Anal.

    (1983)
  • L. Giraitis et al.

    Rescaled variance and related tests for long memory in volatility and levels

    J. Econometrics

    (2003)
  • P. Hall et al.

    Semiparametric bootstrap approach to confidence intervals for the Hurst coefficient

    Statist. Inference for Stochastic Processes

    (2001)
  • Henry, M., Robinson, P.M., 1996. Bandwidth choice in Gaussian semiparametric estimation of long range dependence. In:...
  • H.R. Hurst

    Long-term storage in reservoirs

    Trans. Amer. Soc. Civil Eng.

    (1951)
  • C.M. Hurvich et al.

    The local Whittle estimator of long-memory stochastic volatility

    J. Financial Econometrics

    (2003)
  • Cited by (115)

    • A new multifractal-based deep learning model for text mining

      2024, Information Processing and Management
    View all citing articles on Scopus
    View full text