Testing the equality of several covariance functions for functional data: A supremum-norm based test

https://doi.org/10.1016/j.csda.2018.02.002Get rights and content

Highlights

  • A supremum-norm based test for the equality of several covariance functions is proposed.

  • Asymptotic random expression of the test statistic is derived under the null hypothesis.

  • The new test is shown to be root-n consistent under a local alternative.

  • The new test is more powerful than L2-norm type tests for highly correlated data or with some local spikes.

  • Simulations and real data examples show the new test outperforms some competitors.

Abstract

Testing the equality of covariance functions is crucial for solving functional ANOVA problems. Available methods, such as the recently proposed L2-norm based tests work well when functional data are less correlated but are less powerful when functional data are highly correlated or with some local spikes, which are often the cases in real functional data analysis. To overcome this difficulty, a new test for the equality of several covariance functions is proposed. Its test statistic is taken as the supremum value of the sum of the squared differences between the estimated individual covariance functions and the pooled sample covariance function. The asymptotic random expressions of the test statistic under the null hypothesis and under a local alternative are derived and a non-parametric bootstrap method is suggested. The root-n consistency of the proposed test is also obtained. Intensive simulation studies are conducted to demonstrate the finite sample performance of the proposed test. The simulation results show that the proposed test is indeed more powerful than several existing L2-norm based competitors when functional data are highly correlated or with some local spikes. The proposed test is illustrated with three real data examples collected in a wide scope of scientific fields.

Introduction

Functional data arise in a wide scope of scientific fields such as biology, medicine, ergonomics among others. They are collected in a form of curves or images instead of traditional scalar or vector observations. Nowadays, this type of data is widely regarded as a basis element in data analysis. Compared with traditional data,functional data are worth exploring since they contain much more structural information. A lot of useful and effective tools have been developed over the past two decades to extract those information hidden in functional data. The reader is referred to Ramsay and Silverman (2005), Ferraty and Vieu (2006), Horváth and Kokoszka (2012) and Zhang (2013) and references therein for more details.

In recent decades, much attention has been paid to hypothesis testing of the mean functions for functional data. A general and direct testing procedure is the so-called pointwise t-test described by Ramsay and Silverman (2005). However, this pointwise t-test fails to give a global conclusion which is often needed in real data analysis. To overcome this drawback, Zhang et al. (2010) proposed an L2-norm based test whose test statistic is obtained as a squared L2-norm of the mean function differences of two functional samples. For one-way ANOVA problems for functional data, several interesting tests have been proposed in the literature, see the ANOVA test proposed in Cuevas et al. (2004) and the GPF test proposed in Zhang and Liang (2013). Both of their methods are extensions of the classical F-test. When conducting the above mean function testing procedures, one may first need to check the equality of the covariance functions of the functional samples involved, as many mean function testing procedures are based on the assumption that the functional samples involved have a common covariance function. Motivated by the need of testing the equality of the covariance functions of two functional samples, Panaretos et al. (2010) and Fremdt et al. (2013) proposed dimension reduction based approaches to address this issue. However, their methods are based on functional principal component analysis so that their results often strongly depend on the number of selected functional principal components. More discussions about the drawbacks of the dimension reduction based methods can be found in Paparoditis and Sapatinas (2016) and Guo et al. (2016).

Although some recent works have shed some light on how to test the equality of the covariance functions of two functional samples, how to test the equality of the covariance functions of several functional samples receives relatively little attention. Paparoditis and Sapatinas (2016) described a general bootstrap procedure for the problem but only studied a two-sample test. Zhang (2013) proposed an L2-norm based test for the multi-sample equal-covariance function testing problem with the Welch–Satterthwaite χ2-approximation and a random permutation approximation. Guo et al. (2016) further studied the asymptotic power and finite sample performance of the L2-norm based test. However, as shown in Guo et al. (2016), these L2-norm based tests are less powerful when the functional data are highly correlated in the sense that the dependence of discrete measured values in a single observed functional curve is high, which is a common feature of functional samples due to their inherent smoothness. The smoothness of functional data arises from the fact that the discrete measured values in a single functional observation are obtained sequentially over a continuum such as time. In addition, the above L2-norm based tests have limited ability to perceive covariance function differences with some local spikes because the L2-norm is a global measure that tends to obscure local characteristics. To surmount these challenges, in this paper, we develop a supremum-norm based test which is good at detecting the covariance function differences when the functional data are moderately or highly correlated. It is also powerful in detecting the covariance function differences with some local spikes. The asymptotic random expressions of the proposed test statistic under the null hypothesis and a local alternative are derived. A nonparametric bootstrap method is proposed to approximate the associated null distribution. The proposed test is demonstrated via intensive simulation studies and three real data applications.

The rest of the paper is organized as follows. We present the main results in Section 2 and intensive simulations in Section 3. We illustrate the methodologies using three real data examples in Section 4. The technical proofs of the main results are given in Appendix A.

Section snippets

Main results

Throughout this paper, let SP(η,γ) and GP(η,γ) respectively denote a stochastic process and a Gaussian process with mean function η(t) and covariance function γ(s,t) where s and t can be scalars or vectors. Let yi1(t),yi2(t),,yini(t), i=1,2,,k be independent functional samples over a given finite time interval T=[a,b], <a<b<, which satisfy yij(t)=ηi(t)+vij(t),j=1,2,,ni,vi1(t),vi2(t),,vini(t)i.i.d.SP(0,γi);i=1,2,,k,where η1(t),η2(t),,ηk(t) model the unknown group mean functions of the k

Simulation studies

For the ECF testing problem (2), Zhang (2013) proposed an L2-norm based test whose null distribution can be approximated by a naive method, a bias-reduced method, and a random permutation method. The associated L2-norm based tests may be denoted as Lnv2,Lbr2 and Lrp2, respectively for easy reference. In this section, we present some simulation studies, aiming to compare the Tmax-test against Lnv2,Lbr2 and Lrp2 under various simulation configurations. In the simulation studies, we generate k

Applications to three real data examples

In this section, we illustrate the applications of Tmax, together with Lnv2,Lbr2 and Lrp2, to three real data examples. All these tests used the same mean and covariance function estimations based on (3). Throughout this section, the p-values of Lrp2 and Tmax were obtained via 10,000 runs of random permutation or nonparametric bootstrapping. From these three examples, we shall see that Tmax often has higher power than Lnv2,Lbr2 and Lrp2 in detecting the covariance function differences of

Acknowledgments

The work was financially supported by the National University of Singapore Academic Research grant R-155-000-175-114. The first author was also financially supported by the Australian Research Council (ARC) Centre of Excellence for Mathematical and Statistical Frontiers (ACEMS). The authors thank the Editor-in-Chief, the associated editor, and two reviewers for their valuable comments and suggestions which helped improve the paper presentation substantially.

References (24)

  • CuevasA. et al.

    An ANOVA test for functional data

    Comput. Statist. Data Anal.

    (2004)
  • EddelbuettelD. et al.

    RcppArmadillo: Accelerating R with high-performance C++ linear algebra

    Comput. Statist. Data Anal.

    (2014)
  • AnalyticsR. et al.

    foreach: Provides foreach Looping Construct For R

    (2015)
  • BillingsleyP.

    Convergence of Probability Measures

    (2009)
  • EubankR.L.

    Nonparametric regression and spline smoothing

    (1999)
  • FanJ. et al.

    Local Polynomial Modelling and Its Applications

    (1996)
  • FebreroM. et al.

    Outlier detection in functional data by depth measures, with application to identify abnormal NOx levels

    Environmetrics

    (2008)
  • FerratyF. et al.

    Nonparametric Functional Data Analysis: Theory and Practice

    (2006)
  • FremdtS. et al.

    Testing the equality of covariance operators in functional samples

    Scand. J. Stat.

    (2013)
  • GuoJ. et al.

    A Further Study of an L2-norm Based Test for the Equality of Several Covariance Functions

    (2016)
  • GuoJ. et al.

    ECFsup: equal covariance functions testing by L2-norm and sup-norm

    (2017)
  • HorváthL. et al.

    Inference for Functional Data with Applications

    (2012)
  • Cited by (0)

    View full text