Joint segmentation of the wind speed and direction
Introduction
The problem of change-point detection has been studied considerably over the years. We found in the literature sequential or on-line segmentation methods [1] (in which analysis is performed as each new observation is gathered) and off-line methods for which it is considered that all the data are available at the same time and perform a global segmentation. One interesting area of study is the extension of change-point methods in the problem of the joint segmentation of piecewise stationary processes. In this paper we present a non-sequential Bayesian approach for the joint segmentation of wind direction and wind velocity supposed to be piecewise stationary.
The classical change-point problem supposes that the data are normally distributed with constant (but unknown) variance and the mean is suspected to undergo an abrupt change in value. The first works on this problem are due to Page [2] and Hinkley [3] for the detection of one change in the process. When the number of changes is unknown, the problem of change-point detection can be seen as a problem of model selection. In this case, each change configuration is a model and the number of changes is the dimension of the model. A classical solution to solve this problem is the penalized maximum likelihood. Let M be the model set, a model and the empirical penalized criterion:where are the estimated parameters of the model, and a penalty function (in our presentation, we choose log-likelihood of the noise distribution of the model observations t). The first work about such criteria is due to Mallows [4] and Akaike [5]. Both approaches are based on some unbiased estimation of the quadratic risk and aim at choosing a model which minimizes this risk: this is the efficiency point of view. Recent and more detailed works about this kind of criteria can be found in [6]. Another point of view about model selection consists in assuming the existence of a true model of minimal size and aiming at finding it. This is the case of change-point detection where all possible configurations of changes m are in M. The criteria described in [7], [8] have been designed to find the model with a probability tending to 1 when the variance of the process goes to 0: this is the consistency point of view. For those criteria the penalty term is a linear function of the model dimension. In this paper we propose a criterion deduced from the MAP estimate of the posterior distribution of the change instants conditionally to the process parameter estimation of a set of processes. We suppose the configuration of changes to be a sequence of Bernoulli variables [9] and we obtain for the penalty term a linear function of the model dimension. In the case of joint segmentation the penalty term is deduced from the prior law of the change instants. It is composed of parameters that guide the number and the position of changes and parameters that will bring prior information on the joint behavior of the process.
This work is applied to the statistical estimation of horizontal wind vector parameters. For modelling the transport and the diffusion of airborne pollutants, knowledge of the wind characteristics plays a crucial role in the numerical models of air quality [10], [11]. The dispersion of airborne material is mainly due to turbulent diffusion inherent in atmospheric motion. For realistic estimates of dispersion, it is therefore of primary importance to have an accurate description of atmospheric turbulence and wind vector parameters. The wind vector is described, in an Euclidean co-ordinate system, by three typical parameters: the vector of mean wind speed, the vector of mean wind direction and the angular standard deviation. The angular standard deviation is an indicator of atmospheric stability (Pasquill classes [12]). It allows appreciating the atmospheric turbulence taking into account the local topographical and climatic conditions. In our approach the wind velocity is expressed by two components, the wind speed and the wind direction, supposed to be piecewise stationary. The statistical properties of the wind speed are described by a positive linear random variable: the variable follows a log-normal distribution [13]. The wind direction is a circular random variable: it follows a Von Mises distribution [11]. In the preceding works the wind vector parameters were estimated on data supposed to be stationary. In this case the measurements were inaccurate and needed manual human intervention [10].
The problem, then, consists in recovering the instants of change in the recorded wind data. The wind velocity vector is described by an angle and a module. The value of these two parameters is available from a cup anemometer and a wind vane placed at the seafront. For this application we can define prior information on the behavior of the process. First, there is a correlation of the change instants with the tide hours. Second, the modifications of climatic conditions induce simultaneous changes in the two processes [14]. The goal of the joint estimation method we propose in this article is to integrate in the segmentation the prior information we have on the joint behavior of the process.
The paper is organized as follows. Section 2 describes the Bayesian technique. We detail, in this section, the joint estimation method used for estimating the posterior distribution of the change points. The parametric model and the MAP estimate of wind vector parameters are described in Section 3. In Section 4 we present numerical experiments on synthetic and real data.
Section snippets
A Bayesian approach
The problem we address in this paper is the detection of changes in the statistical distribution of a -dimensional process. Let be a non-stationary process with conditional law , and the set of J processes. We assume that the processes are piecewise stationary. Let be the configuration of change points in the process and the configuration of changes. takes the value 0 between two changes and 1 at the change instant. Let
Expression of the contrast function
The first decision taken at the beginning of this work was either to deal with measurements of wind velocity and wind direction (in polar co-ordinate system), assumed to be two different and independent time series, or to deal with the components of the horizontal wind vector (in Cartesian co-ordinate system). One of the advantages of the former is that this is the usual way to record wind data by the traditional sensors (anemometer and vane). Furthermore, this approach is more intuitive and it
Experiments on synthetic data
We show in this experiment the contribution of the joint segmentation for the detection of changes and the estimation of the statistical parameters of the wind vector. The sequence of changes is estimated by minimizing Eq. (20) with a simulated annealing algorithm described in Appendix B. In this experiment we define and as two sequences of 500 samples. There are three changes on at , and three changes on at . The vector of
Conclusion
We present in this article a Bayesian estimation technique for the joint estimation of piecewise stationary processes. This method is applied to the estimation of wind statistical parameters. The MAP estimate we propose is a penalized contrast function. The contrast function is deduced from the circular Von Mises parametric distribution of the wind direction and the log-normal distribution of the wind speed. The penalty term is composed of parameters that guide the number and the position of
Acknowledgements
The authors would like to thank the reviewers for their helpful and constructive comments.
References (17)
Estimating the number of change-points via schwarz criterion
Statist. Probab. Lett.
(1988)Estimators for the standard deviations of lateral, longitudinal and vertical wind components
Atmospheric Environment
(1998)- et al.
The detection of abrupt changes-theory and applications, Information and System Sciences Series
(1993) Estimating the point of change in a continuous process
Biometrika
(1957)Inference about the change point in a sequence of random variables
Biometrika
(1970)Some comments on Cp
Technometrics
(1974)Information theory and an extension of the maximum likelihood principle
- et al.
Gaussian model selection
J. European Math. Soc.
(2001)