The valid regions of Gram–Charlier densities with high-order cumulants

doi:10.1016/j.cam.2021.113945

Journal of Computational and Applied Mathematics

Volume 407, June 2022, 113945

https://doi.org/10.1016/j.cam.2021.113945 Get rights and content

Abstract

Based on derivatives of a Gaussian density, the Gram–Charlier series is an infinite expansion. Its truncated series is often used in many fields to approximate probability density functions. Although the expansions are useful, there are constrained regions on the value of the cumulants (or moments) that admit a valid (nonnegative) probability density function. When the truncation order is low (just at fourth-order), the truncated Gram–Charlier density may be difficult to approximate an implied probability distribution as closely as possible, especially for distributions that are not sufficiently close to a normal distribution. One might increase the order after which the series is truncated until a perfect fit is achieved. However, the series expansion is usually truncated in the existing literature until the fourth-order term because it becomes difficult to find valid regions. This paper shows how the valid region of higher cumulants can be numerically implemented by the semi-definite algorithm, which ensures that a series truncated at a cumulant of arbitrary even order represents a valid probability density. We provide examples of two valid regions of the sixth and eighth Gram–Charlier densities (i.e., truncated at the sixth and eighth terms). Our analysis proves the fact that valid regions can be broadened with the higher-order expansions. Furthermore, the impact of higher cumulants on the valid regions has been shown.

Introduction

The primary historical purpose of the standard Gram–Charlier (GC hereafter) expansion and Edgeworth series is to extend the Gaussian distribution when new information, such as moments or cumulants, is given which does not match that of the Gaussian. The GC and Edgeworth expansions were established at the end of the 19th century and the beginning of the 20th century. Since then, these expansions have been commonly used in many fields from mathematics, physics, finance, astronomy to oceanography. For applications in finance, this approach comes from the pioneering work of Sargan [1], [2] and Phillips [3], [4], who developed the theory of Edgeworth and GC expansions to derive the approximate distribution function of econometric estimators. Jarrow and Rudd [5] show how an arbitrary distribution can approximate a given stochastic process of an underlying security in terms of a series expansion involving second and higher moments. Jondeau and Rockinger [6] use the constrained expansion to the estimation of risk-neutral densities. Corrado [7] gives a hidden martingale restriction which is developed for option pricing models based on the GC expansions of the normal density function. For application in astronomy, Blinnikov and Moessner [8] draw attention to the form of this expansion and present an algorithm to illustrate the problems arising when fitting spectral line profiles of galaxies, supernovae, or other stars. For application in oceanography, [9] use the GC series to analyze the capabilities and limitations of the simulation of the probability density of rough sea surface elevations. For applications in mathematics, [10] introduces a new approximation that involves a four-dimensional dynamical system that uses the skewness and kurtosis of the queueing distribution via the GC expansion.

Although convenient, it should be mentioned that the application of the series expansion of probabilities has the shortcoming that once truncated, the GC density is not well-defined; i.e., it is not strictly nonnegative for all values of its parameters in the parametric space. Several authors have disregarded this issue, and some have come up with parameters that do not yield a proper probability density function. Without keeping the nonnegative of a truncating GC density, there might be inconsistencies among option prices. For example, if the function used as density is negative over one interval $(a, b)$ , then a digital option that pays off only when the log return is in that interval will have a negative price. As a consequence, a proper probability density function is the only safe choice. For this purpose, different alternatives have been proposed in the many comparative studies depending on the end-use of a model, namely: (i) accurate selection of initial values for the maximum likelihood (ML) algorithms [11]; (ii) density function transformations; and (iii) parametric constraints.

The first method to estimate GC densities by ML also presents important drawbacks. The first one is the shape of the ML function surface that presents flat areas and jumps. These features may lead the maximization algorithms to local optima or even to non-convergence. Compared with ML algorithms, [12] propose the use of the method of moments (MM) to obtain estimates of the GC densities parameters and show this method is extremely straightforward to implement because these parameters are linear functions of the density moments. However, both ML and MM algorithms still do not solve the “nonnegative problem” unless a very large expansion is implemented.

The second method is based on the density function transformations. Sargan [1] brought GC and Edgeworth series into semi-nonparametric (SNP hereafter) econometrics. Many related studies have developed both univariate and multivariate distributions based on GC densities, attempting to ensure the nonnegativity by exploiting the orthogonal properties of the Hermite polynomials. There exist fruitful methods to ensure nonnegative, such as [13], [14], who introduced straightforward solutions to this problem that define the density in terms of the squared weighted sum of Hermite polynomials. The positive Edgeworth–Sargan distribution, the simplest case, is used in some studies to forecast heavy-tailed densities, such as [15]. For applications in asset pricing, [16] derive the statistical properties of the SNP densities whose advantage is always positive and flexible. In that paper, financial derivatives valuation by SNP densities has been shown. For application in risk modeling, [17] introduce the SNP approach for modeling Bitcoin risk relatively to other parametric distributions and volatility models. Ñíguez et al. [18] show that flexible probability distribution functions, in addition to being able to capture stylized facts of financial returns, can be used to identify pure higher-order effects of investors’ optimizing behavior. Cohen [19] presents an inspiring approach by defining the probability density functions as the square of other complex functions, which are analogous to quantum mechanical wave functions. The advantage of this approach is that the nonnegative restriction is automatically satisfied.

Parametric constraints to avoid the density approximation taking negative values derive from how to characterize the set of moments or cumulants since polynomial factors are determined by the moments or cumulants of the distribution. The problem was first highlighted by Shenton [20], who guarantees the nonnegative by evaluating the efficiency of moment estimation for the GC densities. Since conditions on coefficients of a quartic polynomial to ensure the absence of real roots were known by the dialytic method of Sylvester, [21] extend the work of Shenton to propose a set of polynomial inequalities in the skewness and kurtosis. Then, the kurtosis-skewness pairs on the boundary of the positive definite and unimodal regions can be shown by solving these inequalities numerically. Draper and Tierney [22] re-examine the results given by Barton and Dennis and correct a minor error in their plot of the unimodal region for the Edgeworth series. In [6], the nonnegative problem has been tackled through parametric restrictions. Later, a fantastic analytical expression on the polynomial equations to determine the positive definite and unimodal regions for the GC densities is given by Kwon [23], who first establishes a direct relationship between the skewness and kurtosis. In Kwon’s paper, polynomial equations are solved explicitly to express the standardized skewness on the two boundaries analytically as functions of the standardized kurtosis. This enables the skewness corresponding to any kurtosis on the two boundaries to be computed exactly and analytically, and avoids the reliance on tabulated values that require further interpolation. However, these solutions are not always the best option since imposing nonnegative regions is not easy to define beyond the simpler cases. The applications of these GC densities in finance, mainly for asset or option pricing, have not usually considered expansions beyond the fourth-order term (e.g., for expansions defined in terms of a couple of moments, usually skewness and kurtosis). “Unfortunately, [6] only determined those restrictions for $n = 4$ , because it becomes exceedingly difficult to find them for higher $n$ ”, as [16] point out. Our goal in this paper is to fill the vacuum.

In this paper, the higher moments or cumulants regions to determine the positive definite are solved by semi-definite programming (SDP hereafter). Theoretically, the domain of any order moments and cumulants to ensure the truncated GC density nonnegative could be estimated by SDP. According to SDP results, adding the higher terms to the GC density can significantly improve the possible values of cumulants, which allows a closer fit of the cumulants or moments observed in the market. The second contribution in this paper is that we gauge and analyze the impact of higher cumulants on skewness and kurtosis. We use SDP to calculate all possible values of skewness and kurtosis when both higher-order cumulants are fixed at all possible values. Nevertheless, we also present a shortcoming in GC densities when one cumulant reaches an extreme value (minimum or maximum). When one of the cumulants is fixed at a possible extreme value, SDP constrains other cumulants in a quite narrow space. Finally, we focus on a theoretical extension of the truncated GC series for a better application in this paper and leave the application to asset pricing and risk measuring in my following research considering the length of this paper.

The remainder of this paper is organized as follows. In Section 2, we provide some necessary mathematical foundations of the GC densities. In Section 3, we review the positive region and skewness of the truncated GC expansion in terms of the first two moments. The impact of higher cumulants on the positive definite region is discussed in Section 4, together with the convex optimization, SDP, outlined in Appendix A. Three specific cases are described in Section 4 to show the range of higher cumulants. In Section 5, we emphasize the analysis of the impact of higher cumulants on the positive definite region. Concluding remarks are offered in Section 6.

Section snippets

Mathematical foundation of Gram–Charlier density

First, we create some notations and mathematical preliminaries that will be used later in the paper. A general formula relating any two probability density functions is derived and discussed in this section. A generalization of the GC-type expansion and Edgeworth-type expansion are presented. Under these generalized expansions, the classical Edgeworth series and GC series can be obtained with some restrictions. Finally, we review a special kind of convex optimization problem, SDP and its dual

The valid region and the skewness of fourth-order GC density

One main advantage of the GC expansion is that skewness and kurtosis directly appear as parameters. Besides, GC expansion can be proved to be a convenient and quick method of estimating the risk-neutral density. Nevertheless, there is a fundamental difficulty with these methods if the series is truncated. Whether the truncated series Eq. (11) is nonnegative for all $x$ is an important one.

In order to solve this problem, for example, [6] focused on the fourth-order GC density, $P_{2} (x) = [1 + \frac{1}{3!} κ_{3} H e_{3} (x) + 1]$

The impact of higher-order coefficients/cumulants on the valid region

After defining the probabilistic and statistical Hermite polynomial $H e_{n} (x) = {(- 1)}^{n} e^{x^{2} / 2} \frac{d^{n}}{d x^{n}} e^{- x^{2} / 2},$ we rewrite Eq. (8) and obtain $P_{2} (x) = [1 + \frac{1}{3!} κ_{3} H e_{3} (x) + \frac{1}{4!} κ_{4} H e_{4} (x) + \frac{1}{5!} κ_{5} H e_{5} (x) + \frac{1}{6!} (κ_{6} + 10 κ_{3}^{2}) H e_{6} (x) + \dots] N (0, 1),$ which is the simplest type and used in many studies, for example, [6]. If the series is truncated, $P_{2} (x) \sim [1 + \frac{γ_{3}}{3!} H e_{3} (x) + \frac{γ_{4}}{4!} H e_{4} (x) + \dots + \frac{γ_{N}}{N!} H e_{N} (x)] N (0, 1)$ where $γ_{i}$ refers to the coefficient of $H e_{i} (x)$ and $N$ can be any even integer. Each of the high-order terms shapes both tails of the normal density. $P_{2} (x)$

Analysis

With a view to the full description, Fig. 7 shows three ( $γ_{4}$ , $γ_{3}$ )-regions in the same figure. The purple, orange and blue lines refer to the fourth-order, sixth-order, and eighth-order GC densities. We found that the range of kurtosis is also broadened, even negative ones. Heuristically, Case 2 and Case 3 give us a hint that these higher-order terms in GC densities are needed if one would like to obtain a broader range of cumulants to improve the fit for market data. It is reasonable that the

Conclusion

The use of GC densities presents a shortcoming because empirical applications require that the truncated GC expansion is positive (i.e., well-defined) in the whole parameter space. Different solutions have been proposed in the literature to solve this problem, but they have not provided the exact positive parameter region when higher-order terms are added.

In this paper, we derive and establish the cumulant/coefficient constrained region where the GC series truncated at an arbitrary even order

Acknowledgments

We would like to acknowledge the helpful and thought-provoking comments from two anonymous referees. Jin E. Zhang has been supported by an establishment grant from the University of Otago. Wei Lin appreciates being awarded by the University of Otago Doctoral Scholarship and the Scholarship and the Scientific Research Foundation for Scholars of Hangzhou Normal University under Grant 4085C50220204089. Wei Lin is also supported by Zhejiang Provincial Natural Science Foundation of China (

References (28)

JarrowRobert et al.
Approximate option valuation for arbitrary stochastic processes
J. Financ. Econ.
(1982)
JondeauEric et al.
Gram–Charlier densities
J. Econom. Dynam. Control
(2001)
Del BrioEsther B. et al.
Gram-Charlier densities: Maximum likelihood versus the method of moments
Insurance Math. Econom.
(2012)
JimenezInes et al.
Risk quantification and validation for Bitcoin
Oper. Res. Lett.
(2020)
SchlöglErik
Option pricing where the underlying assets follow a Gram-Charlier density of arbitrary order
J. Econom. Dynam. Control
(2013)
MertonRobert C.
Option pricing when underlying stock returns are discontinuous
J. Financ. Econ.
(1976)
SarganJohn D.
Gram-Charlier approximations applied to t ratios of k-class estimators
Econometrica
(1975)
SarganJohn D.
Econometric estimators and the Edgeworth approximation
Econometrica
(1976)
PhillipsPeter C.B.
A general theorem in the theory of asymptotic expansions as approximations to the finite sample distributions of econometric estimators
Econometrica
(1977)
PhillipsPeter C.B.
Edgeworth and saddlepoint approximations in the first-order noncircular autoregression
Biometrika
(1978)

CorradoCharles

The hidden martingale restriction in gram-charlier option prices

J. Futures Mark.

(2007)

BlinnikovSergei et al.

Expansions for nearly Gaussian distributions

Astron. Astrophys. Suppl. Ser.

(1998)

ZapevalovAlexander S. et al.

Simulating of the probability density of sea surface elevations using the Gram-Charlier series

Oceanology

(2011)

PenderJamol

Gram-Charlier expansion for time varying multiserver queues with abandonment

SIAM J. Appl. Math.

(2014)

Cited by (7)

Topology optimization method of district heating system considering load uncertainty
2023, Energy Reports
With the implementation of the carbon peaking and carbon neutrality strategy, China's heating system has witnessed various transformative trends, including source-network-load integration and the synergistic utilization of fossil and renewable energy sources. The district heating system, serving as a vital intermediary between heat sources and end-users, plays a crucial role in ensuring a stable energy supply. Particularly, as renewable energy integration increases and end-user requirements become more refined, the demand-supply matching aspect of district heating systems becomes increasingly critical. Therefore, it is essential to quantify and analyze user demand while establishing an adaptive pipe network capable of accommodating fluctuations in end-user load. This paper proposes an optimal topology structure method for district heating systems, taking into account the uncertainty of end-user load. Specifically, a quantitative model of end-user load uncertainty is developed, utilizing the Gram-Charlier A expansion method to capture the factors influencing end-user load uncertainty comprehensively. Furthermore, an optimization model for the pipe network topology is formulated, which considers the uncertainty of end-user load as a constraint and aims to minimize pipeline construction costs. This model optimizes the layout, size, and location of heating sources within the district heating system. The proposed method's practical applicability is validated through a case study of a district heating system in a northern Chinese city that involves 21 end-users. The results indicate that the end-user load uncertainty model outperforms the conventional normal distribution fitting model when dealing with bimodal load distributions. Additionally, the proposed method reduces construction costs by 6.24%, decreases heat loss by 5.30%, and eliminates redundant structures within the district heating system. This research significantly contributes to the planning and design of heating pipeline networks and the optimization and refurbishment of existing pipeline networks within district heating systems.
Further exploration into the valid regions of Gram–Charlier densities
2023, Journal of Computational and Applied Mathematics
Density estimation plays an important and fundamental role in finance, pattern recognition, machine learning, and statistics. Based on derivatives of a Gaussian density, a Gram–Charlier series presents an infinite expansion. Its truncated series is often used in many fields to approximate probability density functions. Although the expansions are convenient, there are constrained regions on the value of the cumulants (or moments) that admit a valid (nonnegative) probability density function. Lin and Zhang (2022)’s paper focuses on Gram–Charlier densities to show how the valid region of higher cumulants can be numerically implemented by semidefinite programming, which ensures that a series truncated at a cumulant of an arbitrary even order represents a valid probability density. This paper is the further exploration into the same problem. First, we use the representation theorem of such polynomials as sum of squares on the Gram–Charlier density to show how to develop the corresponding convex optimization problem for its valid region. Second, we provide the valid skewness–kurtosis regions of Gram–Charlier densities only up to the sixteenth-order because the semidefinite programming fails to calculate these regions when the order is above that. Third, we explore the valid region of the fourth-order Gram–Charlier defined on an arbitrary finite domain $[- q, q]$ but not the field $R$ of real numbers. Our analysis proves that the ranges of skewness and kurtosis can be broadened with the finite domains, which earn a wider application. Furthermore, the impact of the length of finite domains $2 q$ on valid regions has been shown.
Has the interaction between skewness and kurtosis of asset returns information content for risk forecasting?
2022, Finance Research Letters
This paper introduces the effect of the crossed products of Hermite polynomials on Gram-Charlier densities. This allows capturing the impact of the interaction between skewness and kurtosis and evaluating this new parameter as an additional source of information for risk management. We show that our modified Gram-Charlier density presents an improved accuracy, especially at distribution tails. Risk quantification is assessed for S&P500 losses with backtesting procedures for Value-at-Risk and Median Shortfall.
ORTHOGONAL GAMMA-BASED EXPANSION FOR THE CIR’S FIRST PASSAGE TIME DISTRIBUTION
2024, arXiv
Statistical modeling of the sea surface in the presence of abnormal waves
2023, E3S Web of Conferences
The impact of the El Niño phenomenon on electricity prices in hydrologic-based production systems: A switching regime semi-nonparametric approach
2023, Energy Science and Engineering

View all citing articles on Scopus

^☆: ACKNOWLEDGMENTS: We would like to acknowledge the helpful and thought-provoking comments from two anonymous referees. Jin E. Zhang has been supported by an establishment grant from the University of Otago. Wei Lin appreciates being awarded by the University of Otago Doctoral Scholarship and the Scientific Research Foundation for Scholars of Hangzhou Normal University under Grant 4085C50220204089. Wei Lin is also supported by Zhejiang Provincial Natural Science Foundation of China (LQ22A010003). We declare that we have no relevant or material financial interests that relate to the research described in this paper. All remaining errors are ours.

View full text

The valid regions of Gram–Charlier densities with high-order cumulants☆

Abstract

Introduction

Section snippets

Mathematical foundation of Gram–Charlier density

The valid region and the skewness of fourth-order GC density

The impact of higher-order coefficients/cumulants on the valid region

Analysis

Conclusion

Acknowledgments

J. Financ. Econ.

J. Econom. Dynam. Control

Insurance Math. Econom.

Oper. Res. Lett.

J. Econom. Dynam. Control

J. Financ. Econ.

Gram-Charlier approximations applied to t ratios of k-class estimators

Econometrica

Econometric estimators and the Edgeworth approximation

Econometrica

A general theorem in the theory of asymptotic expansions as approximations to the finite sample distributions of econometric estimators

Econometrica

Edgeworth and saddlepoint approximations in the first-order noncircular autoregression

Biometrika

The hidden martingale restriction in gram-charlier option prices

J. Futures Mark.

Expansions for nearly Gaussian distributions

Astron. Astrophys. Suppl. Ser.

Simulating of the probability density of sea surface elevations using the Gram-Charlier series

Oceanology

Gram-Charlier expansion for time varying multiserver queues with abandonment

SIAM J. Appl. Math.