Scale and shape mixtures of matrix variate extended skew normal distributions

https://doi.org/10.1016/j.jmva.2020.104649Get rights and content

Abstract

In this paper, we propose a matrix extension of the scale and shape mixtures of multivariate skew normal distributions and present some particular cases of this new class. We also present several formal properties of this class, such as the marginal distributions, the moment generating function, the distribution of linear and quadratic forms, and the selection and stochastic representations. In addition, we introduce the matrix variate tail conditional expectation measure and derive this risk measure for the scale and shape mixtures of matrix variate extended skew normal distributions. We present an efficient EM-type algorithm for the computation of maximum likelihood estimates of parameters in some special cases of the proposed class. Finally, we conduct a small simulation study and fit various special cases of the new class to a real dataset.

Introduction

Matrix variate distributions have a central role in multivariate analysis, and have been useful in many fields such as psychology, physics and economics. This type of distributions have proven useful for describing and modeling repeated measurements in multivariate variables. In this context, the observations are not necessarily independent. As an example, consider n blood tests for k individuals where in each test, p variables are measured. In this situation, a random matrix X of size p×n is then observed for each individual. In most cases, however, the focus has been focused on matrix variate elliptical distributions, particularly on the matrix variate normal distribution. In the books of Fang and Zhang [11], Gupta and Nagar [15] and Gupta et al. [17] we can find a comprehensive collection of the most important results on matrix variate elliptical distributions.

The matrix variate normal distribution is one of the most well-known matrix variate elliptical distribution. For example, its quadratic forms are used in the analysis of the multivariate linear models. A random matrix X of size p×n is said to have a matrix variate normal distribution with parameters M, Σ and Ψ and is denoted by XNp×n(M,ΨΣ), if its probability density function (pdf) is given by ϕp×n(X;M,ΨΣ)=(2π)np2|Ψ|p2|Σ|n2etr{12Ψ1(XM)Σ1(XM)},where etr{A}=exp{tr(A)}. Here, M is a matrix of size p×n corresponding to the mean of X, while Σ and Ψn×n are positive definite matrices of size p×p and n×n, respectively, such that ΨΣ is the covariance matrix of vec(X), where vec() denotes the vectorization operator.

Matrix variate elliptical distributions, however, cannot take into account the skewness present in the data, which can lead to incorrect conclusions in our analysis. For this reason, in recent years there has been a growing interest in building more flexible matrix variate distributions so that they can adjust the skewness of the data. For instance, a first skew version of the matrix variate normal distribution is introduced by Chen and Gupta [6]. They proposed properties of matrix variate skew normal distribution. A few years later, Domínguez-Molina et al. [10] introduced the matrix variate closed skew normal distribution and gave two constructions for it; and Harrar and Gupta [18] generalized the results of Chen and Gupta [6] and provided a stochastic representation for the matrix variate skew normal distribution. More recently, Zheng et al. [24] studied the inverse problem of matrix variate skew normal distributions; Zheng et al. [25] obtained moments and quadratic forms of matrix variate skew normal distributions by using the moment generating function of the matrix variate closed skew normal distribution. In the same way, Ning and Gupta [21] introduced the matrix variate extended skew normal (ESN) distribution based on the result of Harrar and Gupta [18]. According to these authors, a random matrix X of size p×n is said to have a ESN distribution, denoted by XESNp×n(M,ΨΣ,Ω,λ,δ), if its pdf is given by fESN(X;M,ΨΣ,Ω,λ,δ)=1Φn(δ;Ω+λλΨ)ϕp×n(X;M,ΨΣ)Φn(δ+(XM)Σ12λ;Ω),where M is a p×n location matrix, Σ is a p×p positive definite matrix, Ω and Ψ are n×n positive definite matrices, ϕp×n(.;M,ΨΣ) is the pdf of Np×n(M,ΨΣ) distribution, λRp, δRn and Φn(.;Ω) is the cumulative distribution function (cdf) of the n-variate normal distribution with mean vector and covariance matrix 0 and Ω, respectively. The moment generating function (mgf) of ESNp×n(M,ΨΣ,Ω,λ,δ) is MESN(T;M,ΨΣ,Ω,λ,δ)=etr{TM+12ΣTΨT}Φn(δ;Ω+λλΨ)Φn(δ+ΨTΣ12λ;Ω+λλΨ),where TRp×n.

On the other hand, there is relatively little work for mixtures of matrix variate distributions. For example, Díaz-García and Gutiérrez-Jáimez [8] proposed the class of scale mixture of matrix variate Kotz-type distributions; Doğru et al. [9] considered finite mixtures of matrix variate t distributions; Gallaugher and McNicholas [13] used some finite mixtures of skewed matrix variate distributions for clustering; Gallaugher and McNicholas [12], [14] developed a total of four skewed matrix variate distributions based on a matrix normal variance-mean mixture. In same way, Gupta and Varga [16] proposed the scale mixture of matrix variate normal (SMN) distributions and provided its pdf as follows: g(X)=SJϕp×n(X;M,ΨL(s)Σ)dJ(s),where ϕp×n(;M,ΨL(s)Σ) is the pdf of Np×n(M,ΨL(s)Σ), L is a weight function and J is a distribution function on (0,+]. We denote it by SMNp×n(M,ΨΣ;L,J).

The main purpose of this paper is to develop the theory of matrix variate flexible distributions by defining the scale and shape mixtures of ESN distributions (hereinafter SSMESN), presenting some particular cases and studying main properties of them. The remainder of this paper is laid out as follows. In Section 2, we give the definition of the SSMESN distributions and propose some subclasses such as scale mixtures of matrix variate ESN (hereinafter SMESN) distributions. Some basic properties are studied in Section 3. In Section 4, we discuss the different representations of the SSMESN distributions. The matrix variate tail conditional expectation (TCE) measure is defined and, also, it is derived for the SSMESN family in Section 5. In Section 6, we obtain maximum likelihood estimator of some special cases of the SSMESN distributions by using the EM algorithm. In Section 7, a small simulation study is presented for evaluating the performance of the maximum likelihood estimators. Finally, some matrix variate distributions in the SSMESN family are fitted to a real dataset in Section 8.

Section snippets

Scale and shape mixtures of matrix variate ESN distributions

In this section, we develop the SSMESN family of matrix variate distributions defined as follows.

Definition 1

Suppose that θ and ω are two random variables with joint distribution Q(θ0,ω0) and marginal distributions H(θ0) and G(ω0), respectively. A random matrix Y of size p×n is said to have a scale and shape mixtures of matrix variate extended skew normal distributions with location matrix MRp×n, a p×p positive definite matrix Σ, two n×n positive definite matrices Ω and Ψ and two vectors λRp and δRn, if

Some basic properties

In this section, we derive the distributions of linear and quadratic forms of SSMESN random matrices. We also study here the marginal distributions of SSMESN random matrices.

Selection and stochastic representations

In this section, we discuss the selection and stochastic representations of a SSMESN distributions. We start by adopting the concept of selection distribution presented in Arellano-Valle et al. [3]. For instance, as was considered by Ning and Gupta [21], the matrix variate ESN distribution becomes a selection distribution. In fact, let xNn(0,Ω) and XNp×n(0,ΨIp) and suppose that they are independent. Consider the selection random matrix defined by Y=dX|(x<Xλ+δ). By Arellano-Valle et al. [3],

Matrix variate tail conditional expectation

In this section, assuming the existence of the required moments, we introduce the matrix variate tail conditional expectation (TCE) as a new risk measure and obtain it in an explicit closed form expression for the scale and shape mixtures of matrix variate extended skew normal distributions.

Maximum likelihood estimation

In this section, we use the EM algorithm, which was proposed by Dempster et al. [7], to find the maximum likelihood estimator (mle) of M, Σ and λ in SSMESN distributions when δ=0 and Ψ=Ω=In.

A simulation study

To evaluate the performance of the estimators considered in the previous section for varying samples sizes, we consider the matrix variate skew-t-normal distribution and generate random samples of size N=50,75 and 100, with 300 replications from the STNp×n(M,Σ,λ,ν,τ), where p=2, n=3, M=M11M12M13M21M22M23=10.50211.5,Σ=σ11σ12σ21σ22=0.75112.25, λ=(λ1,λ2)=(1,1) and ν=τ=2.

To generate a random sample Y1,,YN from YSTNp×n(M,Σ,λ,ν,τ), we consider the selection representation of the matrix variate

An example

In this section, we fit some distributions of the SSMESN family, namely, the matrix variate normal, t, skew normal, skew t and skew-t-normal distributions, to Dow-Jones dividends dataset. This dataset consists of sum of dividends of Dow-Jones Industrial Common Stocks (DJ dividends) and the Dow-Jones divisor (DJ divisor), which was adjusted to correct the price index for stock splits and stock dividends, for quarters from 1920–1934; see for more Rappaport and White [22]. The data are presented

Conclusions

In the paper, we have presented the definition of scale and shape mixtures of matrix variate extended skew normal distributions. In this definition, we considered the matrix variate extended skew normal distribution which has been introduced by Ning and Gupta [21]. Our definition, with n=1, leads to the definition of a generalization of scale and shape mixtures of multivariate skew normal distributions that is introduced by Arellano-Valle et al. [4]. We obtained the probability density

Acknowledgments

We thank the Editor, Associate Editor and three anonymous referees for the valuable suggestions and comments that improved this paper.

References (25)

  • DempsterA.P. et al.

    Maximum likelihood from incomplete data via the EM algorithm

    J. R. Stat. Soc. Ser. B Stat. Methodol.

    (1977)
  • DoğruF.Z. et al.

    Finite mixtures of matrix variate t distributions

    Gazi Univ. J. Sci.

    (2016)
  • View full text