Abstract
This paper develops a quantile hidden semi-Markov regression to jointly estimate multiple quantiles for the analysis of multivariate time series. The approach is based upon the Multivariate Asymmetric Laplace (MAL) distribution, which allows to model the quantiles of all univariate conditional distributions of a multivariate response simultaneously, incorporating the correlation structure among the outcomes. Unobserved serial heterogeneity across observations is modeled by introducing regime-dependent parameters that evolve according to a latent finite-state semi-Markov chain. Exploiting the hierarchical representation of the MAL, inference is carried out using an efficient Expectation-Maximization algorithm based on closed form updates for all model parameters, without parametric assumptions about the states’ sojourn distributions. The validity of the proposed methodology is analyzed both by a simulation study and through the empirical analysis of air pollutant concentrations in a small Italian city.






Similar content being viewed by others
Notes
The pdf of a GIG(p, a, b) distribution is defined as \(f_{GIG}(x; p,a,b)= \frac{\left( \frac{a}{b}\right) ^{p/2}}{2K_{p}(\sqrt{ab})} x^{p-1} e^{-\frac{1}{2}\left( ax +bx^{-1} \right) }\), with \(a>0\), \(b>0\) and \(p \in {{{\mathcal {R}}}}\).
References
Adam, T., Langrock, R., Weiß, C.H.: Penalized estimation of flexible hidden Markov models for time series of counts. Metron 77(2), 87–104 (2019)
Akaike, H.: Information theory and an extension of the maximum likelihood principle, in ‘Selected papers of Hirotugu Akaike’, Springer, pp. 199–213, (1998)
Barbu, V. S. and Limnios, N.: Semi-Markov chains and hidden semi-Markov models toward applications: their use in reliability and DNA analysis, Vol. 191, Springer Science & Business Media (2009)
Bartolucci, F., Farcomeni, A. and Pennoni, F.: Latent Markov models for longitudinal data, CRC Press, (2012)
Bassani, C., Vichi, F., Esposito, G., Montagnoli, M., Giusto, M., Ianniello, A.: Nitrogen dioxide reductions from satellite and surface observations during COVID-19 mitigation in Rome (Italy). Environmental Science and Pollution Research 28(18), 22981–23004 (2021)
Bernardi, M., Gayraud, G., Petrella, L., et al.: Bayesian tail risk interdependence using quantile regression. Bayesian Analysis 10(3), 553–603 (2015)
Biernacki, C., Celeux, G., Govaert, G.: Assessing a mixture model for clustering with the integrated completed likelihood. IEEE Transactions on Pattern Analysis and Machine Intelligence 22(7), 719–725 (2000)
Browne, R.P., McNicholas, P.D.: A mixture of generalized hyperbolic distributions. Canadian Journal of Statistics 43(2), 176–198 (2015)
Bulla, J., Bulla, I.: Stylized facts of financial time series and hidden semi-Markov models. Computational Statistics & Data Analysis 51(4), 2192–2209 (2006)
Bulla, J., Bulla, I., Nenadić, O.: hsmm-an R package for analyzing hidden semi-Markov models. Computational Statistics & Data Analysis 54(3), 611–619 (2010)
Cappé, O., Moulines, E. and Rydén, T. (2006), Inference in hidden Markov models, Springer Science & Business Media
Charlier, I., Paindaveine, D., Saracco, J.: Multiple-output quantile regression through optimal quantization. Scandinavian Journal of Statistics 47(1), 250–278 (2020)
Chavas, J.-P.: On multivariate quantile regression analysis. Statistical Methods & Applications 27(3), 365–384 (2018)
Dannemann, J., Holzmann, H., Leister, A.: Semiparametric hidden Markov models: identifiability and estimation. Wiley Interdisciplinary Reviews: Computational Statistics 6(6), 418–425 (2014)
Dempster, A. P., Laird, NM and Rubin, DB.: ‘Maximum likelihood from incomplete data via the EM algorithm’, Journal of the Royal Statistical Society Series B (Methodological) pp. 1–38 (1977)
El Ghouch, A., Genton, M.G.: Local polynomial quantile regression with parametric features. J.Am. Stat. Assoc. 104(488), 1416–1429 (2009)
Engle, R.F., Manganelli, S.: CAViaR: conditional autoregressive value at risk by regression quantiles. J. Bus. Econ. Stat. 22(4), 367–381 (2004)
Ephraim, Y., Merhav, N.: Hidden Markov processes. IEEE Trans. Inf. Theor. 48(6), 1518–1569 (2002)
Farcomeni, A.: Quantile regression for longitudinal data based on latent Markov subject-specific parameters. Stat. Comput. 22(1), 141–152 (2012)
Geraci, M.: Modelling and estimation of nonlinear quantile regression with clustered data. Comput. Stat. Data Anal. 136, 30–46 (2019)
Guédon, Y.: Estimating hidden semi-Markov chains from discrete sequences. J. Comput. Graph. Stat. 12(3), 604–639 (2003)
Hamilton, J. D.:‘A new approach to the economic analysis of nonstationary time series and the business cycle’, Econometrica: Journal of the Econometric Society pp. 357–384, (1989)
Holzmann, H., Munk, A., Gneiting, T.: Identifiability of finite mixtures of elliptical distributions. Scandinav. J. Stat. 33(4), 753–763 (2006)
Hubert, L., Arabie, P.: Comparing partitions. J. Classif. 2(1), 193–218 (1985)
Koenker, R.: Quantile regression. Cambridge University Press (2005)
Koenker, R. and Bassett, G.:‘Regression Quantiles’, Econometrica: Journal of the Econometric Society 46(1), 33–50, (1978)
Koenker, R., Chernozhukov, V., He, X. and Peng, L.: Handbook of quantile regression, CRC press, (2017)
Kong, L. and Mizera, I.: ‘Quantile tomography: using quantiles with multivariate data’, Statistica Sinica pp. 1589–1610, (2012)
Kotz, S., Kozubowski, T. and Podgorski, K.:The Laplace distribution and generalizations: a revisit with applications to communications, economics, engineering, and finance, Springer Science & Business Media, (2012)
Langrock, R., Kneib, T., Sohn, A., DeRuiter, S.L.: Nonparametric inference in hidden Markov models using P-splines. Biometrics 71(2), 520–528 (2015)
Langrock, R., Zucchini, W.: Hidden Markov models with arbitrary state dwell-time distributions. Comput. Stat. Data Anal. 55(1), 715–724 (2011)
Leroux, B.G.: Maximum-likelihood estimation for hidden Markov models. Stochastic Process. Appl. 40(1), 127–143 (1992)
Levinson, S.E., Rabiner, L.R., Sondhi, M.M.: An introduction to the application of the theory of probabilistic functions of a Markov process to automatic speech recognition. Bell Syst. Tech. J. 62(4), 1035–1074 (1983)
Luo, Y., Lian, H., Tian, M.: Bayesian quantile regression for longitudinal data models. J. Stat. Comput. Simul. 82(11), 1635–1649 (2012)
MacDonald, I. L. and Zucchini, W.:Hidden Markov and other models for discrete-valued time series, Vol. 110, CRC Press (1997)
Marino, M.F., Tzavidis, N., Alfò, M.: Mixed hidden Markov quantile regression models for longitudinal data with possibly incomplete sequences. Stat. Methods Med. Res 27(7), 2231–2246 (2018)
Maruotti, A.: Mixed hidden Markov models for longitudinal data: an overview. Int. Stat. Rev. 79(3), 427–454 (2011)
Maruotti, A., Bulla, J., Lagona, F., Picone, M. and Martella, F.:‘Dynamic mixtures of factor analyzers to characterize multivariate air pollutant exposures’, The Annals of Applied Statistics pp. 1617–1648,(2017)
Maruotti, A., Petrella, L., Sposito, L.: Hidden semi-Markov-switching quantile regression for time series. Compu. Stati. Data Anal. 159, 107208 (2021)
Maruotti, A., Punzo, A.: Initialization of hidden Markov and semi-Markov models: a critical evaluation of several strategies. Int. Stat. Rev. 89(3), 447–480 (2021)
Maruotti, A., Punzo, A., Bagnato, L.: Hidden Markov and semi-Markov models with multivariate leptokurtic-normal components for robust modeling of daily returns series. J. Financ. Econom. 17(1), 91–117 (2019)
Merlo, L., Petrella, L., Raponi, V.: Forecasting VaR and ES using a joint quantile regression and its implications in portfolio allocation. J. Bank. Finan. 133, 106248 (2021)
Merlo, L., Petrella, L., Salvati, N., Tzavidis, N.: Marginal M-quantile regression for multivariate dependent data. Comput. Stat. Data Anal. 173, 107500 (2022)
Merlo, L., Petrella, L., Tzavidis, N.: ‘Quantile mixed hidden Markov models for multivariate longitudinal data: an application to children’s Strengths and Difficulties Questionnaire scores’, Journal of the Royal Statistical Society. Ser. C Appl. Stat. 71(2), 417–448 (2022)
O’Connell, J., Højsgaard, S., et al.: Hidden semi Markov models for multiple observation sequences: the mhsmm package for R. J. Stat. Softw. 39(4), 1–22 (2011)
Petrella, L., Raponi, V.: Joint estimation of conditional quantiles in multivariate linear regression models with an application to financial distress. J. Multivar. Anal. 173, 70–84 (2019)
Pohle, J., Adam, T., Beumer, L.T.: Flexible estimation of the state dwell-time distribution in hidden semi-Markov models. Comput. Stat. Data Anal. 172, 107479 (2022)
Pohle, J., Langrock, R., van Beest, F.M., Schmidt, N.M.: Selecting the number of states in hidden Markov models: pragmatic solutions illustrated using animal movement. J. Agric Biol. Environ. Stat. 22(3), 270–293 (2017)
Putaud, J.-P., Pozzoli, L., Pisoni, E., Martins Dos Santos, S., Lagler, F., Lanzani, G., Dal Santo, U., Colette, A.: Impacts of the COVID-19 lockdown on air pollution at regional and urban background sites in northern Italy. Atmosp. Chem. Phys. 21(10), 7597–7609 (2021)
Sansom, J. and Thomson, P. (2001), ‘Fitting hidden semi-Markov models to breakpoint rainfall data’, Journal of Applied Probability 38(A), 142–157
Schwarz, G., et al.: Estimating the dimension of a model. Anna. Stat. 6(2), 461–464 (1978)
Serfling, R.: Quantile functions for multivariate analysis: approaches and applications. Statist. Neerlandica 56(2), 214–232 (2002)
Stolfi, P., Bernardi, M. and Petrella, L.: ‘The sparse method of simulated quantiles: An application to portfolio optimization’, Statistica Neerlandica (2018),
Visser, I., Raijmakers, M.E., Molenaar, P.C.: Confidence intervals for hidden Markov model parameters. Br. J. Math. Statist. Psychol. 53(2), 317–327 (2000)
Ye, W., Zhu, Y., Wu, Y. and Miao, B.: ‘Markov regime-switching quantile regression models and financial contagion detection’, Insurance: Mathematics and Economics 67, 21–26,(2016)
Yu, K., Moyeed, R.A.: Bayesian quantile regression. Stat. Probab. Lett. 54(4), 437–447 (2001)
Yu, K., Zhang, J.: A three-parameter asymmetric Laplace distribution and its extension. Commun. Statist. Theory Methods 34(9–10), 1867–1879 (2005)
Yu, S.-Z.: Hidden Semi-Markov models: theory, algorithms and applications. Morgan Kaufmann (2015)
Zucchini, W., MacDonald, I. L. and Langrock, R.: Hidden Markov models for time series: an introduction using R, Chapman and Hall CRC (2016)
Acknowledgements
We would like to warmly thank the Associate Editor and two anonymous reviewers for their thoughtful comments and efforts towards improving our manuscript. This work was partially supported by the Finance Market Fund, Norway, project number 309218; “Statistical modelling and inference for (high-dimensional) financial data”.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Appendix
Appendix
Proof of Proposition 1
Firstly, we note that to ensure identifiability of the MAL density in (5) it suffices to apply Proposition 1 of Petrella and Raponi (2019). Secondly, for a general HSMM, identifiability has been proven up to label switching (Leroux 1992). Thus, to ensure identifiability, all one needs to prove is the identifiability of the marginal mixtures (Dannemann et al. 2014), which in our case are represented by the finite mixtures of MAL distributions. Based on the work of Holzmann et al. (2006), Browne and McNicholas (2015) prove identifiability of finite mixtures of multivariate generalized hyperbolic distributions. Since the MAL in (5) is a limiting case of the multivariate generalized hyperbolic distribution (see (3) and (4) in Browne and McNicholas (2015) with \(\lambda = 1, \psi = 2\) and \(\chi \rightarrow 0\)), model identifiability follows by applying Corollary 2 of Browne and McNicholas (2015).
Proof of Proposition 2
The E-step of the EM algorithm considers the conditional expectation of the complete log-likelihood function in (8) given the observed data and the current parameter estimates \(\hat{{\varvec{\Phi }}}^{(r-1)}_{\varvec{\tau }}\). At first, we recall that under the constraints imposed on \({\varvec{\tilde{\xi }}}\) and \(\varvec{\Lambda }\), the representation in (7) implies that:
This means that the joint density function of \(\mathbf{Y}\) and \({\tilde{C}}\) is:
By substituting (36) in (8) and taking the conditional expectation of the logarithm of (8), we obtain the expected complete log-likelihood function in (13).
To compute the conditional expectation of \({\tilde{c}}_{tk}\) and \({\tilde{z}}_{tk}\) in (13), \({\tilde{C}}\) is treated as an additional latent variable. Using the joint distribution of \(\mathbf{Y}\) and \({\tilde{C}}\) derived in (36) and the MAL density of \(\mathbf{Y}\) given in (5), we have that:
which corresponds to a Generalized Inverse Gaussian (GIG) distribution with parameters \(\nu , {2+{\tilde{d}}}, \tilde{m_i}\), i.e.Footnote 1
Then, it follows that
and
Denoting the two conditional expectations in (39) and (40) by \(\hat{{\tilde{c}}}\) and \(\hat{{\tilde{z}}}\) respectively, concludes the proof.
Proof of Proposition 3
Imposing the first order conditions on (13) with respect to each component of the set \({\varvec{\Phi }}_{\varvec{\tau }}\), gives the update estimates in (16), (17), (18) and (19). However, there is not closed formula solution to update the elements of the scale matrix \(\mathbf{D}_j\); hence, the M-step update requires using numerical optimization techniques to maximize (13). A considerable disadvantage of this procedure is the necessary high computational effort which could be very time-consuming. For this reason, we utilize a simpler estimator for the scale parameters \(\delta _{jk}, k = 1,\dots ,p\), which follows directly from the fact that all marginals of the MAL distribution are univariate AL distributions (see Yu and Zhang 2005):
where \(\hat{{\mu }}_{tjk}\) is the k-th element of the vector \(\hat{\varvec{\mu }}_{tj}\).
Rights and permissions
Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Merlo, L., Maruotti, A., Petrella, L. et al. Quantile hidden semi-Markov models for multivariate time series. Stat Comput 32, 61 (2022). https://doi.org/10.1007/s11222-022-10130-1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11222-022-10130-1