Abstract
The extension of exploratory use of real principal component analysis (RPCA) to complex data tables clears the deficiency existing in complex principal component analysis (CPCA), a method mainly developed in the statistical framework, but lacking effective interpretation tools. While often used in climatology, oceanography, and signal analysis among others, its use raises understanding difficulties, due to an intrinsic indeterminacy, which gets more tortuous its use. In this paper, a real framework in which CPCA may be embedded is proposed in order to solve its intricacies. This is obtained through the RPCA of a particular real table, derived from the complex at hand, whose double eigenvalues correspond to eigenplanes, that are proved to be both holomorphic and isoclinic. Relations existing between the two analyses lead to fix the intrinsic CPCA indetermination through a second RPCA, that optimizes the complex principal components inner structure. As a spin-off, appropriate interpretation aids derive, associated with statistics describing the structure of the clouds of units associated with the complex variables, which may get meaningful the issued graphical results. Eventually, CPCA is applied to a small wind speeds data table, to show both its use and the effectiveness of its interpretation aids, allowing an easier understanding of the CPCA’s abilities in the exploratory framework.










Similar content being viewed by others
Data Availability Statements
The original data had been downloaded from Internet as quoted; thus, they are public: their transformations are fully described. An R (R Core Team, 2013) package to run CPCA including these data is currently under development.
References
Autonne, L. (1913). Sur les matrices hypohermitiennes et les unitaires. Comptes rendus des séances hebdomadaires de l? Académie des sciences de Paris, 156, 858–860.
Camiz, S., & Creta, S. (2015). Principal component analysis of complex data and application to climatology. In F Mola C Conversano (Eds.) Cladag 2015 ? Book of Abstracts (pp. 428–431). Cagliari: CUEC Editrice.
Carroll, J.D. (1968). Generalization of canonical correlation analysis to three or more sets of variables. In Proceedings of the 76th annual convention of the American Psychological Association, (Vol. 3 pp. 227–228).
De Iaco, S., Posa, D., & Palma, M. (2013). Complex-valued random fields for vectorial data: Estimating and modeling aspects. Mathematical Geosciences, 45(5), 557–573.
De Iaco, S., & Posa, D. (2016). Wind velocity prediction through complex kriging: Formalism and computational aspects. Environmental and ecological statistics, 23(1), 115–139.
Dmitriev, E.A., & Myasnikov, V.V. (2018). Comparative study of description algorithms for complex-valued gradient fields of digital images using linear dimensionality reduction methods. Computer Optics, 42(5), 822–828.
Eckart, C., & Young, G. (1936). The approximation of one matrix by another of lower rank. Psychometrika, 1(3), 211–218.
Elliott, M.A., Walter, G.A., Swift, A., Vandenborne, K., Schotland, J.C., & Leigh, J.S. (1999). Spectral quantitation by principal component analysis using complex singular value decomposition. Magnetic Resonance in Medicine, 41(3), 450–455. An Official Journal of the International Society for Magnetic Resonance in Medicine.
Eriksson, J., & Koivunen, V. (2006). Complex random vectors and ICA models: Identifiability, uniqueness, and separability. IEEE Transactions on Information theory, 52(3), 1017–1029.
Grzebyk, M., & Wackernagel, H. (1994). Multivariate analysis and spatial/temporal scales: Real and complex models. In Proceedings of the XVIIth International Biometrics Conference, (Vol. 1, Citeseer pp. 19–33).
Gupta, R.P. (1972). Principal components analysis in the complex case. Metrika, 19(1), 150–155.
Halliwell, L.J. (2015). Complex random variables. In Casualty Actuarial Society E-Forum, Fall.
Hanson, B., Klink, K., Matsuura, K., Roberson, S.M., & Willmott, C.J. (1992). Vector correlation: Review, exposition, and geographic application. Annals of the Association of American Geographers, 82(1), 103–116.
Hellings, C., Gögler, P, & Utschick, W. (2015). Composite real principal component analysis of complex signals. In 23rd European Signal Processing Conference (EUSIPCO), 2015, IEEE (pp. 2216–2220).
Hellings, C., & Utschick, W. (2015). Block-skew-circulant matrices in complex-valued signal processing. IEEE Transactions on Signal Processing, 63(8), 2093–2107.
Horel, J.D. (1984). Complex principal component analysis: Theory and examples. Journal of Climate and Applied Meteorology, 23(12), 1660–1673.
Hotelling, H. (1933). Analysis of a complex of statistical variables into principal components. Journal of educational psychology, 24(6), 417.
Klink, K., & Willmott, C.J. (1989). Principal components of the surface wind field in the united states: A comparison of analyses based upon wind velocity, direction, and speed. International Journal of Climatology, 9(3), 293–308.
Lebart, L., Piron, M., & Morineau, A. (2006). Statistique exploratoire multidimensionnelle: visualisation et inférences en fouilles de données. Paris: Dunod.
Legler, D.M. (1983). Empirical orthogonal function analysis of wind vectors over the tropical pacific region. Bulletin of the American Meteorological Society, 64(3), 234–241.
Marshall, A.W., Olkin, I., & Arnold, B. (2011). Inequalities: Theory of majorization and its applications. Springer Science & Business Media.
Ollila, E. (2008). On the circularity of a complex random variable. IEEE Signal Processing Letters, 15, 841–844.
Pearson, K. (1901). On lines and planes of closest fit to systems of points in space. The London, Edinburgh, and Dublin philosophical magazine and journal of science, 2(11), [LIII], 559–572.
Picinbono, B. (1994). On circularity. IEEE Transactions on signal processing, 42(12), 3473–3482.
Posa, D. (2020). Parametric families for complex valued covariance functions: Some results, an overview and critical aspects. Spatial Statistics, 39, 1–20. https://doi.org/10.1016/j.spasta.2020.100473.
Posa, D. (2021). Models for the difference of continuous covariance functions. Stochastic Environmental Research and Risk Assessment, pp 1–18. https://doi.org/10.1007/s00477-020-01947-1.
Preisendörfer, R.W., & Mobley, C.D. (1988). Principal component analysis in meteorology and oceanography Vol. 425. Amsterdam: Elsevier.
Qiu, L., Zhang, Y., & Li, C-K (2005). Unitarily invariant metrics on the grassmann space. SIAM journal on matrix analysis and applications, 27 (2), 507–531.
R Core Team. (2013). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. http://www.R-project.org/.
Rizza, G.B. (2001). On the geometry of a pair of oriented planes. Rivista di Matematica dell’Università di Parma, 4(6), 217–228.
Scharnhorst, K. (2001). Angles in complex vector spaces. Acta Appl. Math., 69, 95–103.
Schäcke, K. (2004). On the kronecker product. Master’s Thesis, University of Waterloo.
Schreier, P.J., & Scharf, L.L. (2003). Second-order analysis of improper complex random vectors and processes. IEEE Transactions on Signal Processing, 51(3), 714–725.
Schreier, P.J., & Scharf, L.L. (2010). Statistical signal processing of complex-valued data: the theory of improper and noncircular signals. Cambridge (UK): Cambridge University Press.
Stewart, G. (1993). On the early history of the singular value decomposition. SIAM Review, 35(4), 551–566.
Trevitt, C., & MacKisack, M. (1989). Australian bureau of metheorology data. http://www.statsci.org/data/timeseri.html.
Wallace, J.M., & Dickinson, R.E. (1972). Empirical orthogonal representation of time series in the frequency domain. part I: Theoretical considerations. Journal of Applied Meteorology, 11(6), 887–892.
Wong, Y.-C. (1977). Linear geometry in euclidean 4-space. Southeast Asian Mathematical Society, 1.
Funding
For this work, the second author was partially supported by the Erasmus Plus agreement between Sapienza Università di Roma and the Université des Sciences et Technologies de Lille and by his Sapienza research grant 2016.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interest
The authors declare no competing interests.
Additional information
Ethical Conduct
No experimentation on living creatures had been carried out.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Denimal, JJ., Camiz, S. Complex Principal Component Analysis: Theory and Geometrical Aspects. J Classif 39, 376–408 (2022). https://doi.org/10.1007/s00357-022-09412-0
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00357-022-09412-0