Skip to main content
Log in

Complex Principal Component Analysis: Theory and Geometrical Aspects

  • Published:
Journal of Classification Aims and scope Submit manuscript

Abstract

The extension of exploratory use of real principal component analysis (RPCA) to complex data tables clears the deficiency existing in complex principal component analysis (CPCA), a method mainly developed in the statistical framework, but lacking effective interpretation tools. While often used in climatology, oceanography, and signal analysis among others, its use raises understanding difficulties, due to an intrinsic indeterminacy, which gets more tortuous its use. In this paper, a real framework in which CPCA may be embedded is proposed in order to solve its intricacies. This is obtained through the RPCA of a particular real table, derived from the complex at hand, whose double eigenvalues correspond to eigenplanes, that are proved to be both holomorphic and isoclinic. Relations existing between the two analyses lead to fix the intrinsic CPCA indetermination through a second RPCA, that optimizes the complex principal components inner structure. As a spin-off, appropriate interpretation aids derive, associated with statistics describing the structure of the clouds of units associated with the complex variables, which may get meaningful the issued graphical results. Eventually, CPCA is applied to a small wind speeds data table, to show both its use and the effectiveness of its interpretation aids, allowing an easier understanding of the CPCA’s abilities in the exploratory framework.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

Data Availability Statements

The original data had been downloaded from Internet as quoted; thus, they are public: their transformations are fully described. An R (R Core Team, 2013) package to run CPCA including these data is currently under development.

References

  • Autonne, L. (1913). Sur les matrices hypohermitiennes et les unitaires. Comptes rendus des séances hebdomadaires de l? Académie des sciences de Paris, 156, 858–860.

    MATH  Google Scholar 

  • Camiz, S., & Creta, S. (2015). Principal component analysis of complex data and application to climatology. In F Mola C Conversano (Eds.) Cladag 2015 ? Book of Abstracts (pp. 428–431). Cagliari: CUEC Editrice.

  • Carroll, J.D. (1968). Generalization of canonical correlation analysis to three or more sets of variables. In Proceedings of the 76th annual convention of the American Psychological Association, (Vol. 3 pp. 227–228).

  • De Iaco, S., Posa, D., & Palma, M. (2013). Complex-valued random fields for vectorial data: Estimating and modeling aspects. Mathematical Geosciences, 45(5), 557–573.

    Article  MathSciNet  Google Scholar 

  • De Iaco, S., & Posa, D. (2016). Wind velocity prediction through complex kriging: Formalism and computational aspects. Environmental and ecological statistics, 23(1), 115–139.

    Article  MathSciNet  Google Scholar 

  • Dmitriev, E.A., & Myasnikov, V.V. (2018). Comparative study of description algorithms for complex-valued gradient fields of digital images using linear dimensionality reduction methods. Computer Optics, 42(5), 822–828.

    Article  Google Scholar 

  • Eckart, C., & Young, G. (1936). The approximation of one matrix by another of lower rank. Psychometrika, 1(3), 211–218.

    Article  Google Scholar 

  • Elliott, M.A., Walter, G.A., Swift, A., Vandenborne, K., Schotland, J.C., & Leigh, J.S. (1999). Spectral quantitation by principal component analysis using complex singular value decomposition. Magnetic Resonance in Medicine, 41(3), 450–455. An Official Journal of the International Society for Magnetic Resonance in Medicine.

    Article  Google Scholar 

  • Eriksson, J., & Koivunen, V. (2006). Complex random vectors and ICA models: Identifiability, uniqueness, and separability. IEEE Transactions on Information theory, 52(3), 1017–1029.

    Article  MathSciNet  Google Scholar 

  • Grzebyk, M., & Wackernagel, H. (1994). Multivariate analysis and spatial/temporal scales: Real and complex models. In Proceedings of the XVIIth International Biometrics Conference, (Vol. 1, Citeseer pp. 19–33).

  • Gupta, R.P. (1972). Principal components analysis in the complex case. Metrika, 19(1), 150–155.

    Article  MathSciNet  Google Scholar 

  • Halliwell, L.J. (2015). Complex random variables. In Casualty Actuarial Society E-Forum, Fall.

  • Hanson, B., Klink, K., Matsuura, K., Roberson, S.M., & Willmott, C.J. (1992). Vector correlation: Review, exposition, and geographic application. Annals of the Association of American Geographers, 82(1), 103–116.

    Article  Google Scholar 

  • Hellings, C., Gögler, P, & Utschick, W. (2015). Composite real principal component analysis of complex signals. In 23rd European Signal Processing Conference (EUSIPCO), 2015, IEEE (pp. 2216–2220).

  • Hellings, C., & Utschick, W. (2015). Block-skew-circulant matrices in complex-valued signal processing. IEEE Transactions on Signal Processing, 63(8), 2093–2107.

    Article  MathSciNet  Google Scholar 

  • Horel, J.D. (1984). Complex principal component analysis: Theory and examples. Journal of Climate and Applied Meteorology, 23(12), 1660–1673.

    Article  Google Scholar 

  • Hotelling, H. (1933). Analysis of a complex of statistical variables into principal components. Journal of educational psychology, 24(6), 417.

    Article  Google Scholar 

  • Klink, K., & Willmott, C.J. (1989). Principal components of the surface wind field in the united states: A comparison of analyses based upon wind velocity, direction, and speed. International Journal of Climatology, 9(3), 293–308.

    Article  Google Scholar 

  • Lebart, L., Piron, M., & Morineau, A. (2006). Statistique exploratoire multidimensionnelle: visualisation et inférences en fouilles de données. Paris: Dunod.

    Google Scholar 

  • Legler, D.M. (1983). Empirical orthogonal function analysis of wind vectors over the tropical pacific region. Bulletin of the American Meteorological Society, 64(3), 234–241.

    Article  Google Scholar 

  • Marshall, A.W., Olkin, I., & Arnold, B. (2011). Inequalities: Theory of majorization and its applications. Springer Science & Business Media.

  • Ollila, E. (2008). On the circularity of a complex random variable. IEEE Signal Processing Letters, 15, 841–844.

    Article  Google Scholar 

  • Pearson, K. (1901). On lines and planes of closest fit to systems of points in space. The London, Edinburgh, and Dublin philosophical magazine and journal of science, 2(11), [LIII], 559–572.

    Article  Google Scholar 

  • Picinbono, B. (1994). On circularity. IEEE Transactions on signal processing, 42(12), 3473–3482.

    Article  Google Scholar 

  • Posa, D. (2020). Parametric families for complex valued covariance functions: Some results, an overview and critical aspects. Spatial Statistics, 39, 1–20. https://doi.org/10.1016/j.spasta.2020.100473.

    Article  MathSciNet  Google Scholar 

  • Posa, D. (2021). Models for the difference of continuous covariance functions. Stochastic Environmental Research and Risk Assessment, pp 1–18. https://doi.org/10.1007/s00477-020-01947-1.

  • Preisendörfer, R.W., & Mobley, C.D. (1988). Principal component analysis in meteorology and oceanography Vol. 425. Amsterdam: Elsevier.

    Google Scholar 

  • Qiu, L., Zhang, Y., & Li, C-K (2005). Unitarily invariant metrics on the grassmann space. SIAM journal on matrix analysis and applications, 27 (2), 507–531.

    Article  MathSciNet  Google Scholar 

  • R Core Team. (2013). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. http://www.R-project.org/.

  • Rizza, G.B. (2001). On the geometry of a pair of oriented planes. Rivista di Matematica dell’Università di Parma, 4(6), 217–228.

    MathSciNet  MATH  Google Scholar 

  • Scharnhorst, K. (2001). Angles in complex vector spaces. Acta Appl. Math., 69, 95–103.

    Article  MathSciNet  Google Scholar 

  • Schäcke, K. (2004). On the kronecker product. Master’s Thesis, University of Waterloo.

  • Schreier, P.J., & Scharf, L.L. (2003). Second-order analysis of improper complex random vectors and processes. IEEE Transactions on Signal Processing, 51(3), 714–725.

    Article  MathSciNet  Google Scholar 

  • Schreier, P.J., & Scharf, L.L. (2010). Statistical signal processing of complex-valued data: the theory of improper and noncircular signals. Cambridge (UK): Cambridge University Press.

    Book  Google Scholar 

  • Stewart, G. (1993). On the early history of the singular value decomposition. SIAM Review, 35(4), 551–566.

    Article  MathSciNet  Google Scholar 

  • Trevitt, C., & MacKisack, M. (1989). Australian bureau of metheorology data. http://www.statsci.org/data/timeseri.html.

  • Wallace, J.M., & Dickinson, R.E. (1972). Empirical orthogonal representation of time series in the frequency domain. part I: Theoretical considerations. Journal of Applied Meteorology, 11(6), 887–892.

    Article  Google Scholar 

  • Wong, Y.-C. (1977). Linear geometry in euclidean 4-space. Southeast Asian Mathematical Society, 1.

Download references

Funding

For this work, the second author was partially supported by the Erasmus Plus agreement between Sapienza Università di Roma and the Université des Sciences et Technologies de Lille and by his Sapienza research grant 2016.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jean-Jacques Denimal.

Ethics declarations

Conflict of Interest

The authors declare no competing interests.

Additional information

Ethical Conduct

No experimentation on living creatures had been carried out.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Denimal, JJ., Camiz, S. Complex Principal Component Analysis: Theory and Geometrical Aspects. J Classif 39, 376–408 (2022). https://doi.org/10.1007/s00357-022-09412-0

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00357-022-09412-0

Keywords