Abstract
Data analysis is often affected by different types of errors as: measurement errors, computation errors, imprecision related to the method adopted for estimating the data. The methods which have been proposed for treating errors in the data, may also be applied to different kinds of data that in real life are of interval type. The uncertainty in the data, which is strictly connected to the above errors, may be treated by considering, rather than a single value for each data, the interval of values in which it may fall: the interval data. The purpose of the present paper is to introduce methods for analyzing the interdependence and dependence among interval-valued variables. Statistical units described by interval-valued variables can be assumed as a special case of Symbolic Object (SO). In Symbolic Data Analysis (SDA), these data are represented as boxes. Accordingly, the purpose of the present work is the extension of the Principal Component Analysis to obtain a visualization of such boxes, on a lower dimensional space. Furthermore, a new method for fitting an interval simple linear regression equation is developed. With difference to other approaches proposed in the literature that work on scalar recoding of the intervals using classical tools of analysis, we make extensively use of the interval algebra tools combined with some optimization techniques.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
ALEFELD, G. and HERZERBERGER, J. (1983): Introduction to Interval computation. Academic Press, New York.
BILLARD, L. and DIDAY, E. (2000): Regression Analysis for Interval-Valued Data. In: H.-H. Bock and E. Diday (Eds.): Data Analysis, Classification and Related Methods. Springer, Berlin, 123–124.
BILLARD, L. and DIDAY, E. (2002): Symbolic regression Analysis. In: Proceedings IFCS. Krzysztof Jajuga et al (Eds.): Data Analysis, Classification and Clustering Methods Heidelberg. Springer-Verlag.
CANAL, L. and PEREIRA, M. (1998): In: Proceedings of the NTTS’98 Seminar: Towards statistical indices for numeroid data. Sorrento, Italy.
CAZES, P. CHOUAKRIA, A. DIDAY, E. and SCHEKTMAN, Y. (1997): Extension de l’analyse en composantes principales des donnes de type intervalle. Revue de Statistique Applique, XIV, 3, 5–24.
DEIF, A.S. (1991a): The Interval Eigenvalue Problem. ZAMM, 71, 1.61–64. Akademic-Verlag Berlin.
DEIF, A.S. (1991b): Singular Values of an Interval Matrix. Linear Algebra and its Applications, 151, 125–133.
GIOIA, F. (2001): Statistical Methods for Interval Variables, Ph.D. thesis. Dep. of Mathematics and Statistics-University Federico II Naples, in Italian.
GIOIA, F. and LAURO, C.N. (2005a): Basic Statistical Methods for Interval Data. Statistica Applicata, In press.
GIOIA, F. and LAURO, C.N. (2005b): Principal Component Analysis on Interval Data. Computational Statistics, special issue on: “Statistical Analysis of Interval Data”. In press.
LAURO, C.N. and PALUMBO, F. (2000): Principal component analysis of interval data: A symbolic data analysis approach. Computational Statistics, 15,1, 73–87.
MARINO, M. and PALUMBO, F. (2002): Interval arithmetic for the evaluation of imprecise data effects in least squares linear regression. Statistica Applicata, 14(3), 277–291.
MOORE, R.E. (1966): Interval Analysis. Prentice Hall, Englewood Cliffs, NJ.
PALUMBO, F. and LAURO, C.N. (2003): In: New developments in Psychometrics, Yanai H. et al. (Eds.): A PCA for interval valued data based on midpoints and radii. Psychometric Society, Springer-Verlag, Tokyo.
SEIF, N.P. HASHEM, S. and DEIF, A.S. (1992): Bounding the Eigenvectors for Symmetric Interval Matrices. ZAMM, 72, 233–236.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin · Heidelberg
About this paper
Cite this paper
Lauro, C., Gioia, F. (2006). Dependence and Interdependence Analysis for Interval-Valued Variables. In: Batagelj, V., Bock, HH., Ferligoj, A., Žiberna, A. (eds) Data Science and Classification. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg . https://doi.org/10.1007/3-540-34416-0_19
Download citation
DOI: https://doi.org/10.1007/3-540-34416-0_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-34415-5
Online ISBN: 978-3-540-34416-2
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)