A Partitioning Method for Mixed Feature-Type Symbolic Data Using a Squared Euclidean Distance

Cardoso Rodrigues de Souza, Renata Maria; de Assis Tenorio de Carvalho, Francisco; Pizzato, Daniel F.

doi:10.1007/978-3-540-69912-5_20

A Partitioning Method for Mixed Feature-Type Symbolic Data Using a Squared Euclidean Distance

Renata Maria Cardoso Rodrigues de Souza¹,
Francisco de Assis Tenorio de Carvalho¹ &
Daniel F. Pizzato¹

Conference paper

668 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4314))

Abstract

A partitioning cluster method for mixed feature-type symbolic data is presented. This method needs a previous pre-processing step to transform Boolean symbolic data into modal symbolic data. The presented dynamic clustering algorithm has then as input a set of vectors of modal symbolic data (weight distributions) and furnishes a partition and a prototype to each class by optimizing an adequacy criterion based on a suitable squared Euclidean distance. To show the usefulness of this method, examples with synthetic symbolic data sets and applications with real symbolic data sets are considered.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bock, H.H.: Clustering algorithms and kohonen maps for symbolic data. J. Jpn. Soc. Comp. Statistic 15, 1–13 (2002), (Proc. ICNCB, Osaka, 203-215)
Google Scholar
Bock, H.H., Diday, E.: Analysis of Symbolic Data, Exploratory methods for extracting statistical information from complex data. Springer, Heidelberg (2000)
Google Scholar
Bobou, A., Ribeyre, F.: Mercury in the food web: accumulation and transfer mechanisms. In: Sigrel, A., Sigrel, H. (eds.) Metal Ions in Biological Systems, pp. 289–319. M. Dekker, New York (1998)
Google Scholar
Chavent, M., Lechevallier, Y.: Dynamical Clustering Algorithm of Interval Data: Optimization of an Adequacy Criterion Based on Hausdorff Distance. In: Sokolowski, A., Bock, H.-H. (eds.) Classification, Clustering and Data Analysis, pp. 53–59. Springer, Heidelberg (2002)
Google Scholar
Chavent, M., et al.: Trois nouvelles méthodes de classification automatique de données symboliques de type intervalle. Revue de Statistique Appliquée LI(4), 5–29 (2003)
Google Scholar
De Carvalho, F.A.T.: Histograms In Symbolic Data Analysis. Annals of Operations Research 55, 229–322 (1995)
Article Google Scholar
De Carvalho, F.A.T., Verde, R., Lechevallier, Y.: A dynamical clustering of symbolic objcts based on a context dependent proximity measure. In: Proceedings of the IX International Symposium on Applied Stochastic Models and Data analysis, Lisboa, Universidade de Lisboa, pp. 237–242 (1999)
Google Scholar
De Carvalho, F.A.T., et al.: Adaptive Hausdorff distances and dynamic clustering of symbolic data. Pattern Recognition Letters 27(3), 167–179 (2006)
Article Google Scholar
Diday, E., Brito, P.: Symbolic Cluster Analysis. In: Opitz, O. (ed.) Conceptual and Numerical Analysis of Data, pp. 45–84. Springer, Heidelberg (1989)
Google Scholar
Diday, E., Simon, J.J.: Clustering Analysis. In: Fu, K.S. (ed.) Digital Pattern Recognition, pp. 47–94. Springer, Heidelberg (1976)
Google Scholar
El-Sonbaty, Y., Ismail, M.A.: Fuzzy Clustering for Symbolic Data. IEEE Transactions on Fuzzy Systems 6, 195–204 (1998)
Article Google Scholar
Everitt, B.: Cluster Analysis. Halsted, New York (2001)
Google Scholar
Gordon, A.D.: Classification. Chapman and Hall/CRC, Boca Raton (1999)
MATH Google Scholar
Gordon, A.D.: An Iteractive Relocation Algorithm for Classifying Symbolic Data. In: Gaul, W., et al. (eds.) Data Analysis: Scientific Modeling and Practical Application, pp. 17–23. Springer, Heidelberg (2000)
Google Scholar
Hubert, L., Arabie, P.: Comparing Partitions. Journal of Classification 2, 193–218 (1985)
Article Google Scholar
Jain, A.K., Murty, M.N., Flynn, P.J.: Data Clustering: A Review. ACM Computing Surveys 31(3), 264–323 (1999)
Article Google Scholar
Ralambondrainy, H.: A conceptual version of the k-means algorithm. Pattern Recognition Letters 16, 1147–1157 (1995)
Article Google Scholar
Souza, R.M.C.R., De Carvalho, F.A.T.: Clustering of interval data based on city-block distances. Pattern Recognition Letters 25(3), 353–365 (2004)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Centro de Informatica - CIn / UFPE, Av. Prof. Luiz Freire, s/n - Cidade Universitaria, CEP: 50740-540 - Recife - PE, Brasil
Renata Maria Cardoso Rodrigues de Souza, Francisco de Assis Tenorio de Carvalho & Daniel F. Pizzato

Authors

Renata Maria Cardoso Rodrigues de Souza
View author publications
You can also search for this author in PubMed Google Scholar
Francisco de Assis Tenorio de Carvalho
View author publications
You can also search for this author in PubMed Google Scholar
Daniel F. Pizzato
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Christian Freksa Michael Kohlhase Kerstin Schill

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cardoso Rodrigues de Souza, R.M., de Assis Tenorio de Carvalho, F., Pizzato, D.F. (2007). A Partitioning Method for Mixed Feature-Type Symbolic Data Using a Squared Euclidean Distance. In: Freksa, C., Kohlhase, M., Schill, K. (eds) KI 2006: Advances in Artificial Intelligence. KI 2006. Lecture Notes in Computer Science(), vol 4314. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-69912-5_20

Download citation

DOI: https://doi.org/10.1007/978-3-540-69912-5_20
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-69911-8
Online ISBN: 978-3-540-69912-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics