Skip to main content

A Partitioning Method for Mixed Feature-Type Symbolic Data Using a Squared Euclidean Distance

  • Conference paper
  • 668 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4314))

Abstract

A partitioning cluster method for mixed feature-type symbolic data is presented. This method needs a previous pre-processing step to transform Boolean symbolic data into modal symbolic data. The presented dynamic clustering algorithm has then as input a set of vectors of modal symbolic data (weight distributions) and furnishes a partition and a prototype to each class by optimizing an adequacy criterion based on a suitable squared Euclidean distance. To show the usefulness of this method, examples with synthetic symbolic data sets and applications with real symbolic data sets are considered.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bock, H.H.: Clustering algorithms and kohonen maps for symbolic data. J. Jpn. Soc. Comp. Statistic 15, 1–13 (2002), (Proc. ICNCB, Osaka, 203-215)

    Google Scholar 

  2. Bock, H.H., Diday, E.: Analysis of Symbolic Data, Exploratory methods for extracting statistical information from complex data. Springer, Heidelberg (2000)

    Google Scholar 

  3. Bobou, A., Ribeyre, F.: Mercury in the food web: accumulation and transfer mechanisms. In: Sigrel, A., Sigrel, H. (eds.) Metal Ions in Biological Systems, pp. 289–319. M. Dekker, New York (1998)

    Google Scholar 

  4. Chavent, M., Lechevallier, Y.: Dynamical Clustering Algorithm of Interval Data: Optimization of an Adequacy Criterion Based on Hausdorff Distance. In: Sokolowski, A., Bock, H.-H. (eds.) Classification, Clustering and Data Analysis, pp. 53–59. Springer, Heidelberg (2002)

    Google Scholar 

  5. Chavent, M., et al.: Trois nouvelles méthodes de classification automatique de données symboliques de type intervalle. Revue de Statistique Appliquée LI(4), 5–29 (2003)

    Google Scholar 

  6. De Carvalho, F.A.T.: Histograms In Symbolic Data Analysis. Annals of Operations Research 55, 229–322 (1995)

    Article  Google Scholar 

  7. De Carvalho, F.A.T., Verde, R., Lechevallier, Y.: A dynamical clustering of symbolic objcts based on a context dependent proximity measure. In: Proceedings of the IX International Symposium on Applied Stochastic Models and Data analysis, Lisboa, Universidade de Lisboa, pp. 237–242 (1999)

    Google Scholar 

  8. De Carvalho, F.A.T., et al.: Adaptive Hausdorff distances and dynamic clustering of symbolic data. Pattern Recognition Letters 27(3), 167–179 (2006)

    Article  Google Scholar 

  9. Diday, E., Brito, P.: Symbolic Cluster Analysis. In: Opitz, O. (ed.) Conceptual and Numerical Analysis of Data, pp. 45–84. Springer, Heidelberg (1989)

    Google Scholar 

  10. Diday, E., Simon, J.J.: Clustering Analysis. In: Fu, K.S. (ed.) Digital Pattern Recognition, pp. 47–94. Springer, Heidelberg (1976)

    Google Scholar 

  11. El-Sonbaty, Y., Ismail, M.A.: Fuzzy Clustering for Symbolic Data. IEEE Transactions on Fuzzy Systems 6, 195–204 (1998)

    Article  Google Scholar 

  12. Everitt, B.: Cluster Analysis. Halsted, New York (2001)

    Google Scholar 

  13. Gordon, A.D.: Classification. Chapman and Hall/CRC, Boca Raton (1999)

    MATH  Google Scholar 

  14. Gordon, A.D.: An Iteractive Relocation Algorithm for Classifying Symbolic Data. In: Gaul, W., et al. (eds.) Data Analysis: Scientific Modeling and Practical Application, pp. 17–23. Springer, Heidelberg (2000)

    Google Scholar 

  15. Hubert, L., Arabie, P.: Comparing Partitions. Journal of Classification 2, 193–218 (1985)

    Article  Google Scholar 

  16. Jain, A.K., Murty, M.N., Flynn, P.J.: Data Clustering: A Review. ACM Computing Surveys 31(3), 264–323 (1999)

    Article  Google Scholar 

  17. Ralambondrainy, H.: A conceptual version of the k-means algorithm. Pattern Recognition Letters 16, 1147–1157 (1995)

    Article  Google Scholar 

  18. Souza, R.M.C.R., De Carvalho, F.A.T.: Clustering of interval data based on city-block distances. Pattern Recognition Letters 25(3), 353–365 (2004)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Christian Freksa Michael Kohlhase Kerstin Schill

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Cardoso Rodrigues de Souza, R.M., de Assis Tenorio de Carvalho, F., Pizzato, D.F. (2007). A Partitioning Method for Mixed Feature-Type Symbolic Data Using a Squared Euclidean Distance. In: Freksa, C., Kohlhase, M., Schill, K. (eds) KI 2006: Advances in Artificial Intelligence. KI 2006. Lecture Notes in Computer Science(), vol 4314. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-69912-5_20

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-69912-5_20

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-69911-8

  • Online ISBN: 978-3-540-69912-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics