Abstract
The use of real-valued distances between bit vectors is customary in clustering applications. However, there is another, rarely used, kind of distances on bit vector spaces: the autometrized Boolean-valued distances, taking values in the same Boolean algebra, instead of ℝ. In this paper we use the topological concept of closed ball to define density in regions of the bit vector space and then introduce two algorithms to compare these different sorts of distances. A few, initial experiments using public databases, are consistent with the hypothesis that Boolean distances can yield a better classification, but more experiments are necessary to confirm it.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Jain, A.K., Murty, M.P., Flynn, P.J.: Data clustering: A review. ACM Computing Surveys 31(3), 264–322 (1999)
Theodoridis, S., Koutroumbas, K.: Pattern Recognition. Academic Press, San Diego (1998)
Zhang, B., Srihari, S.N.: Properties of binary vector dissimilarity measures. Technical report, State University of New York at Buffalo (2003), http://ftp.cedar.buffalo.edu/papers/articles/CVPRIP03_propbina.pdf
Blumenthal, L.M.: Boolean geometry. I. Rend. Circ. Mat. Palermo II Ser. 1, 343–360 (1952)
Swamy, K.L.N.: A general theory of autometrized algebras. Mathematischen Annalen 157, 65–74 (1964)
Krasner, M.: Nombres semi-reels et espaces ultrametriques. Compte Rendus d’Academie des Sciences de Paris. Tome II 219, 433 (1944)
Ramal, R., Toulouse, G., Virasoro, M.: Ultrametricity for physicists. Reviews of Modern Physics 58(3), 765–788 (1986)
Zomorodian, A.J.: Topology for Computing. Cambridge University Press, Cambridge (2005)
Sikorski, R.: Boolean Algebras. Springer, Heidelberg (1969)
Monk, J.D.: A brief introduction to Boolean algebras (2004), http://www.colorado.edu/math/courses/monkd
Ramon, J.: Clustering and Instance Based Learning in First Order Logic. PhD thesis, Katholieke Universiteit Leuven, Belgium, Department of Computer Science (2002)
Sneath, P.H.A., Sokal, R.R.: Numerical Taxonomy: the principles and pratice of numerical classification. W.H. Freeman and Company, San Francisco (1974)
Moraglio, A., Poli, R.: Topological interpretation of crossover. In: Deb, K., et al. (eds.) GECCO 2004. LNCS, vol. 3102, pp. 1377–1388. Springer, Heidelberg (2004)
Blake, C., Merz, C.: UCI repository of machine learning databases. University of California, Irvine, Dept. of Information and Computer Sciences (1998)
Mali, K., Mitra, S.: Clustering and its validation in a symbolic framework. Pattern Recognition Letters 24(14), 2367–2376 (2003)
Guha, S., Rastogi, R., Shim, K.: ROCK: A robust clustering algorithm for categorical attributes. Information Systems 25(5), 345–366 (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
González, C.G., Bonventi, W., Rodrigues, A.L.V. (2008). Density of Closed Balls in Real-Valued and Autometrized Boolean Spaces for Clustering Applications. In: Zaverucha, G., da Costa, A.L. (eds) Advances in Artificial Intelligence - SBIA 2008. SBIA 2008. Lecture Notes in Computer Science(), vol 5249. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-88190-2_7
Download citation
DOI: https://doi.org/10.1007/978-3-540-88190-2_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-88189-6
Online ISBN: 978-3-540-88190-2
eBook Packages: Computer ScienceComputer Science (R0)