Abstract
Clustering methods provide an useful tool to tackle the problem of exploring large-dimensional data. However many common approaches suffer from being applied in high-dimensional spaces. Building on a dissimilarity-based representation of data, we propose a dimensionality reduction technique which preserves the clustering structure of the data. The technique is designed for cases in which data dimensionality is large compared to the number of available observations. In these cases, we represent data in the space of soft D-ranks, by applying the concept of fuzzy ranking. A clustering procedure is then applied. Experimental results show that the method is able to retain the necessary information, while considerably reducing dimensionality.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Dunn, J.C.: A fuzzy relative of the ISODATA process and its use in detecting compact wellseparated clusters. Journal of Cybernetics 3, 32–57 (1974)
Bezdek, J.C.: Pattern recognition with fuzzy objective function algorithms. Plenum, New York (1981)
Aggarwal, C.C., Yu, P.S.: Redefining clustering for high-dimensional applications. IEEE Transactions on Knowledge and Data Engineering 14, 210–225 (2002)
Beyer, K., Goldstein, J., Ramakrishnan, R., Shaft, U.: When is nearest neighbor meaningful? In: Beeri, C., Bruneman, P. (eds.) ICDT 1999, vol. 1540, pp. 217–235. Springer, Heidelberg (1998)
Shawe-Taylor, J., Cristianini, N.: Kernel Methods for Pattern Analysis. Cambridge University Press, Cambridge (2004)
Pȩkalska, E., Paclík, P., Duin, R.P.W.: A generalized kernel approach to dissimilarity-based classification. Journal of Machine Learning Research 2, 175–211 (2001)
Masulli, F., Rovetta, S.: Fuzzy variations in the training of vector quantizers. In: Proceedings of the 2003 International Workshop on Fuzzy Logics, Napoli, Italy (2003)
Bortolan, G., Degani, R.: A review of some methods for ranking fuzzy sets. Fuzzy Sets and Systems 15, 1–19 (1985)
Wang, W., Kerre, E.: Reasonable properties for the ordering of fuzzy quantities (I). Fuzzy Sets and Systems 118, 375–385 (2001)
Wang, W., Kerre, E.: Reasonable properties for the ordering of fuzzy quantities (II). Fuzzy Sets and Systems 118, 386–405 (2001)
Kaufman, L., Rousseeuw, P.J.: Finding Groups in Data. John Wiley & Sons, New York (1990)
Ihaka, R., Gentleman, R.: R: A language for data analysis and graphics. Journal of Computational and Graphical Statistics 5, 299–314 (1996)
Golub, T., Slonim, D., Tamayo, P., Huard, C., Gaasenbeek, M., Mesirov, J., Coller, H., Loh, M., Downing, J., Caligiuri, M., Bloomfield, C., Lander, E.: Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring. Science 286, 531–537 (1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Rovetta, S., Masulli, F., Filippone, M. (2006). Soft Rank Clustering. In: Apolloni, B., Marinaro, M., Nicosia, G., Tagliaferri, R. (eds) Neural Nets. WIRN NAIS 2005 2005. Lecture Notes in Computer Science, vol 3931. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11731177_29
Download citation
DOI: https://doi.org/10.1007/11731177_29
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-33183-4
Online ISBN: 978-3-540-33184-1
eBook Packages: Computer ScienceComputer Science (R0)