Abstract
Evaluating distances between uncertain objects is needed for some uncertain data mining techniques based on distance. An uncertain object can be described by uncertain numerical or categorical attributes. However, many uncertain data mining algorithms mainly discuss methods of evaluating distances between uncertain numerical objects. In this paper, an efficient method of evaluating distances between uncertain categorical objects is presented. The method is used in nearest-neighbor classifying. Experiments with datasets based on UCI datasets and the plant dataset of “Three Parallel Rivers of Yunnan Protected Areas” verify the method is efficient.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Singh, S., Mayfield, C., Prabhakar, S., Shah, R., Hambrusch, S.: Indexing Uncertain Categorical Data. In: 23th IEEE International Conference on Data Engineering, pp. 616–625. IEEE Computer Society, New York (2007)
Wang, L.Z.: A Method of the Abstract Generalization on the Basis of the Semantic Proximity. Chinese Journal of Computers 10, 1114–1121 (2000)
Chau, M., Cheng, R., Kao, B.: Uncertain Data Mining: A New Research Direction. In: The Workshop on the Sciences of the Artificial, Taiwan (2005)
Ngai, W.K., Kao, B., Chui, C.K., Cheng, R., Chau, M., Yip, K.Y.: Efficient Clustering of Uncertain Data. In: 6th IEEE International Conference on Data Mining, pp. 436–445. IEEE Computer Society, New York (2006)
Lee, S.D., Kao, B., Cheng, R.: Reducing UK-means to K-means. In: 7th IEEE International Conference on Data Mining Workshops, pp. 483–488. IEEE Computer Society, New York (2007)
Kao, B., Lee, S.D., Cheung, D.W., Ho, W.S., Chan, K.F.: Clustering Uncertain Data Using Voronoi Diagrams. In: 8th IEEE International Conference on Data Mining, pp. 333–342. IEEE Computer Society, New York (2008)
Aggarwal, C.C., Yu, P.S.: Outlier Detection with Uncertain Data. In: 8th SIAM International Conference on Data Mining, pp. 483–493. SIAM, Philadelphia (2008)
Beskales, G., Soliman, M.A., Llyas, I.F.: Efficient Search for the Top-k Probable Nearest Neighbors in Uncertain Databases. In: Very Large Data Base, pp. 326–339. ACM, New York (2008)
Aggarwal, C.C., Yu, P.S.: A Survey of Uncertain Data Algorithms and Applications. IEEE Transactions On Knowledge And Data Engineering 21, 609–623 (2009)
Leung, C.K.-S., Carmichael, C.L., Hao, B.: Efficient Mining of Frequent Patterns from Uncertain Data. In: 7th IEEE International Conference on Data Mining, pp. 489–494. IEEE Computer Society, New York (2007)
Qin, B., Xia, Y., Prabhakar, S., Tu, Y.: A Rule-Based Classification Algorithm for Uncertain Data. In: 25th IEEE International Conference on Data Engineering, pp. 1633–1640. IEEE Computer Society, New York (2009)
Wang, L.Z., Zhou, L.H., Lu, J., Yip, J.: An Order-clique-based Approach for Mining Maximal Co-locations. Information Sciences 179, 3370–3382 (2009)
UCI Machine Learning Repository, http://archive.ics.uci.edu/ml/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chen, H., Wang, L., Liu, W., Xiao, Q. (2010). Evaluating the Distance between Two Uncertain Categorical Objects. In: Cao, L., Zhong, J., Feng, Y. (eds) Advanced Data Mining and Applications. ADMA 2010. Lecture Notes in Computer Science(), vol 6441. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17313-4_12
Download citation
DOI: https://doi.org/10.1007/978-3-642-17313-4_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-17312-7
Online ISBN: 978-3-642-17313-4
eBook Packages: Computer ScienceComputer Science (R0)