ABSTRACT
The basic idea of traditional density estimation is to model the overall point density analytically as the sum of influence functions of data points. However, traditional density estimation techniques only consider the location of a point. Supervised density estimation techniques, on the other hand, additionally consider a variable of interest that is associated with a point. Density in supervised density estimation is measured as the product of an influence function with the variable of interest. Based on this novel idea, a supervised density-based clustering named SCDE is introduced and discussed in detail. The SCDE algorithm forms clusters by associating data points with supervised density attractors which represent maxima and minima of a supervised density function.
- Clifford, S. 1993. A model for the hydrological and climatic behavior of water on mars. Journal of Geophysical Research, Vol. 98, No. E6, 1993, 10973--11016.Google ScholarCross Ref
- Eick C., Vaezian B., Jiang, D. and Wang, J. 2006. Discovery of Interesting Regions in Spatial Datasets Using Supervised Clustering. In Proceedings of the 10th European Conference on Principles and Practice of Knowledge Discovery in Databases, (Berlin, Germany, September 2006).Google Scholar
- Ester, M., Kriegel, H., Sander, J., and Xu, X. 1996. A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise. In Proceedings of 2nd International Conference on Knowledge Discovery and Data Mining (Portland, Oregon, August 1996). 226--231.Google Scholar
- Hinneburg, A. and Keim, D. A. 1998. An Efficient Approach to Clustering in Large Multimedia Databases with Noise. In Proceedings of the 4th International Conference on Knowledge Discovery and Data Mining, (New York City, August 1998). 58--65.Google Scholar
- Hinneburg, A. and Gabriel, H., 2007. Denclue 2.0: Fast Clustering based on Kernel Density Estimation. In Proceedings of the 7th International Symposium on Intelligent Data Analysis (Ljubljana, Slovenoja, September 2007). Google ScholarDigital Library
- Kaufman, L. and Rousseeuw, P. J. 2000. Finding groups in data: An introduction to cluster analysis, John Wiley and Sons, New Jersey, USA, 2000.Google Scholar
- Kulldorff, M. 1997. A spatial scan statistic, Communications in Statistics: Theory and Methods, Vol. 26, No.6, 1997, 1481--1496.Google ScholarCross Ref
- Levine, N. 2007. CrimeStat III: A Spatial Statistics Program for the Analysis of Crime Incident Locations (v 3.1), Ned Levine & Associates, Houston, TX, and the National Institute of Justice, Washington, DC., March 2007.Google Scholar
- Murray, A. T. and Estivill-Castro, V. 1998. Cluster discovery techniques for exploratory spatial data analysis, International Journal of Geographical Information Science, Vol. 12, No. 5, 1998, 431--443.Google ScholarCross Ref
- Sander, J., Ester, M., Kriegel, H. P., and Xu, X., 1998. Density-Based Clustering in Spatial Databases: The Algorithm GDBSCAN and its Applications. Data Mining and Knowledge Discovery, Kluwer Academic Publishers, Vol. 2, No. 2, 1998, 169--194. Google ScholarDigital Library
- Silverman, B. 1986. Density Estimation for Statistics and Data Analysis. Chapman & Hall, London, UK, 1986.Google ScholarCross Ref
- Tay, S. C., Hsu, W., and Lim, K. H. 2003. Spatial data mining: Clustering of hot spots and pattern recognition. In International Geoscience & Remote Sensing Symposium, (Toulouse France, July 2003).Google Scholar
- Williams, G. J. 1999. Evolutionary hot spots data mining -- an architecture for exploring for interesting discoveries. In Proceedings of the 3rd Pacific-Asia Conference on Methodologies for Knowledge Discovery and Data Mining, (London, UK, 1999). 184--193. Google ScholarDigital Library
Index Terms
- On supervised density estimation techniques and their application to spatial data mining
Recommendations
Mixture density estimation with group membership functions
The mixture density model has been extensively studied in the field of statistical pattern recognition. And the EM algorithm has been well known as a convenient and efficient tool to iteratively compute the maximum likelihood estimates of mixture model ...
Ensemble Gaussian mixture models for probability density estimation
Estimation of probability density functions (PDF) is a fundamental concept in statistics. This paper proposes an ensemble learning approach for density estimation using Gaussian mixture models (GMM). Ensemble learning is closely related to model ...
Density estimation for spherical data using nonparametric mixtures
AbstractNonparametric density estimation is studied for spherical data that may arise in many scientific and practical fields. In particular, nonparametric mixture models based on likelihood maximization are used. A nonparametric mixture has component ...
Comments