poster

On supervised density estimation techniques and their application to spatial data mining

Authors:
Dan Jiang

University of Houston, Houston, TX

University of Houston, Houston, TX
View Profile

,
Christoph F. Eick

University of Houston, Houston, TX

University of Houston, Houston, TX
View Profile

,
Chun-sheng Chen

University of Houston, Houston, TX

University of Houston, Houston, TX
View Profile

GIS '07: Proceedings of the 15th annual ACM international symposium on Advances in geographic information systemsNovember 2007Article No.: 65Pages 1–4https://doi.org/10.1145/1341012.1341089

Published:07 November 2007Publication History

GIS '07: Proceedings of the 15th annual ACM international symposium on Advances in geographic information systems

Pages 1–4

ABSTRACT

The basic idea of traditional density estimation is to model the overall point density analytically as the sum of influence functions of data points. However, traditional density estimation techniques only consider the location of a point. Supervised density estimation techniques, on the other hand, additionally consider a variable of interest that is associated with a point. Density in supervised density estimation is measured as the product of an influence function with the variable of interest. Based on this novel idea, a supervised density-based clustering named SCDE is introduced and discussed in detail. The SCDE algorithm forms clusters by associating data points with supervised density attractors which represent maxima and minima of a supervised density function.

References

Clifford, S. 1993. A model for the hydrological and climatic behavior of water on mars. Journal of Geophysical Research, Vol. 98, No. E6, 1993, 10973--11016.Google ScholarCross Ref
Eick C., Vaezian B., Jiang, D. and Wang, J. 2006. Discovery of Interesting Regions in Spatial Datasets Using Supervised Clustering. In Proceedings of the 10th European Conference on Principles and Practice of Knowledge Discovery in Databases, (Berlin, Germany, September 2006).Google Scholar
Ester, M., Kriegel, H., Sander, J., and Xu, X. 1996. A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise. In Proceedings of 2nd International Conference on Knowledge Discovery and Data Mining (Portland, Oregon, August 1996). 226--231.Google Scholar
Hinneburg, A. and Keim, D. A. 1998. An Efficient Approach to Clustering in Large Multimedia Databases with Noise. In Proceedings of the 4th International Conference on Knowledge Discovery and Data Mining, (New York City, August 1998). 58--65.Google Scholar
Hinneburg, A. and Gabriel, H., 2007. Denclue 2.0: Fast Clustering based on Kernel Density Estimation. In Proceedings of the 7th International Symposium on Intelligent Data Analysis (Ljubljana, Slovenoja, September 2007). Google ScholarDigital Library
Kaufman, L. and Rousseeuw, P. J. 2000. Finding groups in data: An introduction to cluster analysis, John Wiley and Sons, New Jersey, USA, 2000.Google Scholar
Kulldorff, M. 1997. A spatial scan statistic, Communications in Statistics: Theory and Methods, Vol. 26, No.6, 1997, 1481--1496.Google ScholarCross Ref
Levine, N. 2007. CrimeStat III: A Spatial Statistics Program for the Analysis of Crime Incident Locations (v 3.1), Ned Levine & Associates, Houston, TX, and the National Institute of Justice, Washington, DC., March 2007.Google Scholar
Murray, A. T. and Estivill-Castro, V. 1998. Cluster discovery techniques for exploratory spatial data analysis, International Journal of Geographical Information Science, Vol. 12, No. 5, 1998, 431--443.Google ScholarCross Ref
Sander, J., Ester, M., Kriegel, H. P., and Xu, X., 1998. Density-Based Clustering in Spatial Databases: The Algorithm GDBSCAN and its Applications. Data Mining and Knowledge Discovery, Kluwer Academic Publishers, Vol. 2, No. 2, 1998, 169--194. Google ScholarDigital Library
Silverman, B. 1986. Density Estimation for Statistics and Data Analysis. Chapman & Hall, London, UK, 1986.Google ScholarCross Ref
Tay, S. C., Hsu, W., and Lim, K. H. 2003. Spatial data mining: Clustering of hot spots and pattern recognition. In International Geoscience & Remote Sensing Symposium, (Toulouse France, July 2003).Google Scholar
Williams, G. J. 1999. Evolutionary hot spots data mining -- an architecture for exploring for interesting discoveries. In Proceedings of the 3rd Pacific-Asia Conference on Methodologies for Knowledge Discovery and Data Mining, (London, UK, 1999). 184--193. Google ScholarDigital Library

Index Terms

On supervised density estimation techniques and their application to spatial data mining
1. Information systems
  1. Information systems applications
    1. Data mining

Recommendations

Mixture density estimation with group membership functions

The mixture density model has been extensively studied in the field of statistical pattern recognition. And the EM algorithm has been well known as a convenient and efficient tool to iteratively compute the maximum likelihood estimates of mixture model ...
Read More
Ensemble Gaussian mixture models for probability density estimation

Estimation of probability density functions (PDF) is a fundamental concept in statistics. This paper proposes an ensemble learning approach for density estimation using Gaussian mixture models (GMM). Ensemble learning is closely related to model ...
Read More
Density estimation for spherical data using nonparametric mixtures
Abstract
Nonparametric density estimation is studied for spherical data that may arise in many scientific and practical fields. In particular, nonparametric mixture models based on likelihood maximization are used. A nonparametric mixture has component ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
GIS '07: Proceedings of the 15th annual ACM international symposium on Advances in geographic information systems
November 2007
439 pages
ISBN:9781595939142
DOI:10.1145/1341012
General Chairs:
Hanan Samet
University of Maryland
,
Cyrus Shahabi
University of Southern California
,
Program Chair:
Markus Schneider
University of Florida
Copyright © 2007 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 7 November 2007
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
density estimation
density-based clustering
hot spot discovery
spatial data mining
Qualifiers
- poster
Conference

Acceptance Rates
Overall Acceptance Rate220of1,116submissions,20%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 6
  Total Citations
  View Citations
- 355
  Total Downloads
- Downloads (Last 12 months)0
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

On supervised density estimation techniques and their application to spatial data mining

GIS '07: Proceedings of the 15th annual ACM international symposium on Advances in geographic information systems

ABSTRACT

References

Cited By

Index Terms

Recommendations

Mixture density estimation with group membership functions

Ensemble Gaussian mixture models for probability density estimation

Density estimation for spherical data using nonparametric mixtures