Abstract
This paper presents a distributed Grid-Density based Satellite data Clustering technique, DisClus, which can detect clusters of arbitrary shapes and sizes over high resolution, multi-spectral satellite datasets. Quality of the clusters is further enhanced by incorporating a partitioning based method for the reassignment of the border pixels to the most relevant clusters. Experimental results are presented to establish the superiority of the technique in terms of scale-up, speedup as well as cluster quality.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Morgan Kaufmann Publishers, San Fransisco (2004)
Ester, M., Kriegel, H.P., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of KDD 1996, pp. 226–231 (1996)
Ankerst, M., Breuing, M.M., Kriegel, H.P., Sander, J.: Optics: Ordering points to identify the clustering structure. In: Proceedings of ACM-SIGMOD 1999, pp. 49–60 (1999)
Astrahan, M.M.: Speech analysis by clustering, or the hyper-phoneme method. Stanford A. I. Project Memo (1970)
Mitra, P., Murthy, C.A., Pal, S.K.: Density-based multiscale data condensation. IEEE Transactions on Pattern Analysis and Machine intelligence 24(6) (June 2002)
Wang, W., Yang, J., Muntz, R.R.: Sting: A statistical information grid approach to spatial data mining. In: Proceedings of VLDB 1997, Athens, Greece, pp. 186–195 (1997)
Agrawal, R., Gehrke, J., Gunopulos, D., Raghavan, P.: Automatic subspace clustering of high dimensional data for data mining applications. In: Proceedings of SIGMOD 1998, Seattle, pp. 94–105 (1998)
Nagesh, H.S., Goil, S., Choudhary, A.N.: A scalable parallel subspace clustering algorithm for massive data sets. In: Proceedings of International Conference on Parallel Processing, p. 477 (2000)
Sarmah, S., Das, R., Bhattacharyya, D.K.: A distributed algorithm for intrinsic cluster detection over large spatial data. International Journal of Computer Science 3(4), 246–256 (2008)
Dhillon, I., Modha, D.: A data clustering algorithm or distributed memory multiprocessors. In: Workshop on Large-scale Parallel Knowledge Discovery in Databases (1999)
Xu, X., Jager, J., Kriegel, H.-P.: A fast parallel clustering algorithm for large spatial databases. Data Mining and Knowledge Discovery 3(3), 263–290 (1999)
Januzaj, E., et al.: Towards effective and efficient distributed clustering. In: Proceedings of the ICDM 2003 (2003)
Pizzuti, C., Talia, D.: P-autoclass: Scalable parallel clustering for mining large data sets. IEEE Transactions on Knowledge and Data Engineering 15(3), 629–641 (2003)
McQueen, J.B.: Some methods for classification and analysis of multivariate observations. In: Proceedings of the Fifth Berkeley Symp. Math. Statistics and Probability, vol. 1, pp. 281–297 (1967)
Ball, G.H., Hall, D.J.: A clustering technique for summarizing multivariate data. Behavioural Science 12, 153–155 (1967)
Yamazaki, T.: A robust clustering technique for multi-spectral satellite images. In: Proceedings of the International Symposium on Noise Reduction for Imaging and Communication Systems, ISNIC (1998)
Pal, P., Chanda, B.: A symmetry based clustering technique for multi-spectral satellite imagery, http://www.ee.iitb.ac.in/~icvgip/PAPERS/252.pdf
Ameri, F., Zoej, M.J.V., Mokhtarzade, M.: Satellite image segmentation based on fuzzy c-means clustering, http://www.gisdevelopment.net/technology/ip/ma06_121abs.htm
Bandyopadhyay, S., Pal, S.K.: Pixel classification using variable string genetic algorithms with chromosomal differentiation. IEEE Transactions on Geoscience and Remote Sensing 39(2), 303–308 (2001)
Maulik, U., Bandyopadhyay, S.: Fuzzy partitioning using a real-coded variable-length genetic algorithm for pixel classification. IEEE Transactions on Geoscience and Remote Sensing 41(5), 1075–1081 (2003)
Bandyopadhyay, S., Maulik, U., Mukhopadhyay, A.: Multiobjective genetic clustering for pixel classification in remote sensing imagery. IEEE Transactions on Geoscience and Remote Sensing 45(2), 1506–1511 (2007)
Sarmah, S., Bhattacharyya, D.K.: A grid-density based technique for clustering satellite image. ISPRS Journal of Photogrammetry and Remote Sensing (Communicated) (2009)
Borah, B., Bhattacharyya, D.K., Das, R.K.: A parallel density-based data clustering technique on distributed memory multicomputers. In: Proceedings of the ADCOM (2004)
Pal, S.K., Ghosh, A., Shankar, B.U.: Segmentation with remotely sensed images with fuzzy thresholding and quantitative evaluation. International Journal of Remote Sensing 21(11), 2269–2300 (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Sarmah, S., Bhattacharyya, D.K. (2010). DisClus: A Distributed Clustering Technique over High Resolution Satellite Data. In: Kant, K., Pemmaraju, S.V., Sivalingam, K.M., Wu, J. (eds) Distributed Computing and Networking. ICDCN 2010. Lecture Notes in Computer Science, vol 5935. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-11322-2_35
Download citation
DOI: https://doi.org/10.1007/978-3-642-11322-2_35
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-11321-5
Online ISBN: 978-3-642-11322-2
eBook Packages: Computer ScienceComputer Science (R0)