Abstract
Uncertain data exist in many application fields, and there are numerous recent efforts in processing uncertain data to get more reliable results, especially uncertainty processing in clustering method. However, it is one of the urgent challenges to discover clusters with specific shape features. So we present a clustering method for data with uncertainties, and it is called multivariate wavelet principal component analysis-improved clustering using references and density (MWPCA-ICURD), which utilizes feature extraction and density-based clustering. To cluster uncertain data with specific shape original features, the original features are extracted by MWPCA method, which combines digital wavelet decomposition and principal component analysis organically. Then, a density-based clustering method ICURD is explored to discover specific shape clusters. Experimental results illustrate its validation and feasibility.









Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Abbreviations
- CURD:
-
Clustering using references and density
- ICURD:
-
Improved clustering using references and density
- WSNs:
-
Wireless sensor networks
- DBSCAN:
-
Density-based spatial clustering of applications with noise
- MWPCA:
-
Multivariate wavelet principal component analysis
- PDF:
-
Probability density function
- PCA:
-
Principal component analysis
- MSPCA:
-
Multiscale wavelet principal component analysis
- MSPCA-DBSCAN:
-
Multiscale wavelet principal component analysis-density-based spatial clustering of applications with noise
References
Luo QH, Peng Y, Peng XY, Saddik AE (2014) Uncertain data clustering-based distance estimation in wireless sensor networks. Sensors 14(4):6584–6605
Luo QH, Yan XZ, Li JB, Peng Y (2014) DDEUDSC: dynamic distance estimation using uncertain data stream clustering in mobile wireless sensor networks. Measurement 55:423–433
Ren SJ (2006) Interval number-based uncertain data mining and its applications, ZheJiang University, Ph.D. thesis, HangZhou, China
Qiu ZP (1994) The analysis method of static responses for uncertain parameter structural and the problem of eigenvalue, Jilin University, Ph.D. thesis, Jilin, pp 24–25
Wang T, Wu JY, Xu H, Zhu JH (2013) A cross unequal clustering routing algorithm for sensor network. Meas Sci Rev 13(4):200–205
Hichem F (2002) SyMP: an efficient clustering approach to identify clusters of arbitrary shapes in large data sets. In: Proceedings of KDD’2002, pp 507–512. doi:10.1145/775047.775121
Ester M, Kriegel HP, Sander J, Xu X (1996) A density based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of the 2nd international conference on knowledge discovery and data mining, pp 226–231
Kriegel HP, Martin P (2005) Density-based clustering of uncertain data. In: Proceedings of KDD 2005, pp 672–677. doi:10.1145/1081870.1081955
Xu HJ, Li GH (2008) Density-based probabilistic clustering of uncertain data. In: Proceedings of international conference on computer science and software engineering, pp 474–477. doi:10.1109/CSSE.2008.968
Dirk H, Peter BV, Ralf D, Clemens U (2007) Error-aware density-based clustering of imprecise measurement values. In: Proceedings of seventh IEEE international conference on data mining (ICDM 2007), pp 471–476. doi:10.1109/ICDMW.2007.88
Apinya T, Songrit M (2010) U-DBSCAN: a density-based clustering algorithm for uncertain objects. In: Proceedings of 25th international conference on data engineering (ICDE 2010), pp 136–143
Feizollahi MJ, Modarres M, Peter BV (2012) The robust deviation redundancy allocation problem with interval component reliabilities. IEEE Trans Reliab 16:957–965
Lu CC (2015) Robust data envelopment analysis approaches for evaluating algorithmic performance. Comput Ind Eng 81:78–89
Yan XZ, Luo QH, Tang YM, Wang JQ, Yu RQ (2014) MSP-DBSCAN: an uncertain data clustering algorithm using multiscale wavelet transform PCA. J Comput Inf Syst 9:9919–9926
Elise M, Caroline T, Ca H, Patrick D (2010) Multivariate denoising methods combining wavelets and principal component analysis for mass spectrometry data. Proteomics 10:2564–2572
Mina A, Nathalie C, Poggi JM (2006) Multivariate denoising using wavelets and principal component analysis. Comput Stat Data Anal 50:2381–2398
Shuai MA, Wang TJ, Tang SW et al (2003) A fast clustering algorithm based on reference and density. J Softw 14:1089–1095
Gullo F, Giovanni P, and Andrea T (2008) Clustering uncertain data via k-medoids. In: Proceedings of 2nd international conference on scalable uncertainty management (SUM 2008), pp 229–242. doi:10.1007/978-3-540-87993-0_19
Luo QH, Peng Y, Peng XY (2013) Multiple dimensional uncertain data stream clustering algorithm. Chin J Sci Instrum 34(6):1330–1338
Wang T, Zhang YH, Cui Y, Zhou YC (2014) A novel protocol of energy-constrained sensor network for emergency monitoring. Int J Sens Netw 15(3):171–182
Acknowledgments
The research presented in this paper is supported by National Natural Science Foundation of China (61102038), Natural Science Foundation of Shandong Province of China (ZR2015FM027), Space support technology fund Projects (2014-HT-HGD5), Research Fund of Harbin institute of technology (WeiHai) [HIT (WH) 201306, HIT (WH) 201307], Guangxi Key Laboratory of Automatic Detecting Technology and Instruments (YQ14205,YQ15203), Natural Scientific Research Innovation Foundation in Harbin Institute of Technology (HIT.NSRIF.2015122) and State Key Laboratory of Geo-information Engineering (SKLGIE2014-M-2-4).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflicts of interest
The authors declare no conflict of interest.
Rights and permissions
About this article
Cite this article
Luo, Q., Peng, Y., Li, J. et al. MWPCA-ICURD: density-based clustering method discovering specific shape original features. Neural Comput & Applic 28, 2545–2556 (2017). https://doi.org/10.1007/s00521-016-2208-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-016-2208-9