Abstract
Over the last five decades, clustering has established itself as a primary unsupervised learning technique. In most major data mining projects clustering can serve as a first step in understanding the available data. Clustering is used for creating meaningful profiles of entities in an application. It can also be used to compress the dataset into more manageable granules. The initial methods of crisp clustering objects represented using numeric attributes have evolved to address the demands of the real-world. These extensions include the use of soft computing techniques such as fuzzy and rough set theory, the use of centroids and medoids for computational efficiency, modes to accommodate categorical attributes, dynamic and stream clustering for managing continuous accumulation of data, and meta-clustering for correlating parallel clustering processes. This paper uses applications in engineering, web usage, retail, finance, and social networks to illustrate some of the recent advances in clustering and their role in improved profiling, as well as augmenting prediction, classification, association mining, dimensionality reduction, and optimization tasks.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Ammar, A., Elouedi, Z., Lingras, P.: Decremental possibilistic k-modes. In: SCAI, pp. 15–24 (2013)
Ammar, A., Elouedi, Z., Lingras, P.: Incremental rough possibilistic k-modes. In: Ramanna, S., Lingras, P., Sombattheera, C., Krishna, A. (eds.) MIWAI 2013. LNCS (LNAI), vol. 8271, pp. 13–24. Springer, Heidelberg (2013). doi:10.1007/978-3-642-44949-9_2
Black, F., Scholes, M.: The pricing of options and corporate liabilities. J. Polit. Econ. 81, 637–654 (1973)
Dunn, J.C.: A fuzzy relative of the isodata process and its use in detecting compact well-separated clusters. J. Cybern. 3, 32–57 (1973)
Eagle, N.: The reality mining data (2010). http://eprom.mit.edu/data/RealityMining_ReadMe.pdf
Hartigan, J.A., Wong, M.A.: Algorithm as 136: a k-means clustering algorithm. J. R. Stat. Soc. Ser. C (Appl. Stat.) 28(1), 100–108 (1979). http://www.jstor.org/stable/2346830
Janusz, A., Ślezak, D.: Rough set methods for attribute clustering and selection. Appl. Artif. Intell. 28(3), 220–242 (2014)
Joshi, M., Lingras, P.: Evolutionary and iterative crisp and rough clustering ii: experiments. In: Chaudhury, S., Mitra, S., Murthy, C.A., Sastry, P.S., Pal, S.K. (eds.) PReMI 2009. LNCS, vol. 5909, pp. 621–627. Springer, Heidelberg (2009). doi:10.1007/978-3-642-11164-8_101
Lingras, P.: Unsupervised rough set classification using gas. J. Intell. Inf. Syst. 16(3), 215–228 (2001)
Lingras, P., Elagamy, A., Ammar, A., Elouedi, Z.: Iterative meta-clustering through granular hierarchy of supermarket customers and products. Inf. Sci. 257, 14–31 (2013)
Lingras, P., Haider, F.: Rough ensemble clustering. In: Intelligent Data Analysis, Special Issue on Business Analytics in Finance and Industry (2014)
Lingras, P., Haider, F., Triff, M.: Granular meta-clustering based on hierarchical, network, and temporal connections. Granular Comput. 1(1), 71–92 (2016)
Lingras, P., Triff, M.: Fuzzy and crisp recursive profiling of online reviewers and businesses. IEEE Trans. Fuzzy Syst. 23(4), 1242–1258 (2015)
Lingras, P., West, C.: Interval set clustering of web users with rough k-means. J. Intell. Inf. Syst. 23(1), 5–16 (2004)
Mitra, S.: An evolutionary rough partitive clustering. Pattern Recogn. Lett. 25(12), 1439–1449 (2004)
Peters, G.: Some refinements of rough k-means clustering. Pattern Recogn. 39(8), 1481–1491 (2006)
Peters, G., Crespo, F., Lingras, P., Weber, R.: Soft clustering-fuzzy and rough approaches and their extensions and derivatives. Int. J. Approximate Reasoning 54(2), 307–322 (2013)
Peters, G., Weber, R., Nowatzke, R.: Dynamic rough clustering and its applications. Appl. Soft Comput. 12(10), 3193–3207 (2012)
Sharma, S.C., Werner, A.: Improved method of grouping provincewide permanent traffic counters. Transp. Res. Rec. 815, 13–18 (1981)
Silva, J.A., Faria, E.R., Barros, R.C., Hruschka, E.R., de Carvalho, A.C., Gama, J.: Data stream clustering: a survey. ACM Comput. Surv. (CSUR) 46(1), 13 (2013)
Slezak, D.: Rough sets and few-objects-many-attributes problem: the case study of analysis of gene expression data sets. In: Frontiers in the Convergence of Bioscience and Information Technologies, FBIT 2007, pp. 437–442. IEEE (2007)
Ślęzak, D., Kowalski, M.: Intelligent data granulation on load: improving infobright’s knowledge grid. In: Lee, Y., Kim, T., Fang, W., Ślęzak, D. (eds.) FGIT 2009. LNCS, vol. 5899, pp. 12–25. Springer, Heidelberg (2009). doi:10.1007/978-3-642-10509-8_3
Ślezak, D., Synak, P., Wojna, A., Wróblewski, J.: Two database related interpretations of rough approximations: data organization and query execution. Fundamenta Informaticae 127(1–4), 445–459 (2013)
Yao, Y., Lingras, P., Wang, R., Miao, D.: Interval set cluster analysis: a re-formulation. In: Sakai, H., Chakraborty, M.K., Hassanien, A.E., Ślęzak, D., Zhu, W. (eds.) RSFDGrC 2009. LNCS (LNAI), vol. 5908, pp. 398–405. Springer, Heidelberg (2009). doi:10.1007/978-3-642-10646-0_48
Yu, H., Su, T., Zeng, X.: A three-way decisions clustering algorithm for incomplete data. In: Miao, D., Pedrycz, W., Ślȩzak, D., Peters, G., Hu, Q., Wang, R. (eds.) RSKT 2014. LNCS (LNAI), vol. 8818, pp. 765–776. Springer, Heidelberg (2014). doi:10.1007/978-3-319-11740-9_70
Yu, H., Zhang, C., Wang, G.: A tree-based incremental overlapping clustering method using the three-way decision theory. Knowl.-Based Syst. 91, 189–203 (2016)
Zhang, P., Joshi, M., Lingras, P.: Use of stability and seasonality analysis for optimal inventory prediction models. J. Intell. Syst. 20(2), 147–166 (2011)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Lingras, P., Triff, M. (2016). Advances in Rough and Soft Clustering: Meta-Clustering, Dynamic Clustering, Data-Stream Clustering. In: Flores, V., et al. Rough Sets. IJCRS 2016. Lecture Notes in Computer Science(), vol 9920. Springer, Cham. https://doi.org/10.1007/978-3-319-47160-0_1
Download citation
DOI: https://doi.org/10.1007/978-3-319-47160-0_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-47159-4
Online ISBN: 978-3-319-47160-0
eBook Packages: Computer ScienceComputer Science (R0)