Abstract
This paper uses fuzzy clustering to extend a previous prioritized sampling proposal. In many big data problems, modeling an individual object such as a large engineering plant can be a tedious process requiring up to a month of analysis. A solution is to model as many representative objects as possible to represent the entire population. A new object can then use a model (or combination of models) from previously analyzed objects that best matches its characteristics. Since the modeling process can continue indefinitely adding models over time, we prioritize the sampling based on the ability of objects to represent as many characteristics as possible. The approach is demonstrated with a large set of weather stations to create a ranked sample based on hourly and monthly variations of important weather parameters, such as temperature, solar radiation, wind speed, and humidity. The weather patterns are represented using a combination of average and rough patterns to capture the essence of the distribution. The weather stations are grouped using Fuzzy C-Means and the objects with the largest fuzzy memberships are used as the representatives of each cluster. The weather stations representing a combination of different clustering schemes are then ranked based on the number of weather patterns they represent.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bezdek, J.C.: Pattern Recognition with Fuzzy Objective Function Algorithms. Kluwer Academic Publishers, Norwell (1981)
Dunn, J.C.: A fuzzy relative of the isodata process and its use in detecting compact well-separated clusters. Cybern. Syst. 3(3), 32–57 (1973)
Lingras, P., Butz, C.J.: Rough support vector regression. Eur. J. Oper. Res. 206(2), 445–455 (2010)
Triff, M., Pavlovski, I., Liu, Z., Morgan, L.A., Lingras, P.: Clustering ensemble for prioritized sampling based on average and rough patterns. In: Proceedings of 23rd International Symposium on Methodologies for Intelligent Systems, Warsaw, Poland p. 530 (2017)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this paper
Cite this paper
Triff, M., Pavlovski, I., Liu, Z., Morgan, LA., Lingras, P. (2018). Fuzzy Clustering Ensemble for Prioritized Sampling Based on Average and Rough Patterns. In: Mouhoub, M., Sadaoui, S., Ait Mohamed, O., Ali, M. (eds) Recent Trends and Future Technology in Applied Intelligence. IEA/AIE 2018. Lecture Notes in Computer Science(), vol 10868. Springer, Cham. https://doi.org/10.1007/978-3-319-92058-0_63
Download citation
DOI: https://doi.org/10.1007/978-3-319-92058-0_63
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-92057-3
Online ISBN: 978-3-319-92058-0
eBook Packages: Computer ScienceComputer Science (R0)