Abstract
System identification is an abductive task which is affected by several kinds of modeling assumptions and measurement errors. Therefore, instead of optimizing values of parameters within one behavior model, system identification is supported by multi-model reasoning strategies. The objective of this work is to develop a data mining algorithm that combines principal component analysis and k-means to obtain better understandings of spaces of candidate models. One goal is to improve views of model-space topologies. The presence of clusters of models having the same characteristics, thereby defining model classes, is an example of useful topological information. Distance metrics add knowledge related to cluster dissimilarity. Engineers are thus better able to improve decision making for system identification and downstream tasks such as further measurement, preventative maintenance and structural replacement.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Alonso, C., Rodriguez, J.J., Pulido, B.: Enhancing Consistency based Diagnosis with Machine Learning Techniques. LNCS, vol. 3040, pp. 312–321 (2004)
Chan, Z.S.H., Collins, L., Kasabov, N.: An efficient greedy k-means algorithm for global gene trajectory clustering. Exp. Sys. with Appl. 30(1), 137–141 (2006)
Ding, C., He, X.: K-means clustering via principal component analysis. In: Proceedings of the 21st International Conference on Machine Learning (2004)
Hand, D., Mannila, H., Smyth, P.: Principles of Data Mining, p. 546. MIT Press, Cambridge (2001)
Jolliffe, I.T.: Principal Component Analysis. Statistics Series, p. 271. Springer, Heidelberg (1986)
Ljung, L.: System Identification - Theory For the User, p. 609. Prentice-Hall, Englewood Cliffs (1999)
Melhem, H.G., Cheng, Y.: Prediction of Remaining Service Life of Bridge Decks Using Machine Learning. J. Comp. in Civ. Eng. 17(1), 1–9 (2003)
Nguyen, H.H., Chan, C.W.: Applications of data analysis techniques for oil production prediction. Art. Int. in Eng. 13, 257–272 (1999)
Ordonez, C.: Integrating k-means clustering with a relational DBMS using SQL. IEEE Trans. on Know. and Data Eng. 18(2), 188–201 (2006)
Pan, X., Ye, X., Zhang, S.: A hybrid method for robust car plate character recognition. Eng. Appl. of Art. Int. 18(8), 963–972 (2005)
Picone, J.: Duration in context clustering for speech recognition. Speech Com. 9(2), 119–128 (1990)
Raphael, B., Smith, I.F.C.: Fundamentals of Computer-Aided Engineering, p. 306. John Wiley, Chichester (2003)
Reich, Y., Barai, S.V.: Evaluating machine learning models for engineering problems. Art. Int. in Eng. 13, 257–272 (1999)
Robert-Nicoud, Y., Raphael, B., Smith, I.F.C.: Improving the reliability of system identification. Next Gen. Int. Sys. in Eng. 199, 100–109 (2004)
Saitta, S., Raphael, B., Smith, I.F.C.: Data mining techniques for improving the reliability of system identification. Adv. Eng. Inf. 19(4), 289–298 (2005)
Shirazi Kia, S., Noroozi, S., Carse, B., Vinney, J.: Application of Data Mining Techniques in Predicting the Behaviour of Composite Joints. In: Eighth AICC, Paper 18 (2005) (CD-ROM)
Tan, P.-N., Steinbach, M., Kumar, V.: Introduction to Data Mining, p. 769. Addison-Wesley, Reading (2006)
Webb, A.: Statistical Pattern Recognition, p. 496. Wiley, Chichester (2002)
Xu, L.J., Yan, Y., Cornwell, S., Riley, G.: Online fuel tracking by combining principal component analysis and neural network techniques. IEEE Trans. on Inst. and Meas. 54(4), 1640–1645 (2005)
Yan, L., Fraser, M., Oliver, K., Elgamal, A., Conte, J.P., Fountain, T.: Traffic Pattern Recognition using an Active Learning Neural Network and Principal Components Analysis. In: Eighth AICC, Paper 48 (2005) (CD-ROM)
Yun, C.-B., Yi, J.-H., Bahng, E.Y.: Joint damage assessment of framed structures using a neural networks technique. Eng. Struct. 23, 425–435 (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Saitta, S., Raphael, B., Smith, I.F.C. (2006). Combining Two Data Mining Methods for System Identification. In: Smith, I.F.C. (eds) Intelligent Computing in Engineering and Architecture. EG-ICE 2006. Lecture Notes in Computer Science(), vol 4200. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11888598_54
Download citation
DOI: https://doi.org/10.1007/11888598_54
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-46246-0
Online ISBN: 978-3-540-46247-7
eBook Packages: Computer ScienceComputer Science (R0)