Abstract
Dimensionality is an obstacle for many potentially powerful machine learning techniques. Widely approved and otherwise elegant methodologies exhibit relatively high complexity. This limits their applicability to real world applications. Friedman’s Multivariate Adaptive Regression Splines (MARS) is a function approximator that produces continuous models of multi-dimensional functions using recursive partitioning and multidimensional spline curves that are automatically adapted to the data. Despite this technique’s many strengths, it, too, suffers from the dimensionality problem. Each additional dimension of a hyperplane requires the addition of one dimension to the approximation model, and an increase in the time and space required to compute and store the splines. Rough set theory can reduce dataset dimensionality as a preprocessing step to training a learning system. This paper investigates the applicability of the Rough Set Attribute Reduction (RSAR) technique to MARS in an effort to simplify the models produced by the latter and decrease their complexity. The paper describes the techniques in question and discusses how RSAR can be integrated with MARS. The integrated system is tested by modelling the impact of pollution on communities of several species of river algae. These experimental results help draw conclusions on the relative success of the integration effort.
Preview
Unable to display preview. Download preview PDF.
References
Bartels, R., Beatty, J. and Barsky, B. Splines for Use in Computer Graphics and Geometric Modelling. Morgan Kaufmann (1987).
Chouchoulas, A. and Shen, Q. Rough Set-Aided Rule Induction for Plant Monitoring. Proceedings of the 1998 International Joint Conference on Information Science (JCIS'98), 2 (1998) 316–319.
ERUDIT, European Network for Fuzzy Logic and Uncertainty Modelling in Information Technology. Protecting Rivers and Streams by Monitoring Chemical Concentrations and Algae Communities (Third International Competition) http://www.erudit.de/erudit/activities/ic-99/problem.htm (1999).
Foley, J. D., van Dam, A., Feiner, S. K., Hughes, J. F. and Philips, R. L. Introduction to Computer Graphics. Addison-Wesley (1990).
Friedman, J. H. Multivariate Adaptive Regression Splines. Annals of Statistics, 19 (1991) 1–67.
Pawlak, Z. Rough Sets: Theoretical Aspects of Reasoning About Data. Kluwer Academic Publishers, Dordrecht (1991).
Shen, Q. and Chouchoulas, A. A modular approach to generating fuzzy rules with reduced attributes for the monitoring of complex systems. Engineering Applications of Artificial Intelligence 13 (2000) 263–278.
Shen, Q. and Chouchoulas, A. Combining Rough Sets and Data-Driven Fuzzy Learning. Pattern Recognition, 32 (1999) 2073–2076.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chouchoulas, A., Shen, Q. (2001). Rough Set-Based Dimensionality Reduction for Multivariate Adaptive Regression Splines. In: Ziarko, W., Yao, Y. (eds) Rough Sets and Current Trends in Computing. RSCTC 2000. Lecture Notes in Computer Science(), vol 2005. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45554-X_17
Download citation
DOI: https://doi.org/10.1007/3-540-45554-X_17
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43074-2
Online ISBN: 978-3-540-45554-7
eBook Packages: Springer Book Archive