ABSTRACT
The paper presents an adaptive GP boosting ensemble method forthe classification of distributed homogeneous streaming data that comes from multiple locations. The approach is able to handle concept drift via change detection by employing a change detection strategy, based on self-similarity of the ensemble behavior, and measured by its fractal dimension. It is efficient since each nodeof the network works with its local streaming data, and communicate only the local model computed with the otherpeer-nodes. Furthermore, once the ensemble has been built, it isused to predict the class membership of new streams of data until concept drift is detected. Only in such a case the algorithm is executed to generate a new set of classifiers to update the current ensemble. Experimental results on a synthetic and reallife data set showed the validity of the approach in maintaining an accurate and up-to-date GP ensemble.
Index Terms
- StreamGP: tracking evolving GP ensembles in distributed data streams using fractal dimension
Recommendations
Improving cooperative GP ensemble with clustering and pruning for pattern classification
GECCO '06: Proceedings of the 8th annual conference on Genetic and evolutionary computationA boosting algorithm based on cellular genetic programming to build an ensemble of predictors is proposed. The method evolves a population of trees for a fixed number of rounds and, after each round, it chooses the predictors to include into the ...
Enhancing the classification accuracy by scatter-search-based ensemble approach
Data-mining algorithms have been used in many classification problems. Among them, the decision tree (DT), back-propagation network (BPN), and support vector machine (SVM) are popular and can be applied to various areas. Nevertheless, different problems ...
Genetic programming in classifying large-scale data: an ensemble method
Special issue: Soft computing data miningThis study demonstrated potential of genetic programming (GP) as a base classifier algorithm in building ensembles in the context of large-scale data classification. An ensemble built upon base classifiers that were trained with GP was found to ...
Comments