The Power of Sampling and Stacking for the PaKDD-2007 Cross-Selling Problem

doi:10.4018/jdwm.2008040104

Reference Hub

This research has been cited in:

Article
MLP ensembles improve long term prediction accuracy over single networksInternational Journal of Forecasting10.1016/j.ijforecast.2009.05.029
Conference
Data transformations and seasonality adjustments improve forecasts of MLP ensembles2012 IEEE Conference on Evolving and Adaptive Intelligent Systems10.1109/EAIS.2012.6232819
Conference
The role of temporal feature extraction and bagging of MLP neural networks for solving the WCCI 2008 Ford Classification Challenge2009 International Joint Conference on Neural Networks10.1109/IJCNN.2009.5178965
Article
A framework for data transformation in Credit Behavioral Scoring applications based on Model Driven DevelopmentExpert Systems with Applications10.1016/j.eswa.2016.10.059
Conference
pRAM n-tuple Classifier - a new architecture of probabilistic RAM neurons for classification problemsThe 2010 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN.2010.5596779
Conference
Kolmogorov-Smirnov and ROC curve metrics for binary classification performance assessment are equivalent2022 26th International Conference on Pattern Recognition (ICPR)10.1109/ICPR56361.2022.9956449
Conference
An experimental investigation of artificial immune system algorithms for credit risk assessment applications2012 IEEE Congress on Evolutionary Computation10.1109/CEC.2012.6252947
Chapter
Domain Driven Data Mining for Unavailability Estimation of Electrical Power GridsTrends in Applied Intelligent Systems10.1007/978-3-642-13025-0_38
Article
IRIS-GRAPE: An approach for prediction of quality attributes in vineyard grapes inspired by iris biometric recognitionComputers and Electronics in Agriculture10.1016/j.compag.2019.105140
Chapter
Variable Transformation for Granularity Change in Hierarchical Databases in Actual Data Mining SolutionsIntelligent Data Engineering and Automated Learning – IDEAL 201510.1007/978-3-319-24834-9_18
Chapter
Knowledge Reuse in Data Mining Projects and Its Practical ApplicationsEnterprise Information Systems10.1007/978-3-642-01347-8_27
Conference
Continuous variables segmentation and reordering for optimal performance on binary classification tasks2014 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN.2014.6889965

The Power of Sampling and Stacking for the PaKDD-2007 Cross-Selling Problem

Paulo J.L. Adeodato, Germano C. Vasconcelos, Adrian L. Arnaud, Rodrigo C.L.V. Cunha, Domingos S.M.P. Monteiro, Rosalvo F.O. Neto

Source Title: International Journal of Data Warehousing and Mining (IJDWM)4(2)

Cite Article Cite Article

MLA

Adeodato, Paulo J.L., et al. "The Power of Sampling and Stacking for the PaKDD-2007 Cross-Selling Problem." IJDWM vol.4, no.2 2008: pp.22-31. http://doi.org/10.4018/jdwm.2008040104

APA

Adeodato, P. J., Vasconcelos, G. C., Arnaud, A. L., Cunha, R. C., Monteiro, D. S., & Neto, R. F. (2008). The Power of Sampling and Stacking for the PaKDD-2007 Cross-Selling Problem. International Journal of Data Warehousing and Mining (IJDWM), 4(2), 22-31. http://doi.org/10.4018/jdwm.2008040104

Chicago

Adeodato, Paulo J.L., et al. "The Power of Sampling and Stacking for the PaKDD-2007 Cross-Selling Problem," International Journal of Data Warehousing and Mining (IJDWM) 4, no.2: 22-31. http://doi.org/10.4018/jdwm.2008040104

Export Reference

Favorite Full-Issue Download

View Full Text PDF

Abstract

This article presents an efficient solution for the PAKDD-2007 Competition cross-selling problem. The solution is based on a thorough approach which involves the creation of new input variables, efficient data preparation and transformation, adequate data sampling strategy and a combination of two of the most robust modeling techniques. Due to the complexity imposed by the very small amount of examples in the target class, the approach for model robustness was to produce the median score of the 11 models developed with an adapted version of the 11-fold cross-validation process and the use of a combination of two robust techniques via stacking, the MLP neural network and the n-tuple classifier. Despite the problem complexity, the performance on the prediction data set (unlabeled samples), measured through KS2 and ROC curves was shown to be very effective and finished as the first runner-up solution of the competition.

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.

Username or email: *

Password: *

Forgot individual login password?

Create individual account

The Power of Sampling and Stacking for the PaKDD-2007 Cross-Selling Problem

MLA

APA

Chicago

Export Reference

Abstract

Request Access