A non-random data sampling method for classification model assessment | IEEE Conference Publication | IEEE Xplore

A non-random data sampling method for classification model assessment


Abstract:

Data sampling is a critical factor for building and evaluating the quality of classifiers, such as neural networks. Traditional techniques, such as k-fold cross validatio...Show More

Abstract:

Data sampling is a critical factor for building and evaluating the quality of classifiers, such as neural networks. Traditional techniques, such as k-fold cross validation, exhibit limitations when dealing with small data sets. This paper introduces an alternative method that splits the data into training and testing partitions, which have similar statistical characteristics. This method is compared with a traditional technique, using a relatively small dataset and several neural network classifiers. Results suggest that this new technique can reduce variability of predictive accuracies and provide consistent results across different classification models.
Date of Conference: 26-26 August 2004
Date Added to IEEE Xplore: 20 September 2004
Print ISBN:0-7695-2128-2
Print ISSN: 1051-4651
Conference Location: Cambridge, UK

Contact IEEE to Subscribe

References

References is not available for this document.