Abstract:
Data sampling is a critical factor for building and evaluating the quality of classifiers, such as neural networks. Traditional techniques, such as k-fold cross validatio...Show MoreMetadata
Abstract:
Data sampling is a critical factor for building and evaluating the quality of classifiers, such as neural networks. Traditional techniques, such as k-fold cross validation, exhibit limitations when dealing with small data sets. This paper introduces an alternative method that splits the data into training and testing partitions, which have similar statistical characteristics. This method is compared with a traditional technique, using a relatively small dataset and several neural network classifiers. Results suggest that this new technique can reduce variability of predictive accuracies and provide consistent results across different classification models.
Published in: Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004.
Date of Conference: 26-26 August 2004
Date Added to IEEE Xplore: 20 September 2004
Print ISBN:0-7695-2128-2
Print ISSN: 1051-4651