Loading [a11y]/accessibility-menu.js
On machine learning, ROC analysis, and statistical tests of significance | IEEE Conference Publication | IEEE Xplore

On machine learning, ROC analysis, and statistical tests of significance


Abstract:

Receiver operating characteristic (ROC) analysis is being used with greater frequency as an evaluation methodology in machine learning and pattern recognition. Researcher...Show More

Abstract:

Receiver operating characteristic (ROC) analysis is being used with greater frequency as an evaluation methodology in machine learning and pattern recognition. Researchers have used ANOVA to determine if the results from such analysis are statistically significant. Yet, in the medical decision making community, the prevailing method is LABMRMC. Although this latter method uses ANOVA, before doing so, it applies the Jackknife method to account for case-sample variance. To determine whether these two tests make the same decisions regarding statistical significance, we conducted a Monte Carlo simulation using several problems derived from Gaussian distributions, three machine-learning algorithms, ROC analysis, ANOVA, and LABMRMC. Results suggest that the decisions these tests make are not the same, even for simple problems. Furthermore, the larger issue is that since ANOVA does not account for case-sample variance, one cannot generalize experimental results to the population from which the data were drawn.
Date of Conference: 11-15 August 2002
Date Added to IEEE Xplore: 10 December 2002
Print ISBN:0-7695-1695-X
Print ISSN: 1051-4651
Conference Location: Quebec City, QC, Canada

Contact IEEE to Subscribe

References

References is not available for this document.