Using Classifier diversity to handle label noise | IEEE Conference Publication | IEEE Xplore

Using Classifier diversity to handle label noise


Abstract:

It is widely known in the machine learning community that class noise can be (and often is) detrimental to inducing a model of the data. Many current approaches use a sin...Show More

Abstract:

It is widely known in the machine learning community that class noise can be (and often is) detrimental to inducing a model of the data. Many current approaches use a single, often biased, measurement to determine if an instance is noisy. A biased measure may work well on certain data sets, but it can also be less effective on a broader set of data sets. In this paper, we conduct a large empirical evaluation of noise handling techniques; examining 12 noise handling techniques on a set of 54 data sets and 5 learning algorithms. The chosen set of noise handling techniques includes biased and ensembled approaches. Included in the investigation is the proposed noise identification using classifier diversity (NICD). NICD lessens the bias of the noise measure by selecting a diverse set of classifiers to determine which instances are noisy. We examine NICD as a technique for filtering, instance weighting, and selecting the base classifiers of a voting ensemble. We find that lessening the bias of the noise handling techniques significantly improves performance over a broad set of data sets.
Date of Conference: 12-17 July 2015
Date Added to IEEE Xplore: 01 October 2015
ISBN Information:

ISSN Information:

Conference Location: Killarney, Ireland

Contact IEEE to Subscribe

References

References is not available for this document.