Abstract
Active learning traditionally focuses on labeling the most informative instances for some well defined learning tasks with known class labels, and a labeler is provided to label each queried instance. In an extreme case, the whole active learning task may start without any available information about the tasks, for instance, no labeled data are available at the initial stage and the labeler is incapable of providing the ground truth to each queried instance. In this paper, we propose an active class discovery method for the case where no randomly labeled instances exist to kick-off the learning circle and the labeler only has weak knowledge to answer whether a pair of instances belong to the same class or not. To roughly identify the classes in the data, a Minimum Spanning Tree based query strategy is employed to discover a number of classes from unlabeled data. Experiments and comparisons demonstrate superior performance of the proposed method for class discovery tasks.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Pelleg, D., Moore, A.: Active learning for anomaly and rare category detection. In: Neural Information Processing System (2004)
He, J., Carbonell, J.: Nearest-neighbor-based active learning for rare category detection. In: Neural Information Processing System (2007)
Vatturi, P., Wong, W.K.: Category detection using hierarchical mean shift. In: SIGKDD, pp. 847–856 (2009)
Acknowledgements
This work is supported by the Australian Research Council (ARC) through Discovery Project Grant DP130100364.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Fu, Y., Gao, J., Zhu, X. (2015). Active Class Discovery by Querying Pairwise Label Homogeneity. In: Zhang, C., et al. Data Science. ICDS 2015. Lecture Notes in Computer Science(), vol 9208. Springer, Cham. https://doi.org/10.1007/978-3-319-24474-7_19
Download citation
DOI: https://doi.org/10.1007/978-3-319-24474-7_19
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-24473-0
Online ISBN: 978-3-319-24474-7
eBook Packages: Computer ScienceComputer Science (R0)