Abstract
Multi-instance multi-label learning (MIML) introduced by Zhou and Zhang is a comparatively new framework in machine learning with two special characteristics: Firstly, each instance is represented by a set of feature vectors (a bag of instances), and secondly, bags of instances may belong to many classes (a Multi-Label). Thus, an MIML classifier receives a bag of instances and produces a Multi-Label. For classifier training, the training set is also of this MIML structure. Labeling a data set is always cost-intensive, especially in an MIMIL framework. In order to reduce the labeling costs it is important to restructure the annotation process in such a way that the most informative examples are labeled in the beginning, and less or non-informative data more to the end of the annotation phase. Active learning is a possible approach to tackle this kind of problems in this work we focus on the MIMLSVM algorithm in combination with the k-Medoids clustering algorithm to transform the Multi-Instance to a Single-Instance representation. For the clustering distance measure we consider variants of the Hausdorff distance, namely Median- and Average-Based Hausdorff distance. Finally, active learning strategies derived from the single-instance scenario have been investigated in the MIML setting and evaluated on a benchmark data set.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Boutell, M. R., Luo, J., Shen, X., & Christopher, M. B. (2004). Learning multi-label scene classification. Pattern Recognition, 37, 1757–1771.
Brinker, K. (2005). On active learning in multi-label classification. In Data and information analysis to knowledge engineering. Proceedings of the 29th Annual Conference of the Gesellschaft für Klassifikation e.V. (pp. 206–213).
Dietterich, T. G., Lathrop, R. H., & Lozano-Perez, T. (1997). Solving the multiple-instance problem with axis-parallel rectangles. Artificial Intelligence, 89, 31–71.
Fu, J., & Yin, J. (2011). Bag-level active multi-instance learning. In Eighth International Conference on Fuzzy Systems and Knowledge Discovery (pp. 1307–1311).
Li, X., Wang, L., & Sung, E. (2004). Multi-label SVM active learning for image classification. In IEEE 2004 International Conference on Image Processing (pp. 2207–2210).
Mccallum, A. K. (1999). Multi-label text classification with a mixture model trained by EM. In AAAI 99 Workshop on Text Learning.
Schwenker, F., & Trentin, E. (2014). Pattern classification and clustering: A review of partially supervised learning approaches. Pattern Recognition Letters, 37, 3–14.
Settles, B. (2009). Active learning literature survey. Computer Sciences Technical Report, 1648.
Settles, B., Craven, M., & Soumya, R. (2008). Multiple-instance active learning. In Neural Information Processing Systems (pp. 1289–1296).
Wang, J., & Zucker, J. D. (2000). Solving the multiple-instance problem: A lazy learning approach. In Proceedings of the 17th ICML (pp. 1119–1126).
Yang, B., Sun, J. T., Wang, T., & Chen, Z. (2009). Effective multi-label active learning for text classification. In 15th ACM SIGKDD International Conference on KDDM (pp. 917–926).
Zhang, M. L., & Zhou, Z. H. (2009). Multi-instance clustering with applications to multi-instance prediction. Applied Intelligence, 31, 47–68.
Zhou, Z., & Zhang, M. (2007). Multi-instance multi-label learning with application to scene classification. In NIPS 19 (pp. 1609–1616).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Retz, R., Schwenker, F. (2016). Active Multi-Instance Multi-Label Learning. In: Wilhelm, A., Kestler, H. (eds) Analysis of Large and Complex Data. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Cham. https://doi.org/10.1007/978-3-319-25226-1_8
Download citation
DOI: https://doi.org/10.1007/978-3-319-25226-1_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-25224-7
Online ISBN: 978-3-319-25226-1
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)