ABSTRACT
The weakly supervised Multiple-Instance Learning (MIL) problem has been successfully applied in information retrieval tasks. Two related issues might affect the performance of MIL algorithms: how to cope with label ambiguities and how to deal with non-discriminative components, and we propose COmpact MultiPle-Instance LEarning (COMPILE) to consider them simultaneously. To treat label ambiguities, COMPILE seeks ground-truth positive instances in positive bags. By using weakly supervised information to learn data's short binary representations, COMPILE enhances discrimination via strengthening discriminative components and suppressing non-discriminative ones. We adapt block coordinate descent to optimize COMPILE efficiently. Experiments on text categorization empirically show: 1) COMPILE unifies disambiguation and data preprocessing successfully; 2) it generates short binary representations efficiently to enhance discrimination at significantly reduced storage cost.
- J. Amores. 2013. Multiple instance classification: Review, taxonomy and comparative study. Artif. Intell. Vol. 201 (2013), 81--105. Google ScholarDigital Library
- S. Andrews, I. Tsochantaridis, and T. Hofmann. 2003. Support vector machines for multiple-instance learning NIPS. 577--584. Google ScholarDigital Library
- J.F. Bonnans, J.C. Gilbert, C. Lemaréchal, and C.A. Sagastizábal. 2006. Numerical optimization: theoretical and practical aspects. Springer Science & Business Media. Google ScholarDigital Library
- J. Chai, H. Chen, L. Huang, and F. Shang. 2014. Maximum margin multiple-instance feature weighting. Pattern Recog., Vol. 47, 6 (2014), 2091--2103. Google ScholarDigital Library
- R.O. Duda, P.E. Hart, and D.G. Stork. 2012. Pattern classification. John Wiley & Sons.Google Scholar
- Y. Gong and S. Lazebnik. 2011. Iterative quantization: A procrustean approach to learning binary codes CVPR. 817--824. Google ScholarDigital Library
- Wu-Jun Li and Dit-Yan Yeung. 2010. MILD: Multiple-Instance Learning via Disambiguation. IEEE Trans. Knowl. Data Eng. Vol. 22, 1 (2010), 76--89. Google ScholarDigital Library
- F. Shen, C. Shen, W. Liu, and H.T. Shen. 2015. Supervised discrete hashing. In CVPR. 37--45.Google Scholar
- X. Shen, W. Liu, I. Tsang, F. Shen, and Q.S. Sun. 2017. Compressed K-Means for Large-Scale Clustering. In AAAI. 2527--2533.Google Scholar
- J. Wang and J.D. Zucker. 2000. Solving the Multiple-Instance Problem: A Lazy Learning Approach ICML. 1119--1126. Google ScholarDigital Library
- J. Wu, Z. Hong, S. Pan, X. Zhu, Z. Cai, and C. Zhang. 2014. Exploring features for complicated objects: Cross-view feature selection for multi-instance learning. In CIKM. 1699--1708. Google ScholarDigital Library
- M.L. Zhang and Z.H. Zhou. 2006. Adapting RBF neural networks to multi-instance learning. Neural Processing Letters Vol. 23, 1 (2006), 1--26. Google ScholarDigital Library
- Q. Zhang, S.A. Goldman, et almbox.. 2001. EM-DD: An improved multiple-instance learning technique NIPS. 1073--1080. Google ScholarDigital Library
- Y. Zhou, Z. Jorgensen, and M. Inge. 2007. Combating good word attacks on statistical spam filters with multiple instance learning ICTAI. 298--305. Google ScholarDigital Library
- Z.H. Zhou, K. Jiang, and M. Li. 2005. Multi-instance learning based web mining. Appl. Intell., Vol. 22, 2 (2005), 135--147. Google ScholarDigital Library
- Z.H. Zhou, Y.Y. Sun, and Y.F. Li. 2009. Multi-instance learning by treating instances as non-iid samples ICML. 1249--1256. Google ScholarDigital Library
Recommendations
Multiple instance learning with bag dissimilarities
Multiple instance learning (MIL) is concerned with learning from sets (bags) of objects (instances), where the individual instance labels are ambiguous. In this setting, supervised learning cannot be applied directly. Often, specialized MIL methods ...
Graph-based multiple-instance learning for object-based image retrieval
MIR '08: Proceedings of the 1st ACM international conference on Multimedia information retrievalWe study in this paper the problem of using multiple-instance semi-supervised learning to solve object-based image retrieval problem, in which the user is only interested in a portion of the image, and the rest of the image is considered as irrelevant. ...
Multiple-instance learning with pairwise instance similarity
Abstract Multiple-Instance Learning MIL has attracted much attention of the machine learning community in recent years and many real-world applications have been successfully formulated as MIL problems. Over the past few years, several Instance ...
Comments