Abstract
Unlike the traditional supervised learning, multiple-instance learning (MIL) deals with learning from bags of instances rather than individual instances. Over the last couple of years, some researchers have attempted to solve the MIL problem from the perspective of instance selection. The basic idea is selecting some instance prototypes from the training bags and then converting MIL to single-instance learning using these prototypes. However, a bag is composed of one or more instances, which often leads to high computational complexity for instance selection. In this paper, we propose a simple and general instance reduction method to speed up the instance selection process for various instance selection-based MIL (ISMIL) algorithms. We call it pairwise-similarity-based instance reduction for multiple-instance learning (MIPSIR), which is based on the pairwise similarity between instances in a bag. Instead of the original training bag, we use a pair of instances with the highest or lowest similarity value depending on the bag label within this bag for instance selection. We have applied our method to four effective ISMIL algorithms. The evaluation on three benchmark datasets demonstrates that the MIPSIR method can significantly improve the efficiency of an ISMIL algorithm while maintaining or even improving its generalization capability.
Similar content being viewed by others
Notes
The source code of DD-SVM is available at http://www.cs.olemiss.edu/~ychen/ddsvm.html.
The source code of MILES is available at http://www.cs.olemiss.edu/~ychen/MILES.html.
The source code of MILD is available at http://www.cs.sjtu.edu.cn/~liwujun/.
This software package is available at http://www.csie.ntu.edu.tw/~cjlin/libsvm/.
These datasets are available at http://www.cs.columbia.edu/~andrews/mil/datasets.html.
References
Dietterich TG, Lathrop RH, Lozano-Pérez T (1997) Solving the multiple instance problem with axis-parallel rectangles. Artif Intell 89(1–2):31–71
Maron O, Lozano-Pérez T (1998) A framework for multiple-instance learning. In: Proceedings of advances in neural information processing systems, vol 10. MIT Press, Cambridge, pp 570–576
Maron O, Ratan AL (1998) Multiple-instance learning for natural scene classification. In: Proceedings of the 15th international conference on machine learning. Morgan Kaufmann, San Francisco, pp 341–349
Fung G, Dundar M, Krishnapuram B, Rao RB (2007) Multiple instance learning for computer aided diagnosis. In: Proceedings of advances in neural information processing systems, vol 19. MIT Press, Cambridge, pp 425–432
Raykar VC, Krishnapuram B, Bi J, Dundar M, Rao RB (2008) Bayesian multiple instance learning: automatic feature selection and inductive transfer. In: Proceedings of the 25th international conference on machine learning. ACM, New York, pp 808–815
Yang C, Lozano-Pérez T (2000) Image database retrieval with multiple-instance learning techniques. In: Proceedings of the 16th international conference on data engineering. IEEE, Washington, pp 233–243
Zhang Q, Goldman SA, Yu W, Fritts JE (2002) Content-based image retrieval using multiple-instance learning. In: Proceedings of the 19th international conference on machine learning. Morgan Kaufmann, San Francisco, pp 682–689
Rahmani R, Goldman SA, Zhang H, Cholleti SR, Fritts JE (2008) Localized content-based image retrieval. IEEE Trans Pattern Anal Mach Intell 30(11):1902–1912
Zha ZJ, Hua XS, Mei T, Wang J, Qi GJ, Wang Z (2008) Joint multi-label multi-instance learning for image classification. In: Proceedings of the 21st conference on computer vision and pattern recognition. IEEE, Washington, pp 1–8
Ali S, Shah M (2010) Human action recognition in videos using kinematic features and multiple instance learning. IEEE Trans Pattern Anal Mach Intell 32(2):288–303
Viola PA, Platt JC, Zhang C (2006) Multiple instance boosting for object detection. In: Proceedings of advances in neural information processing systems, vol 18. MIT Press, Cambridge, pp 1417–1424
Dollár P, Babenko B, Belongie S, Perona P, Tu Z (2008) Multiple component learning for object detection. In: Proceedings of the 10th European conference on computer vision. Springer, Berlin, pp 211–224
Babenko B, Yang MH, Belongie S (2009) Visual tracking with online multiple instance learning. In: Proceedings of the 22nd conference on computer vision and pattern recognition. IEEE, Washington, pp 983–990
Babenko B, Yang MH, Belongie S (2011b) Robust object tracking with online multiple instance learning. IEEE Trans Pattern Anal Mach Intell 33(8):1619–1632
Ning J, Shi W, Yang S, Yanne P (2013) Visual tracking based on distribution fields and online weighted multiple instance learning. Image Vis Comput 31(11):853–863
Chen Y, Wang JZ (2004) Image categorization by learning and reasoning with regions. J Mach Learn Res 5:913–939
Chen Y, Bi J, Wang JZ (2006) MILES: Multiple-instance learning via embedded instance selection. IEEE Trans Pattern Anal Mach Intell 28(12):1931–1947
Li WJ, Yeung DY (2010) MILD: Multiple-instance learning via disambiguation. IEEE Trans Knowl Data Eng 22(1):76–89
Fu Z, Robles-Kelly A, Zhou J (2011) MILIS: Multiple instance learning with instance selection. IEEE Trans Pattern Anal Mach Intell 33(5):958–977
Wang X-Z, Lu S-X, Zhai J-H (2008) Fast fuzzy multicategory SVM based on support vector domain description. Int J Pattern Recognit Artif Intell 22(01):109–120
Chen W-J, Shao Y-H, Hong N (2013) Laplacian smooth twin support vector machine for semi-supervised classification. Int J Mach Learn Cybern. doi:10.1007/s13042-013-0183-3
Zhang Q, Goldman SA (2002) EM-DD: An improved multiple-instance learning technique. In: Proceedings of advances in neural information processing systems, vol 14. MIT Press, Cambridge, pp 1073–1080
Settles B, Craven M, Ray S (2008) Multiple-instance active learning. In: Proceedings of advances in neural information processing systems, vol 20. Curran Associates Inc, New York, pp 1289–1296
Babenko B, Verma N, Dollár P, Belongie S (2011a) Multiple instance learning with manifold bags. In: Proceedings of the 28th international conference on machine learning. Omnipress, New York, pp 81–88
Antić B, Ommer B (2013) Robust multiple-instance learning with superbags. In: Proceedings of the 11th Asian conference on computer vision. Springer, Berlin, pp 242–255
Ramon J., De Raedt L. (2000) Multi instance neural networks. In: Proceedings of the 17th international conference on machine learning, workshop on attribute-value and relational learning
Zhang M-L, Zhou Z-H (2004) Improve multi-instance neural networks through feature selection. Neural Process Lett 19(1):1–10
Wang J, Zucker JD (2000) Solving the multiple-instance problem: a lazy learning approach. In: Proceedings of the 17th international conference on machine learning. Morgan Kaufmann, San Francisco, pp 1119–1126
Gärtner T, Flach PA, Kowalczyk A, Smola AJ (2002) Multi-instance kernels. In: Proceedings of the 19th international conference on machine learning. Morgan Kaufmann, San Francisco, pp 179–186
Andrews S, Tsochantaridis I, Hofmann T (2003) Support vector machines for multiple-instance learning. In: Proceedings of advances in neural information processing systems, vol 15. MIT Press, Cambridge, pp 561–568
Bergeron C, Moore G, Zaretzki J, Breneman CM, Bennett KP (2012) Fast bundle algorithm for multiple-instance learning. IEEE Trans Pattern Anal Mach Intell 34(6):1068–1079
Minhas FA, Ben-Hur A (2012) Multiple instance learning of Calmodulin binding sites. Bioinformatics 28(18):i416–i422
Li Y, Tax DMJ, Duin RPW, Loog M (2013) Multiple-instance learning as a classifier combining problem. Pattern Recognit 46(3):865–874
Nguyen DT, Nguyen CD, Hargraves R, Kurgan LA, Cios KJ (2013) mi-DS: Multiple-instance learning algorithm. IEEE Trans Cybern 43(1):143–154
Wang X-Z, Dong L-C, Yan J-H (2012) Maximum ambiguity-based sample selection in fuzzy decision tree induction. IEEE Trans Knowl Data Eng 24(8):1491–1505
Hamidzadeh J, Monsefi R, Sadoghi Yazdi H (2014) Large symmetric margin instance selection algorithm. Int J Mach Learn Cybern. doi:10.1007/s13042-014-0239-z
Foulds J, Frank E (2008) Revisiting multiple-instance learning via embedded instance selection. In: Proceedings of the 21st Australasian joint conference on artificial intelligence. Springer, Berlin, pp 300–310
Cheplygina V, Tax DMJ, Loog M (2013) Combining instance information to classify bags. In: Proceedings of the 11th international workshop on multiple classifier systems. Springer, Berlin, pp 13–24
Acknowledgments
This research has been supported by the National Natural Science Foundation of China under the Grant Nos. 61173087 and 61370162. The authors would like to thank the anonymous reviewers for their valuable suggestions.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Yuan, L., Liu, J., Tang, X. et al. Pairwise-similarity-based instance reduction for efficient instance selection in multiple-instance learning. Int. J. Mach. Learn. & Cyber. 6, 83–93 (2015). https://doi.org/10.1007/s13042-014-0248-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13042-014-0248-y