Skip to main content
Log in

Pairwise-similarity-based instance reduction for efficient instance selection in multiple-instance learning

  • Original Article
  • Published:
International Journal of Machine Learning and Cybernetics Aims and scope Submit manuscript

Abstract

Unlike the traditional supervised learning, multiple-instance learning (MIL) deals with learning from bags of instances rather than individual instances. Over the last couple of years, some researchers have attempted to solve the MIL problem from the perspective of instance selection. The basic idea is selecting some instance prototypes from the training bags and then converting MIL to single-instance learning using these prototypes. However, a bag is composed of one or more instances, which often leads to high computational complexity for instance selection. In this paper, we propose a simple and general instance reduction method to speed up the instance selection process for various instance selection-based MIL (ISMIL) algorithms. We call it pairwise-similarity-based instance reduction for multiple-instance learning (MIPSIR), which is based on the pairwise similarity between instances in a bag. Instead of the original training bag, we use a pair of instances with the highest or lowest similarity value depending on the bag label within this bag for instance selection. We have applied our method to four effective ISMIL algorithms. The evaluation on three benchmark datasets demonstrates that the MIPSIR method can significantly improve the efficiency of an ISMIL algorithm while maintaining or even improving its generalization capability.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Notes

  1. The source code of DD-SVM is available at http://www.cs.olemiss.edu/~ychen/ddsvm.html.

  2. The source code of MILES is available at http://www.cs.olemiss.edu/~ychen/MILES.html.

  3. The source code of MILD is available at http://www.cs.sjtu.edu.cn/~liwujun/.

  4. This software package is available at http://www.csie.ntu.edu.tw/~cjlin/libsvm/.

  5. These datasets are available at http://www.cs.columbia.edu/~andrews/mil/datasets.html.

References

  1. Dietterich TG, Lathrop RH, Lozano-Pérez T (1997) Solving the multiple instance problem with axis-parallel rectangles. Artif Intell 89(1–2):31–71

    Article  MATH  Google Scholar 

  2. Maron O, Lozano-Pérez T (1998) A framework for multiple-instance learning. In: Proceedings of advances in neural information processing systems, vol 10. MIT Press, Cambridge, pp 570–576

  3. Maron O, Ratan AL (1998) Multiple-instance learning for natural scene classification. In: Proceedings of the 15th international conference on machine learning. Morgan Kaufmann, San Francisco, pp 341–349

  4. Fung G, Dundar M, Krishnapuram B, Rao RB (2007) Multiple instance learning for computer aided diagnosis. In: Proceedings of advances in neural information processing systems, vol 19. MIT Press, Cambridge, pp 425–432

  5. Raykar VC, Krishnapuram B, Bi J, Dundar M, Rao RB (2008) Bayesian multiple instance learning: automatic feature selection and inductive transfer. In: Proceedings of the 25th international conference on machine learning. ACM, New York, pp 808–815

  6. Yang C, Lozano-Pérez T (2000) Image database retrieval with multiple-instance learning techniques. In: Proceedings of the 16th international conference on data engineering. IEEE, Washington, pp 233–243

  7. Zhang Q, Goldman SA, Yu W, Fritts JE (2002) Content-based image retrieval using multiple-instance learning. In: Proceedings of the 19th international conference on machine learning. Morgan Kaufmann, San Francisco, pp 682–689

  8. Rahmani R, Goldman SA, Zhang H, Cholleti SR, Fritts JE (2008) Localized content-based image retrieval. IEEE Trans Pattern Anal Mach Intell 30(11):1902–1912

    Article  Google Scholar 

  9. Zha ZJ, Hua XS, Mei T, Wang J, Qi GJ, Wang Z (2008) Joint multi-label multi-instance learning for image classification. In: Proceedings of the 21st conference on computer vision and pattern recognition. IEEE, Washington, pp 1–8

  10. Ali S, Shah M (2010) Human action recognition in videos using kinematic features and multiple instance learning. IEEE Trans Pattern Anal Mach Intell 32(2):288–303

    Article  Google Scholar 

  11. Viola PA, Platt JC, Zhang C (2006) Multiple instance boosting for object detection. In: Proceedings of advances in neural information processing systems, vol 18. MIT Press, Cambridge, pp 1417–1424

  12. Dollár P, Babenko B, Belongie S, Perona P, Tu Z (2008) Multiple component learning for object detection. In: Proceedings of the 10th European conference on computer vision. Springer, Berlin, pp 211–224

  13. Babenko B, Yang MH, Belongie S (2009) Visual tracking with online multiple instance learning. In: Proceedings of the 22nd conference on computer vision and pattern recognition. IEEE, Washington, pp 983–990

  14. Babenko B, Yang MH, Belongie S (2011b) Robust object tracking with online multiple instance learning. IEEE Trans Pattern Anal Mach Intell 33(8):1619–1632

    Article  Google Scholar 

  15. Ning J, Shi W, Yang S, Yanne P (2013) Visual tracking based on distribution fields and online weighted multiple instance learning. Image Vis Comput 31(11):853–863

    Article  Google Scholar 

  16. Chen Y, Wang JZ (2004) Image categorization by learning and reasoning with regions. J Mach Learn Res 5:913–939

    Google Scholar 

  17. Chen Y, Bi J, Wang JZ (2006) MILES: Multiple-instance learning via embedded instance selection. IEEE Trans Pattern Anal Mach Intell 28(12):1931–1947

    Article  Google Scholar 

  18. Li WJ, Yeung DY (2010) MILD: Multiple-instance learning via disambiguation. IEEE Trans Knowl Data Eng 22(1):76–89

    Article  Google Scholar 

  19. Fu Z, Robles-Kelly A, Zhou J (2011) MILIS: Multiple instance learning with instance selection. IEEE Trans Pattern Anal Mach Intell 33(5):958–977

    Article  Google Scholar 

  20. Wang X-Z, Lu S-X, Zhai J-H (2008) Fast fuzzy multicategory SVM based on support vector domain description. Int J Pattern Recognit Artif Intell 22(01):109–120

    Article  Google Scholar 

  21. Chen W-J, Shao Y-H, Hong N (2013) Laplacian smooth twin support vector machine for semi-supervised classification. Int J Mach Learn Cybern. doi:10.1007/s13042-013-0183-3

    Google Scholar 

  22. Zhang Q, Goldman SA (2002) EM-DD: An improved multiple-instance learning technique. In: Proceedings of advances in neural information processing systems, vol 14. MIT Press, Cambridge, pp 1073–1080

  23. Settles B, Craven M, Ray S (2008) Multiple-instance active learning. In: Proceedings of advances in neural information processing systems, vol 20. Curran Associates Inc, New York, pp 1289–1296

  24. Babenko B, Verma N, Dollár P, Belongie S (2011a) Multiple instance learning with manifold bags. In: Proceedings of the 28th international conference on machine learning. Omnipress, New York, pp 81–88

  25. Antić B, Ommer B (2013) Robust multiple-instance learning with superbags. In: Proceedings of the 11th Asian conference on computer vision. Springer, Berlin, pp 242–255

  26. Ramon J., De Raedt L. (2000) Multi instance neural networks. In: Proceedings of the 17th international conference on machine learning, workshop on attribute-value and relational learning

  27. Zhang M-L, Zhou Z-H (2004) Improve multi-instance neural networks through feature selection. Neural Process Lett 19(1):1–10

    Article  MATH  Google Scholar 

  28. Wang J, Zucker JD (2000) Solving the multiple-instance problem: a lazy learning approach. In: Proceedings of the 17th international conference on machine learning. Morgan Kaufmann, San Francisco, pp 1119–1126

  29. Gärtner T, Flach PA, Kowalczyk A, Smola AJ (2002) Multi-instance kernels. In: Proceedings of the 19th international conference on machine learning. Morgan Kaufmann, San Francisco, pp 179–186

  30. Andrews S, Tsochantaridis I, Hofmann T (2003) Support vector machines for multiple-instance learning. In: Proceedings of advances in neural information processing systems, vol 15. MIT Press, Cambridge, pp 561–568

  31. Bergeron C, Moore G, Zaretzki J, Breneman CM, Bennett KP (2012) Fast bundle algorithm for multiple-instance learning. IEEE Trans Pattern Anal Mach Intell 34(6):1068–1079

    Article  Google Scholar 

  32. Minhas FA, Ben-Hur A (2012) Multiple instance learning of Calmodulin binding sites. Bioinformatics 28(18):i416–i422

    Article  Google Scholar 

  33. Li Y, Tax DMJ, Duin RPW, Loog M (2013) Multiple-instance learning as a classifier combining problem. Pattern Recognit 46(3):865–874

    Article  Google Scholar 

  34. Nguyen DT, Nguyen CD, Hargraves R, Kurgan LA, Cios KJ (2013) mi-DS: Multiple-instance learning algorithm. IEEE Trans Cybern 43(1):143–154

    Article  Google Scholar 

  35. Wang X-Z, Dong L-C, Yan J-H (2012) Maximum ambiguity-based sample selection in fuzzy decision tree induction. IEEE Trans Knowl Data Eng 24(8):1491–1505

    Article  Google Scholar 

  36. Hamidzadeh J, Monsefi R, Sadoghi Yazdi H (2014) Large symmetric margin instance selection algorithm. Int J Mach Learn Cybern. doi:10.1007/s13042-014-0239-z

    Google Scholar 

  37. Foulds J, Frank E (2008) Revisiting multiple-instance learning via embedded instance selection. In: Proceedings of the 21st Australasian joint conference on artificial intelligence. Springer, Berlin, pp 300–310

  38. Cheplygina V, Tax DMJ, Loog M (2013) Combining instance information to classify bags. In: Proceedings of the 11th international workshop on multiple classifier systems. Springer, Berlin, pp 13–24

Download references

Acknowledgments

This research has been supported by the National Natural Science Foundation of China under the Grant Nos. 61173087 and 61370162. The authors would like to thank the anonymous reviewers for their valuable suggestions.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Liming Yuan.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Yuan, L., Liu, J., Tang, X. et al. Pairwise-similarity-based instance reduction for efficient instance selection in multiple-instance learning. Int. J. Mach. Learn. & Cyber. 6, 83–93 (2015). https://doi.org/10.1007/s13042-014-0248-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13042-014-0248-y

Keywords

Navigation