Abstract
Confounded information is an objective fact when using multi-instance learning (MIL) to classify bags of instances, which may be inherited by MIL embedding methods and lead to questionable bag label prediction. To respond to this problem, we propose the multi-instance embedding learning with deconfounded instance-level prediction algorithm. Unlike traditional embedding-based strategies, we design a deconfounded optimization goal to maximize the distinction between instances in positive and negative bags. In addition, we present and use bag-level embedding with feature distillation to reduce the MIL classification task to a single-instance learning problem. Under the theoretical analysis, the embedding cohesiveness and feature magnitude metrics are developed to explain the benefits of the proposed deconfounded technique in MIL settings. Extensive experiments on thirty-four data sets demonstrate that our proposed method has the best overall performance over other state-of-the-art MIL methods. This strategy, in particular, has a substantial advantage on web data sets. Source codes are available at https://github.com/InkiInki/MEDI.






Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Angelidis, S., Lapata, M.: Multiple instance learning networks for fine-grained sentiment analysis. Tran. Assoc. Comput. Linguist. 6, 17–31 (2018). https://doi.org/10.1162/tacl_a_00002
Chen, Y.X., Bi, J.B., Wang, J.Z.: MILES: multiple-instance learning via embedded instance selection. IEEE Trans. Pattern Anal. Mach. Intell. 28(12), 1931–1947 (2006). https://doi.org/10.1109/TPAMI.2006.248
Dietterich, T.G., Lathrop, R.H., Lozano-Pérez, T.: Solving the multiple instance problem with axis-parallel rectangles. Artif. Intell. 89(1–2), 31–71 (1997). https://doi.org/10.1016/S0004-3702(96)00034-3
Fu, Z.Y., Robles-Kelly, A., Zhou, J.: MILIS: multiple instance learning with instance selection. IEEE Trans. Pattern Anal. Mach. Intell. 33(5), 958–977 (2011). https://doi.org/10.1109/TPAMI.2010.155
Hong, R.C., Wang, M., Gao, Y., et al.: Image annotation by multiple-instance learning with discriminative feature mapping and selection. IEEE Trans. Cybern. 44(5), 669–680 (2014). https://doi.org/10.1109/TCYB.2013.2265601
Ilse, M., Tomczak, J., Welling, M.: Attention-based deep multiple instance learning. In: International Conference on Machine Learning, pp. 2127–2136 (2018)
Li, S., Liu, F., Jiao, L.C.: Self-training multi-sequence learning with transformer for weakly supervised video anomaly detection, pp 1–9 (2022)
Lin, T.C., Xu, H.T., Yang, C.Q. et al.: Interventional multi-instance learning with deconfounded instance-level prediction. In: AAAI Conference on Artificial Intelligence, pp. 1–9 (2022) https://doi.org/10.48550/arXiv.2204.09204
Lin, Y., Zhang, H.G.: Regularized instance embedding for deep multi-instance learning. Appl. Sci. 10(1), 64–77 (2020). https://doi.org/10.3390/app10010064
Shi, X.S., Xing, F.Y., Xie, Y.P. et al.: Loss-based attention for deep multiple instance learning. In: AAAI Conference on Artificial Intelligence, pp. 5742–5749 (2020). https://doi.org/10.1609/aaai.v34i04.6030
Sİvrikaya, Ö.E., Yüksekgönül, M., Baydoğan, M.G.: Learning prototypes for multiple instance learning. Turk. J. Electr. Eng. Comput. Sci. 29(7), 2901–2919 (2021)
Tarragó, D.S., Cornelis, C., Bello, R., et al.: A multi-instance learning wrapper based on the Rocchio classifier for web index recommendation. Knowl. Based Syst. 59, 173–181 (2014). https://doi.org/10.1016/j.knosys.2014.01.008
Tian, Y., Pang, G.S., Chen, Y.H., et al.: Weakly-supervised video anomaly detection with robust temporal feature magnitude learning. In: IEEE/CVF International Conference on Computer Vision, pp. 4975–4986 (2021)
Wei, X.S., Zhou, Z.H.: An empirical study on image bag generators for multi-instance learning. Mach. Learn. 105(2), 155–198 (2016). https://doi.org/10.1007/s10994-016-5560-1
Wei, X.S., Wu, J.X., Zhou, Z.H.: Scalable multi-instance learning. In: IEEE International Conference on Data Mining, pp. 1037–1042 (2014) https://doi.org/10.1109/ICDM.2014.16
Wei, X.S., Wu, J.X., Zhou, Z.H.: Scalable algorithms for multi-instance learning. IEEE Trans. Neural Netw. Learn. Syst. 28(4), 975–987 (2017). https://doi.org/10.1109/TNNLS.2016.2519102
Wu, J., Pan, S.R., Zhu, X.Q., et al.: Multi-instance learning with discriminative bag mapping. IEEE Trans. Knowl. Data Eng. 30(6), 1065–1080 (2018). https://doi.org/10.1109/TKDE.2017.2788430
Xu, B.C., Ting, K.M., Zhou, Z.H.: Isolation set-kernel and its application to multi-instance learning. In: ACM SIGKDD International Conference on Knowledge Discovery & Data Mining July, pp. 941–949 (2019). https://doi.org/10.1145/3292500.3330830
Yang, M., Zhang, Y.X., Wang, X.Z. et al.: Multi-instance ensemble learning with discriminative bags. IEEE Trans. Syst. Man Cybern. Syst. 5456–5467 (2021). https://doi.org/10.1109/TSMC.2021.3125040
Yang, M., Tang, W.T., Min, F.: Multi-instance multi-label learning based on parallel attention and local label manifold correlation. In: International Conference on Data Science and Advanced Analytics, pp. 1–10 (2022a)
Yang, M., Zeng, W.X., Min, F.: Multi-instance embedding learning through high-level instance selection. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 122–133 (2022b). https://doi.org/10.1007/978-3-031-05936-0_10
Yang, M., Zhang, Y.X., Ye, M., et al.: Attention-to-embedding framework for multi-instance learning. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 109–121 (2022c). https://doi.org/10.1007/978-3-031-05936-0_9
Yang, M., Zhang, Y.X., Zhou, Z. et al.: Multi-embedding space set-kernel and its application to multi-instance learning. Neurocomputing 512, 1–14 (2022d)
Zhang, H.R., Meng, Y.D., Zhao, Y.T. et al.: DTFD-MIL: Double-tier feature distillation multiple instance learning for histopathology whole slide image classification. In: Computer Vision and Pattern Recognition, pp. 18,802–18,812 (2022). https://doi.org/10.48550/arXiv.2203.12081
Zhang, M.L., Zhou, Z.H.: Multi-instance clustering with applications to multi-instance prediction. Appl. Intell. 31(1), 47–68 (2009). https://doi.org/10.1007/s10489-007-0111-x
Zhang, T., Jin, H.: Optimal margin distribution machine for multi-instance learning. In: International Conference on International Joint Conferences on Artificial Intelligence, pp. 2383–2389 (2021)
Zhang, W.J., Liu, L., Li, J.Y.: Robust multi-instance learning with stable instances, pp 1682–1689 (2020). https://doi.org/10.3233/FAIA200280. arXiv:1902.05066
Zhou, Z.H., Jiang, K., Li, M.: Multi-instance learning based web mining. Appl. Intell. 22, 135–147 (2005). https://doi.org/10.1007/s10489-005-5602-z
Zhou, Z.H., Sun, Y.Y., Li, Y.F.: Multi-instance learning by treating instances as non-I.I.D. samples. In: International Conference on Machine Learning, pp. 1249–1256 (2009). https://doi.org/10.1145/1553374.1553534
Acknowledgements
This work was supported in part by the National Key R &D Program of China (2018YFE0203900), National Natural Science Foundation of China (61773093), Sichuan Science and Technology Program (2020YFG0476), Important Science and Technology Innovation Projects in Chengdu (2018-YF08-00039-GX), and Open Project of Zhejiang Key Laboratory of Marine Big Data Mining and Application (OBDMA202102).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Zhang, YX., Yang, M., Zhou, Z. et al. Multi-instance embedding learning with deconfounded instance-level prediction. Int J Data Sci Anal 16, 391–401 (2023). https://doi.org/10.1007/s41060-022-00372-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s41060-022-00372-7