Leveraging Reproduction-Error Representations for Multi-Instance Classification

Kauschke, Sebastian; Mühlhäuser, Max; Fürnkranz, Johannes

doi:10.1007/978-3-030-01771-2_6

Sebastian Kauschke^17,18,
Max Mühlhäuser¹⁸ &
Johannes Fürnkranz¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11198))

Included in the following conference series:

International Conference on Discovery Science

828 Accesses
1 Citations

Abstract

Multi-instance learning deals with the problem of classifying bags of instances, when only the labels of the bags are known for learning, and the instances themselves have no labels. In this work, we propose a method that trains autoencoders for the instances in each class, and recodes each instance into a representation that captures the reproduction error for this instance. The idea behind this approach is that an autoencoder trained on only instances of a single class is unable to reproduce examples from another class properly, which is then reflected in the encoding. The transformed instances are then piped into a propositional classifier that decides the latent instance label. In a second classification layer, the bag label is decided based on the output of the propositional classifier on all the instances in the bag. We show that this reproduction-error encoding creates an advantage compared to the classification of non-encoded data, and that further research into this direction could be beneficial for the cause of multi-instance learning.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
http://yann.lecun.com/exdb/mnist/.

References

Amores, J.: Multiple instance classification: review taxonomy and comparative study. Artif. Intell. 201, 81–105 (2013)
Article MathSciNet Google Scholar
Andrews, S., Tsochantaridis, I., Hofmann, T.: Support vector machines for multiple-instance learning. In: Advances in Neural Information Processing Systems - NIPS’03, pp. 561–568 (2003)
Google Scholar
Bunescu, R.C., Mooney, R.J.: Multiple instance learning for sparse positive bags. In: Proceedings of the 24th International Conference on Machine Learning, pp. 105–112 (2007)
Google Scholar
Demšar, J.: Statistical comparisons of classifiers over multiple data sets. J. Mach. Learn. Res. 7, 1–30 (2006)
MathSciNet MATH Google Scholar
Dietterich, T.G., Lathrop, R.H., Lozano-Pérez, T.: Solving the multiple instance problem with axis-parallel rectangles. Artif. Intell. 89(1–2), 31–71 (1997)
Article Google Scholar
Feng, S., Xiong, W., Li, B., Lang, C., Huang, X.: Hierarchical sparse representation based multi-instance semi-supervised learning with application to image categorization. Signal Process. 94, 595–607 (2014)
Article Google Scholar
Foulds, J., Frank, E.: A review of multi-instance learning assumptions. In: Knowledge Engineering Review, vol. 25, pp. 1–25. Cambridge University Press, Cambridge (2010)
Google Scholar
Frank, E., Xu, X.: Applying propositional learning algorithms to multi-instance data. (Working paper 06/03). Technical report, University of Waikato, Department of Computer Science (2003)
Google Scholar
Kauschke, S., Fürnkranz, J., Janssen, F.: Predicting cargo train failures: a machine learning approach for a lightweight prototype. In: Proceedings of the 19th International Conference on Discovery Science - DS’16, pp. 151–166 (2016)
Chapter Google Scholar
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. In: CoRR (2014). http://arxiv.org/abs/1412.6980
Liu, M., Zhang, J., Adeli, E., Shen, D.: Landmark-based deep multi-instance learning for brain disease diagnosis. Med. Image Anal. 43, 157–168 (2018)
Article Google Scholar
Sipos, R., Fradkin, D., Moerchen, F., Wang, Z.: Log-based predictive maintenance. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining - KDD’14, pp. 1867–1876 (2014)
Google Scholar
Sutskever, I., Martens, J., Dahl, G., Hinton, G.: On the importance of initialization and momentum in deep learning. In: Proceedings of the International Conference on Machine Learning - ICML’13, pp. 1139–1147 (2013)
Google Scholar
Vincent, P., Larochelle, H., Manzagol, P.-A.: Stacked denoising autoencoders: learning useful representations in a deep network with a local denoising criterion. J. Mach. Learn. Res. 11, 3371–3408 (2010)
Google Scholar
Wang, J., Zucker, J.D.: Solving multiple-instance problem: a lazy learning approach. In: Proceedings of the 17th International Conference on Machine Learning - ICML’00, pp. 1119–1125 (2000)
Google Scholar
Wang, Y., Yao, H., Zhao, S.: Auto-encoder based dimensionality reduction. Neurocomputing 184, 232–242 (2016)
Article Google Scholar
Weidmann, N., Frank, E., Pfahringer, B.: A two-level learning method for generalized multi-instance problems. In: Lavrač, N., Gamberger, D., Blockeel, H., Todorovski, L. (eds.) ECML 2003. LNCS (LNAI), vol. 2837, pp. 468–479. Springer, Heidelberg (2003). https://doi.org/10.1007/978-3-540-39857-8_42
Chapter Google Scholar
Wu, J., Yu, Y., Huang, C., Yu, K.: Deep multiple instance learning for image classification and auto-annotation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition - CVPR’15, pp. 3460–3469. IEEE (2015)
Google Scholar
Yan, Z., Zhan, Y., Zhang, S., Metaxas, D., Zhou, X.S.: Multi-instance multi-stage deep learning for medical image recognition. In: Deep Learning for Medical Image Analysis, pp. 83–104. Academic Press (2017)
Google Scholar
Zhou, Z.H., Sun, Y.Y., Li, Y.F.: Multi-instance learning by treating instances as non-i.i.d. samples. In: Proceedings of the 26th International Conference on Machine Learning - ICML’09, pp. 1249–1256. ACM (2009)
Google Scholar

Download references

Acknowledgements

This work has been sponsored by the German Federal Ministry of Education and Research (BMBF) Software Campus project Effiziente Modellierungstechniken für Predictive Maintenance [01IS17050]. We also gratefully acknowledge the use of the Lichtenberg high performance computer of the TU Darmstadt for our experiments.

Author information

Authors and Affiliations

Knowledge Engineering Group, TU Darmstadt, Darmstadt, Germany
Sebastian Kauschke & Johannes Fürnkranz
Telecooperation Group, TU Darmstadt, Darmstadt, Germany
Sebastian Kauschke & Max Mühlhäuser

Authors

Sebastian Kauschke
View author publications
You can also search for this author in PubMed Google Scholar
Max Mühlhäuser
View author publications
You can also search for this author in PubMed Google Scholar
Johannes Fürnkranz
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sebastian Kauschke .

Editor information

Editors and Affiliations

Goldsmiths University of London, London, UK
Larisa Soldatova
Eindhoven University of Technology, Eindhoven, The Netherlands
Joaquin Vanschoren
University of Cyprus, Nicosia, Cyprus
George Papadopoulos
Università degli Studi di Bari Aldo Moro, Bari, Italy
Michelangelo Ceci

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kauschke, S., Mühlhäuser, M., Fürnkranz, J. (2018). Leveraging Reproduction-Error Representations for Multi-Instance Classification. In: Soldatova, L., Vanschoren, J., Papadopoulos, G., Ceci, M. (eds) Discovery Science. DS 2018. Lecture Notes in Computer Science(), vol 11198. Springer, Cham. https://doi.org/10.1007/978-3-030-01771-2_6

Download citation

DOI: https://doi.org/10.1007/978-3-030-01771-2_6
Published: 07 October 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-01770-5
Online ISBN: 978-3-030-01771-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics