What Pushes Self-supervised Image Representations Away?

Zieliński, Bartosz; Górszczak, Michał

doi:10.1007/978-3-030-92307-5_60

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1516))

Included in the following conference series:

International Conference on Neural Information Processing

2348 Accesses

Abstract

Self-supervised models provide on par or superior results to their fully supervised competitors, yet it is unclear what information about images they contain. As a result, a visual probing framework was recently introduced to probe image representations for interesting visual features. While visual probing provides information about semantic knowledge, complexity, and consistency, it does not directly and exhaustively explain which visual features push self-supervised image representations away and which are neutral. In this paper, we fill this gap by proving a method that removes particular visual features from the image and analyzes how such a distortion influences the representation. Our key findings emphasize that discrepancies in features like lines and forms push self-supervised representations away more than brightness, color, shape, and especially texture changes. Our work is complementary to visual probing and provides more direct explanations of the mechanisms behind the contrastive loss.

Supported by grant no POIR.04.04.00-00-14DE/18-00 carried out within the Team-Net program of the Foundation for Polish Science co-financed by the European Union under the European Regional Development Fund and Priority Research Area Digiworld under the program Excellence Initiative – Research University at the Jagiellonian University in Kraków.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 99.00; Price excludes VAT (USA)

Softcover Book: USD 129.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
We use the following implementations of self-supervised methods: https://github.com/{google-research/simclr, yaox12/BYOL-PyTorch, facebookresearch/swav, facebookresearch/moco}. We use ResNet-50 (1x) variant for each self-supervised method.

References

Adebayo, J., Gilmer J., Muelly, M., Goodfellow, I., Hardt, M., Kim, B.: Sanity checks for saliency maps. In: NeurIPS (2018)
Google Scholar
Alain, G., Bengio, Y.: Understanding intermediate layers using linear classifier probes. In: ICLR Workshop (2016)
Google Scholar
Basaj, D., et al.: Explaining self-supervised image representations with visual probing. In: ICJAI (2021)
Google Scholar
Bau, D., Zhou, B., Khosla, A., Oliva, A., Torralba, A.: Network dissection: quantifying interpretability of deep visual representations. CoRR (2017)
Google Scholar
Bernard, J., Hutter, M., Ritter, C., Lehmann, M., Sedlmair, M., Zeppelzauer, M: Visual analysis of degree-of-interest functions to support selection strategies for instance labeling. In: EuroVA (2019)
Google Scholar
Bromley, J., Guyon, I., LeCun, Y., Säckinger, E., Shah, R.: Signature verification using a “Siamese” time delay neural network. In: NeurIPS (1993)
Google Scholar
Caron, M., Misra, I., Mairal, J., Goyal, P., Bojanowski, P., Joulin, A.: Unsupervised learning of visual features by contrasting cluster assignments. arXiv (2020)
Google Scholar
Ghorbani, G., Wexler, J., Zou, J., Kim, B.: Towards automatic concept-based explanations. In: NeurIPS (2019)
Google Scholar
Grill, J.-B. et al.: Bootstrap your own latent: a new approach to self-supervised learning. arXiv (2020)
Google Scholar
Hadsell, R., Chopra, S., LeCun, Y.: Dimensionality reduction by learning an invariant mapping. In: CVPR (2006)
Google Scholar
He, K., Fan, H., Wu, Y., Xie, S., Girshick, R.: Momentum contrast for unsupervised visual representation learning. In: CVPR (2020)
Google Scholar
Kim, B., et al.: Interpretability beyond feature attribution: quantitative testing with concept activation vectors (TCAV). In: ICML (2018)
Google Scholar
Marr, D.: Vision: A Computational Investigation into the Human Representation and Processing of Visual Information. Henry Holt and Co., Inc. (1982)
Google Scholar
Oleszkiewicz, W., et al.: Visual probing: cognitive framework for explaining self-supervised image representations. arXiv (2020)
Google Scholar
Ribeiro, M.T., Singh, S., Guestrin, C.: Why should i trust you?: explaining the predictions of any classifier. CoRR (2016)
Google Scholar
Chen, T., Kornblith, S., Swersky, K., Norouzi, M., Hinton, G.: Big self-supervised models are strong semi-supervised learners. In: NeurIPS 2020 (2020)
Google Scholar
Chen, C., Li, O., Barnett, A., Su, J., Rudin, C.: This looks like that: deep learning for interpretable image recognition. In: NeurIPS 2020 (2020)
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Mathematics and Computer Science, Jagiellonian University, Łojasiewicza 6, 30-428, Krakow, Poland
Bartosz Zieliński & Michał Górszczak

Authors

Bartosz Zieliński
View author publications
You can also search for this author in PubMed Google Scholar
Michał Górszczak
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Bartosz Zieliński .

Editor information

Editors and Affiliations

Sampoerna University, Jakarta, Indonesia
Teddy Mantoro
Kyungpook National University, Daegu, Korea (Republic of)
Minho Lee
Sampoerna University, Jakarta, Indonesia
Media Anugerah Ayu
Murdoch University, Murdoch, WA, Australia
Kok Wai Wong
Universitas Indonesia, Depok, Indonesia
Achmad Nizar Hidayanto

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zieliński, B., Górszczak, M. (2021). What Pushes Self-supervised Image Representations Away?. In: Mantoro, T., Lee, M., Ayu, M.A., Wong, K.W., Hidayanto, A.N. (eds) Neural Information Processing. ICONIP 2021. Communications in Computer and Information Science, vol 1516. Springer, Cham. https://doi.org/10.1007/978-3-030-92307-5_60

Download citation

DOI: https://doi.org/10.1007/978-3-030-92307-5_60
Published: 02 December 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-92306-8
Online ISBN: 978-3-030-92307-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics