Evaluating CNNs on the Gestalt Principle of Closure

Ehrensperger, Gregor; Stabinger, Sebastian; Sánchez, Antonio Rodríguez

doi:10.1007/978-3-030-30487-4_23

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 11727))

Included in the following conference series:

International Conference on Artificial Neural Networks

3234 Accesses

Abstract

Deep convolutional neural networks (CNNs) are widely known for their outstanding performance in classification and regression tasks over high-dimensional data. This made them a popular and powerful tool for a large variety of applications in industry and academia. Recent publications show that seemingly easy classification tasks (for humans) can be very challenging for state of the art CNNs. An attempt to describe how humans perceive visual elements is given by the Gestalt principles. In this paper we evaluate AlexNet and GoogLeNet regarding their performance on classifying the correctness of the well known Kanizsa triangles and triangles where sections of the edges were removed. Both types heavily rely on the Gestalt principle of closure. Therefore we created various datasets containing valid as well as invalid variants of the described triangles. Our findings suggest that perceiving objects by utilizing the principle of closure is very challenging for the applied network architectures but they appear to adapt to the effect of closure.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Mixed Evidence for Gestalt Grouping in Deep Neural Networks

Article Open access 10 July 2023

Inception and ResNet: Same Training, Same Features

Neural Networks Trained on Natural Scenes Exhibit Gestalt Closure

Article Open access 09 April 2021

Notes

1.
In our case: Inception v3.
2.
30.000 training, 10.000 validation and 10.000 test images; dimension: $256 \times 256$ px.
3.
To maximize the visual error, the offset is applied in the direction of the connecting line of the other two vertices.
4.
All CNNs were trained using NVIDIA DIGITS https://developer.nvidia.com/digits with the Torch backend and def. settings: fixed learning rate = 0.01, solver = SGD.
5.
While the original MNIST dataset contains 60.000 training images and 10.000 validation images, we moved 10.000 training images to a test image set, and deleted 20.000 of the training images while not changing the distribution of the images among the classes. We did this to be comparable with our own datasets.
6.
Trained multiple times; lowest results are displayed.
7.
In further experiments we also worked with a set where only one of the three vertices was rotated. AlexNet needed at least 109 epochs to be able to classify 95% correctly, while GoogLeNet needed 6 epochs.

References

Kim, B., Reif, E., Wattenberg, M., Bengio, S.: Do neural networks show gestalt phenomena? An exploration of the law of closure. arXiv preprint (2019)
Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems (2012)
Google Scholar
LeCun, Y., et al.: Backpropagation applied to handwritten zip code recognition. Neural Comput. 1(4), 541–551 (1989). https://doi.org/10.1162/neco.1989.1.4.541
Article Google Scholar
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998). https://doi.org/10.1109/5.726791
Article Google Scholar
Stabinger, S., Rodríguez-Sánchez, A., Piater, J.: 25 years of CNNs: can we compare to human abstraction capabilities? In: Villa, A.E.P., Masulli, P., Pons Rivero, A.J. (eds.) ICANN 2016. LNCS, vol. 9887, pp. 380–387. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-44781-0_45
Chapter Google Scholar
Stabinger, S., Rodríguez-Sánchez, A.: Evaluation of deep learning on an abstract image classification dataset. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2767–2772 (2017). https://doi.org/10.1109/iccvw.2017.325
Szegedy, C., et al.: Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–9 (2015). https://doi.org/10.1109/cvpr.2015.7298594
Wertheimer, M.: Laws of organization in perceptual forms. In: A Source Book of Gestalt Psychology, pp. 71–88, London (1938). https://doi.org/10.1037/11496-005

Download references

Acknowledgment

We want to thank the anonymous reviewers for their constructive suggestions and helpful comments.

Author information

Authors and Affiliations

University of Innsbruck, Technikerstraße 21a, 6020, Innsbruck, Austria
Gregor Ehrensperger, Sebastian Stabinger & Antonio Rodríguez Sánchez

Authors

Gregor Ehrensperger
View author publications
You can also search for this author in PubMed Google Scholar
Sebastian Stabinger
View author publications
You can also search for this author in PubMed Google Scholar
Antonio Rodríguez Sánchez
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Gregor Ehrensperger .

Editor information

Editors and Affiliations

Helmholtz Zentrum München - Deutsches Forschungszentrum für Gesundheit und Umwelt (GmbH), Neuherberg, Germany
Igor V. Tetko
Institute of Computer Science, Czech Academy of Sciences, Prague 8, Czech Republic
Věra Kůrková
Helmholtz Zentrum München - Deutsches Forschungszentrum für Gesundheit und Umwelt (GmbH), Neuherberg, Germany
Pavel Karpov
Helmholtz Zentrum München - Deutsches Forschungszentrum für Gesundheit und Umwelt (GmbH), Neuherberg, Germany
Fabian Theis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ehrensperger, G., Stabinger, S., Sánchez, A.R. (2019). Evaluating CNNs on the Gestalt Principle of Closure. In: Tetko, I., Kůrková, V., Karpov, P., Theis, F. (eds) Artificial Neural Networks and Machine Learning – ICANN 2019: Theoretical Neural Computation. ICANN 2019. Lecture Notes in Computer Science(), vol 11727. Springer, Cham. https://doi.org/10.1007/978-3-030-30487-4_23

Download citation

DOI: https://doi.org/10.1007/978-3-030-30487-4_23
Published: 09 September 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-30486-7
Online ISBN: 978-3-030-30487-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics