SimLoss: Class Similarities in Cross Entropy

Kobs, Konstantin; Steininger, Michael; Zehe, Albin; Lautenschlager, Florian; Hotho, Andreas

doi:10.1007/978-3-030-59491-6_41

Konstantin Kobs¹³,
Michael Steininger¹³,
Albin Zehe¹³,
Florian Lautenschlager¹³ &
…
Andreas Hotho¹³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 12117))

Included in the following conference series:

International Symposium on Methodologies for Intelligent Systems

1271 Accesses

Abstract

One common loss function in neural network classification tasks is Categorical Cross Entropy (CCE), which punishes all misclassifications equally. However, classes often have an inherent structure. For instance, classifying an image of a rose as “violet” is better than as “truck”. We introduce SimLoss, a drop-in replacement for CCE that incorporates class similarities along with two techniques to construct such matrices from task-specific knowledge. We test SimLoss on Age Estimation and Image Classification and find that it brings significant improvements over CCE on several metrics. SimLoss therefore allows for explicit modeling of background knowledge by simply exchanging the loss function, while keeping the neural network architecture the same. Code and additional resources are available at https://github.com/konstantinkobs/SimLoss

Roses are red, violets are blue,

both are somehow similar, but the classifier has no clue.

(Common proverb)

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Negative Log Likelihood Ratio Loss for Deep Neural Network Classification

Tackling Algorithmic Bias in Neural-Network Classifiers using Wasserstein-2 Regularization

Article 27 April 2022

Self-Supervised Classification Network

References

Cesa-Bianchi, N., Gentile, C., Zaniboni, L.: Incremental algorithms for hierarchical classification. J. Mach. Learn. Res. 7, 31–54 (2006)
MathSciNet MATH Google Scholar
Frome, A., Corrado, G.S., Shlens, J., Bengio, S., Dean, J., Mikolov, T., et al.: Devise: a deep visual-semantic embedding model. In: NIPS (2013)
Google Scholar
Fu, Y., Huang, T.S.: Human age estimation with regression on discriminative aging manifold. IEEE Trans. Multimed. 10(4), 578–584 (2008)
Article Google Scholar
Guo, G., Mu, G., Fu, Y., Huang, T.S.: Human age estimation using bio-inspired features. In: CVPR. IEEE (2009)
Google Scholar
Izbicki, M., Papalexakis, E.E., Tsotras, V.J.: Exploiting the earth’s spherical geometry to geolocate images. In: Brefeld, U., Fromont, E., Hotho, A., Knobbe, A., Maathuis, M., Robardet, C. (eds.) ECML PKDD 2019, vol. 11907, pp. 3–19. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-46147-8_1
Chapter Google Scholar
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)
Krizhevsky, A., Hinton, G.: Learning multiple layers of features from tiny images. Technical report, Citeseer (2009)
Google Scholar
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P., et al.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)
Article Google Scholar
Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013)
Morgan, N., Bourlard, H.: Generalization and parameter estimation in feedforward nets: some experiments. In: NIPS (1990)
Google Scholar
Niu, Z., Zhou, M., Wang, L., Gao, X., Hua, G.: Ordinal regression with multiple output CNN for age estimation. In: CVPR (2016)
Google Scholar
Norouzi, M., et al.: Zero-shot learning by convex combination of semantic embeddings. arXiv preprint arXiv:1312.5650 (2013)
Sukhbaatar, S., Bruna, J., Paluri, M., Bourdev, L., Fergus, R.: Training convolutional networks with noisy labels. arXiv preprint arXiv:1406.2080 (2014)
Wu, C., Tygert, M., LeCun, Y.: Hierarchical loss for classification. arXiv preprint arXiv:1709.01062 (2017)
Zhang, Z., Song, Y., Qi, H.: Age progression/regression by conditional adversarial autoencoder. In: CVPR (2017)
Google Scholar

Download references

Author information

Authors and Affiliations

Julius-Maximilians University Würzburg, Würzburg, Germany
Konstantin Kobs, Michael Steininger, Albin Zehe, Florian Lautenschlager & Andreas Hotho

Authors

Konstantin Kobs
View author publications
You can also search for this author in PubMed Google Scholar
Michael Steininger
View author publications
You can also search for this author in PubMed Google Scholar
Albin Zehe
View author publications
You can also search for this author in PubMed Google Scholar
Florian Lautenschlager
View author publications
You can also search for this author in PubMed Google Scholar
Andreas Hotho
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Konstantin Kobs , Michael Steininger , Albin Zehe , Florian Lautenschlager or Andreas Hotho .

Editor information

Editors and Affiliations

Graz University of Technology, Graz, Austria
Denis Helic
University of Klagenfurt, Klagenfurt, Austria
Gerhard Leitner
Graz University of Technology, Graz, Austria
Martin Stettinger
Graz University of Technology, Graz, Austria
Alexander Felfernig
University of North Carolina at Charlotte, Charlotte, NC, USA
Zbigniew W. Raś

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kobs, K., Steininger, M., Zehe, A., Lautenschlager, F., Hotho, A. (2020). SimLoss: Class Similarities in Cross Entropy. In: Helic, D., Leitner, G., Stettinger, M., Felfernig, A., Raś, Z.W. (eds) Foundations of Intelligent Systems. ISMIS 2020. Lecture Notes in Computer Science(), vol 12117. Springer, Cham. https://doi.org/10.1007/978-3-030-59491-6_41

Download citation

DOI: https://doi.org/10.1007/978-3-030-59491-6_41
Published: 17 September 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-59490-9
Online ISBN: 978-3-030-59491-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

SimLoss: Class Similarities in Cross Entropy

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Negative Log Likelihood Ratio Loss for Deep Neural Network Classification

Tackling Algorithmic Bias in Neural-Network Classifiers using Wasserstein-2 Regularization

Self-Supervised Classification Network

References

Author information

Authors and Affiliations

Corresponding authors

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

SimLoss: Class Similarities in Cross Entropy

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Negative Log Likelihood Ratio Loss for Deep Neural Network Classification

Tackling Algorithmic Bias in Neural-Network Classifiers using Wasserstein-2 Regularization

Self-Supervised Classification Network

References

Author information

Authors and Affiliations

Corresponding authors

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation