Improving discrimination ability of convolutional neural networks by hybrid learning

Kim, In-Jung; Choi, Changbeom; Lee, Sang-Heon

doi:10.1007/s10032-015-0256-9

Improving discrimination ability of convolutional neural networks by hybrid learning

Original Paper
Published: 14 November 2015

Volume 19, pages 1–9, (2016)
Cite this article

International Journal on Document Analysis and Recognition (IJDAR) Aims and scope Submit manuscript

In-Jung Kim¹,
Changbeom Choi² &
Sang-Heon Lee³

846 Accesses
Explore all metrics

Abstract

The discrimination of similar patterns is important because they are the major sources of the classification error. This paper proposes a novel method to improve the discrimination ability of convolutional neural networks (CNNs) by hybrid learning. The proposed method embeds a collection of discriminators as well as a recognizer in a shared CNN. By visualizing contrastive class saliency, we show that learning with embedded discriminators leads the shared CNN to detect and catch the differences among similar classes. Also proposed is a hybrid learning algorithm that learns recognition and discrimination together. The proposed method learns recognition focusing on the differences among similar classes, and thereby improves the discrimination ability of the CNN. Unlike conventional discrimination methods, the proposed method does not require predefined sets of similar classes or additional step to integrate its result with that of the recognizer. In experiments on two handwritten Hangul databases SERI95a and PE92, the proposed method reduced classification error from 2.56 to 2.33, and from 4.04 to 3.66 % respectively. These improvement lead to relative error reduction rates of 8.97 % on SERI95a, and 9.42 % on PE92. Our best results update the state-of-the-art performance which were 4.04 % on SERI95a and 7.08 % on PE92.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Convolutional Neural Network with Discriminant Criterion for Input of Each Neuron in Output Layer

Amended Convolutional Neural Network with Global Average Pooling for Image Classification

Convolutional neural networks for pattern classifying based on parameterized predefined sequence of image filters

Article 13 May 2024

Notes

SERI95a is also known as KU-1.

References

Kim, I.-J., Kim, J.: Pairwise discrimination based on a stroke importance measure. Pattern Recognit. 35(10), 2259–2266 (2002)
Article MATH Google Scholar
Leung, K.C., Leung, C.H.: Recognition of handwritten Chinese characters by critical region analysis. Pattern Recognit. 43(3), 949–961 (2010)
Article MATH Google Scholar
Xu, B., Huang, K., Liu, C.L.: Similar characters recognition by critical region selection based on average symmetric uncertainty. In: Proceedings of 12th ICFHR, Kolkata, India, pp. 527–532 (2010)
Ryu, S.-J., Kim, I.-J.: Discrimination of similar characters using nonlinear normalization based on regional importance measure. Int. J. Doc. Anal. Recognit. (IJDAR) 17(1), 79–89 (2014)
Article Google Scholar
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)
Article Google Scholar
Kim, I.-J., Xie, X.: Handwritten Hangul recognition using deep convolutional neural networks. Int. J. Doc. Anal. Recognit. 18(1), 1–13 (2014)
Article Google Scholar
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition, arXiv:1409.1556 (2014)
Yin, F., Wang, Q.F., Zhang, X.X.Y., Liu, C.L.: ICDAR 2013 Chinese handwriting recognition competition. http://www.nlpr.ia.ac.cn/events/CHRcompetition2013/competition/ICDAR%202013%20CHR%20competition.pdf
Kim, D.-I., Kim, S.-Y., Lee, S.-W.: Design and construction of a large-set off-line handwritten hangul character image database KU-1. In: Proceedings of $9{{\rm th}}$ National Conference on Korean Language Information Processing, pp. 152–159 (1997) (in Korean)
Kim, D.H., Bang, S.Y.: An overview of hangul handwritten image database PE92. In: Proceedings of 4${{\rm th}}$ National Conference on Korean Language Information Processing, pp. 152–159 (1992) (in Korean)
Kim, I.-J., Kim, J.: Pairwise discrimination based on a stroke importance measure. Pattern Recognit. 35(10), 2259–2266 (2002)
Goodfellow, I. et al.: Maxout networks, arXiv:1302.4389 (2013)
Lin, M., Chen, Q., Yan, S.: Network in network. In: Proceedings of ICLR (2014)
Szegedy, C. et al.: Going deeper with convolutions, arXiv:1409.4842 (2014)
Ioffe, S., Szegedy, C.: Batch normalization: accelerating deep network training by reducing internal covariate shift, arXiv:1502.03167 (2015)
Hinton, G., Vinyals, O., Dean, J.: Distilling the knowledge in a neural network, arXiv:1503.02531 (2015)
Girshick, R. et al.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), IEEE (2014)
Socher, R. et al.: Convolutional-recursive deep learning for 3d object classification. Advances in Neural Information Processing Systems (2012)
Zeiler, M., Fergus, R.: Visualizing and understanding convolutional networks. Compu Vis-ECCV 2014, 818–833 (2014)
Google Scholar
Simonyan, K., Vedaldi, A., Zisserman, A.: Deep inside convolutional networks: visualising image classification models and saliency maps, arXiv:1312.6034 (2013)
Sato, A., Yamada, K.: Generalized learning vector quantization. Adv. Neural Inf. Process. Syst. 15(8), 423–429 (1996)
Google Scholar
Caruana, R.: Multitask learning. Mach. Learn. 28(1), 41–75 (1997)
Article MathSciNet Google Scholar
Bengio, Y. et al.: Curriculum learning. In: Proceedings of 26th Annual International Conference on Machine Learning. ACM, pp. 41–48 (2009)
Simard, P., Steinkraus, D., Platt, J.: Best practices for convolutional neural networks applied to visual document analysis, null. IEEE, p. 958 (2003)
Srivastava, N., et al.: Dropout: a simple way to prevent neural networks from overfitting. J Mach. Learn. Res. 15(1), 1929–1958 (2014)
MathSciNet MATH Google Scholar

Download references

Acknowledgments

This work was supported by the DGIST R&D Program of the Ministry of Science, ICT and Future Planning (15-IT-03).

Author information

Authors and Affiliations

School of CSEE, Handong Global University, Pohang, Korea
In-Jung Kim
School of Creative Convergence Education, Handong Global University, Pohang, Korea
Changbeom Choi
Department of IoT and Robotics Convergence Research, DGIST, Daegu, Korea
Sang-Heon Lee

Authors

In-Jung Kim
View author publications
You can also search for this author inPubMed Google Scholar
Changbeom Choi
View author publications
You can also search for this author inPubMed Google Scholar
Sang-Heon Lee
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to In-Jung Kim.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kim, IJ., Choi, C. & Lee, SH. Improving discrimination ability of convolutional neural networks by hybrid learning. IJDAR 19, 1–9 (2016). https://doi.org/10.1007/s10032-015-0256-9

Download citation

Received: 11 July 2015
Revised: 13 September 2015
Accepted: 28 October 2015
Published: 14 November 2015
Issue Date: March 2016
DOI: https://doi.org/10.1007/s10032-015-0256-9

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Improving discrimination ability of convolutional neural networks by hybrid learning

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Convolutional Neural Network with Discriminant Criterion for Input of Each Neuron in Output Layer

Amended Convolutional Neural Network with Global Average Pooling for Image Classification

Convolutional neural networks for pattern classifying based on parameterized predefined sequence of image filters

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now