Style-Neutralized Pattern Classification Based on Adversarially Trained Upgraded U-Net

Jiang, Haochuan; Huang, Kaizhu; Zhang, Rui; Hussain, Amir

doi:10.1007/s12559-019-09660-0

Style-Neutralized Pattern Classification Based on Adversarially Trained Upgraded U-Net

Published: 07 September 2019

Volume 13, pages 845–858, (2021)
Cite this article

Cognitive Computation Aims and scope Submit manuscript

Haochuan Jiang ORCID: orcid.org/0000-0002-8727-4121¹,
Kaizhu Huang¹,
Rui Zhang² &
…
Amir Hussain³

367 Accesses
5 Citations
Explore all metrics

Abstract

Traditional machine learning approaches usually hold the assumption that data for model training and in real applications are created following the identical and independent distribution (i.i.d.). However, several relevant research topics have demonstrated that such condition may not always describe the real scenarios. One particular case is that the patterns are equipped with diverse and changeable style information. In this paper, a novel classification framework named Style Neutralization Generative Adversarial Classifier (SN-GAC), based on an upgraded U-Net architecture, and trained adversarially with the Generative Adversarial Network (GAN) framework, is introduced to accomplish the classification in such disparate and inconsistent data information case. The generative model in SN-GAC neutralizes style information from the original style-discriminative patterns (style-source) by building the mapping function from them to their style-free counterparts (corresponding standard examples, standard-target). A well-learned generator in the SN-GAC framework is capable of producing the targeted style-neutralized data (generated-target), satisfying the i.i.d. condition. Additionally, SN-GAC is trained adversarially, where an independent discriminator is used to surveil and supervise the training progress of the above-mentioned generator by distinguishing between the real and the generated. Simultaneously, an auxiliary classifier is also embedded in the discriminator to assign the correct class label of both the real and generated data. This process proves effective to aid the generator to produce high-quality human-readable style-neutralized patterns. It will then be further fine-tuned for the sake of promoting the final classification performance. Extensive experiments have adequately demonstrated the effectiveness of the proposed SN-GAC framework: it outperforms several relevant state-of-the-art baselines on two empirical data sets in the non-i.i.d. data classification task.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Style Neutralization Generative Adversarial Classifier

Style-Agnostic Reinforcement Learning

An unsupervised font style transfer model based on generative adversarial networks

Article 15 December 2021

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Notes

The neural network–based SN-GAC can be readily extended to classification with large class numbers.
Although the proposed SN-GAC model is evaluated only with dataset specifying groups of style patterns, it can also be applied in a more generalized way for any style-inconsistent classification situation.
Such style inconsistency can be found when data are created by multiple sources, while each source is generating examples with a special kind of style information. The stylistic tendency caused differs from different sources.
The source code of the SN-GAC model can be referred to via the online Github service: https://github.com/falconjhc/SN-GAC
Paired input is not evaluated for conventional baselines in the “Experiments” section since style-neutralization cannot be achieved with traditional approaches.
As suggested in [16], G is trained once after D-C is learned for five times to guarantee the best Wasserstein distance estimation at the current training progress.
Although there exists a style shift between the cursive characters in testing and the isolated examples in training for a specific writer, as will be demonstrated in the“??” section, the writing style seems to be similar since they are written by the identical individual.
The experiment setting as well as all the experimental results except the proposed SN-GAC model is referred to [2, 3].
The “Heiti” font.

References

Jiang H, Huang K, Zhang R. Field support vector regression. Proceedings of the International Conference on Neural Information Processing. Cham: Springer; 2017.
Huang K, Jiang H, Zhang X. Field support vector machines. Proceedings of the 1st International Conference on Internet of Things and Machine Learning. ACM; 2017.
Huang K, Jiang H, Zhang X. Field support vector machines. IEEE Transactions on Emerging Topics in Computational Intelligence 2017;1.6:454–63.
Article Google Scholar
Zhang X-Y, Huang K, Liu C-L. Pattern field classification with style normalized transformation. Twenty-Second International Joint Conference on Artificial Intelligence; 2011.
Liu Z-Y, Qiao H, Yang X, Hoi SCH. Graph matching by simplified convex-concave relaxation procedure. Int J Comput Vis. 2014;109(3).
Liu Z-Y, Qiao H. GNCCP-Graduated nonconvexityand concavity procedure. IEEE Trans Pattern Anal Mach Intell. 2013;36(6).
Gourier N, Hall D, Crowley JL. Estimating face orientation from robust detection of salient facial features. ICPR International Workshop on Visual Observation of Deictic Gestures; 2004.
Liu C-L, Yin F, Wang D-H, Wang Q-F. CASIA online and offline Chinese handwriting databases. 2011 International Conference on Document Analysis and Recognition. IEEE; 2011.
Goodfellow I, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville A, Bengio Y. 2014. Generative adversarial nets. Advances in Neural Information Processing Systems.
Isola P, Zhu J-Y, Zhou T, Efros AA. Image-to-image translation with conditional adversarial networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition; 2017.
Ronneberger O, Fischer P, Brox T. U-net: convolutional networks for biomedical image segmentation. Proceedings of the International Conference on Medical Image Computing and Computer-assisted Intervention. Cham: Springer; 2015.
Lvmin Z, Ji Y, Lin X, Liu C. Style transfer for anime sketches with enhanced residual u-net and auxiliary classifier gan. 4th IAPR Asian Conference on Pattern Recognition. IEEE; 2017.
Odena A, Olah C, Shlens J. Conditional image synthesis with auxiliary classifier gans. Proceedings of the 34th International Conference on Machine Learning; 2017.
Mirza M, Osindero S. Conditional generative adversarial nets. arXiv:1411.1784.
Salimans T, Goodfellow I, Zaremba W, Cheung V, Radford A, Chen X. Improved techniques for training gans. Advances in Neural Information Processing Systems. 2016.
Gulrajani I, Ahmed F, Arjovsky M, Dumoulin V, Courville AC. Improved training of wasserstein gans. Advances in Neural Information Processing Systems. 2017.
Yoshida Y, Miyato T. 2017. Spectral norm regularization for improving the generalizability of deep learning. arXiv:1705.10941.
Miyato T, Kataoka T, Koyama M, Yoshida Y. Spectral normalization for generative adversarial networks. arXiv:1802.05957. 2018.
Antoniou A, Storkey A, Edwards H. Data augmentation generative adversarial networks. arXiv:1711.04340. 2017.
Shrivastava A, Pfister T, Tuzel O, Susskind J, Wang W, Webb R. Learning from simulated and unsupervised images through adversarial training. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition; 2017.
Wang T-C, Liu M-Y, Zhu J-Y, Tao A, Kautz J, Catanzaro B. High-resolution image synthesis and semantic manipulation with conditional gans. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition; 2018.
Taigman Y, Polyak A, Wolf L. Unsupervised cross-domain image generation. arXiv:1611.02200. 2016.
Zhu J-Y, Park T, Isola P, Efros AA. Unpaired image-to-image translation using cycle-consistent adversarial networks. Proceedings of the IEEE International Conference on Computer Vision; 2017.
Jiang H, Yang G, Huang K, Zhang R. W-Net: one-shot arbitrary-style chinese character generation with deep neural networks. Proceedings of the International Conference on Neural Information Processing; 2018.
Evgeniou T, Pontil M. Regularized multi-task learning. Proceedings of the tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM; 2004.
Sarkar P, Nagy G. Style consistent classification of isogenous patterns. IEEE Trans Pattern Anal Mach Intell 2005;27:1.
Article Google Scholar
Tenenbaum JB, Freeman WT. Separating style and content with bilinear models. Neural Comput 2000;12: 6.
Article Google Scholar
Zhong G, Huang K. 2018. Semi-supervised learning: background, applications and future directions. Nova Science Publishers Inc.
Hsu C-W, Lin C-J. A comparison of methods for multiclass support vector machines. IEEE Trans Neural Netw 2002;2:13.
Google Scholar
He K, et al. Identity mappings in deep residual networks. European conference on computer vision. Cham: Springer; 2016.
Google Scholar
Huang K, Hussain A, Wang Q, Zhang R. Deep learning: fundamentals, theory, and applications. Springer; 2019. ISBN-13: 978-3540794516.
Jiang Y, Lian Z, Tang Y, Xiao J. 2017. DCFOnt: an end-to-end deep chinese font generation system. SIGGRAPH Asia 2017 Technical Briefs. ACM.
Johnson M, Schuster M, Le QV, Krikun M, Wu Y, Chen Z, Thorat N, et al. 2017. Google’s multilingual neural machine translation system: enabling zero-shot translation. Transactions of the Association for Computational Linguistics 5.
Tian Y. 2017. Zi2Zi-Tensorflow. https://kaonashi-tyc.github.io/2017/04/06/zi2zi.html.
Srivastava N, Hinton G, Krizhevsky A, Sutskever I, Salakhutdinov R. Dropout: a simple way to prevent neural networks from overfitting. J Mach Learn Res 2014;1:15.
MathSciNet MATH Google Scholar
Cortes C, Vapnik V. Support-vector networks. Mach Learn 1995;20:3.
MATH Google Scholar
Cate H, Dalvi F, Hussain Z. Deepface: face generation using deep learning. Proceedings of the IEEE conference on computer vision and pattern recognition; 2014.
Krizhevsky A, Sutskever I, Hinton GE. Imagenet classification with deep convolutional neural networks. Advances in neural information processing systems; 2012.
Jing X-Y, Wong H-S, Zhang D. Face recognition based on 2D Fisherface approach. Pattern Recogn 2006; 4:39.
MATH Google Scholar
Kimura F, Takashina K, Tsuruoka S, Miyake Y. 1987. Modified quadratic discriminant functions and the application to Chinese character recognition. IEEE Transactions on Pattern Analysis & Machine Intelligence 1.
Abadi M, Barham P, Chen J, Chen Z, Davis A, Dean J, Devin M, et al. Tensorflow: a system for large-scale machine learning. 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16); 2016.
Wall ME, Rechtsteiner A, Rocha LM. Singular value decomposition and principal component analysis. A practical approach to microarray data analysis. Boston: Springer; 2003.
Boser BE, Guyon IM, Vapnik VN. A training algorithm for optimal margin classifiers. Proceedings of the Fifth Annual Workshop on Computational Learning Theory. ACM; 1992.
Internet Archive. GB 2312-1980: information technology—Chinese ideogram coded character set for information interchange (basic set). https://archive.org/details/GB2312-1980/page/n17.

Download references

Acknowledgments

Acknowledgment goes to Ms. Zijun CUI who offered assistance in designing several of the illustrations in this paper.

Funding

The work reported here was partially supported by the following: National Natural Science Foundation of China under grant no. 61876155; Natural Science Fund for Colleges and Universities in Jiangsu Province under grant no. 17KJD520010; Suzhou Science and Technology Program under grant no. SYG2-01712, SZS201613; Jiangsu University Natural Science Research Programme under grant no. 17KJB-520041; Key Program Special Fund in XJTLU (KSF-A-01).

Author information

Authors and Affiliations

Department of EEE, Xi’an Jiaotong - Liverpool University, SIP, Suzhou, 215123, Jiangsu, People’s Republic of China
Haochuan Jiang & Kaizhu Huang
Department of MS, Xi’an Jiaotong - Liverpool University, SIP, Suzhou, 215123, Jiangsu, People’s Republic of China
Rui Zhang
Cyber and Cognitive Big Data Lab, Edinburgh Napier University, Scotland, UK
Amir Hussain

Authors

Haochuan Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Kaizhu Huang
View author publications
You can also search for this author in PubMed Google Scholar
Rui Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Amir Hussain
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kaizhu Huang.

Ethics declarations

Conflict of interests

The authors declare that they have no conflict of interest.

Ethical Approval

This article does not contain any studies with human participants performed by any of the authors.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Jiang, H., Huang, K., Zhang, R. et al. Style-Neutralized Pattern Classification Based on Adversarially Trained Upgraded U-Net. Cogn Comput 13, 845–858 (2021). https://doi.org/10.1007/s12559-019-09660-0

Download citation

Received: 27 February 2019
Accepted: 10 June 2019
Published: 07 September 2019
Issue Date: July 2021
DOI: https://doi.org/10.1007/s12559-019-09660-0

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Style-Neutralized Pattern Classification Based on Adversarially Trained Upgraded U-Net

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Style Neutralization Generative Adversarial Classifier

Style-Agnostic Reinforcement Learning

An unsupervised font style transfer model based on generative adversarial networks

Notes

References

Acknowledgments

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interests

Ethical Approval

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Style-Neutralized Pattern Classification Based on Adversarially Trained Upgraded U-Net

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Style Neutralization Generative Adversarial Classifier

Style-Agnostic Reinforcement Learning

An unsupervised font style transfer model based on generative adversarial networks

Explore related subjects

Notes

References

Acknowledgments

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interests

Ethical Approval

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation