How transfer learning is used in generative models for image classification: improved accuracy

Ebrahimzadeh, Danial; Sharif, Sarah; Banad, Yaser

doi:10.1007/s11760-024-03673-5

How transfer learning is used in generative models for image classification: improved accuracy

Original Paper
Published: 09 December 2024

Volume 19, article number 103, (2025)
Cite this article

Signal, Image and Video Processing Aims and scope Submit manuscript

Danial Ebrahimzadeh¹,
Sarah Sharif¹ &
Yaser Banad¹

258 Accesses
Explore all metrics

Abstract

Recent breakthroughs in generative neural networks have paved the way for transformative capabilities, particularly in their capacity to generate novel data, notably in the realm of images. The integration of these models with the increasingly popular technique of transfer learning, designed for proficient feature extraction, holds the promise of enhancing overall performance. This paper delves into the exploration of employing generative models in conjunction with transfer learning methods for feature extraction, with a specific focus on image classification tasks. Our investigation aims to scrutinize the effectiveness of leveraging generative models alongside pre-trained models as feature extractors in the context of image classification. To the best of our knowledge, our investigation is the first to link transfer learning and generative models for a discriminative task under one roof. The proposed approach undergoes rigorous evaluation on two distinct datasets, employing specific metrics to gauge the model’s performance. The results exhibit a notable nearly 10% enhancement achieved through the integration of generative models, underscoring their potential for achieving heightened accuracy in image classification. These findings highlight significant advancements in image classification accuracy, surpassing the performance of conventional Artificial Neural Network (ANN) models.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1

Pre-trained Convolutional Networks and Generative Statistical Models: A Comparative Study in Large Datasets

Advancing Image Classification Through Self-teachable Machine Models and Transfer Learning

EncodeNet: A Framework for Boosting DNN Accuracy with Entropy-Driven Generalized Converting Autoencoder

Data availability

All data and codes used in this study are available upon reasonable request. Researchers interested in accessing the data and materials should contact the corresponding author for further information.

References

Szeliski, R.: Computer Vision: Algorithms and Applications. Springer (2022)
Book MATH Google Scholar
Yu, H.-F., Jain, P., Kar, P., Dhillon, I.: Large-scale multi-label learning with missing labels. In: International Conference on Machine Learning, pp. 593–601 (2014). PMLR
Tsoumakas, G., Katakis, I.: Multi-label classification: an overview. Int. J. Data Warehous. Min. (IJDWM) 3(3), 1–13 (2007)
Article MATH Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Chen, X., Duan, Y., Houthooft, R., Schulman, J., Sutskever, I., Abbeel, P.: Infogan: Interpretable representation learning by information maximizing generative adversarial nets. Advances in neural information processing systems 29 (2016)
Bernardo, J., Bayarri, M., Berger, J., Dawid, A., Heckerman, D., Smith, A., West, M.: Generative or discriminative? Getting the best of both worlds. Bayesian Stat. 8(3), 3–24 (2007)
MathSciNet MATH Google Scholar
Ackley, D.H., Hinton, G.E., Sejnowski, T.J.: A learning algorithm for Boltzmann machines. Cogn. Sci. 9(1), 147–169 (1985)
MATH Google Scholar
Kingma, D.P., Welling, M.: Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114 (2013)
Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., Bengio, Y.: Generative adversarial nets. Adv. Neural Inf. Process. Syst. 27 (2014)
Hinton, G.E., Osindero, S., Teh, Y.-W.: A fast learning algorithm for deep belief nets. Neural Comput. 18(7), 1527–1554 (2006)
Article MathSciNet MATH Google Scholar
Smolensky, P., et al.: Information processing in dynamical systems: Foundations of harmony theory (1986)
Larochelle, H., Bengio, Y.: Classification using discriminative restricted boltzmann machines. In: Proceedings of the 25th International Conference on Machine Learning, pp. 536–543 (2008)
Miyato, T., Koyama, M.: cgans with projection discriminator. arXiv preprint arXiv:1802.05637 (2018)
Mirza, M., Osindero, S.: Conditional generative adversarial nets. arXiv preprint arXiv:1411.1784 (2014)
Li, C., Xu, K., Zhu, J., Liu, J., Zhang, B.: Triple generative adversarial networks. IEEE Trans. Pattern Anal. Mach. Intell. 44(12), 9629–9640 (2021)
Article MATH Google Scholar
Pan, S.J., Yang, Q.: A survey on transfer learning. IEEE Trans. Knowl. Data Eng. 22(10), 1345–1359 (2009)
Article MATH Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. Adv. Neural Inf. Process. Syst. 25 (2012)
Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J., Wojna, Z.: Rethinking the inception architecture for computer vision. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2818–2826 (2016)
Larochelle, H., Mandel, M., Pascanu, R., Bengio, Y.: Learning algorithms for the classification restricted Boltzmann machine. J. Mach. Learn. Res. 13(1), 643–669 (2012)
MathSciNet MATH Google Scholar
Chen, C.P., Feng, S.: Generative and discriminative fuzzy restricted Boltzmann machine learning for text and image classification. IEEE Trans. Cybern. 50(5), 2237–2248 (2018)
Article MATH Google Scholar
Elaiwat, S., Bennamoun, M., Boussaid, F.: A semantic RBM-based model for image set classification. Neurocomputing 205, 507–518 (2016)
Article MATH Google Scholar
Fang, K., OuYang, J.: Classification algorithm optimization based on triple-GAN. J. Artif. Intell. 2(1), 1–15 (2020)
Article MATH Google Scholar
Wu, S., Deng, G., Li, J., Li, R., Yu, Z., Wong, H.-S.: Enhancing triplegan for semi-supervised conditional instance synthesis and classification. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 10091–10100 (2019)
Fang, H., Deng, W., Zhong, Y., Hu, J.: Triple-gan: Progressive face aging with triple translation loss. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pp. 804–805 (2020)
Wang, X., Tan, K., Du, Q., Chen, Y., Du, P.: Caps-Triplegan: Gan-assisted Capsnet for hyperspectral image classification. IEEE Trans. Geosci. Remote Sens. 57(9), 7232–7245 (2019)
Article MATH Google Scholar
Gangwar, A., González-Castro, V., Alegre, E., Fidalgo, E.: Triple-Biggan: Semi-supervised generative adversarial networks for image synthesis and classification on sexual facial expression recognition. Neurocomputing 528, 200–216 (2023)
Article Google Scholar
Yang, Z., Nai, W., Li, D., Xing, Y.: Water quality classification for inland lakes and ponds with few color image samples based on triple-GAN and CSNN. Mathematical Problems in Engineering 2022 (2022)
Chen, Y., Xiong, Y., Liu, B., Yin, X.: Trangan: Generative adversarial network based transfer learning for social tie prediction. In: ICC 2019-2019 IEEE International Conference on Communications (ICC), pp. 1–6 (2019). IEEE
Magotra, A., Kim, J.: Improvement of heterogeneous transfer learning efficiency by using Hebbian learning principle. Appl. Sci. 10(16), 5631 (2020)
Article MATH Google Scholar
Lu, S., Lu, Z., Zhang, Y.-D.: Pathological brain detection based on Alexnet and transfer learning. J. Comput. Sci. 30, 41–47 (2019)
Article MATH Google Scholar
Abd Almisreb, A., Jamil, N., Din, N.M.: Utilizing alexnet deep transfer learning for ear recognition. In: 2018 Fourth International Conference on Information Retrieval and Knowledge Management (CAMP), pp. 1–5 (2018). IEEE
Wang, C., Chen, D., Hao, L., Liu, X., Zeng, Y., Chen, J., Zhang, G.: Pulmonary image classification based on inception-v3 transfer learning model. IEEE Access 7, 146533–146541 (2019)
Article Google Scholar
Joshi, K., Tripathi, V., Bose, C., Bhardwaj, C.: Robust sports image classification using inceptionv3 and neural networks. Proced. Comput. Sci. 167, 2374–2381 (2020)
Article MATH Google Scholar
Ullah, N., Khan, J.A., Khan, M.S., Khan, W., Hassan, I., Obayya, M., Negm, N., Salama, A.S.: An effective approach to detect and identify brain tumors using transfer learning. Appl. Sci. 12(11), 5645 (2022)
Article MATH Google Scholar
Abubakar, A., Ajuji, M., Usman Yahya, I.: Comparison of deep transfer learning techniques in human skin burns discrimination. Appl. Syst. Innov. 3(2), 20 (2020)
Article MATH Google Scholar
Ahsan, M.M., Uddin, M.R., Ali, M.S., Islam, M.K., Farjana, M., Sakib, A.N., Al Momin, K., Luna, S.A.: Deep transfer learning approaches for monkeypox disease diagnosis. Exp. Syst. Appl. 216, 119483 (2023)
Article Google Scholar
Shukla, R.K., Tiwari, A.K.: Masked face recognition using mobilenet v2 with transfer learning. Comput. Syst. Sci. Eng. 45(1) (2023)
Liu, P., Xu, C., Xie, J., Fu, M., Chen, Y., Liu, Z., Zhang, Z.: A cnn-based transfer learning method for leakage detection of pipeline under multiple working conditions with ae signals. Process Safety and Environmental Protection 170, 1161–1172 (2023)
Article MATH Google Scholar
Kang, J., Gwak, J.: Ensemble of multi-task deep convolutional neural networks using transfer learning for fruit freshness classification. Multimedia Tools and Applications 81(16), 22355–22377 (2022)
Manandhar, P., Temimi, M., Aung, Z.: Short-term solar radiation forecast using total sky imager via transfer learning. Energy Reports 9, 819–828 (2023)
Bala, A., Kaur, T.: Local texton xor patterns: A new feature descriptor for content-based image retrieval. Engineering Science and Technology, an International Journal 19(1), 101–112 (2016)
Article MATH Google Scholar
Krizhevsky, A., Hinton, G., et al.: Learning multiple layers of features from tiny images (2009)
Li, Y., Guo, Y., Zhang, S., Deng, S., Hai, Y., Gu, S.: Differentiable spike: Rethinking gradient-descent for training spiking neural networks. Advances in Neural Information Processing Systems 34, 23426–23439 (2021)
MATH Google Scholar
Xiao, M., Meng, Q., Zhang, Z., He, D., Lin, Z.: Online training through time for spiking neural networks. Advances in Neural Information Processing Systems 35, 20717–20730 (2022)
MATH Google Scholar
Jeevan, P., Viswanathan, K., Sethi, A.: Wavemix-lite: A resource-efficient neural network for image analysis. arXiv preprint arXiv:2205.14375 (2022)
Guo, Y., Chen, Y., Zhang, L., Liu, X., Wang, Y., Huang, X., Ma, Z.: Im-loss: information maximization loss for spiking neural networks. Advances in Neural Information Processing Systems 35, 156–166 (2022)
MATH Google Scholar
Rippel, O., Snoek, J., Adams, R.P.: Spectral representations for convolutional neural networks. Advances in neural information processing systems 28 (2015)

Download references

Acknowledgements

The authors have no acknowledgments to report.

Funding

The authors declare that no financial contributions or grants were received for this study.

Author information

Authors and Affiliations

School of Electrical and Computer Engineering, University of Oklahoma, Norman, OK, 73019, USA
Danial Ebrahimzadeh, Sarah Sharif & Yaser Banad

Authors

Danial Ebrahimzadeh
View author publications
You can also search for this author in PubMed Google Scholar
Sarah Sharif
View author publications
You can also search for this author in PubMed Google Scholar
Yaser Banad
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization, D.E. and Y.M.B; methodology, Y.M.B., D.E., and S.S.; software, D.E.; validation, D.E. and Y.M.B.; formal analysis, Y.M.B. and D.E.; resources, S.S., D.E., and Y.M.B.; data curation, Y.M.B., D.E.; writing-original draft preparation, D.E.; writing-review and editing, D.E. and Y.M.B.; supervision, Y.M.B. and S.S.; project administration, Y.M.B. and S.S.; funding acquisition, Y.M.B. All authors have read and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Yaser Banad.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Consent for publication

We have obtained consent for the publication of this manuscript from all individuals and organizations mentioned in the study, where applicable.

Ethical approval and consent to participate

This research is not subject to human participation and or animals.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file 1 (pdf 715 KB)

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Ebrahimzadeh, D., Sharif, S. & Banad, Y. How transfer learning is used in generative models for image classification: improved accuracy. SIViP 19, 103 (2025). https://doi.org/10.1007/s11760-024-03673-5

Download citation

Received: 23 January 2024
Revised: 22 September 2024
Accepted: 24 September 2024
Published: 09 December 2024
DOI: https://doi.org/10.1007/s11760-024-03673-5

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

How transfer learning is used in generative models for image classification: improved accuracy

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Pre-trained Convolutional Networks and Generative Statistical Models: A Comparative Study in Large Datasets

Advancing Image Classification Through Self-teachable Machine Models and Transfer Learning

EncodeNet: A Framework for Boosting DNN Accuracy with Entropy-Driven Generalized Converting Autoencoder

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Consent for publication

Ethical approval and consent to participate

Additional information

Publisher's Note

Supplementary Information

Supplementary file 1 (pdf 715 KB)

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

How transfer learning is used in generative models for image classification: improved accuracy

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Pre-trained Convolutional Networks and Generative Statistical Models: A Comparative Study in Large Datasets

Advancing Image Classification Through Self-teachable Machine Models and Transfer Learning

EncodeNet: A Framework for Boosting DNN Accuracy with Entropy-Driven Generalized Converting Autoencoder

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Consent for publication

Ethical approval and consent to participate

Additional information

Publisher's Note

Supplementary Information

Supplementary file 1 (pdf 715 KB)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation