Skip to main content
Log in

NucNormZSL: nuclear norm-based domain adaptation in zero-shot learning

  • Original Article
  • Published:
Neural Computing and Applications Aims and scope Submit manuscript

Abstract

The ability of human beings to recognize novel concepts has attracted significant attention in the research community. Zero-shot learning, also known as zero-data learning, seeks to build models that can recognize novel class instances even without “seeing” them during their training; some description of novel classes is, however, required. In this work, we pose zero-shot learning as a dictionary learning problem to learn the projection functions from feature to semantic space as dictionaries in the source and target domains. To get a robust projection mapping in the source domain, we introduce nuclear norm to achieve low-rank solutions. Further, this low-ranked dictionary is used as a regularizer in the target domain so that the knowledge contained in the source dictionary is utilized in the target domain. In our experiments, source domain contains the seen class images, their ground truths and attribute representations while corresponding data for unseen class are contained in the target domain. We also use label propagation as an alternative to the nearest neighbor search in the semantic space for class-label assignment. Our proposed model, NucNormZSL, achieves state-of-the-art results for the Large Attribute (LAD) dataset and remains fairly competitive with existing approaches on Animals with Attributes-2 (AWA2), Caltech-UCSD Birds (CUB) and SUN datasets in the conventional setting and generalized setting.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13

Similar content being viewed by others

References

  1. Akata Z, Perronnin F, Harchaoui Z, Schmid C (2015) Label-embedding for image classification. IEEE transLabel-embedding for image classification. IEEE transactions on pattern analysis and machine intelligence 38(7):1425-1438

  2. Alom MZ, Taha TM, Yakopcic C, Westberg S, Sidike P, Nasrin MS, Hasan M, Van Essen BC, Awwal AA, Asari VK (2019) A state-of-the-art survey on deep learning theory and architectures. Electronics 8(3):292

    Article  Google Scholar 

  3. Bhatia N et al (2010) Survey of nearest neighbor techniques. arXiv:1007.0085

  4. Cao W, Zhou C, Wu Y, Ming Z, Xu Z, Zhang J (2020) Research progress of zero-shot learning beyond computer vision. In: International conference on algorithms and architectures for parallel processing, Springer, pp 538–551

  5. Changpinyo S, Chao WL, Gong B, Sha F (2016) Synthesized classifiers for zero-shot learning. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5327–5336

  6. Chao WL, Changpinyo S, Gong B, Sha F (2016) An empirical study and analysis of generalized zero-shot learning for object recognition in the wild. In: European conference on computer vision, Springer, pp 52–68

  7. Elsken T, Staffler B, Metzen JH, Hutter F (2020) Meta-learning of neural architectures for few-shot learning. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 12365–12375

  8. Farhadi A, Endres I, Hoiem D, Forsyth D (2009) Describing objects by their attributes. In: 2009 IEEE conference on computer vision and pattern recognition, IEEE, pp 1778–1785

  9. Fazel M, Hindi H, Boyd SP (2001) A rank minimization heuristic with application to minimum order system approximation. In: Proceedings of the 2001 American control conference (Cat. No. 01CH37148), IEEE, vol 6, pp 4734–4739

  10. Fu Y, Hospedales TM, Xiang T, Fu Z, Gong S (2014) Transductive multi-view embedding for zero-shot recognition and annotation. In: European conference on computer vision, Springer, pp 584–599

  11. Fu Y, Hospedales TM, Xiang T, Gong S (2015) Transductive multi-view zero-shot learning. IEEE Trans Pattern Anal Mach Intell 37(11):2332–2345

    Article  Google Scholar 

  12. Fu Y, Xiang T, Jiang YG, Xue X, Sigal L, Gong S (2018) Recent advances in zero-shot recognition: toward data-efficient understanding of visual content. IEEE Signal Process Mag 35(1):112–125

    Article  Google Scholar 

  13. Gopalan R, Li R, Chellappa R (2011) Domain adaptation for object recognition: an unsupervised approach. In: 2011 international conference on computer vision, IEEE, pp 999–1006

  14. Hu Z, Nie F, Tian L, Wang R, Li X (2018) A comprehensive survey for low rank regularization. arXiv:1808.04521

  15. Huang H, Wang C, Yu PS, Wang CD (2019) Generative dual adversarial network for generalized zero-shot learning. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 801–810

  16. Huynh D, Elhamifar E (2020) Fine-grained generalized zero-shot learning via dense attribute-based attention. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 4483–4493

  17. Jayaraman D, Grauman K (2014) Zero-shot recognition with unreliable attributes. In: Advances in neural information processing systems, pp 3464–3472

  18. Ji Z, Cui B, Yu Y, Pang Y, Zhang Z (2021) Zero-shot classification with unseen prototype learning. In: Neural computing and applications, pp 1–11

  19. Jiang H, Wang R, Shan S, Chen X (2019) Transferable contrastive network for generalized zero-shot learning. In: Proceedings of the IEEE international conference on computer vision, pp 9765–9774

  20. Kadam S, Vaidya V (2018) Review and analysis of zero, one and few shot learning approaches. In: International conference on intelligent systems design and applications, Springer, pp 100–112

  21. Kodirov E, Xiang T, Fu Z, Gong S (2015) Unsupervised domain adaptation for zero-shot learning. In: Proceedings of the IEEE international conference on computer vision, pp 2452–2460

  22. Kumar Verma V, Arora G, Mishra A, Rai P (2018) Generalized zero-shot learning via synthesized examples. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4281–4289

  23. Lampert CH, Nickisch H, Harmeling S (2013) Attribute-based classification for zero-shot visual object categorization. IEEE Trans Pattern Anal Mach Intell 36(3):453–465

    Article  Google Scholar 

  24. Latif S, Rana R, Khalifa S, Jurdak R, Qadir J, Schuller BW (2020) Deep representation learning in speech processing: challenges, recent advances, and future trends. arXiv:2001.00378

  25. Li X, Zhang D, Ye M, Li X, Dou Q, Lv Q (2020) Bidirectional generative transductive zero-shot learning. In: Neural computing and applications, pp 1–14

  26. Li Y, Wang D, Hu H, Lin Y, Zhuang Y (2017) Zero-shot recognition using dual visual-semantic mapping paths. arXiv:1703.05002

  27. Liu J, Kuipers B, Savarese S (2011) Recognizing human actions by attributes. In: CVPR 2011, IEEE, pp 3337–3344

  28. Newman M (2005) Power laws, pareto distributions and Zipf’s law. Contemp Phys 46(5):323–351. https://doi.org/10.1080/00107510500052444

  29. Pan SJ, Yang Q (2010) A survey on transfer learning. IEEE Trans Knowl Data Eng 22(10):1345–1359

    Article  Google Scholar 

  30. Parikh D, Grauman K (2011) Relative attributes. In: 2011 International conference on computer vision, IEEE, pp 503–510

  31. Rahman S, Khan S, Porikli F (2018) A unified approach for conventional zero-shot, generalized zero-shot, and few-shot learning. IEEE Trans Image Process 27(11):5652–5667

    Article  MathSciNet  Google Scholar 

  32. Recht B, Fazel M, Parrilo PA (2010) Guaranteed minimum-rank solutions of linear matrix equations via nuclear norm minimization. SIAM Rev 52(3):471–501

    Article  MathSciNet  Google Scholar 

  33. Ribani R, Marengoni M (2019) A survey of transfer learning for convolutional neural networks. In: 2019 32nd SIBGRAPI conference on graphics, patterns and images tutorials (SIBGRAPI-T), IEEE, pp 47–57

  34. Ruder S, Peters ME, Swayamdipta S, Wolf T (2019) Transfer learning in natural language processing. In: Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: tutorials, pp 15–18

  35. Schonfeld E, Ebrahimi S, Sinha S, Darrell T, Akata Z (2019) Generalized zero-and few-shot learning via aligned variational autoencoders. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 8247–8255

  36. Sen PC, Hajra M, Ghosh M (2020) Supervised classification algorithms in machine learning: a survey and review. In: Emerging technology in modelling and graphics, Springer, pp 99–111

  37. Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv:1409.1556

  38. Soni AN (2018) Application and analysis of transfer learning-survey. Int J Sci Res Eng Dev 1(2):272–278

    Google Scholar 

  39. Tan C, Sun F, Kong T, Zhang W, Yang C, Liu C (2018) A survey on deep transfer learning. In: International conference on artificial neural networks, Springer, pp 270–279

  40. van Wieringen WN (2015) Lecture notes on ridge regression. arXiv:1509.09169

  41. Wang Q, Chen K (2017) Zero-shot visual recognition via bidirectional latent embedding. Int J Comput Vis 124(3):356–383

    Article  MathSciNet  Google Scholar 

  42. Wang W, Zheng VW, Yu H, Miao C (2019) A survey of zero-shot learning: settings, methods, and applications. ACM Trans Intell Syst Technol (TIST) 10(2):1–37

    Google Scholar 

  43. Wang Y, Yao Q, Kwok JT, Ni LM (2020) Generalizing from a few examples: a survey on few-shot learning. ACM Comput Surv (CSUR) 53(3):1–34

    Article  Google Scholar 

  44. Wu X, Sahoo D, Hoi SC (2020) Recent advances in deep learning for object detection. Neurocomputing 396:39–64

    Article  Google Scholar 

  45. Xian Y, Schiele B, Akata Z (2017) Zero-shot learning-the good, the bad and the ugly. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4582–4591

  46. Xian Y, Lampert CH, Schiele B, Akata Z (2018) Zero-shot learning—a comprehensive evaluation of the good, the bad and the ugly. IEEE Trans Pattern Anal Mach Intell 41(9):2251–2265

    Article  Google Scholar 

  47. Xiao J, Hays J, Ehinger KA, Oliva A, Torralba A (2010) Sun database: large-scale scene recognition from abbey to zoo. In: 2010 IEEE computer society conference on computer vision and pattern recognition, IEEE, pp 3485–3492

  48. Xiao J, Hays J, Ehinger KA, Oliva A, Torralba A (2010) Sun database: large-scale scene recognition from abbey to zoo. In: 2010 IEEE computer society conference on computer vision and pattern recognition, IEEE, pp 3485–3492

  49. Xu X, Hospedales T, Gong S (2017) Transductive zero-shot action recognition by word-vector embedding. Int J Comput Vis 123(3):309–333

    Article  MathSciNet  Google Scholar 

  50. Xu X, Hospedales T, Gong S (2017) Transductive zero-shot action recognition by word-vector embedding. Int J Comput Vis 123(3):309–333

    Article  MathSciNet  Google Scholar 

  51. Xu X, Shen F, Yang Y, Zhang D, Tao Shen H, Song J (2017) Matrix tri-factorization with manifold regularizations for zero-shot learning. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3798–3807

  52. Ye M, Guo Y (2017) Zero-shot classification with discriminative semantic representation learning. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7140–7148

  53. Zha Z, Wen B, Zhang J, Zhou J, Zhu C (2019) A comparative study for the nuclear norms minimization methods. In: 2019 IEEE international conference on image processing (ICIP), IEEE, pp 2050–2054

  54. Zhang J, Li W, Ogunbona P, Xu D (2019) Recent advances in transfer learning for cross-dataset visual recognition: a problem-oriented perspective. ACM Comput Surv (CSUR) 52(1):1–38

    Article  Google Scholar 

  55. Zhang L, Xiang T, Gong S (2017) Learning a deep embedding model for zero-shot learning. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2021–2030

  56. Zhao B, Fu Y, Liang R, Wu J, Wang Y, Wang Y (2019) A large-scale attribute dataset for zero-shot learning. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops

  57. Zhu P, Wang H, Saligrama V (2019) Generalized zero-shot recognition based on visually semantic embedding. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2995–3003

  58. Zhu X, Ghahramani Z (2002) Learning from labeled and unlabeled data with label propagation

  59. Zhuang F, Qi Z, Duan K, Xi D, Zhu Y, Zhu H, Xiong H, He Q (2020) A comprehensive survey on transfer learning. In: Proceedings of the IEEE

Download references

Funding

We declare that our work is funded by Indian Institute of Information Technology Allahabad.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Upendra Pratap Singh.

Ethics declarations

Conflict of interest

All authors certify that they have no affiliations with or involvement in any organization or entity with any financial interest or non-financial interest in the subject matter or materials discussed in this manuscript.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Singh, U.P., Singh, K.P. & Thakur, M. NucNormZSL: nuclear norm-based domain adaptation in zero-shot learning. Neural Comput & Applic 34, 2353–2374 (2022). https://doi.org/10.1007/s00521-021-06461-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00521-021-06461-1

Keywords

Navigation