Abstract
For the zero-shot learning, the rational description and utilization of attributes are valid approaches to build the bridge between training classes and testing classes. In order to improve the descriptive ability of attributes and further construct the appropriate mapping between attributes and features, a novel zero-shot learning method based on weighted reconstruction of hybrid attribute groups (WRHAG) is proposed. First, original semantic attributes are divided into groups by using the hierarchical clustering, and grouped semantic attributes are further enhanced by the broad learning. The semantic attribute groups and enhanced attribute groups together constitute hybrid attribute groups, which effectively improve the attribute description ability. Then, the mutual mapping between attributes and features is obtained by constructing a weighted autoencoder, in which the structured sparse L21 norm and attribute group coefficients are adopted to choose the discriminative attributes and consider the differences between attribute groups. Finally, the zero-shot classification is achieved by calculating the similarity between features of testing sample and predicted class features in the feature space. Comparative experiments on typical CUB dataset demonstrate that the proposed WRHAG model yields better performance in zero-shot image classification.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Xian, Y., Lampert, C.H., Schiele, B., Akata, Z.: Zero-shot learning - a comprehensive evaluation of the good, the bad and the ugly. IEEE Trans. Pattern Anal. Mach. Intell. 41(9), 2251–2265 (2019)
Xie, G., Zhang, Z., Xiong, H., Shao, L., Li, X.: Towards zero-shot learning: a brief review and an attention-based embedding network. IEEE Trans. Circuits Syst. Video Technol. 33(3), 1181–1197 (2023)
Yang, H., et al.: Iterative class prototype calibration for transductive zero-shot learning. IEEE Trans. Circuits Syst. Video Technol. 33(3), 1236–1246 (2023)
Reed, S., Akata, Z., Lee, H., Schiele, B.: Learning deep representations of fine-grained visual descriptions. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 49–58. IEEE, Las Vegas (2016)
Yu, Y., Ji, Z., Guo, G., Zhang, Z.: Zero-shot learning via latent space encoding. IEEE Trans. Cybern. 49(10), 3755–3766 (2019)
Han, H., Jain, A.K., Wang, F., Shan, S., Chen, X.: Heterogeneous face attribute estimation: a deep multi-task learning approach. IEEE Trans. Pattern Anal. Mach. Intell. 40(11), 2597–2609 (2018)
Jayaraman, D., Sha, F., Grauman, K.: Decorrelating semantic visual attributes by resisting the urge to share. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1629–1636. IEEE, Columbus (2014)
Wang, X., Li, Q., Gong, P., Cheng, Y.: Zero-shot learning based on multitask extended attribute groups. IEEE Trans. Syst. Man Cybern. Syst. 51(3), 2003–2011 (2021)
Zhang, J., Wang, X., Cheng, Y.: Broad attribute prediction model with enhanced attribute and feature. IEEE Access 7, 124606–124620 (2019)
Akata, Z., Reed, S., Walter, D., Lee, H., Schiele, B.: Evaluation of output embeddings for fine-grained image classification. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2927–2936. IEEE, Boston (2015)
Kodirov, E., Xiang, T., Gong, S.: Semantic autoencoder for zero-shot learning. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3174–3183. IEEE, Honolulu (2017)
Ranzato, M., Boureau, Y. L., Le, C.Y.: Sparse feature learning for deep belief networks. In: Proceedings of 20th International Conference on Neural Information Processing Systems, pp. 1185–1192. Curran Associates Inc., Vancouver (2009)
Kim, S., Xing, E. P.: Tree-guided group lasso for multi-task regression with structured sparsity. In: Proceedings of 27th International Conference on International Conference on Machine Learning, pp. 543–550. Omnipress, Haifa (2010)
Bartels, R.H., Stewart, G.W.: Solution of the matrix equation ax + xb = c [f4]. Commun. ACM 15(9), 820–826 (1972)
Wah, C., Branson, S., Perona, P., Belongie, S.: Multiclass recognition and part localization with humans in the loop. In: Proceedings of 2011 International Conference on Computer Vision, pp. 2524–2531. IEEE, Barcelona (2011)
Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., et al.: Going deeper with convolutions. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–9. IEEE, Boston (2015)
Lampert, C. H., Nickisch, H., Harmeling, S.: Learning to detect unseen object classes by between-class attribute transfer. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 951–958. IEEE, Miami (2009)
Acknowledgements
This work was supported by the Natural Science Foundation of Jiangsu Higher Education Institutions of China under Grant 21KJB520005; the Jiangsu Normal University Foundation under Grant 21XSRS001; the Natural Science Foundation of Jiangsu Province under Grant BK20200632; and the National Natural Science Foundation of China under Grant 41902273.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Zhang, J., Li, R., Yu, N., Liu, J., Kong, Y. (2023). Zero-Shot Learning Based on Weighted Reconstruction of Hybrid Attribute Groups. In: Huang, DS., Premaratne, P., Jin, B., Qu, B., Jo, KH., Hussain, A. (eds) Advanced Intelligent Computing Technology and Applications. ICIC 2023. Lecture Notes in Computer Science(), vol 14089. Springer, Singapore. https://doi.org/10.1007/978-981-99-4752-2_21
Download citation
DOI: https://doi.org/10.1007/978-981-99-4752-2_21
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-4751-5
Online ISBN: 978-981-99-4752-2
eBook Packages: Computer ScienceComputer Science (R0)