Graph-based variational auto-encoder for generalized zero-shot learning

Published: 03 May 2021 Publication History


Zero-shot learning has been a highlighted research topic in both vision and language areas. Recently, generative methods have emerged as a new trend of zero-shot learning, which synthesizes unseen categories samples via generative models. However, the lack of fine-grained information in the synthesized samples makes it difficult to improve classification accuracy. It is also time-consuming and inefficient to synthesize samples and using them to train classifiers. To address such issues, we propose a novel Graph-based Variational Auto-Encoder for zero-shot learning. Specifically, we adopt knowledge graph to model the explicit inter-class relationships, and design a full graph convolution auto-encoder framework to generate the classifier from the distribution of the class-level semantic features on individual nodes. The encoder learns the latent representations of individual nodes, and the decoder generates the classifiers from latent representations of individual nodes. In contrast to synthesize samples, our proposed method directly generates classifiers from the distribution of the class-level semantic features for both seen and unseen categories, which is more straightforward, accurate and computationally efficient. We conduct extensive experiments and evaluate our method on the widely used large-scale ImageNet-21K dataset. Experimental results validate the efficacy of the proposed approach.


    Author Tags

    1. generalized zero-shot learning
    2. graph-based variational autoencoder
    3. large-scale dataset


    Funding Sources

    • National Natural Science Foundation of China
    • Dongguan Songshan Lake Introduction Program of Leading Innovative and Entrepreneurial Talents
    • Fundamental Research Funds for the Central Universities
    • Sichuan Science and Technology Program, China


