skip to main content
research-article

Category-Stitch Learning for Union Domain Generalization

Authors Info & Claims
Published:05 January 2023Publication History
Skip Abstract Section

Abstract

Domain generalization aims at generalizing the network trained on multiple domains to unknown but related domains. Under the assumption that different domains share the same classes, previous works can build relationships across domains. However, in realistic scenarios, the change of domains is always followed by the change of categories, which raises a difficulty for collecting sufficient aligned categories across domains. Bearing this in mind, this article introduces union domain generalization (UDG) as a new domain generalization scenario, in which the label space varies across domains, and the categories in unknown domains belong to the union of all given domain categories. The absence of categories in given domains is the main obstacle to aligning different domain distributions and obtaining domain-invariant information. To address this problem, we propose category-stitch learning (CSL), which aims at jointly learning the domain-invariant information and completing missing categories in all domains through an improved variational autoencoder and generators. The domain-invariant information extraction and sample generation cross-promote each other to better generalizability. Additionally, we decouple category and domain information and propose explicitly regularizing the semantic information by the classification loss with transferred samples. Thus our method can breakthrough the category limit and generate samples of missing categories in each domain. Extensive experiments and visualizations are conducted on MNIST, VLCS, PACS, Office-Home, and DomainNet datasets to demonstrate the effectiveness of our proposed method.

REFERENCES

  1. [1] Balaji Yogesh, Sankaranarayanan Swami, and Chellappa Rama. 2018. Metareg: Towards domain generalization using meta-regularization. In Proceedings of the Advances in Neural Information Processing Systems. 9981008.Google ScholarGoogle Scholar
  2. [2] Baldi Pierre. 2012. Autoencoders, unsupervised learning, and deep architectures. In Proceedings of the ICML Workshop on Unsupervised and Transfer Learning. 3749.Google ScholarGoogle Scholar
  3. [3] Ben-David Shai, Blitzer John, Crammer Koby, and Pereira Fernando. 2007. Analysis of representations for domain adaptation. In Proceedings of the Advances in Neural Information Processing Systems. 137–144.Google ScholarGoogle Scholar
  4. [4] Cao Zhangjie, Ma Lijia, Long Mingsheng, and Wang Jianmin. 2018. Partial adversarial domain adaptation. In Proceedings of the European Conference on Computer Vision. 135150.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. [5] Cao Zhangjie, You Kaichao, Long Mingsheng, Wang Jianmin, and Yang Qiang. 2019. Learning to transfer examples for partial domain adaptation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.Google ScholarGoogle ScholarCross RefCross Ref
  6. [6] Carlucci Fabio M., D’Innocente Antonio, Bucci Silvia, Caputo Barbara, and Tommasi Tatiana. 2019. Domain generalization by solving jigsaw puzzles. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 22292238.Google ScholarGoogle ScholarCross RefCross Ref
  7. [7] Choi Myung Jin, Lim Joseph J., Torralba Antonio, and Willsky Alan S.. 2010. Exploiting hierarchical context on a large database of object categories. InProceedings of the 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.Google ScholarGoogle Scholar
  8. [8] Choi Yunjey, Choi Minje, Kim Munyoung, Ha Jung-Woo, Kim Sunghun, and Choo Jaegul. 2018. Stargan: Unified generative adversarial networks for multi-domain image-to-image translation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 87898797.Google ScholarGoogle ScholarCross RefCross Ref
  9. [9] Ding Yuhang, Fan Hehe, Xu Mingliang, and Yang Yi. 2020. Adaptive exploration for unsupervised person re-identification. ACM Transactions on Multimedia Computing, Communications, and Applications 16, 1(2020), 19 pages. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. [10] Doersch Carl. 2016. Tutorial on variational autoencoders. arXiv:1606.05908. Retrieved from https://arxiv.org/abs/1606.05908.Google ScholarGoogle Scholar
  11. [11] Everingham Mark, Gool Luc Van, Williams Christopher K. I., Winn John, and Zisserman Andrew. 2010. The pascal visual object classes (voc) challenge. International Journal of Computer Vision 88, 2 (2010), 303338.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. [12] French Geoff, Mackiewicz Michal, and Fisher Mark. 2018. Self-ensembling for visual domain adaptation. In Proceedings of the International Conference on Learning Representations. Retrieved from https://openreview.net/forum?id=rkpoTaxA-.Google ScholarGoogle Scholar
  13. [13] Ganin Yaroslav, Ustinova Evgeniya, Ajakan Hana, Germain Pascal, Larochelle Hugo, Laviolette François, Marchand Mario, and Lempitsky Victor. 2016. Domain-adversarial training of neural networks. The Journal of Machine Learning Research 17, 1 (2016), 2096–2030.Google ScholarGoogle ScholarCross RefCross Ref
  14. [14] Ghifary Muhammad, Balduzzi David, Kleijn W. Bastiaan, and Zhang Mengjie. 2017. Scatter component analysis: A unified framework for domain adaptation and domain generalization. IEEE Transactions on Pattern Analysis and Machine Intelligence 39, 7 (2017), 14141430.Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. [15] Ghifary Muhammad, Kleijn W. Bastiaan, Zhang Mengjie, and Balduzzi David. 2015. Domain generalization for object recognition with multi-task autoencoders. In Proceedings of the IEEE International Conference on Computer Vision. 25512559.Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. [16] Griffin Gregory, Holub Alex, and Perona Pietro. 2007. Caltech-256 Object Category Dataset. California Institute of Technology.Google ScholarGoogle Scholar
  17. [17] Khosla Aditya, Zhou Tinghui, Malisiewicz Tomasz, Efros Alexei A., and Torralba Antonio. 2012. Undoing the damage of dataset bias. In Proceedings of the European Conference on Computer Vision. Springer, 158171.Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. [18] Kieu My, Bagdanov Andrew D., and Bertini Marco. 2021. Bottom-up and layerwise domain adaptation for pedestrian detection in thermal images. ACM Transactions on Multimedia Computing, Communications, and Applications 17, 1(2021), 19 pages. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. [19] Kingma Diederik P. and Welling Max. 2013. Auto-encoding variational bayes. arXiv:1312.6114. Retrieved from https://arxiv.org/abs/1312.6114.Google ScholarGoogle Scholar
  20. [20] Klys Jack, Snell Jake, and Zemel Richard. 2018. Learning latent subspaces in variational autoencoders. In Proceedings of the Advances in Neural Information Processing Systems. 64446454.Google ScholarGoogle Scholar
  21. [21] Krizhevsky Alex, Sutskever Ilya, and Hinton Geoffrey E.. 2012. Imagenet classification with deep convolutional neural networks. In Proceedings of the Advances in Neural Information Processing Systems. 10971105.Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. [22] Kullback Solomon. 1997. Information Theory and Statistics. Courier Corporation.Google ScholarGoogle Scholar
  23. [23] Li Da, Yang Yongxin, Song Yi-Zhe, and Hospedales Timothy M.. 2017. Deeper, broader and artier domain generalization. In Proceedings of the 2017 IEEE International Conference on Computer Vision. IEEE, 55435551.Google ScholarGoogle ScholarCross RefCross Ref
  24. [24] Li Da, Yang Yongxin, Song Yi-Zhe, and Hospedales Timothy M.. 2018. Learning to generalize: Meta-learning for domain generalization. In Proceedings of the 32nd AAAI Conference on Artificial Intelligence.Google ScholarGoogle ScholarCross RefCross Ref
  25. [25] Li Da, Zhang Jianshu, Yang Yongxin, Liu Cong, Song Yi-Zhe, and Hospedales Timothy M.. 2019. Episodic training for domain generalization. In Proceedings of the International Conference on Computer Vision. Institute of Electrical and Electronics Engineers (IEEE).Google ScholarGoogle ScholarCross RefCross Ref
  26. [26] Li Ya, Tian Xinmei, Gong Mingming, Liu Yajing, Liu Tongliang, Zhang Kun, and Tao Dacheng. 2018. Deep domain generalization via conditional invariant adversarial networks. In Proceedings of the European Conference on Computer Vision. 624639.Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. [27] Li Yiying, Yang Yongxin, Zhou Wei, and Hospedales Timothy. 2019. Feature-critic networks for heterogeneous domain generalization. In Proceedings of the International Conference on Machine Learning. 39153924.Google ScholarGoogle Scholar
  28. [28] Liu Yajing, Tian Xinmei, Li Ya, Xiong Zhiwei, and Wu Feng. 2019. Compact feature learning for multi-domain image classification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.Google ScholarGoogle ScholarCross RefCross Ref
  29. [29] Liu Yajing, Xiong Zhiwei, Li Ya, Tian Xinmei, and Zha Zheng-Jun. 2021. Domain Generalization via Encoding and Resampling in a Unified Latent Space. In IEEE Transactions on Multimedia. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  30. [30] Mitsuzumi Yu, Irie Go, Ikami Daiki, and Shibata Takashi. 2021. Generalized domain adaptation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 10841093.Google ScholarGoogle ScholarCross RefCross Ref
  31. [31] Muandet Krikamol, Balduzzi David, and Schölkopf Bernhard. 2013. Domain generalization via invariant feature representation. In Proceedings of the International Conference on Machine Learning. 1018.Google ScholarGoogle Scholar
  32. [32] Odena Augustus, Olah Christopher, and Shlens Jonathon. 2017. Conditional image synthesis with auxiliary classifier gans. In Proceedings of the International Conference on Machine Learning. PMLR, 26422651.Google ScholarGoogle Scholar
  33. [33] Pan Sinno Jialin and Yang Qiang. 2009. A survey on transfer learning. IEEE Transactions on Knowledge and Data Engineering 22, 10 (2009), 13451359.Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. [34] Pan Yingwei, Yao Ting, Li Yehao, Ngo Chong-Wah, and Mei Tao. 2020. Exploring category-agnostic clusters for open-set domain adaptation. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition.1386413872. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  35. [35] Pan Yingwei, Yao Ting, Li Yehao, Wang Yu, Ngo Chong-Wah, and Mei Tao. 2019. Transferrable prototypical networks for unsupervised domain adaptation. In Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition. 22342242. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  36. [36] Parascandolo G., Neitz A., Orvieto A., Gresele L., and Schlkopf B.. 2020. Learning explanations that are hard to vary. (2020).Google ScholarGoogle Scholar
  37. [37] Peng Xingchao, Bai Qinxun, Xia Xide, Huang Zijun, Saenko Kate, and Wang Bo. 2019. Moment matching for multi-source domain adaptation. In Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision. 14061415. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  38. [38] Peng Xingchao, Bai Qinxun, Xia Xide, Huang Zijun, Saenko Kate, and Wang Bo. 2019. Moment matching for multi-source domain adaptation. In Proceedings of the IEEE International Conference on Computer Vision. 14061415.Google ScholarGoogle ScholarCross RefCross Ref
  39. [39] Russell Bryan C., Torralba Antonio, Murphy Kevin P., and Freeman William T.. 2008. LabelMe: A database and web-based tool for image annotation. International Journal of Computer Vision 77, 1–3 (2008), 157173.Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. [40] Sariyildiz Mert Bulent and Cinbis Ramazan Gokberk. 2019. Gradient matching generative networks for zero-shot learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 21682178.Google ScholarGoogle ScholarCross RefCross Ref
  41. [41] Shankar Shiv, Piratla Vihari, Chakrabarti Soumen, Chaudhuri Siddhartha, Jyothi Preethi, and Sarawagi Sunita. 2018. Generalizing across domains via cross-gradient training. In Proceedings of the International Conference on Learning Representations. Retrieved from https://openreview.net/forum?id=r1Dx7fbCW.Google ScholarGoogle Scholar
  42. [42] Sohn Kihyuk, Lee Honglak, and Yan Xinchen. 2015. Learning structured output representation using deep conditional generative models. In Proceedings of the Advances in Neural Information Processing Systems. 34833491.Google ScholarGoogle Scholar
  43. [43] Tang Jinhui, Shu Xiangbo, Li Zechao, Qi Guo-Jun, and Wang Jingdong. 2016. Generalized deep transfer networks for knowledge propagation in heterogeneous domains. ACM Transactions on Multimedia Computing, Communications, and Applications 12, 4s(2016), 22 pages. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. [44] Tian Yingtao and Engel Jesse. 2018. Latent domain transfer: Crossing modalities with bridging autoencoders. In Proceedings of the ICLR 2019 Conference on Blind Submission.Google ScholarGoogle Scholar
  45. [45] Torralba Antonio and Efros Alexei A.. 2011. Unbiased look at dataset bias. In Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 15211528.Google ScholarGoogle ScholarDigital LibraryDigital Library
  46. [46] Venkateswara Hemanth, Eusebio Jose, Chakraborty Shayok, and Panchanathan Sethuraman. 2017. Deep hashing network for unsupervised domain adaptation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 50185027.Google ScholarGoogle ScholarCross RefCross Ref
  47. [47] Wang Yaxing, Gonzalez-Garcia Abel, Weijer Joost van de, and Herranz Luis. 2019. SDIT: Scalable and diverse cross-domain image translation. In Proceedings of the 27th ACM International Conference on Multimedia. 12671276.Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. [48] Xu Ruijia, Chen Ziliang, Zuo Wangmeng, Yan Junjie, and Lin Liang. 2018. Deep cocktail network: Multi-source unsupervised domain adaptation with category shift. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 39643973.Google ScholarGoogle ScholarCross RefCross Ref
  49. [49] Xu Zheng, Li Wen, Niu Li, and Xu Dong. 2014. Exploiting low-rank structure from latent domains for domain generalization. In Proceedings of the European Conference on Computer Vision. Springer, 628643.Google ScholarGoogle ScholarCross RefCross Ref
  50. [50] Yao Yuan, Zhang Yu, Li Xutao, and Ye Yunming. 2019. Heterogeneous domain adaptation via soft transfer network. In Proceedings of the 27th ACM International Conference on Multimedia. 15781586.Google ScholarGoogle ScholarDigital LibraryDigital Library
  51. [51] Zareapoor Masoumeh and Yang Jie. 2021. Equivariant adversarial network for image-to-image translation. ACM Transactions on Multimedia Computing, Communications, and Applications 17, 2s(2021), 14 pages. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  52. [52] Zhao Shanshan, Gong Mingming, Liu Tongliang, Fu Huan, and Tao Dacheng. 2020. Domain generalization via entropy regularization. Advances in Neural Information Processing Systems 33 (2020).Google ScholarGoogle Scholar

Index Terms

  1. Category-Stitch Learning for Union Domain Generalization

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Multimedia Computing, Communications, and Applications
      ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 19, Issue 1
      January 2023
      505 pages
      ISSN:1551-6857
      EISSN:1551-6865
      DOI:10.1145/3572858
      • Editor:
      • Abdulmotaleb El Saddik
      Issue’s Table of Contents

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 5 January 2023
      • Online AM: 17 March 2022
      • Accepted: 6 March 2022
      • Revised: 3 March 2022
      • Received: 18 June 2021
      Published in tomm Volume 19, Issue 1

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Refereed

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    View Full Text

    HTML Format

    View this article in HTML Format .

    View HTML Format