Abstract
Unsupervised domain adaptation aims to improve the performance of model generalization on unlabeled target domain by utilizing given labeled training samples from source domain. The optimal transport theory has been widely used to reduce domain discrepancy. However, most existing optimal transport-based approaches inevitably have the pair-wise mismatching problem during alignment of feature distributions. Besides, most current studies tend to focus on the improvement of prediction accuracy while ignoring the uncertainty estimation of noisy training samples, which is particularly important for the learning of transferrable model. To alleviate these issues, we propose a framework, Uncertainty-guided Joint Unbalanced Optimal Transport (UJUOT), which employs a feature uncertainty estimation (FUE) mechanism and an unbalanced optimal transport strategy. FUE encodes uncertainty by modeling each image embedding as a Gaussian distribution, improving representation space with better inter-class separability and intra-class compactness. It not only makes it easier for the domain alignment, but also lets the model more robust to noisy data. In addition, to reduce negative transfer, we design a novel unbalanced optimal transport (UOT) strategy to achieve precise pair-wise matching, which fully utilizes discriminative class-aware information to learn mass adaptively in order to determine best transport plan. To our best knowledge, this is a pioneering work of introducing data uncertainty to unsupervised domain adaptation. Extensive experiments on various standard datasets prove that our proposal can significantly improve transfer performance, outperforming state-of-the-art methods in many aspects.






Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Data availability
The experimental datasets are open source and can be obtained through references.
References
Li M, Zhai Y-M, Luo Y-W, Ge P-F, Ren C-X Enhanced transport distance for unsupervised domain adaptation In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 13936–13944 (2020)
Lee C-Y, Batra T, Baig MH, Ulbricht D Sliced wasserstein discrepancy for unsupervised domain adaptation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 10285–10295 (2019)
Tzeng E, Hoffman J, Zhang N, Saenko K, Darrell T Deep domain confusion: maximizing for domain invariance. arXiv preprint arXiv:1412.3474 (2014)
Sun B, Saenko K Deep coral: Correlation alignment for deep domain adaptation. In: European Conference on Computer Vision, pp. 443–450 (2016) Springer
Zhuang F, Cheng X, Luo P, Pan SJ, He Q (2015) Supervised representation learning: transfer learning with deep autoencoders. In: Twenty-Fourth International Joint Conference on Artificial Intelligence
Ganin Y, Ustinova E, Ajakan H, Germain P, Larochelle H, Laviolette F, Marchand M, Lempitsky V (2016) Domain-adversarial training of neural networks. J Mach Learn Res 17(1):2030–2096
Goodfellow I, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville A, Bengio Y (2014) Generative adversarial nets. arXiv:1406.2661 (preprint)
Hoffman J, Tzeng E, Park T, Zhu J-Y, Isola P, Saenko K, Efros A, Darrell T (2018) Cycada: Cycle-consistent adversarial domain adaptation. In: International Conference on Machine Learning, PMLR pp 1989–1998
Courty N, Flamary R, Tuia D (2014) Domain adaptation with regularized optimal transport. In: Joint European Conference on Machine Learning and Knowledge Discovery in Databases, Springer pp 274–289
Yan Y, Li W, Wu H, Min H, Tan M, Wu Q (2018) Semi-supervised optimal transport for heterogeneous domain adaptation. IJCAI 7:2969–2975
Courty N, Flamary R, Habrard A, Rakotomamonjy A (2017) Joint distribution optimal transportation for domain adaptation. arXiv preprint arXiv:1705.08848
Courty N, Flamary R, Tuia D, Rakotomamonjy A (2016) Optimal transport for domain adaptation. IEEE Trans Pattern Anal Mach Intell 39(9):1853–1865
Pan SJ, Yang Q (2009) A survey on transfer learning. IEEE Trans Knowl Data Eng 22(10):1345–1359
Ben-David S, Blitzer J, Crammer K, Pereira F et al (2007) Analysis of representations for domain adaptation. Adv Neural Inf Process Syst 19:137
Chen Z, Chen C, Jin X, Liu Y, Cheng Z (2020) Deep joint two-stream wasserstein auto-encoder and selective attention alignment for unsupervised domain adaptation. Neural Comput Appl 32(11):7489–7502
Cheng Z, Chen C, Chen Z, Fang K, Jin X (2021) Robust and high-order correlation alignment for unsupervised domain adaptation. Neural Comput Appl 33(12):6891–6903
Long M, Zhu H, Wang J, Jordan MI (2017) Deep transfer learning with joint adaptation networks. In: International Conference on Machine Learning, PMLR pp 2208–2217
Long M, Cao Y, Wang J, Jordan M (2015) Learning transferable features with deep adaptation networks. In: International Conference on Machine Learning, pp 97–105 PMLR
Kang G, Jiang L, Yang Y, Hauptmann AG (2019) Contrastive adaptation network for unsupervised domain adaptation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 4893–4902
Pan Y, Yao T, Li Y, Wang Y, Ngo C-W, Mei T (2019) Transferrable prototypical networks for unsupervised domain adaptation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 2239–2247
Snell J, Swersky K, Zemel RS (2017) Prototypical networks for few-shot learning. arXiv preprint arXiv:1703.05175
Wang J, Feng W, Chen Y, Yu H, Huang M, Yu PS (2018) Visual domain adaptation with manifold embedded distribution alignment. In: Proceedings of the 26th ACM International Conference on Multimedia, pp 402–410
Wang J, Chen Y, Feng W, Yu H, Huang M, Yang Q (2020) Transfer learning with dynamic distribution adaptation. ACM Transactions Intell Syst Technol (TIST) 11(1):1–25
Tzeng E, Hoffman J, Saenko K, Darrell T (2017) Adversarial discriminative domain adaptation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 7167–7176
Bousmalis K, Silberman N, Dohan D, Erhan D, Krishnan D (2017) Unsupervised pixel-level domain adaptation with generative adversarial networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 3722–3731
Sankaranarayanan S, Balaji Y, Castillo CD, Chellappa R (2018) Generate to adapt: aligning domains using generative adversarial networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 8503–8512
Murez Z, Kolouri S, Kriegman D, Ramamoorthi R, Kim K (2018) Image to image translation for domain adaptation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 4500–4509
Zhang H, Cisse M, Dauphin YN, Lopez-Paz D (2017) mixup: beyond empirical risk minimization. arXiv preprint arXiv:1710.09412
Wu Y, Inkpen D, El-Roby A (2020) Dual mixup regularized learning for adversarial domain adaptation. In: European Conference on Computer Vision, Springer pp 540–555
Monge G Mémoire sur la théorie des déblais et des remblais. Histoire de l’Académie Royale des Sciences de Paris (1781)
Villani C (2009) Optimal transport: old and new. Springer, New York
Perrot M, Courty N, Flamary R, Habrard A (2016) Mapping estimation for discrete optimal transport. Adv Neural Inf Process Syst 29:4197–4205
Damodaran BB, Kellenberger B, Flamary R, Tuia D, Courty N (2018) Deepjdot: deep joint distribution optimal transport for unsupervised domain adaptation. In: Proceedings of the European Conference on Computer Vision (ECCV), pp 447–463
Xu R, Liu P, Wang L, Chen C, Wang J (2020) Reliable weighted optimal transport for unsupervised domain adaptation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 4394–4403
Fatras K, Séjourné T, Flamary R, Courty N (2021) Unbalanced minibatch optimal transport; applications to domain adaptation. In: International Conference on Machine Learning, pp 3186–3197 PMLR
Gal Y, Ghahramani Z (2016) Dropout as a bayesian approximation: Representing model uncertainty in deep learning. In: International Conference on Machine Learning, pp 1050–1059 PMLR
Kendall A, Gal Y (2017) What uncertainties do we need in bayesian deep learning for computer vision? arXiv preprint arXiv:1703.04977
Chang J, Lan Z, Cheng C, Wei Y (2020) Data uncertainty learning in face recognition. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 5710–5719
Yu T, Li D, Yang Y, Hospedales TM, Xiang T (2019) Robust person re-identification by modelling feature uncertainty. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp 552–561
Isobe S, Arai S (2017) Deep convolutional encoder-decoder network with model uncertainty for semantic segmentation. In: 2017 IEEE International Conference on INnovations in Intelligent SysTems and Applications (INISTA), pp 365–370 IEEE
Lee J, Lee G (2020) Model uncertainty for unsupervised domain adaptation. In: 2020 IEEE International Conference on Image Processing (ICIP), pp 1841–1845 IEEE
Chen C, Chen Z, Jiang B, Jin X (2019) Joint domain alignment and discriminative feature learning for unsupervised deep domain adaptation. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol 33, pp 3296–3303
Angenent S, Haker S, Tannenbaum A (2003) Minimizing flows for the Monge-Kantorovich problem. SIAM J Math Anal 35(1):61–97
Dieci L, Walsh JD III (2019) The boundary method for semi-discrete optimal transport partitions and wasserstein distance computation. J Comput Appl Math 353:318–344
Zhan F, Yu Y, Cui K, Zhang G, Lu S, Pan J, Zhang C, Ma F, Xie X, Miao C (2021) Unbalanced feature transport for exemplar-based image translation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 15028–15038
Chizat L, Peyré G, Schmitzer B, Vialard F-X (2018) Scaling algorithms for unbalanced optimal transport problems. Math Comput 87(314):2563–2609
Lu W, Chen Y, Wang J, Qin X (2021) Cross-domain activity recognition via substructural optimal transport. Neurocomputing 454:65–75
Bhatia R, Jain T, Lim Y (2019) On the bures-wasserstein distance between positive definite matrices. Expo Math 37(2):165–191
Chapelle O, Zien A (2005) Semi-supervised classification by low density separation. In: International Workshop on Artificial Intelligence and Statistics, pp 57–64 PMLR
Pham K, Le K, Ho N, Pham T, Bui H (2020) On unbalanced optimal transport: an analysis of sinkhorn algorithm. In: International Conference on Machine Learning, pp 7673–7682 PMLR
Cuturi M (2013) Sinkhorn distances: lightspeed computation of optimal transport. Adv Neural Inf Process Syst 26:2292–2300
Netzer Y, Wang T, Coates A, Bissacco A, Wu B, Ng AY (2011) Reading digits in natural images with unsupervised feature learning
Hull JJ (1994) A database for handwritten text recognition research. IEEE Trans Pattern Anal Mach Intell 16(5):550–554
LeCun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324
Saenko K, Kulis B, Fritz M, Darrell T (2010) Adapting visual category models to new domains. In: European Conference on Computer Vision, Springer pp 213–226
Venkateswara H, Eusebio J, Chakraborty S, Panchanathan S (2017) Deep hashing network for unsupervised domain adaptation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 5018–5027
Peng X, Usman B, Kaushik N, Wang D, Hoffman J, Saenko K (2018) Visda: a synthetic-to-real benchmark for visual domain adaptation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp 2021–2026
Saito K, Watanabe K, Ushiku Y, Harada T (2018) Maximum classifier discrepancy for unsupervised domain adaptation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 3723–3732 (2018)
Chen X, Wang S, Long M, Wang J (2019) Transferability vs. discriminability: batch spectral penalization for adversarial domain adaptation. In: International Conference on Machine Learning, pp 1081–1090 PMLR
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 770–778
van der Maaten L, Hinton G (2008) Visualizing data using t-sne. J Mach Learn Res 9(11)
Acknowledgements
This work is supported by the National Natural Science Foundation of China (No.61975048).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflicts of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Dan, J., Jin, T., Chi, H. et al. Uncertainty-guided joint unbalanced optimal transport for unsupervised domain adaptation. Neural Comput & Applic 35, 5351–5367 (2023). https://doi.org/10.1007/s00521-022-07976-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-022-07976-x