Domain Adaptation for Learning from Label Proportions Using Domain-Adversarial Neural Network

Li, Xintian; Culotta, Aron

doi:10.1007/s42979-023-02090-8

Domain Adaptation for Learning from Label Proportions Using Domain-Adversarial Neural Network

Original Research
Published: 12 August 2023

Volume 4, article number 615, (2023)
Cite this article

SN Computer Science Aims and scope Submit manuscript

80 Accesses
Explore all metrics

Abstract

Learning from Label Proportions (LLP) is a machine learning problem where the training data are composed of bags of instances, and only the class label proportions for each bag are given. In some domains, we can directly obtain label distributions; for example, one can use census statistics and social media user information grouped by location to build a classifier for user demographics. However, label proportions are unavailable in many domains, such as product review sites. The solution is to modify the model fit on data from where label proportion are available domains (the source domain) to apply to a domain where the label distributions are not available (target domain). Such problems can be regarded as the unsupervised domain adaptation problems in an LLP setting. The goal of this paper is to introduce domain adaptation methods to the original LLP solutions such that the proposed model can classify instances from a new domain. We propose a model combining domain-adversarial neural network (DANN) and label regularization, which can be fit on the source-domain bags and predict labels for target-domain instances. This approach requires only label proportions in the source domain. Our experiments on both synthetic tasks and sentiment classification tasks indicate a noticeable improvement in accuracy as compared to using LLP without domain adaptation.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A survey of transfer learning

Article Open access 28 May 2016

A survey on semi-supervised learning

Article Open access 15 November 2019

Density-based weighting for imbalanced regression

Article Open access 07 July 2021

Data availability

No datasets were generated during this study.

References

Ardehaly EM, Culotta A. Inferring latent attributes of twitter users with label regularization. In: Proceedings of the 2015 conference of the north american chapter of the association for computational linguistics: Human language technologies, 2015. p. 185–195.
Ardehaly EM, Culotta A. Domain adaptation for learning from label proportions using self-training. In: IJCAI, 2016. p. 3670–3676.
Ardehaly EM, Culotta A. Co-training for demographic classification using deep learning from label proportions. In: 2017 IEEE International Conference on Data Mining Workshops (ICDMW), IEEE 2017. p. 1017–1024.
Baktashmotlagh M, Harandi MT, Lovell BC, Salzmann M. Unsupervised domain adaptation by domain invariant projection. In: Proceedings of the IEEE International Conference on Computer Vision, 2013. p. 769–776.
Ben-David S, Blitzer J, Crammer K, Kulesza A, Pereira F, Vaughan JW. A theory of learning from different domains. Mach Learn. 2010;79(1):151–75.
Article MathSciNet MATH Google Scholar
Blitzer J, Dredze M, Pereira F. Biographies, bollywood, boom-boxes and blenders: Domain adaptation for sentiment classification. In: Proceedings of the 45th annual meeting of the association of computational linguistics, 2007. p. 440–447.
Borgwardt KM, Gretton A, Rasch MJ, Kriegel HP, Schölkopf B, Smola AJ. Integrating structured biological data by kernel maximum mean discrepancy. Bioinformatics. 2006;22(14):e49–57.
Article Google Scholar
Bortsova G, Dubost F, Ørting S, Katramados I, Hogeweg L, Thomsen L, Wille M, de Bruijne M. Deep learning from label proportions for emphysema quantification. In: Medical Image Computing and Computer Assisted Intervention–MICCAI 2018: 21st International Conference, Granada, Spain, September 16-20, 2018, Proceedings, Part II 11, Springer; 2018. p. 768–776.
Chen T, Yu FX, Chen J, Cui Y, Chen YY, Chang SF. Object-based visual sentiment concept analysis and application. In: Proceedings of the 22nd ACM international conference on Multimedia, 2014; p. 367–376.
Fernando B, Habrard A, Sebban M, Tuytelaars T. Unsupervised visual domain adaptation using subspace alignment. In: Proceedings of the IEEE international conference on computer vision, 2013. p. 2960–2967.
Ganin Y, Ustinova E, Ajakan H, Germain P, Larochelle H, Laviolette F, Marchand M, Lempitsky V. Domain-adversarial training of neural networks. J Mach Learn Res. 2016;17(1):2030–96.
MathSciNet MATH Google Scholar
Gong B, Grauman K, Sha F. Connecting the dots with landmarks: Discriminatively learning domain-invariant features for unsupervised domain adaptation. In: International Conference on Machine Learning, PMLR 2013. p. 222–230.
Goodfellow IJ, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville A, Bengio Y. Generative adversarial networks. 2014. arXiv preprint arXiv:1406.2661.
Gopalan R, Li R, Chellappa R. Domain adaptation for object recognition: An unsupervised approach. In: 2011 international conference on computer vision, IEEE 2011. p. 999–1006.
Hong W, Wang Z, Yang M, Yuan J. Conditional generative adversarial network for structured domain adaptation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018. p. 1335–1344.
Huang J, Gretton A, Borgwardt K, Schölkopf B, Smola A. Correcting sample selection bias by unlabeled data. Adv Neural Inf Process Syst. 2006;19:601–8.
Google Scholar
Lai KT, Yu FX, Chen MS, Chang SF. Video event detection by inferring temporal instance labels. In: Proceedings of the ieee conference on computer vision and pattern recognition, 2014. p. 2243–2250.
Liu J, Wang B, Qi Z, Tian Y, Shi Y. Learning from label proportions with generative adversarial networks. 2019. arXiv preprint arXiv:1909.02180
Mann GS, McCallum A. Generalized expectation criteria for semi-supervised learning with weakly labeled data. J Mach Learn Res. 2010;11(2):955–84.
Pan SJ, Tsang IW, Kwok JT, Yang Q. Domain adaptation via transfer component analysis. IEEE Trans Neural Netw. 2010;22(2):199–210.
Article Google Scholar
Patrini G, Nock R, Rivera P, Caetano T. (almost) no label no cry. In: Advances in Neural Information Processing Systems, 2014. p. 190–198.
Quadrianto N, Smola AJ, Caetano TS, Le QV. Estimating labels from label proportions. J Mach Learn Res. 2009;10:2349–74.
Rueping S. Svm classifier estimation from group probabilities. In: ICML 2010.
Shi Y, Liu J, Wang B, Qi Z, Tian Y. Deep learning from label proportions with labeled samples. Neural Netw. 2020;128:73–81.
Article Google Scholar
Sun T, Sheldon D, O’Connor B. A probabilistic approach for learning with label proportions applied to the us presidential election. In: 2017 IEEE International Conference on Data Mining (ICDM), IEEE 2017. p. 445–454.
Tzeng E, Hoffman J, Saenko K, Darrell T. Adversarial discriminative domain adaptation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, 2017. p. 7167–7176.
Yelp: Yelp dataset 2019. https://www.kaggle.com/yelp-dataset/yelp-dataset/version/9. [Online; Accessed 26 June 2021].
Yu FX, Choromanski K, Kumar S, Jebara T, Chang SF. On learning from label proportions. 2014. arXiv preprint arXiv:1402.5902.

Download references

Author information

Authors and Affiliations

Department of Computer Science, Tulane University, New Orleans, LA, 70118, USA
Xintian Li & Aron Culotta

Authors

Xintian Li
View author publications
You can also search for this author in PubMed Google Scholar
Aron Culotta
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xintian Li.

Ethics declarations

Conflict of interest

This work was supported by National Science Foundation (1618244), (1927407), (1917112), (2133960).

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Li, X., Culotta, A. Domain Adaptation for Learning from Label Proportions Using Domain-Adversarial Neural Network. SN COMPUT. SCI. 4, 615 (2023). https://doi.org/10.1007/s42979-023-02090-8

Download citation

Received: 10 February 2022
Accepted: 26 June 2023
Published: 12 August 2023
DOI: https://doi.org/10.1007/s42979-023-02090-8

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Domain Adaptation for Learning from Label Proportions Using Domain-Adversarial Neural Network

Abstract

Access this article

Similar content being viewed by others

A survey of transfer learning

A survey on semi-supervised learning

Density-based weighting for imbalanced regression

Data availability

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Domain Adaptation for Learning from Label Proportions Using Domain-Adversarial Neural Network

Abstract

Access this article

Similar content being viewed by others

A survey of transfer learning

A survey on semi-supervised learning

Density-based weighting for imbalanced regression

Data availability

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation