research-article

Deep Generative Positive-Unlabeled Learning under Selection Bias

Authors:

Il-Chul MoonAuthors Info & Claims

CIKM '20: Proceedings of the 29th ACM International Conference on Information & Knowledge Management

Pages 1155 - 1164

https://doi.org/10.1145/3340531.3411971

Published: 19 October 2020 Publication History

Get Access

Abstract

Learning in the positive-unlabeled (PU) setting is prevalent in real world applications. Many previous works depend upon theSelected Completely At Random (SCAR) assumption to utilize unlabeled data, but the SCAR assumption is not often applicable to the real world due to selection bias in label observations. This paper is the first generative PU learning model without the SCAR assumption. Specifically, we derive the PU risk function without the SCAR assumption, and we generate a set of virtual PU examples to train the classifier. Although our PU risk function is more generalizable, the function requires PU instances that do not exist in the observations. Therefore, we introduce the VAE-PU, which is a variant of variational autoencoders to separate two latent variables that generate either features or observation indicators. The separated latent information enables the model to generate virtual PU instances. We test the VAE-PU on benchmark datasets with and without the SCAR assumption. The results indicate that the VAE-PU is superior when selection bias exists, and the VAE-PU is also competent under the SCAR assumption. The results also emphasize that the VAE-PU is effective when there are few positive-labeled instances due to modeling on selection bias.

Supplementary Material

MP4 File (3340531.3411971.mp4)

This presentation introduces the CIKM 2020 full research paper, Deep Generative Positive-Unlabeled Learning under Selection Bias. In this paper, we propose a generative positive-unlabeled (PU) learning method, VAE-PU, without the selected completely at random (SCAR) assumption. To do this, the authors derive the risk function without SCAR assumption and design a deep generative model to virtually generate the PU instances. Experiment results indicate that the VAE-PU is superior when selection bias exists, and the VAE-PU is also competent under the SCAR assumption. This generative approach is naturally called for because the generation is necessary for solving such biased observations.

Download
26.30 MB

References

[1]

Antreas Antoniou, Amos Storkey, and Harrison Edwards. 2017. Data Augmentation Generative Adversarial Networks. arxiv: 1711.04340 [stat.ML]

Abstract

Supplementary Material

References

Cited By

Index Terms

Recommendations

Robust Positive-Unlabeled Learning via Noise Negative Sample Self-correction

Combining deep generative and discriminative models for Bayesian semi-supervised learning

Positive and unlabeled learning with label disambiguation

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Upcoming Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations