ABSTRACT
Deep neural networks for medical imaging require large high-quality labelled data, a huge bottleneck for resource poor settings. Given the privacy requirements of medical data, institutes are un-willing to share data, causing an hindrance in resource poor settings. In the present paper, (Camaraderie: Content-based Knowledge Transfer for Medical Image Labelling using Supervised Autoencoders in a Decentralized Setting) we propose to use Discrete Classifier Supervised Autoencoder (DC-SAE) to generate low-dimensional representations of a few annotated images at the Donor client and transfer both the DC-SAE's encoder part and the latent space representations to the Recipient client without sharing raw data. We then pass the unlabelled images of the Recipient Client through this encoder to obtain their latent space representation. In a supervised setting, using latent space representation of Donor client's labelled images, we accurately annotate images of Recipient client. Camaraderie demonstrates that DC-SAE outperforms Recipient end label accuracy beyond classical VAE based classification and anomaly detection based VAE. Thus, given a limited amount of labelled data in a decentralized privacy preserving scenario, one can transfer latent space representation across clients to annotate large number of unlabelled images with high accuracy.
- Muhammad EH Chowdhury, Tawsifur Rahman, Amith Khandakar, Rashid Mazhar, Muhammad Abdul Kadir, Zaid Bin Mahbub, Khandakar Reajul Islam, Muhammad Salman Khan, Atif Iqbal, Nasser Al Emadi, et al. 2020. Can AI help in screening viral and COVID-19 pneumonia? Ieee Access 8 (2020), 132665--132676.Google ScholarCross Ref
- Xintao Duan, Jingjing Liu, and En Zhang. 2019. Efficient image encryption and compression based on a VAE generative model. Journal of Real-Time Image Processing 16 (2019), 765--773.Google ScholarDigital Library
- Daniel S Kermany, Michael Goldbaum,Wenjia Cai, Carolina CS Valentim, Huiying Liang, Sally L Baxter, Alex McKeown, Ge Yang, Xiaokang Wu, Fangbing Yan, et al. 2018. Identifying medical diagnoses and treatable diseases by image-based deep learning. cell 172, 5 (2018), 1122--1131.Google Scholar
- Diederik P Kingma, Max Welling, et al. 2019. An introduction to variational autoencoders. Foundations and Trends® in Machine Learning 12, 4 (2019), 307--392.Google Scholar
- Yann LeCun, Yoshua Bengio, and Geoffrey Hinton. 2015. Deep learning. nature 521, 7553 (2015), 436--444.Google Scholar
- Ruhan Liu, Xiangning Wang, Qiang Wu, Ling Dai, Xi Fang, Tao Yan, Jaemin Son, Shiqi Tang, Jiang Li, Zijian Gao, et al. 2022. Deepdrid: Diabetic retinopathy- grading and image quality estimation challenge. Patterns 3, 6 (2022), 100512.Google ScholarCross Ref
- W Nicholson Price and I Glenn Cohen. 2019. Privacy in the age of medical big data. Nature medicine 25, 1 (2019), 37--43.Google Scholar
- Iqbal H Sarker. 2021. Deep learning: a comprehensive overview on techniques, taxonomy, applications and research directions. SN Computer Science 2, 6 (2021), 420.Google ScholarDigital Library
- Ramzi Snoussi and Habib Youssef. 2023. VAE-Based Latent Representations Learning for Botnet Detection in IoT Networks. Journal of Network and Systems Management 31, 1 (2023), 4.Google ScholarDigital Library
- Yang Zhi-Han. 2022. Training Latent Variable Models with Auto-encoding Variational Bayes: A Tutorial. arXiv preprint arXiv:2208.07818 (2022).Google Scholar
Recommendations
An Approach for Predicting Protein-Protein Interactions using Supervised Autoencoders
AbstractIdentifying protein-protein interactions (PPIs) represents a challenging research problem in computational biology. Even though machine learning methods have significantly advanced the research in this field, current approaches struggle to ...
Set Labelling using Multi-label Classification
iiWAS2018: Proceedings of the 20th International Conference on Information Integration and Web-based Applications & ServicesWe propose the task of set labelling. Starting from some examples members of a set, set labelling tries to infer the most appropriate labels for the given set. For this work, we consider sets of words. We illustrate the task and a possible solution with ...
Sentiment labeling for extending initial labeled data to improve semi-supervised sentiment classification
Semi-supervised framework which exploits unsupervised approach (JST) is proposed.Self-training suffers from incorrectly labeling problem with insufficient data.Confidently predicted instances are labeled and used as training data by JST.Self-training ...
Comments