ABSTRACT
In this paper we propose to use generative adversarial network (GAN) for respiratory sound data augmentation. We present a GAN based approach that requires moderate amount of time and computing resources and capable to greatly increase performance of lung sound classification tasks. We also present a conditioned version of GAN, which is flexible and outperforms competitor augmentation methods. As a result, the GAN based augmentation method is able to boost RNN classifier performance by 10-15
- Antreas Antoniou, Amos Storkey, and Harrison Edwards. 2017. Data augmentation generative adversarial networks. arXiv preprint arXiv:1711.04340(2017).Google Scholar
- Titus Josef Brinker, Achim Hekler, Jochen Sven Utikal, Niels Grabe, Dirk Schadendorf, Joachim Klode, Carola Berking, Theresa Steeb, Alexander H Enk, and Christof von Kalle. 2018. Skin cancer classification using convolutional neural networks: systematic review. Journal of medical Internet research 20, 10 (2018), e11936.Google ScholarCross Ref
- Bert Brunekreef and Stephen T Holgate. 2002. Air pollution and health. The lancet 360, 9341 (2002), 1233–1242.Google Scholar
- Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya Sutskever, and Pieter Abbeel. 2016. Infogan: Interpretable representation learning by information maximizing generative adversarial nets. In Advances in neural information processing systems. 2172–2180.Google Scholar
- Kyunghyun Cho, Bart Van Merriënboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. 2014. Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv preprint arXiv:1406.1078(2014).Google Scholar
- Djork-Arné Clevert, Thomas Unterthiner, and Sepp Hochreiter. 2015. Fast and accurate deep network learning by exponential linear units (elus). arXiv preprint arXiv:1511.07289(2015).Google Scholar
- Ian Goodfellow, Yoshua Bengio, Aaron Courville, and Yoshua Bengio. 2016. Deep learning. Vol. 1. MIT press Cambridge.Google Scholar
- Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2014. Generative adversarial nets. In Advances in neural information processing systems. 2672–2680.Google Scholar
- Ishaan Gulrajani, Faruk Ahmed, Martin Arjovsky, Vincent Dumoulin, and Aaron C Courville. 2017. Improved training of wasserstein gans. In Advances in neural information processing systems. 5767–5777.Google Scholar
- Changhee Han, Hideaki Hayashi, Leonardo Rundo, Ryosuke Araki, Wataru Shimoda, Shinichi Muramatsu, Yujiro Furukawa, Giancarlo Mauri, and Hideki Nakayama. 2018. GAN-based synthetic brain MR image generation. In 2018 IEEE 15th International Symposium on Biomedical Imaging (ISBI 2018). IEEE, 734–738.Google ScholarCross Ref
- Martin Heusel, Hubert Ramsauer, Thomas Unterthiner, Bernhard Nessler, and Sepp Hochreiter. 2017. Gans trained by a two time-scale update rule converge to a local nash equilibrium. In Advances in neural information processing systems. 6626–6637.Google Scholar
- Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980(2014).Google Scholar
- Kirill Kochetov, Evgeny Putin, Svyatoslav Azizov, Ilya Skorobogatov, and Andrey Filchenkov. 2017. Wheeze detection using convolutional neural networks. In EPIA Conference on Artificial Intelligence. Springer, 162–173.Google ScholarDigital Library
- Kirill Kochetov, Evgeny Putin, Maksim Balashov, Andrey Filchenkov, and Anatoly Shalyto. 2018. Noise masking recurrent neural network for respiratory sound classification. In International Conference on Artificial Neural Networks. Springer, 208–217.Google ScholarCross Ref
- Tuomas Kynkäänniemi, Tero Karras, Samuli Laine, Jaakko Lehtinen, and Timo Aila. 2019. Improved precision and recall metric for assessing generative models. In Advances in Neural Information Processing Systems. 3927–3936.Google Scholar
- Yi Ma, Xinzi Xu, Qing Yu, Yuhang Zhang, Yongfu Li, Jian Zhao, and Guoxing Wang. 2019. LungBRN: A Smart Digital Stethoscope for Detecting Respiratory Disease Using bi-ResNet Deep Learning Algorithm. In 2019 IEEE Biomedical Circuits and Systems Conference (BioCAS). IEEE, 1–4.Google ScholarCross Ref
- Polina Mamoshina, Kirill Kochetov, Franco Cortese, Anna Kovalchuk, Alexander Aliper, Evgeny Putin, Morten Scheibye-Knudsen, Charles R Cantor, Neil M Skjodt, Olga Kovalchuk, 2019. Blood biochemistry analysis to detect smoking status and quantify accelerated aging in smokers. Scientific reports 9, 1 (2019), 1–10.Google Scholar
- Xudong Mao, Qing Li, Haoran Xie, Raymond YK Lau, Zhen Wang, and Stephen Paul Smolley. 2017. Least squares generative adversarial networks. In Proceedings of the IEEE international conference on computer vision. 2794–2802.Google ScholarCross Ref
- Brian McFee, Eric J Humphrey, and Juan Pablo Bello. 2015. A software framework for musical data augmentation.. In ISMIR, Vol. 2015. 248–254.Google Scholar
- Puja Mehta, Daniel F McAuley, Michael Brown, Emilie Sanchez, Rachel S Tattersall, Jessica J Manson, HLH Across Speciality Collaboration, 2020. COVID-19: consider cytokine storm syndromes and immunosuppression. Lancet (London, England) 395, 10229 (2020), 1033.Google Scholar
- Rajkumar Palaniappan, Kenneth Sundaraj, and Sebastian Sundaraj. 2014. A comparative study of the svm and k-nn machine learning algorithms for the diagnosis of respiratory pathologies using pulmonary acoustic signals. BMC bioinformatics 15, 1 (2014), 223.Google ScholarCross Ref
- Daniel S Park, William Chan, Yu Zhang, Chung-Cheng Chiu, Barret Zoph, Ekin D Cubuk, and Quoc V Le. 2019. Specaugment: A simple data augmentation method for automatic speech recognition. arXiv preprint arXiv:1904.08779(2019).Google Scholar
- Evgeny Putin, Arip Asadulaev, Yan Ivanenkov, Vladimir Aladinskiy, Benjamin Sanchez-Lengeling, Alán Aspuru-Guzik, and Alex Zhavoronkov. 2018. Reinforced adversarial neural computer for de novo molecular design. Journal of chemical information and modeling 58, 6 (2018), 1194–1204.Google ScholarCross Ref
- Evgeny Putin, Arip Asadulaev, Quentin Vanhaelen, Yan Ivanenkov, Anastasia V Aladinskaya, Alex Aliper, and Alex Zhavoronkov. 2018. Adversarial threshold neural computer for molecular de novo design. Molecular pharmaceutics 15, 10 (2018), 4386–4397.Google Scholar
- Daniele Ravì, Charence Wong, Fani Deligianni, Melissa Berthelot, Javier Andreu-Perez, Benny Lo, and Guang-Zhong Yang. 2016. Deep learning for health informatics. IEEE journal of biomedical and health informatics 21, 1(2016), 4–21.Google Scholar
- BM Rocha, D Filos, L Mendes, I Vogiatzis, E Perantoni, E Kaimakamis, P Natsiavas, A Oliveira, C Jácome, A Marques, 2017. A respiratory sound database for the development of automated classification. In International Conference on Biomedical and Health Informatics. Springer, 33–37.Google Scholar
- Justin Salamon and Juan Pablo Bello. 2017. Deep convolutional neural networks and data augmentation for environmental sound classification. IEEE Signal Processing Letters 24, 3 (2017), 279–283.Google ScholarCross Ref
- Tim Salimans, Ian Goodfellow, Wojciech Zaremba, Vicki Cheung, Alec Radford, and Xi Chen. 2016. Improved techniques for training gans. Advances in neural information processing systems 29 (2016), 2234–2242.Google Scholar
- Nitish Srivastava, Geoffrey Hinton, Alex Krizhevsky, Ilya Sutskever, and Ruslan Salakhutdinov. 2014. Dropout: a simple way to prevent neural networks from overfitting. The journal of machine learning research 15, 1 (2014), 1929–1958.Google ScholarDigital Library
- Bing Xu, Naiyan Wang, Tianqi Chen, and Mu Li. 2015. Empirical evaluation of rectified activations in convolutional network. arXiv preprint arXiv:1505.00853(2015).Google Scholar
- Fang Zheng, Guoliang Zhang, and Zhanjiang Song. 2001. Comparison of different implementations of MFCC. Journal of Computer science and Technology 16, 6 (2001), 582–589.Google ScholarDigital Library
- Generative Adversarial Networks for Respiratory Sound Augmentation
Recommendations
Graph Convolutional Network Based Generative Adversarial Networks for the Algorithm Selection Problem in Classification
CCRIS '20: Proceedings of the 2020 1st International Conference on Control, Robotics and Intelligent SystemIn this work, we address the algorithm selection problem for classification via meta-learning and generative adversarial networks. We focus on the dataset representation question. The matrix representation of classification dataset is not sensitive to ...
EnvGAN: a GAN-based augmentation to improve environmental sound classification
AbstractSeveral deep learning algorithms have emerged for the automatic classification of environmental sounds. However, the non-availability of adequate labeled data for training limits the performance of these algorithms. Data augmentation is an ...
Unregularized Auto-Encoder with Generative Adversarial Networks for Image Generation
MM '18: Proceedings of the 26th ACM international conference on MultimediaWith the development of deep neural networks, recent years have witnessed the increasing research interest on generative models. Specificly, Variational Auto-Encoders (VAE) and Generative Adversarial Networks (GAN) have achieved impressive results in ...
Comments