skip to main content
research-article

Listen2Cough: Leveraging End-to-End Deep Learning Cough Detection Model to Enhance Lung Health Assessment Using Passively Sensed Audio

Published: 30 March 2021 Publication History

Abstract

The prevalence of ubiquitous computing enables new opportunities for lung health monitoring and assessment. In the past few years, there have been extensive studies on cough detection using passively sensed audio signals. However, the generalizability of a cough detection model when applied to external datasets, especially in real-world implementation, is questionable and not explored adequately. Beyond detecting coughs, researchers have looked into how cough sounds can be used in assessing lung health. However, due to the challenges in collecting both cough sounds and lung health condition ground truth, previous studies have been hindered by the limited datasets. In this paper, we propose Listen2Cough to address these gaps. We first build an end-to-end deep learning architecture using public cough sound datasets to detect coughs within raw audio recordings. We employ a pre-trained MobileNet and integrate a number of augmentation techniques to improve the generalizability of our model. Without additional fine-tuning, our model is able to achieve an F1 score of 0.948 when tested against a new clean dataset, and 0.884 on another in-the-wild noisy dataset, leading to an advantage of 5.8% and 8.4% on average over the best baseline model, respectively. Then, to mitigate the issue of limited lung health data, we propose to transform the cough detection task to lung health assessment tasks so that the rich cough data can be leveraged. Our hypothesis is that these tasks extract and utilize similar effective representation from cough sounds. We embed the cough detection model into a multi-instance learning framework with the attention mechanism and further tune the model for lung health assessment tasks. Our final model achieves an F1-score of 0.912 on healthy v.s. unhealthy, 0.870 on obstructive v.s. non-obstructive, and 0.813 on COPD v.s. asthma classification, outperforming the baseline by 10.7%, 6.3%, and 3.7%, respectively. Moreover, the weight value in the attention layer can be used to identify important coughs highly correlated with lung health, which can potentially provide interpretability for expert diagnosis in the future.

References

[1]
2014. The Cost of Lung Disease. Lung Health Institute. https://lunginstitute.com/blog/the-cost-of-lung-disease/
[2]
2020. Covid-19 Sounds App. https://www.covid-19-sounds.org/en/
[3]
2020. Global Initiative for Chronic Obstructive Lung Disease - Global Initiative for Chronic Obstructive Lung Disease. https://goldcopd.org/
[4]
2020. Lung Health & Diseases. American Lung Association. https://www.lung.org/lung-health-diseases
[5]
Forsad Al Hossain, Andrew A Lover, George A Corey, Nicholas G Reich, and Tauhidur Rahman. 2020. FluSense: a contactless syndromic surveillance platform for influenza-like illness in hospital waiting areas. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies 4, 1 (2020), 1--28.
[6]
Justice Amoh and Kofi Odame. 2015. DeepCough: A deep convolutional neural network in a wearable cough detection system. In 2015 IEEE Biomedical Circuits and Systems Conference (BioCAS). IEEE, 1--4.
[7]
Justice Amoh and Kofi Odame. 2016. Deep neural networks for identifying cough sounds. IEEE transactions on biomedical circuits and systems 10, 5 (2016), 1003--1011.
[8]
Yusuf A Amrulloh, Udantha R Abeyratne, Vinayak Swarnkar, Rina Triasih, and Amalia Setyati. 2015. Automatic cough segmentation from non-contact sound recordings in pediatric wards. Biomedical Signal Processing and Control 21 (2015), 126--136.
[9]
Yusuf Aytar, Carl Vondrick, and Antonio Torralba. 2016. Soundnet: Learning sound representations from unlabeled video. In Advances in neural information processing systems. 892--900.
[10]
Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2014. Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473 (2014).
[11]
Filipe Barata, Kevin Kipfer, Maurice Weber, Peter Tinschert, Elgar Fleisch, and Tobias Kowatsch. 2019. Towards device-agnostic mobile cough detection with convolutional neural networks. In 2019 IEEE International Conference on Healthcare Informatics (ICHI). IEEE, 1--11.
[12]
Charles Bergeron, Gregory Moore, Jed Zaretzki, Curt M Breneman, and Kristin P Bennett. 2011. Fast bundle algorithm for multiple-instance learning. IEEE Transactions on Pattern Analysis and Machine Intelligence 34, 6 (2011), 1068--1079.
[13]
Hylan A Bickerman and Sylvia E Itkin. 1958. The effect of a new bronchodilator aerosol on the air flow dynamics of the maximal voluntary cough of patients with bronchial asthma and pulmonary emphysema. Journal of chronic diseases 8, 5 (1958), 629--636.
[14]
SS Birring, T Fleming, S Matos, AA Raj, DH Evans, and ID Pavord. 2008. The Leicester Cough Monitor: preliminary validation of an automated cough detection system in chronic cough. European Respiratory Journal 31, 5 (2008), 1013--1018.
[15]
James C Borders, Alexandra E Brandimore, and Michelle S Troche. 2020. Variability of Voluntary Cough Airflow in Healthy Adults and Parkinson's Disease. Dysphagia (2020), 1--7.
[16]
Marc-André Carbonneau, Veronika Cheplygina, Eric Granger, and Ghyslain Gagnon. 2018. Multiple instance learning: A survey of problem characteristics and applications. Pattern Recognition 77 (2018), 329--353.
[17]
Marc-André Carbonneau, Eric Granger, Yazid Attabi, and Ghyslain Gagnon. 2017. Feature learning from spectrograms for assessment of personality traits. IEEE Transactions on Affective Computing (2017).
[18]
Daniel B Chamberlain, Rahul Kodgule, and Richard Ribón Fletcher. 2016. A mobile platform for automated screening of asthma and chronic obstructive pulmonary disease. In 2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC). IEEE, 5192--5195.
[19]
Soujanya Chatterjee, Md Mahbubur Rahman, Tousif Ahmed, Nazir Saleheen, Ebrahim Nemati, Viswam Nathan, Korosh Vatanparvar, and Jilong Kuang. 2020. Assessing Severity of Pulmonary Obstruction from Respiration Phase-Based Wheeze-Sensing Using Mobile Sensors. In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems. ACM, New York, NY, USA, 1--13. https://doi.org/10.1145/3313831.3376444
[20]
Sneha Chaudhari, Gungor Polatkan, Rohan Ramanath, and Varun Mithal. 2019. An attentive survey of attention models. arXiv preprint arXiv:1904.02874 (2019).
[21]
Qian Cheng, Joshua Juen, Shashi Bellam, Nicholas Fulara, Deanna Close, Jonathan C Silverstein, and Bruce Schatz. 2017. Predicting pulmonary function from phone sensors. Telemedicine and e-Health 23, 11 (2017), 913--919.
[22]
Veronika Cheplygina, Lauge Sørensen, David MJ Tax, Jesper Holst Pedersen, Marco Loog, and Marleen de Bruijne. 2014. Classification of COPD with multiple instance learning. In 2014 22nd International Conference on pattern recognition. IEEE, 1508--1513.
[23]
Edward Choi, Mohammad Taha Bahadori, Jimeng Sun, Joshua Kulas, Andy Schuetz, and Walter Stewart. 2016. Retain: An interpretable predictive model for healthcare using reverse time attention mechanism. In Advances in Neural Information Processing Systems. 3504--3512.
[24]
Ramazan Gokberk Cinbis, Jakob Verbeek, and Cordelia Schmid. 2016. Weakly supervised object localization with multi-fold multiple instance learning. IEEE transactions on pattern analysis and machine intelligence 39, 1 (2016), 189--203.
[25]
Wei Dai, Chia Dai, Shuhui Qu, Juncheng Li, and Samarjit Das. 2017. Very deep convolutional neural networks for raw waveforms. In 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 421--425.
[26]
Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. 2009. Imagenet: A large-scale hierarchical image database. In 2009 IEEE conference on computer vision and pattern recognition. Ieee, 248--255.
[27]
Thomas G Dietterich, Richard H Lathrop, and Tomás Lozano-Pérez. 1997. Solving the multiple instance problem with axis-parallel rectangles. Artificial intelligence 89, 1-2 (1997), 31--71.
[28]
Mohammad Ebrahimpour, Timothy Shea, Andreea Danielescu, David Noelle, and Chris Kello. 2020. End-to-End Auditory Object Recognition via Inception Nucleus. In ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 146--150.
[29]
Eduardo Fonseca, Manoj Plakal, Frederic Font, Daniel PW Ellis, Xavier Favory, Jordi Pons, and Xavier Serra. 2018. General-purpose tagging of freesound audio with audioset labels: Task description, dataset, and baseline. arXiv preprint arXiv:1807.09902 (2018).
[30]
Wei Gao, Wuping Bao, and Xin Zhou. 2019. Analysis of cough detection index based on decision tree and support vector machine. Journal of Combinatorial Optimization 37, 1 (2019), 375--384.
[31]
Jort F Gemmeke, Daniel PW Ellis, Dylan Freedman, Aren Jansen, Wade Lawrence, R Channing Moore, Manoj Plakal, and Marvin Ritter. 2017. Audio set: An ontology and human-labeled dataset for audio events. In 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 776--780.
[32]
Aurélien Géron. 2019. Hands-on machine learning with Scikit-Learn, Keras, and TensorFlow: Concepts, tools, and techniques to build intelligent systems. O'Reilly Media.
[33]
Xavier Glorot, Antoine Bordes, and Yoshua Bengio. 2011. Deep Sparse Rectifier Neural Networks. In Proceedings of the 14th International Conference on Artificial Intelligence and Statistics (AISTATS). 315--323.
[34]
Mayank Goel, Elliot Saba, Maia Stiber, Eric Whitmire, Josh Fromm, Eric C. Larson, Gaetano Borriello, and Shwetak N. Patel. 2016. Spirocall: Measuring lung function over a phone call. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems. ACM, New York, NY, USA, 5675--5685. https://doi.org/10.1145/2858036.2858401
[35]
Siddharth Gupta, Peter Chang, Nonso Anyigbo, and Ashutosh Sabharwal. 2011. mobileSpiro: accurate mobile spirometry for self-management of asthma. In Proceedings of the First ACM Workshop on Mobile Systems, Applications, and Services for Healthcare. 1--6.
[36]
Song Han, Huizi Mao, and William J Dally. 2015. Deep compression: Compressing deep neural networks with pruning, trained quantization and huffman coding. arXiv preprint arXiv:1510.00149 (2015).
[37]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition. 770--778.
[38]
Andrew Howard, Mark Sandler, Bo Chen, Weijun Wang, Liang Chieh Chen, Mingxing Tan, Grace Chu, Vijay Vasudevan, Yukun Zhu, and Ruoming and Pang. 2019. Searching for MobileNetV3. In 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[39]
Jonathan J Huang and Juan Jose Alvarado Leanos. 2018. Aclnet: efficient end-to-end audio classification cnn. arXiv preprint arXiv:1811.06669 (2018).
[40]
MA Huckvale and András Beke. 2017. It sounds like you have a cold! Testing voice features for the Interspeech 2017 Computational Paralinguistics Cold Challenge. International Speech Communication Association (ISCA).
[41]
Maximilian Ilse, Jakub M Tomczak, and Max Welling. 2018. Attention-based deep multiple instance learning. arXiv preprint arXiv:1802.04712 (2018).
[42]
Ali Imran, Iryna Posokhova, Haneya N Qureshi, Usama Masood, Sajid Riaz, Kamran Ali, Charles N John, and Muhammad Nabeel. 2020. AI4COVID-19: AI enabled preliminary diagnosis for COVID-19 from cough samples via an app. arXiv preprint arXiv:2004.01275 (2020).
[43]
Marco Jeub, Magnus Schafer, and Peter Vary. 2009. A binaural room impulse response database for the evaluation of dereverberation algorithms. In 2009 16th International Conference on Digital Signal Processing. IEEE, 1--5.
[44]
Joshua Juen, Qian Cheng, Valentin Prieto-Centurion, Jerry A Krishnan, and Bruce Schatz. 2014. Health monitors for chronic disease by gait analysis with mobile phones. Telemedicine and e-Health 20, 11 (2014), 1035--1041.
[45]
Joshua Juen, Qian Cheng, and Bruce Schatz. 2015. A natural walking monitor for pulmonary patients using mobile phones. IEEE Journal of biomedical and health informatics 19, 4 (2015), 1399--1405.
[46]
Simon Kornblith, Jonathon Shlens, and Quoc V Le. 2019. Do better imagenet models transfer better?. In Proceedings of the IEEE conference on computer vision and pattern recognition. 2661--2671.
[47]
Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. 2017. Imagenet classification with deep convolutional neural networks. Commun. ACM 60, 6 (2017), 84--90.
[48]
Jordi Laguarta, Ferran Hueto, and Brian Subirana. 2020. COVID-19 Artificial Intelligence Diagnosis using only Cough Recordings. IEEE Open Journal of Engineering in Medicine and Biology (2020).
[49]
Gierad Laput, Karan Ahuja, Mayank Goel, and Chris Harrison. 2018. Ubicoustics: Plug-and-play acoustic activity recognition. In Proceedings of the 31st Annual ACM Symposium on User Interface Software and Technology. 213--224.
[50]
Eric C. Larson, Mayank Goel, Gaetano Boriello, Sonya Heltshe, Margaret Rosenfeld, and Shwetak N. Patel. 2012. SpiroSmart: using a microphone to measure lung function on a mobile phone. In Proceedings of the 2012 ACM Conference on Ubiquitous Computing - UbiComp '12. ACM Press, New York, New York, USA, 280. https://doi.org/10.1145/2370216.2370261
[51]
Shasha Le and Weiping Hu. 2013. Cough sound recognition based on Hilbert marginal spectrum. In 2013 6th International Congress on Image and Signal Processing (CISP), Vol. 3. IEEE, 1346--1350.
[52]
John Boaz Lee, Ryan A Rossi, Sungchul Kim, Nesreen K Ahmed, and Eunyee Koh. 2019. Attention models in graphs: A survey. ACM Transactions on Knowledge Discovery from Data (TKDD) 13, 6 (2019), 1--25.
[53]
Jia-Ming Liu, Mingyu You, Guo-Zheng Li, Zheng Wang, Xianghuai Xu, Zhongmin Qiu, Wenjia Xie, Chao An, and Sili Chen. 2013. Cough signal recognition with gammatone cepstral coefficients. In 2013 IEEE China Summit and International Conference on Signal and Information Processing. IEEE, 160--164.
[54]
Jia-Ming Liu, Mingyu You, Zheng Wang, Guo-Zheng Li, Xianghuai Xu, and Zhongmin Qiu. 2014. Cough detection using deep neural networks. In 2014 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). IEEE, 560--563.
[55]
Gianluca Maguolo, Michelangelo Paci, Loris Nanni, and Ludovico Bonan. 2019. Audiogmenter: a MATLAB Toolbox for Audio Data Augmentation. arXiv preprint arXiv:1912.05472 (2019).
[56]
David M Mannino, Earl S Ford, and Stephen C Redd. 2003. Obstructive and restrictive lung disease and markers of inflammation: data from the Third National Health and Nutrition Examination. The American journal of medicine 114, 9 (2003), 758--762.
[57]
Sergio Matos, Surinder S Birring, Ian D Pavord, and H Evans. 2006. Detection of cough signals in continuous audio recordings using hidden Markov models. IEEE Transactions on Biomedical Engineering 53, 6 (2006), 1078--1083.
[58]
Puja Mehta, Daniel F McAuley, Michael Brown, Emilie Sanchez, Rachel S Tattersall, Jessica J Manson, HLH Across Speciality Collaboration, et al. 2020. COVID-19: consider cytokine storm syndromes and immunosuppression. Lancet (London, England) 395, 10229 (2020), 1033.
[59]
Jesús Monge-Álvarez, Carlos Hoyos-Barceló, Paul Lesso, and Pablo Casaseca-de-la Higuera. 2018. Robust detection of audio-cough events using local Hu moments. IEEE journal of biomedical and health informatics 23, 1 (2018), 184--196.
[60]
Viswam Nathan, Korosh Vatanparvar, Md Mahbubur Rahman, Ebrahim Nemati, and Jilong Kuang. 2019. Assessment of chronic pulmonary disease patients using biomarkers from natural speech recorded by mobile devices. In 2019 IEEE 16th International Conference on Wearable and Implantable Body Sensor Networks (BSN). IEEE, 1--4.
[61]
Ebrahim Nemati, Md Juber Rahman, Korosh Vatanparvar, Viswam Nathan, and Jilong Kuang. 2020. Estimation of the Lung Function Using Acoustic Features of the Voluntary Cough. 42nd Annual International Conferences of the IEEE Engineering in Medicine and Biology Society, EMBC 2020.
[62]
Ebrahim Nemati, Md Mahbubur Rahman, Viswam Nathan, and Jilong Kuang. 2018. Private audio-based cough sensing for in-home pulmonary assessment using mobile devices. In EAI International Conference on Body Area Networks. Springer, 221--232.
[63]
Tuomas Oikarinen, Karthik Srinivasan, Olivia Meisner, Julia B Hyman, Shivangi Parmar, Adrian Fanucci-Kiss, Robert Desimone, Rogier Landman, and Guoping Feng. 2019. Deep convolutional network for animal sound classification and source attribution using dual audio recordings. The Journal of the Acoustical Society of America 145, 2 (2019), 654--662.
[64]
Nikolaos Pappas and Andrei Popescu-Belis. 2014. Explaining the stars: Weighted multiple-instance learning for aspect-based sentiment analysis. In Proceedings of the 2014 Conference on Empirical Methods In Natural Language Processing (EMNLP). 455--466.
[65]
Karol J Piczak. 2015. ESC: Dataset for environmental sound classification. In Proceedings of the 23rd ACM international conference on Multimedia. 1015--1018.
[66]
Liam Polley, Nurman Yaman, Liam Heaney, Chris Cardwell, Eimear Murtagh, John Ramsey, Joseph MacMahon, Richard W Costello, and Lorcan McGarvey. 2008. Impact of cough across different chronic respiratory diseases: comparison of two cough-specific health-related quality of life questionnaires. Chest 134, 2 (2008), 295--302.
[67]
Renard Xaviero Adhi Pramono, Syed Anas Imtiaz, and Esther Rodriguez-Villegas. 2016. A cough-based algorithm for automatic diagnosis of pertussis. PloS one 11, 9 (2016), e0162128.
[68]
Speech Processing. 2008. Transmission and Quality Aspects (STQ); Speech Quality Performance in the Presence of Background Noise; Part 1: Background Noise Simulation Technique and Background Noise Database. ETSI EG 202 (2008), 396--1.
[69]
Gwénolé Quellec, Mathieu Lamard, Michael D Abràmoff, Etienne Decencière, Bruno Lay, Ali Erginay, Béatrice Cochener, and Guy Cazuguel. 2012. A multiple-instance learning framework for diabetic retinopathy screening. Medical image analysis 16, 6 (2012), 1228--1240.
[70]
Colin Raffel and Daniel PW Ellis. 2015. Feed-forward networks with attention can solve some long-term memory problems. arXiv preprint arXiv:1512.08756 (2015).
[71]
Md Juber Rahman, Ebrahim Nemati, Md Mahbubur Rahman, Viswam Nathan, Korosh Vatanparvar, and Jilong Kuang. 2020. Automated assessment of pulmonary patients using heart rate variability from everyday wearables. Smart Health 15 (2020), 100081.
[72]
Md Mahbubur Rahman, Tousif Ahmed, Ebrahim Nemati, Viswam Nathan, Korosh Vatanparvar, Erin Blackstock, and Jilong Kuang. 2020. ExhaleSense: Detecting High Fidelity Forced Exhalations to Estimate Lung Obstruction on Smartphones. In 2020 IEEE International Conference on Pervasive Computing and Communications (PerCom). IEEE, 1--10. https://doi.org/10.1109/PerCom45495.2020.9127355
[73]
Jose F Ruiz-Muñoz, Mauricio Orozco Alzate, and Germán Castellanos-Domínguez. 2015. Multiple instance learning-based birdsong classification using unsupervised recording segmentation. In Twenty-Fourth International Joint Conference on Artificial Intelligence.
[74]
Keum San Chun, Viswam Nathan, Korosh Vatanparvar, Ebrahim Nemati, Md Mahbubur Rahman, Erin Blackstock, and Jilong Kuang. 2020. Towards Passive Assessment of Pulmonary Function from Natural Speech Recorded Using a Mobile Phone. In 2020 IEEE International Conference on Pervasive Computing and Communications (PerCom). IEEE, 1--10.
[75]
Mark Sandler, Andrew Howard, Menglong Zhu, Andrey Zhmoginov, and Liang-Chieh Chen. 2018. Mobilenetv2: Inverted residuals and linear bottlenecks. In Proceedings of the IEEE conference on computer vision and pattern recognition. 4510--4520.
[76]
Roneel V Sharan, Udantha R Abeyratne, Vinayak R Swarnkar, Scott Claxton, Craig Hukins, and Paul Porter. 2018. Predicting spirometry readings using cough sound features and regression. Physiological measurement 39, 9 (2018), 095001.
[77]
Neeraj Sharma, Prashant Krishnan, Rohit Kumar, Shreyas Ramoji, Srikanth Raj Chetupalli, Prasanta Kumar Ghosh, Sriram Ganapathy, et al. 2020. Coswara-A Database of Breathing, Cough, and Voice Sounds for COVID-19 Diagnosis. arXiv preprint arXiv:2005.10548 (2020).
[78]
Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014).
[79]
Jaclyn Smith and Ashley Woodcock. 2006. Cough and its importance in COPD. International journal of chronic obstructive pulmonary disease 1, 3 (2006), 305.
[80]
Christian Szegedy, Sergey Ioffe, Vincent Vanhoucke, and Alex Alemi. 2016. Inception-v4, inception-resnet and the impact of residual connections on learning. arXiv preprint arXiv:1602.07261 (2016).
[81]
Joachim Thiemann, Nobutaka Ito, and Emmanuel Vincent. 2013. The Diverse Environments Multi-channel Acoustic Noise Database (DEMAND): A database of multichannel environmental noise recordings. In Proceedings of Meetings on Acoustics ICA2013, Vol. 19. Acoustical Society of America, 035081.
[82]
Tong Tong, Robin Wolz, Qinquan Gao, Ricardo Guerrero, Joseph V Hajnal, Daniel Rueckert, Alzheimer's Disease Neuroimaging Initiative, et al. 2014. Multiple instance learning for classification of dementia in brain MRI. Medical image analysis 18, 5 (2014), 808--818.
[83]
Alvin Wan, Xiaoliang Dai, Peizhao Zhang, Zijian He, Yuandong Tian, Saining Xie, Bichen Wu, Matthew Yu, Tao Xu, Kan Chen, Peter Vajda, and Joseph E. Gonzalez. 2020. FBNetV2: Differentiable Neural Architecture Search for Spatial and Channel Dimensions. In arXiv. arXiv:2004.05565
[84]
Yan Xu, Jun-Yan Zhu, I Eric, Chao Chang, Maode Lai, and Zhuowen Tu. 2014. Weakly supervised histopathology cancer image segmentation and classification. Medical image analysis 18, 3 (2014), 591--604.
[85]
Aina M Yañez, Dolores Guerrero, Rigoberto Pérez de Alejo, Francisco Garcia-Rio, Jose Luis Alvarez-Sala, Miriam Calle-Rubio, Rosa Malo de Molina, Manuel Valle Falcones, Piedad Ussetti, Jaume Sauleda, et al. 2012. Monitoring breathing rate at home allows early identification of COPD exacerbations. Chest 142, 6 (2012), 1524--1529.
[86]
Hongyi Zhang, Moustapha Cisse, Yann N Dauphin, and David Lopez-Paz. 2017. mixup: Beyond empirical risk minimization. arXiv preprint arXiv:1710.09412 (2017).
[87]
Zhi-Hua Zhou. 2004. Multi-instance learning: A survey. Department of Computer Science & Technology, Nanjing University, Tech. Rep 1 (2004).
[88]
Zhi-Hua Zhou, Kai Jiang, and Ming Li. 2005. Multi-instance learning based web mining. Applied intelligence 22, 2 (2005), 135--147.
[89]
Chunmei Zhu, Lianfang Tian, Xiangyang Li, Hongqiang Mo, and Zeguang Zheng. 2013. Recognition of cough using features improved by sub-band energy transformation. In 2013 6th International Conference on Biomedical Engineering and Informatics. IEEE, 251--255.
[90]
Fatma Zubaydi, Assim Sagahyroon, Fadi Aloul, and Hasan Mir. 2017. MobSpiro: Mobile based spirometry for detecting COPD. In 2017 IEEE 7th Annual Computing and Communication Workshop and Conference (CCWC). IEEE, 1--4.

Cited By

View all
  • (2024)Feature evaluation of accelerometry signals for cough detectionFrontiers in Digital Health10.3389/fdgth.2024.13685746Online publication date: 22-Mar-2024
  • (2024)A method for diagnosing COPD using cough sounds based on TF-MLPProceedings of the 2024 5th International Symposium on Artificial Intelligence for Medicine Science10.1145/3706890.3706915(148-152)Online publication date: 13-Aug-2024
  • (2024)EarMonitor: Non-clinical Assessment of Ear Health Conditions Using a Low-cost Endoscope Camera on SmartphonesProceedings of the ACM on Human-Computer Interaction10.1145/36764998:MHCI(1-20)Online publication date: 24-Sep-2024
  • Show More Cited By

Index Terms

  1. Listen2Cough: Leveraging End-to-End Deep Learning Cough Detection Model to Enhance Lung Health Assessment Using Passively Sensed Audio

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies
      Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies  Volume 5, Issue 1
      March 2021
      1272 pages
      EISSN:2474-9567
      DOI:10.1145/3459088
      Issue’s Table of Contents
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 30 March 2021
      Published in IMWUT Volume 5, Issue 1

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. Cough detection
      2. Lung health assessment
      3. Multi-instance learning

      Qualifiers

      • Research-article
      • Research
      • Refereed

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)172
      • Downloads (Last 6 weeks)27
      Reflects downloads up to 17 Feb 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Feature evaluation of accelerometry signals for cough detectionFrontiers in Digital Health10.3389/fdgth.2024.13685746Online publication date: 22-Mar-2024
      • (2024)A method for diagnosing COPD using cough sounds based on TF-MLPProceedings of the 2024 5th International Symposium on Artificial Intelligence for Medicine Science10.1145/3706890.3706915(148-152)Online publication date: 13-Aug-2024
      • (2024)EarMonitor: Non-clinical Assessment of Ear Health Conditions Using a Low-cost Endoscope Camera on SmartphonesProceedings of the ACM on Human-Computer Interaction10.1145/36764998:MHCI(1-20)Online publication date: 24-Sep-2024
      • (2024)From Classification to Clinical InsightsProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/36596048:2(1-25)Online publication date: 15-May-2024
      • (2024)M3BAT: Unsupervised Domain Adaptation for Multimodal Mobile Sensing with Multi-Branch Adversarial TrainingProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/36595918:2(1-30)Online publication date: 15-May-2024
      • (2024)Symptom Detection with Text Message Log Distributions for Holistic Depression and Anxiety ScreeningProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/36435548:1(1-28)Online publication date: 6-Mar-2024
      • (2024)mmArrhythmiaProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/36435498:1(1-25)Online publication date: 6-Mar-2024
      • (2024)Mental-LLMProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/36435408:1(1-32)Online publication date: 6-Mar-2024
      • (2024)KirigamiProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/36435028:1(1-28)Online publication date: 6-Mar-2024
      • (2024)Towards Estimating Missing Emotion Self-reports Leveraging User Similarity: A Multi-task Learning ApproachProceedings of the CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642833(1-19)Online publication date: 11-May-2024
      • Show More Cited By

      View Options

      Login options

      Full Access

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media