A Convolutional Neural Network for Real Time Classification, Identification, and Labelling of Vocal Cord and Tracheal Using Laryngoscopy and Bronchoscopy Video

Matava, Clyde; Pankiv, Evelina; Raisbeck, Sam; Caldeira, Monica; Alam, Fahad

doi:10.1007/s10916-019-1481-4

A Convolutional Neural Network for Real Time Classification, Identification, and Labelling of Vocal Cord and Tracheal Using Laryngoscopy and Bronchoscopy Video

Image & Signal Processing
Published: 02 January 2020

Volume 44, article number 44, (2020)
Cite this article

Journal of Medical Systems Aims and scope Submit manuscript

Clyde Matava ORCID: orcid.org/0000-0002-9502-0981^1,2,3,
Evelina Pankiv^1,3,
Sam Raisbeck^1,2,
Monica Caldeira^1,2 &
…
Fahad Alam^2,3,4

1473 Accesses
37 Citations
6 Altmetric
Explore all metrics

Abstract

Background

The use of artificial intelligence, including machine learning, is increasing in medicine. Use of machine learning is rising in the prediction of patient outcomes. Machine learning may also be able to enhance and augment anesthesia clinical procedures such as airway management. In this study, we sought to develop a machine learning algorithm that could classify vocal cords and tracheal airway anatomy real-time during video laryngoscopy or bronchoscopy as well as compare the performance of three novel convolutional networks for detecting vocal cords and tracheal rings.

Methods

Following institutional approval, a clinical dataset of 775 video laryngoscopy and bronchoscopy videos was used. The dataset was divided into two categories for use for training and testing. We used three convolutional neural networks (CNNs): ResNet, Inception and MobileNet. Backpropagation and a mean squared error loss function were used to assess accuracy as well as minimize bias and variance. Following training, we assessed transferability using the generalization error of the CNN, sensitivity and specificity, average confidence error, outliers, overall confidence percentage, and frames per second for live video feeds. After the training was complete, 22 models using 0 to 25,000 steps were generated and compared.

Results

The overall confidence of classification for the vocal cords and tracheal rings for ResNet, Inception and MobileNet CNNs were as follows: 0.84, 0.78, and 0.64 for vocal cords, respectively, and 0.69, 0.72, 0.54 for tracheal rings, respectively. Transfer learning following additional training resulted in improved accuracy of ResNet and Inception for identifying the vocal cords (with a confidence of 0.96 and 0.93 respectively). The two best performing CNNs, ResNet and Inception, achieved a specificity of 0.985 and 0.971, respectively, and a sensitivity of 0.865 and 0.892, respectively. Inception was able to process the live video feeds at 10 FPS while ResNet processed at 5 FPS. Both were able to pass a feasibility test of identifying vocal cords and tracheal rings in a video feed.

Conclusions

We report the development and evaluation of a CNN that can identify and classify airway anatomy in real time. This neural network demonstrates high performance. The availability of artificial intelligence may improve airway management and bronchoscopy by helping to identify key anatomy real time. Thus, potentially improving performance and outcomes during these procedures. Further, this technology may theoretically be extended to the settings of airway pathology or airway management in the hands of experienced providers. The researchers in this study are exploring the performance of this neural network in clinical trials.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Cook, T. M., Woodall, N., Harper, J. et al., Major complications of airway management in the UK: Results of the fourth National Audit Project of the Royal College of Anaesthetists and the difficult airway society. British Journal of Anaesthesia 106(5):617–631, 2011.
Article CAS Google Scholar
Chen, Y.-L., and Wu, K.-H., Airway Management of Patients with Craniofacial Abnormalities: 10-year experience at a teaching Hospital in Taiwan. Journal of the Chinese Medical Association 72(9):468–470, 2009.
Article CAS Google Scholar
Weiss, M., and Engelhardt, T., Proposal for the management of the unexpected difficult pediatric airway. Pediatric Anesthesia 20(5):454–464, 2010.
Article Google Scholar
Aziz, M. F., Dillman, D., Fu, R., and Brambrink, A. M., Comparative effectiveness of the C-MAC video laryngoscope versus direct laryngoscopy in the setting of the predicted difficult airway. Anesthesiology 116(3):629–636, 2012.
Article Google Scholar
Niforopoulou, P., Pantazopoulos, I., Demestiha, T., Koudouna, E., and Xanthos, T., Video-laryngoscopes in the adult airway management: A topical review of the literature. Acta Anaesthesiologica Scandinavica 54:1050–1061, 2010. https://doi.org/10.1111/j.1399-6576.2010.02285.x.
Article CAS PubMed Google Scholar
Abdelgadir, I. S., Phillips, R. S., Singh, D., Moncreiff, M. P., and Lumsden, J. L., Videolaryngoscopy versus direct laryngoscopy for tracheal intubation in children (excluding neonates). Cochrane Database of Systematic Reviews. https://doi.org/10.1002/14651858.CD011413.pub2.
Fiadjoe, J. E., Nishisaki, A., Jagannathan, N., Hunyady, A. I., Greenberg, R. S., Reynolds, P. I. et al., Airway management complications in children with difficult tracheal intubation from the pediatric difficult intubation (PeDI) registry: A prospective cohort analysis. Lancet Respir Med 4(1):37–48, 2016.
Article Google Scholar
Howard, A. G., Zhu, M., and Chen, B., Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv, 2017 1704.04861 [cs. CV].
Liu W. et al. (2016) SSD: Single Shot MultiBox Detector. In: Leibe B., Matas J., Sebe N., Welling M. (eds) Computer Vision – ECCV 2016. ECCV 2016. Lecture Notes in Computer Science, vol 9905. Springer, Cham https://doi.org/10.1007/978-3-319-46448-0_2
Chapter Google Scholar
Szegedy C, et al (2015) Going deeper with convolutions, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, 2015, pp. 1–9. https://doi.org/10.1109/CVPR.2015.7298594
Zhang, Y. C., and Kagen, A. C., Machine Learning Interface for Medical Image Analysis J Digit Imaging 11:1–7, 2016.
Google Scholar
Ren, S., He, K., Girshick, R., and Sun, J., Faster R-CNN: Towards real-time object detection with region proposal networks. IEEE Transactions on Pattern Analysis and Machine Intelligence 39(6):1137–1149, 2017. https://doi.org/10.1109/TPAMI.2016.2577031.
Article PubMed Google Scholar
Girshick R (2015) Fast R-CNN. arXiv:1504.08083 [cs.CV]
He K, Zhang X, Ren S, Sun J (2016) Deep Residual Learning for Image Recognition IEEE Conference on Computer Vision and Pattern Recognition (CVPR) https://doi.org/10.1109/CVPR.2016.90
Montgomery, D. C., Jennings, C. L., and Kulahci, M., Introduction to time series analysis and forecasting Wiley. New: Jersey, 2015.
Google Scholar

Download references

Funding

This study was funded by departmental funds.

Author information

Authors and Affiliations

Department of Anesthesia and Pain Medicine, The Hospital for Sick Children, Toronto 555 University Avenue, Toronto, ON, M5G 1X8, Canada
Clyde Matava, Evelina Pankiv, Sam Raisbeck & Monica Caldeira
Collaborative Human Immersive Interactive (CHISIL) Laboratory, The Hospital for Sick Children Toronto and Sunnybrook Health Sciences, Toronto, Ontario, Canada
Clyde Matava, Sam Raisbeck, Monica Caldeira & Fahad Alam
Department of Anesthesia, Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada
Clyde Matava, Evelina Pankiv & Fahad Alam
Department of Anesthesia, Sunnybrook Health Sciences, Toronto, Ontario, Canada
Fahad Alam

Authors

Clyde Matava
View author publications
You can also search for this author in PubMed Google Scholar
Evelina Pankiv
View author publications
You can also search for this author in PubMed Google Scholar
Sam Raisbeck
View author publications
You can also search for this author in PubMed Google Scholar
Monica Caldeira
View author publications
You can also search for this author in PubMed Google Scholar
Fahad Alam
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Clyde Matava.

Ethics declarations

Conflict of Interest

Clyde Matava declares that he has no conflict of interest. Evelina Pankiv declares that she no conflict of interest. Sam Raisbeck declares that he has no conflict of interest. Monica Caldeira declares that she has no conflict of interest. Fahad Alam declares that he has no conflict of interest.

Ethical Approval

All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

What is already known?

• Failure to classify vocal cords and tracheal rings anatomy during intubation and bronchoscopy during laryngoscopy and bronchoscopy can lead to adverse events

• Machine learning and artificial intelligence may be able to play a role in real-time clinical decision making

What is new?

• We report the successful development and testing of a convolutional neural network that can classify, identify, and label vocal cords and tracheal rings from live video.

This article is part of the Topical Collection on Image & Signal Processing

Electronic supplementary material

ESM 1

(MP4 41816 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Matava, C., Pankiv, E., Raisbeck, S. et al. A Convolutional Neural Network for Real Time Classification, Identification, and Labelling of Vocal Cord and Tracheal Using Laryngoscopy and Bronchoscopy Video. J Med Syst 44, 44 (2020). https://doi.org/10.1007/s10916-019-1481-4

Download citation

Received: 24 July 2019
Accepted: 11 October 2019
Published: 02 January 2020
DOI: https://doi.org/10.1007/s10916-019-1481-4

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Convolutional Neural Network for Real Time Classification, Identification, and Labelling of Vocal Cord and Tracheal Using Laryngoscopy and Bronchoscopy Video

Abstract

Background

Methods

Results

Conclusions

Access this article

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of Interest

Ethical Approval

Additional information

Publisher’s Note

Electronic supplementary material

ESM 1

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation