Towards an Efficient Way of Building Annotated Medical Image Collections for Big Data Studies

Gur, Yaniv; Moradi, Mehdi; Bulu, Hakan; Guo, Yufan; Compas, Colin; Syeda-Mahmood, Tanveer

doi:10.1007/978-3-319-67534-3_10

Yaniv Gur²⁷,
Mehdi Moradi²⁷,
Hakan Bulu²⁷,
Yufan Guo²⁷,
Colin Compas²⁷ &
…
Tanveer Syeda-Mahmood²⁷

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 10552))

Included in the following conference series:

1307 Accesses
5 Citations

Abstract

Annotating large collections of medical images is essential for building robust image analysis pipelines for different applications, such as disease detection. This process involves expert input, which is costly and time consuming. Semiautomatic labeling and expert sourcing can speed up the process of building such collections. In this work we report innovations in both of these areas. Firstly, we have developed an algorithm inspired by active learning and self training that significantly reduces the number of annotated training images needed to achieve a given level of accuracy on a classifier. This is an iterative process of labeling, training a classifier, and testing that requires a small set of labeled images at the start, complemented with human labeling of difficult test cases at each iteration. Secondly, we have built a platform for large scale management and indexing of data and users, as well as for creating and assigning tasks such as labeling and contouring for big data medical imaging studies. This is a web-based platform and provides the tooling for both researchers and annotators, all within a simple dynamic user interface. Our annotation platform also streamlines the process of iteratively training and labeling in algorithms such as active learning/self training described here. In this paper, we demonstrate that the combination of the platform and the proposed algorithm significantly reduces the workload involved in building a large collection of labeled cardiac echo images.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Medical Image Labeling via Active Learning is 90% Effective

Do We Need Large Annotated Training Data for Detection Applications in Biomedical Imaging? A Case Study in Renal Glomeruli Detection

Self-supervised learning framework application for medical image analysis: a review and summary

Article Open access 27 October 2024

References

Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Pereira, F., Burges, C.J.C., Bottou, L., Weinberger, K.Q. (eds.) Advances in Neural Information Processing Systems, vol. 25, pp. 1097–1105. Curran Associates, Inc. (2012)
Google Scholar
Maier-Hein, L., Mersmann, S., Kondermann, D., Bodenstedt, S., Sanchez, A., Stock, C., Kenngott, H.G., Eisenmann, M., Speidel, S.: Can masses of non-experts train highly accurate image classifiers? In: Golland, P., Hata, N., Barillot, C., Hornegger, J., Howe, R. (eds.) MICCAI 2014. LNCS, vol. 8674, pp. 438–445. Springer, Cham (2014). doi:10.1007/978-3-319-10470-6_55
Google Scholar
Moradi, M., Guo, Y., Gur, Y., Negahdar, M., Syeda-Mahmood, T.: A cross-modality neural network transform for semi-automatic medical image annotation. In: Ourselin, S., Joskowicz, L., Sabuncu, M.R., Unal, G., Wells, W. (eds.) MICCAI 2016. LNCS, vol. 9901, pp. 300–307. Springer, Cham (2016). doi:10.1007/978-3-319-46723-8_35
Chapter Google Scholar
Ronneberger, O., Fischer, P., Brox, T.: U-Net: convolutional networks for biomedical image segmentation. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) MICCAI 2015. LNCS, vol. 9351, pp. 234–241. Springer, Cham (2015). doi:10.1007/978-3-319-24574-4_28
Chapter Google Scholar
Rubin, D.L., Willrett, D., O’Connor, M.J., Hage, C., Kurtz, C., Moreira, D.A.: Automated tracking of quantitative assessments of tumor burden in clinical trials. Translational Oncol. 7, 300–307 (2014)
Google Scholar
Syeda-Mahmood, T., Guo, Y., Moradi, M., Beymer, D., Rajan, D., Cao, Y., Gur, Y., Negahdar, M.: Identifying patients at risk for aortic stenosis through learning from multimodal data. In: Ourselin, S., Joskowicz, L., Sabuncu, M.R., Unal, G., Wells, W. (eds.) MICCAI 2016. LNCS, vol. 9902, pp. 238–245. Springer, Cham (2016). doi:10.1007/978-3-319-46726-9_28
Chapter Google Scholar
Tong, S.: Active learning: theory and applications. Ph.D. thesis, Stanford University, August 2001
Google Scholar
Vajda, S., You, D., Antani, S.K., Thoma, G.R.: Label the many with a few: semi-automatic medical image modality discovery in a large image collection. In: 2014 IEEE Symposium on Computational Intelligence in Healthcare and e-health (CICARE), pp. 167–173, December 2014
Google Scholar
Zhu, X.: Semi-supervised learning literature survey. Technical report (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

IBM Almaden Research Center, San Jose, CA, 95120, USA
Yaniv Gur, Mehdi Moradi, Hakan Bulu, Yufan Guo, Colin Compas & Tanveer Syeda-Mahmood

Authors

Yaniv Gur
View author publications
You can also search for this author in PubMed Google Scholar
Mehdi Moradi
View author publications
You can also search for this author in PubMed Google Scholar
Hakan Bulu
View author publications
You can also search for this author in PubMed Google Scholar
Yufan Guo
View author publications
You can also search for this author in PubMed Google Scholar
Colin Compas
View author publications
You can also search for this author in PubMed Google Scholar
Tanveer Syeda-Mahmood
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yaniv Gur .

Editor information

Editors and Affiliations

University College London, London, United Kingdom
M. Jorge Cardoso
McGill University, Montreal, Québec, Canada
Tal Arbel
Imperial College London, London, United Kingdom
Su-Lin Lee
Eindhoven University of Technology, Eindhoven, The Netherlands
Veronika Cheplygina
University of Barcelona, Barcelona, Spain
Simone Balocco
Technical University of Munich, Garching, Germany
Diana Mateus
Nara Institute of Science and Technology, Nara, Japan
Guillaume Zahnd
DKFZ, Heidelberg, Germany
Lena Maier-Hein
Technical University Munich, Munich, Germany
Stefanie Demirci
École de Technologie Supérieure, Montreal, Québec, Canada
Eric Granger
École de Technologie Supérieure, Montreal, Canada
Luc Duong
École de Technologie Supérieure, Montreal, Québec, Canada
Marc-André Carbonneau
Technical University Munich, Munich, Germany
Shadi Albarqouni
University of Adelaide, Adelaide, South Australia, Australia
Gustavo Carneiro

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gur, Y., Moradi, M., Bulu, H., Guo, Y., Compas, C., Syeda-Mahmood, T. (2017). Towards an Efficient Way of Building Annotated Medical Image Collections for Big Data Studies. In: Cardoso, M., et al. Intravascular Imaging and Computer Assisted Stenting, and Large-Scale Annotation of Biomedical Data and Expert Label Synthesis. LABELS STENT CVII 2017 2017 2017. Lecture Notes in Computer Science(), vol 10552. Springer, Cham. https://doi.org/10.1007/978-3-319-67534-3_10

Download citation

DOI: https://doi.org/10.1007/978-3-319-67534-3_10
Published: 08 September 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-67533-6
Online ISBN: 978-3-319-67534-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics