How Inter-rater Variability Relates to Aleatoric and Epistemic Uncertainty: A Case Study with Deep Learning-Based Paraspinal Muscle Segmentation

Roshanzamir, Parinaz; Rivaz, Hassan; Ahn, Joshua; Mirza, Hamza; Naghdi, Neda; Anstruther, Meagan; Battié, Michele C.; Fortin, Maryse; Xiao, Yiming

doi:10.1007/978-3-031-44336-7_8

Parinaz Roshanzamir¹³,
Hassan Rivaz¹³,
Joshua Ahn¹⁴,
Hamza Mirza¹⁴,
Neda Naghdi¹⁵,
Meagan Anstruther¹⁵,
Michele C. Battié¹⁶,
Maryse Fortin¹⁵ &
…
Yiming Xiao¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14291))

Included in the following conference series:

International Workshop on Uncertainty for Safe Utilization of Machine Learning in Medical Imaging

715 Accesses

Abstract

Recent developments in deep learning (DL) techniques have led to great performance improvement in medical image segmentation tasks, especially with the latest Transformer model and its variants. While labels from fusing multi-rater manual segmentations are often employed as ideal ground truths in DL model training, inter-rater variability due to factors such as training bias, image noise, and extreme anatomical variability can still affect the performance and uncertainty of the resulting algorithms. Knowledge regarding how inter-rater variability affects the reliability of the resulting DL algorithms, a key element in clinical deployment, can help inform better training data construction and DL models, but has not been explored extensively. In this paper, we measure aleatoric and epistemic uncertainties using test-time augmentation (TTA), test-time dropout (TTD), and deep ensemble to explore their relationship with inter-rater variability. Furthermore, we compare UNet and TransUNet to study the impacts of Transformers on model uncertainty with two label fusion strategies. We conduct a case study using multi-class paraspinal muscle segmentation from T2w MRIs. Our study reveals the interplay between inter-rater variability and uncertainties, affected by choices of label fusion strategies and DL models.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 54.99; Price excludes VAT (USA)

Softcover Book: USD 69.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Joint Paraspinal Muscle Segmentation and Inter-rater Labeling Variability Prediction with Multi-task TransUNet

Hybrid representation-enhanced sampling for Bayesian active learning in musculoskeletal segmentation of lower extremities

Article 29 January 2024

LUMINOUS database: lumbar multifidus muscle segmentation from ultrasound images

Article Open access 23 October 2020

References

Camarasa, R., et al.: A quantitative comparison of epistemic uncertainty maps applied to multi-class segmentation. Mach. Learn. Biomed. Imaging 1, 1–39 (2021)
Article Google Scholar
Chen, J., et al.: Transunet: transformers make strong encoders for medical image segmentation. arXiv preprint arXiv:2102.04306 (2021)
Coupe, P., Yger, P., Prima, S., Hellier, P., Kervrann, C., Barillot, C.: An optimized blockwise nonlocal means denoising filter for 3-D magnetic resonance images. IEEE Trans. Med. Imaging 27(4), 425–441 (2008). https://doi.org/10.1109/TMI.2007.906087.PMID:18390341;PMCID:PMC2881565
Article Google Scholar
Der Kiureghian, A., Ditlevsen, O.D.: Aleatoric or epistemic? Does it matter? Struct. Saf. 31(2), 105–112 (2009). https://doi.org/10.1016/j.strusafe.2008.06.020
Article Google Scholar
Dosovitskiy, A., et al.: An image is worth 16x16 words: transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020)
Gal, Y., Ghahramani, Z.: Dropout as a Bayesian approximation: representing model uncertainty in deep learning. In: Proceedings of the 33rd International Conference on Machine Learning, pp. 1050–1059. PMLR (2016)
Google Scholar
Ghandeharioun, A., Eoff, B., Jou, B., Picard, R.: Characterizing sources of uncertainty to proxy calibration and disambiguate annotator and data bias. In: IEEE/CVF International Conference on Computer Vision Workshop (ICCVW), pp. 4202–4206 (2019)
Google Scholar
Jensen, M.H., Jørgensen, D.R., Jalaboi, R., Hansen, M.E., Olsen, M.A.: Improving uncertainty estimation in convolutional neural networks using inter-rater agreement. In: Shen, D., et al. (eds.) MICCAI 2019. LNCS, vol. 11767, pp. 540–548. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-32251-9_59
Chapter Google Scholar
Jones, C.K., Wang, G., Yedavalli, V., Sair, H.: Direct quantification of epistemic and aleatoric uncertainty in 3D U-net segmentation. J. Med. Imaging (Bellingham) 9(3), 034002 (2022). https://doi.org/10.1117/1.JMI.9.3.034002. Epub 2022 Jun 8. PMID: 35692283; PMCID: PMC9174341
Kendall, A., Gal, Y.: What uncertainties do we need in bayesian deep learning for computer vision? Adv. Neural Inf. Process. Syst. 30 (2017)
Google Scholar
Lakshminarayanan, B., Pritzel, A., Blundell, C.: Simple and scalable predictive uncertainty estimation using deep ensembles. Adv. Neural Inf. Process. Syst. 30 (2017)
Google Scholar
Laves, M.H., Ihler, S., Fast, J., Kahrs, L., Ortmaier, T.: Recalibration of aleatoric and epistemic regression uncertainty in medical imaging. Mach. Learn. Biomed. Imaging 1, 1–26 (2021)
Article Google Scholar
Lemay, A., Gros, C., Naga Karthik, E., Cohen-Adad, J.: Label fusion and training methods for reliable representation of inter-rater uncertainty. Mach. Learn. Biomed. Imaging 1, 1–27 (2022)
Google Scholar
Mobiny, A., Yuan, P., Moulik, S.K., Garg, N., Wu, C.C., Van Nguyen, H.: Dropconnect is effective in modeling uncertainty of bayesian deep networks. Sci. Rep. 11(1), 1–14 (2021)
Article Google Scholar
Nichyporuk, B., et al.: Rethinking generalization: the impact of annotation style on medical image segmentation. Mach. Learn. Biomed. Imaging 1, 1–37 (2022)
Article Google Scholar
Roshanzamir, P., et al.: Joint paraspinal muscle segmentation and inter-rater labeling variability prediction with multi-task TransUNet. In: International Workshop on Uncertainty for Safe Utilization of Machine Learning in Medical Imaging, 14 September 2022, pp. 125–134. Springer, Cham (2022). https://doi.org/10.1007/978-3-031-16749-2_12
Tustison, N.J., et al.: N4ITK: improved N3 bias correction. IEEE Trans. Med. Imaging 29(6), 1310 (2010)
Article Google Scholar
Vincent, O., Gros, C., Cohen-Adad, J.: Impact of individual rater style on deep learning uncertainty in medical imaging segmentation. arXiv preprint arXiv:2105.02197 (2021)
Wang, G., Li, W., Aertsen, M., Deprest, J., Ourselin, S., Vercauteren, T.: Aleatoric uncertainty estimation with test-time augmentation for medical image segmentation with convolutional neural networks. Neurocomputing 338, 34–45 (2019)
Article Google Scholar
Wilson, A.G., Izmailov, P.: Bayesian deep learning and a probabilistic perspective of generalization. Adv. Neural. Inf. Process. Syst. 33, 4697–4708 (2020)
Google Scholar
Xiao, Y., Fortin, M., Ahn, J., Rivaz, H., Peters, T.M., Battie, M.C.: Statistical morphological analysis reveals characteristic paraspinal muscle asymmetry in unilateral lumbar disc herniation. Sci. Rep. 11, 15576 (2021). https://doi.org/10.1038/s41598-021-95149-6
Article Google Scholar

Download references

Acknowledgment

We acknowledge the support of the Natural Sciences and Engineering Research Council of Canada (NSERC) and NVIDIA for donation of the GPU.

Author information

Authors and Affiliations

Department of Electrical and Computer Engineering, Concordia University, Montreal, Canada
Parinaz Roshanzamir & Hassan Rivaz
Faculty of Health Sciences, Western University, London, Canada
Joshua Ahn & Hamza Mirza
Health, Kinesiology, and Applied Physiology, Concordia University, Montreal, Canada
Neda Naghdi, Meagan Anstruther & Maryse Fortin
School of Physical Therapy and Western’s Bone and Joint Institute, Western University, London, Canada
Michele C. Battié
Department of Computer Science and Software Engineering, Concordia University, Montreal, Canada
Yiming Xiao

Authors

Parinaz Roshanzamir
View author publications
You can also search for this author in PubMed Google Scholar
Hassan Rivaz
View author publications
You can also search for this author in PubMed Google Scholar
Joshua Ahn
View author publications
You can also search for this author in PubMed Google Scholar
Hamza Mirza
View author publications
You can also search for this author in PubMed Google Scholar
Neda Naghdi
View author publications
You can also search for this author in PubMed Google Scholar
Meagan Anstruther
View author publications
You can also search for this author in PubMed Google Scholar
Michele C. Battié
View author publications
You can also search for this author in PubMed Google Scholar
Maryse Fortin
View author publications
You can also search for this author in PubMed Google Scholar
Yiming Xiao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Parinaz Roshanzamir .

Editor information

Editors and Affiliations

University College London, London, UK
Carole H. Sudre
University of Tübingen, Tübingen, Germany
Christian F. Baumgartner
Harvard Medical School, Charlestown, MA, USA
Adrian Dalca
McGill University, Montreal, QC, Canada
Raghav Mehta
Imperial College London, London, UK
Chen Qin
Department of Radiology, Brigham and Women’s Hospital, Boston, USA
William M. Wells

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Roshanzamir, P. et al. (2023). How Inter-rater Variability Relates to Aleatoric and Epistemic Uncertainty: A Case Study with Deep Learning-Based Paraspinal Muscle Segmentation. In: Sudre, C.H., Baumgartner, C.F., Dalca, A., Mehta, R., Qin, C., Wells, W.M. (eds) Uncertainty for Safe Utilization of Machine Learning in Medical Imaging. UNSURE 2023. Lecture Notes in Computer Science, vol 14291. Springer, Cham. https://doi.org/10.1007/978-3-031-44336-7_8

Download citation

DOI: https://doi.org/10.1007/978-3-031-44336-7_8
Published: 07 October 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-44335-0
Online ISBN: 978-3-031-44336-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The Medical Image Computing and Computer Assisted Intervention Society (opens in a new tab)