Taming Detection Transformers for Medical Object Detection

Ickler, Marc K.; Baumgartner, Michael; Roy, Saikat; Wald, Tassilo; Maier-Hein, Klaus H.

doi:10.1007/978-3-658-41657-7_39

Marc K. Ickler⁸,
Michael Baumgartner^8,10,11,
Saikat Roy^8,10,
Tassilo Wald^8,11 &
…
Klaus H. Maier-Hein^8,9

Part of the book series: Informatik aktuell ((INFORMAT))

Included in the following conference series:

BVM Workshop

868 Accesses
2 Citations
3 Altmetric

Abstract

The accurate detection of suspicious regions in medical images is an error-prone and time-consuming process required by many routinely performed diagnostic procedures. To support clinicians during this difficult task, several automated solutions were proposed relying on complex methods with many hyperparameters. In this study, we investigate the feasibility of detection transformer (DETR) models for volumetric medical object detection. In contrast to previous works, these models directly predict a set of objects without relying on the design of anchors or manual heuristics such as non-maximum-suppression to detect objects. We show by conducting extensive experiments with three models, namely DETR, Conditional DETR, and DINO DETR on four data sets (CADA, RibFrac, KiTS19, and LIDC) that these set prediction models can perform on par with or even better than currently existing methods. DINO DETR, the best-performing model in our experiments demonstrates this by outperforming a strong anchorbased one-stage detector, Retina U-Net, on three out of four data sets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Softcover Book: USD 119.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

nnDetection: A Self-configuring Method for Medical Object Detection

Reg R-CNN: Lesion Detection and Grading Under Noisy Labels

Abstract: nnDetection

References

Baumgartner M, Jäger PF, Isensee F, Maier-Hein KH. NnDetection: a self-configuring method for medical object detection. Med Image Comput Comput Assist Interv. Springer, 2021:530–9.
Google Scholar
Jaeger PF, Kohl SA, Bickelhaupt S, Isensee F, Kuder TA, Schlemmer HP et al. Retina U-Net: embarrassingly simple exploitation of segmentation supervision for medical object detection. ML4H Workshop. PMLR. 2020:171–83.
Google Scholar
Carion N, Massa F, Synnaeve G, Usunier N, Kirillov A, Zagoruyko S. End-to-end object detection with transformers. Comput Vis ECCV. Springer, 2020:213–29.
Google Scholar
Meng D, Chen X, Fan Z, Zeng G, Li H, Yuan Y et al. Conditional DETR for fast training convergence. Proc IEEE Int Conf Comput Vis. 2021:3631–40.
Google Scholar
Zhang H, Li F, Liu S, Zhang L, Su H, Zhu J et al. Dino: Detr with improved denoising anchor boxes for end-to-end object detection. 2022.
Google Scholar
Wittmann B, Navarro F, Shit S, Menze B. Focused decoding enables 3D anatomical detection by transformers. 2022.
Google Scholar
Ivantsits M, Goubergrits L, Kuhnigk JM, Huellebrand M, Bruening J, Kossen T et al. Detection and analysis of cerebral aneurysms based on X-ray rotational angiography-the CADA 2020 challenge. Med Image Anal. 2022;77:102333.
Google Scholar
Jin L, Yang J, Kuang K, Ni B, Gao Y, Sun Y et al. Deep-learning-assisted detection and segmentation of rib fractures from CT scans: development and validation of FracNet. EBioMedicine. 2020;62.
Google Scholar
Heller N, Sathianathen N, Kalapara A,Walczak E, Moore K, Kaluzniak H et al. The KiTS19 challenge data: 300 kidney tumor cases with clinical context, CT semantic segmentations, and surgical outcomes. 2019.
Google Scholar
Armato III SG, McLennan G, Bidaut L, McNitt-Gray MF, Meyer CR, Reeves AP et al. The lung image database consortium (LIDC) and image database resource initiative (IDRI): a completed reference database of lung nodules on CT scans. Med Phys. 2011;38(2):915–31.
Google Scholar

Download references

Author information

Authors and Affiliations

Division of Medical Image Computing, German Cancer Research Center, Heidelberg, Germany
Marc K. Ickler, Michael Baumgartner, Saikat Roy, Tassilo Wald & Klaus H. Maier-Hein
Pattern Analysis and Learning Group, Heidelberg University Hospital, Heidelberg, Germany
Klaus H. Maier-Hein
Faculty of Mathematics and Computer Science, Heidelberg University, Heidelberg, Germany
Michael Baumgartner & Saikat Roy
Helmholtz Imaging, Heidelberg, Germany
Michael Baumgartner & Tassilo Wald

Authors

Marc K. Ickler
View author publications
You can also search for this author in PubMed Google Scholar
Michael Baumgartner
View author publications
You can also search for this author in PubMed Google Scholar
Saikat Roy
View author publications
You can also search for this author in PubMed Google Scholar
Tassilo Wald
View author publications
You can also search for this author in PubMed Google Scholar
Klaus H. Maier-Hein
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Michael Baumgartner .

Editor information

Editors and Affiliations

Peter L. Reichertz Institut für Medizinische, Informatik der TU Braunschweig und der Medizinischen Hochschule Hannover, Braunschweig, Niedersachsen, Deutschland
Thomas M. Deserno
Institut für Medizinische Informatik, Universität zu Lübeck, Lübeck, Schleswig-Holstein, Deutschland
Heinz Handels
Lehrstuhl für Mustererkennung, Friedrich-Alexander-Universität, Erlangen, Bayern, Deutschland
Andreas Maier
Medical Image Computing, E230, Deutsches Krebsforschungszentrum (DKFZ), Heidelberg, Baden-Württemberg, Deutschland
Klaus Maier-Hein
Fakultät für Informatik und Mathematik, Ostbayerische Technische Hochschule Regensburg, Regensburg, Deutschland
Christoph Palm
Institut für Medizinische Informatik, Charité – Universitätsmedizin Berlin, Berlin, Berlin, Deutschland
Thomas Tolxdorff

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ickler, M.K., Baumgartner, M., Roy, S., Wald, T., Maier-Hein, K.H. (2023). Taming Detection Transformers for Medical Object Detection. In: Deserno, T.M., Handels, H., Maier, A., Maier-Hein, K., Palm, C., Tolxdorff, T. (eds) Bildverarbeitung für die Medizin 2023. BVM 2023. Informatik aktuell. Springer Vieweg, Wiesbaden. https://doi.org/10.1007/978-3-658-41657-7_39

Download citation

DOI: https://doi.org/10.1007/978-3-658-41657-7_39
Published: 02 June 2023
Publisher Name: Springer Vieweg, Wiesbaden
Print ISBN: 978-3-658-41656-0
Online ISBN: 978-3-658-41657-7
eBook Packages: Computer Science and Engineering (German Language)

Publish with us

Policies and ethics