skip to main content
10.1145/3579654.3579727acmotherconferencesArticle/Chapter ViewAbstractPublication PagesacaiConference Proceedingsconference-collections
research-article

An Efficient Transformer-based Approach for Joint Nuclei Detection and Segmentation in Whole Slide Tissue Images

Authors Info & Claims
Published:14 March 2023Publication History

ABSTRACT

The detection and segmentation of cell nuclei in whole slide tissue images plays an important role in disease diagnosis and treatment. Automatic detection and segmentation of nuclei is very challenging due to high nuclei density, low contrast, overlapping and adhesion between cells. Recently, Transformer-based object detection and instance segmentation methods have made great progress on traditional computer vision datasets. These Transformer-based approaches are effective by removing the need for many hand-crafted components in the network, and post-processing steps like non-maximum suppression (NMS). However, such approaches consume tremendous amount of memory in detection and segmentation of cell nuclei in whole slide tissue images due to large number of cells in the images. Also, those methods may suffer from inferior performance on small cell instances. Inspired by Deformable DETR which makes use of small set of key sampling points in the attention module to reduce the computation and the usage of multi-scale feature maps, we propose an efficient Transformer-based approach for joint nuclei detection and segmentation in whole slide tissue images. Specifically, In Transformer decoder, it directly outputs the object detection results and instance segmentation masks. We evaluate our proposed method on the public MoNuSeg dataset, and the experimental results show that our method achieves promising performance on nuclei detection and segmentation.

References

  1. Rolf Adams and Leanne Bischof. 1994. Seeded region growing. IEEE Transactions on pattern analysis and machine intelligence 16, 6(1994), 641–647.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Alexey Bochkovskiy, Chien-Yao Wang, and Hong-Yuan Mark Liao. 2020. Yolov4: Optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934(2020).Google ScholarGoogle Scholar
  3. Daniel Bolya, Chong Zhou, Fanyi Xiao, and Yong Jae Lee. 2019. Yolact: Real-time instance segmentation. In Proceedings of the IEEE/CVF international conference on computer vision. 9157–9166.Google ScholarGoogle ScholarCross RefCross Ref
  4. P Bonnin, J Blanc Talon, J C Hayot, and Bertrand Zavidovique. 1990. A new edge point/region cooperative segmentation deduced from a 3d scene reconstruction application. In Applications of Digital Image Processing XII, Vol. 1153. SPIE, 579–591.Google ScholarGoogle Scholar
  5. Zhaowei Cai and Nuno Vasconcelos. 2018. Cascade r-cnn: Delving into high quality object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition. 6154–6162.Google ScholarGoogle ScholarCross RefCross Ref
  6. Jiale Cao, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan, Yanwei Pang, and Ling Shao. 2020. D2det: Towards high quality object detection and instance segmentation. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 11485–11494.Google ScholarGoogle ScholarCross RefCross Ref
  7. Nicolas Carion, Francisco Massa, Gabriel Synnaeve, Nicolas Usunier, Alexander Kirillov, and Sergey Zagoruyko. 2020. End-to-End Object Detection with Transformers. In Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part I. 213–229.Google ScholarGoogle Scholar
  8. Kai Chen, Jiangmiao Pang, Jiaqi Wang, Yu Xiong, Xiaoxiao Li, Shuyang Sun, Wansen Feng, Ziwei Liu, Jianping Shi, Wanli Ouyang, 2019. Hybrid task cascade for instance segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 4974–4983.Google ScholarGoogle ScholarCross RefCross Ref
  9. Yuxin Fang, Shusheng Yang, Xinggang Wang, Yu Li, Chen Fang, Ying Shan, Bin Feng, and Wenyu Liu. 2021. Instances as queries. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 6910–6919.Google ScholarGoogle ScholarCross RefCross Ref
  10. Ross Girshick. 2015. Fast r-cnn. In Proceedings of the IEEE international conference on computer vision. 1440–1448.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Xavier Glorot, Antoine Bordes, and Yoshua Bengio. 2011. Deep sparse rectifier neural networks. In Proceedings of the fourteenth international conference on artificial intelligence and statistics. JMLR Workshop and Conference Proceedings, 315–323.Google ScholarGoogle Scholar
  12. Kaiming He, Georgia Gkioxari, Piotr Dollár, and Ross Girshick. 2017. Mask r-cnn. In Proceedings of the IEEE international conference on computer vision. 2961–2969.Google ScholarGoogle ScholarCross RefCross Ref
  13. Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition. 770–778.Google ScholarGoogle ScholarCross RefCross Ref
  14. Neeraj Kumar, Ruchika Verma, Deepak Anand, Yanning Zhou, Omer Fahri Onder, Efstratios Tsougenis, Hao Chen, Pheng-Ann Heng, Jiahui Li, Zhiqiang Hu, 2019. A multi-organ nucleus segmentation challenge. IEEE transactions on medical imaging 39, 5 (2019), 1380–1391.Google ScholarGoogle Scholar
  15. Hei Law and Jia Deng. 2018. Cornernet: Detecting objects as paired keypoints. In Proceedings of the European conference on computer vision (ECCV). 734–750.Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Tsung-Yi Lin, Priya Goyal, Ross Girshick, Kaiming He, and Piotr Dollár. 2017. Focal loss for dense object detection. In Proceedings of the IEEE international conference on computer vision. 2980–2988.Google ScholarGoogle ScholarCross RefCross Ref
  17. Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, and C Lawrence Zitnick. 2014. Microsoft coco: Common objects in context. In European conference on computer vision. Springer, 740–755.Google ScholarGoogle ScholarCross RefCross Ref
  18. Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu, and Alexander C Berg. 2016. Ssd: Single shot multibox detector. In European conference on computer vision. Springer, 21–37.Google ScholarGoogle ScholarCross RefCross Ref
  19. Jonathan Long, Evan Shelhamer, and Trevor Darrell. 2015. Fully convolutional networks for semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition. 3431–3440.Google ScholarGoogle ScholarCross RefCross Ref
  20. Ilya Loshchilov and Frank Hutter. 2018. Decoupled Weight Decay Regularization. In International Conference on Learning Representations.Google ScholarGoogle Scholar
  21. Fausto Milletari, Nassir Navab, and Seyed-Ahmad Ahmadi. 2016. V-net: Fully convolutional neural networks for volumetric medical image segmentation. In 2016 fourth international conference on 3D vision (3DV). IEEE, 565–571.Google ScholarGoogle Scholar
  22. Mohammad Hasanzadeh Mofrad, Sana Sadeghi, Alireza Rezvanian, and Mohammad Reza Meybodi. 2015. Cellular edge detection: Combining cellular automata and cellular learning automata. AEU-International Journal of Electronics and Communications 69, 9(2015), 1282–1290.Google ScholarGoogle ScholarCross RefCross Ref
  23. Ethan H Nguyen, Haichun Yang, Ruining Deng, Yuzhe Lu, Zheyu Zhu, Joseph T Roland, Le Lu, Bennett A Landman, Agnes B Fogo, and Yuankai Huo. 2021. Circle Representation for Medical Object Detection. IEEE transactions on medical imaging 41, 3 (2021), 746–754.Google ScholarGoogle Scholar
  24. Joseph Redmon and Ali Farhadi. 2017. YOLO9000: better, faster, stronger. In Proceedings of the IEEE conference on computer vision and pattern recognition. 7263–7271.Google ScholarGoogle ScholarCross RefCross Ref
  25. Joseph Redmon and Ali Farhadi. 2018. Yolov3: An incremental improvement. arXiv preprint arXiv:1804.02767(2018).Google ScholarGoogle Scholar
  26. S Ren, K He, R Girshick, and J Sun. 2016. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.IEEE Transactions on Pattern Analysis and Machine Intelligence 39, 6(2016), 1137–1149.Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Hamid Rezatofighi, Nathan Tsoi, JunYoung Gwak, Amir Sadeghian, Ian Reid, and Silvio Savarese. 2019. Generalized intersection over union: A metric and a loss for bounding box regression. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 658–666.Google ScholarGoogle ScholarCross RefCross Ref
  28. Guanglu Song, Yu Liu, and Xiaogang Wang. 2020. Revisiting the sibling head in object detector. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 11563–11572.Google ScholarGoogle ScholarCross RefCross Ref
  29. Zhi Tian, Chunhua Shen, and Hao Chen. 2020. Conditional convolutions for instance segmentation. In European conference on computer vision. Springer, 282–298.Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Alain Tremeau and Nathalie Borel. 1997. A region growing and merging algorithm to color segmentation. Pattern recognition 30, 7 (1997), 1191–1203.Google ScholarGoogle Scholar
  31. Luc Vincent and Pierre Soille. 1991. Watersheds in digital spaces: an efficient algorithm based on immersion simulations. IEEE Transactions on Pattern Analysis & Machine Intelligence 13, 06(1991), 583–598.Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Xinlong Wang, Tao Kong, Chunhua Shen, Yuning Jiang, and Lei Li. 2020. Solo: Segmenting objects by locations. In European Conference on Computer Vision. Springer, 649–665.Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Xiaodong Yu, Dahu Shi, Xing Wei, Ye Ren, Tingqun Ye, and Wenming Tan. 2022. Soit: Segmenting objects with instance-aware transformers. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 36. 3188–3196.Google ScholarGoogle ScholarCross RefCross Ref
  34. Xingyi Zhou, Dequan Wang, and Philipp Krähenbühl. 2019. Objects as Points. arXiv e-prints (2019), arXiv–1904.Google ScholarGoogle Scholar
  35. Xingyi Zhou, Jiacheng Zhuo, and Philipp Krahenbuhl. 2019. Bottom-up object detection by grouping extreme and center points. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 850–859.Google ScholarGoogle ScholarCross RefCross Ref
  36. Xizhou Zhu, Weijie Su, Lewei Lu, Bin Li, Xiaogang Wang, and Jifeng Dai. 2020. Deformable DETR: Deformable Transformers for End-to-End Object Detection. In International Conference on Learning Representations.Google ScholarGoogle Scholar

Index Terms

  1. An Efficient Transformer-based Approach for Joint Nuclei Detection and Segmentation in Whole Slide Tissue Images

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        ACAI '22: Proceedings of the 2022 5th International Conference on Algorithms, Computing and Artificial Intelligence
        December 2022
        770 pages
        ISBN:9781450398336
        DOI:10.1145/3579654

        Copyright © 2022 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 14 March 2023

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed limited

        Acceptance Rates

        Overall Acceptance Rate173of395submissions,44%
      • Article Metrics

        • Downloads (Last 12 months)44
        • Downloads (Last 6 weeks)0

        Other Metrics

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      HTML Format

      View this article in HTML Format .

      View HTML Format