Abstract
Interactive segmentation enables users to specify the object of interest (OOI) via various interaction strategies to obtain accurate segmentation results. An ideal interactive method should efficiently and accurately express users’ segmentation intentions. However, the existing methods can only use a single interactive mode, ignoring the differences in scale and shape between OOIs, resulting in an inflexibility labeling process. In this paper, we propose a grid-based interactive image segmentation method (GridIIS). Specifically, GridIIS overlays grids on the image, and users can specify the location and shape of the OOI by selecting the grid areas as the interactive guidance. Users can choose the appropriate grid selection method and size considering the OOI’s scale, shape, and boundary clarity to obtain guidance. We accordingly propose a novel grid sampling strategy, that considers the OOI’s scale and shapes to adaptively estimate the grid size and area. Experiments on several datasets from different domains (street views, medical images, scene texts, etc.) show that our method achieves superior performance with fewer interaction rounds and exhibits strong generalization ability in cross-domain datasets.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bai, J., Wu, X.: Error-tolerant scribbles based interactive image segmentation. In: CVPR, pp. 392–399 (2014)
Bai, X., Sapiro, G.: A geodesic framework for fast interactive image and video segmentation and matting. In: ICCV, pp. 1–8. IEEE (2007)
Boykov, Y.Y., Jolly, M.P.: Interactive graph cuts for optimal boundary & region segmentation of objects in ND images. In: ICCV, vol. 1, pp. 105–112. IEEE (2001)
Caesar, H., Uijlings, J., Ferrari, V.: Coco-stuff: thing and stuff classes in context. In: CVPR, pp. 1209–1218 (2018)
Chen, X., et al.: Conditional diffusion for interactive segmentation. In: ICCV, pp. 7345–7354 (2021)
Chen, X., et al.: FocalClick: towards practical interactive image segmentation. In: CVPR, pp. 1300–1309 (2022)
Cheng, M.-M., et al.: Intelligent visual media processing: when graphics meets vision. JCST 32(1), 110–121 (2017)
Cordts, M., et al.: The cityscapes dataset for semantic urban scene understanding (2016). arXiv: 1604.01685 [cs.CV]
Ding, Z., et al.: A dual-stream framework guided by adaptive gaussian maps for interactive image segmentation. Knowl.-Based Syst. 223, 107033 (2021)
Everingham, M., et al.: The pascal visual object classes (VOC) challenge. IJCV 88(2), 303–338 (2010)
Gerhard, S., et al.: Segmented anisotropic ssTEM dataset of neural tissue. figshare (2013)
Grady, L.: Random walks for image segmentation. PAMI 28(11), 1768–1783 (2006)
Jang, W.D., Kim, C.S.: Interactive image segmentation via backpropagating refinement scheme. In: CVPR, pp. 5297–5306 (2019)
Kim, T.H., Lee, K.M., Lee, S.U.: Generative image segmentation using random walks with restart. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008. LNCS, vol. 5304, pp. 264–275. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-88690-7_20
Koohbanani, N.A., et al.: NuClick: a deep learning framework for interactive segmentation of microscopic images. MIA 65, 101771 (2020)
Li, Y., et al.: Lazy snapping. ACM Trans. Graph. (ToG) 23(3), 303–308 (2004)
Liew, J.H., et al.: Regional interactive image segmentation networks. In: ICCV. IEEE Computer Society, pp. 2746–2754 (2017)
Lin, D., et al.: Scribblesup: scribble-supervised convolutional networks for semantic segmentation. In: CVPR, pp. 3159–3167 (2016)
Lin, T.Y., et al.: Microsoft COCO: common objects in context. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8693, pp. 740–755. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10602-1_48
Lin, Z., et al.: Interactive image segmentation with first click attention. In: CVPR, pp. 13339–13348 (2020)
Ling, H., et al.: Fast interactive object annotation with Curve-GCN. In: CVPR, pp. 5257–5266 (2019)
Luo, X., et al.: MIDeepSeg: minimally interactive segmentation of unseen objects from medical images using deep learning. MIA 72, 102102 (2021)
Mahadevan, S., Voigtlaender, P., Leibe, B.: Iteratively trained interactive segmentation. arXiv preprint: arXiv:1805.04398 (2018)
Majumder, S., Yao, A.: Content-aware multi-level guidance for interactive instance segmentation. In: CVPR, pp. 11602–11611 (2019)
Maninis, K.-K. et al.: Deep extreme cut: from extreme points to object segmentation. In: CVPR, pp. 616–625 (2018)
McGuinness, K., O’connor, N.E.: A comparative evaluation of interactive segmentation algorithms. Pattern Recogn. 43, 434–444 (2010)
Rother, C., Kolmogorov, V., Blake, A.: “GrabCut’’ interactive foreground extraction using iterated graph cuts. ACM Trans. Graph. (TOG) 23(3), 309–314 (2004)
Sofiiuk, K., Petrov, I.A., Konushin, A.: Reviving iterative training with mask guidance for interactive segmentation. arXiv preprint: arXiv:2102.06583 (2021)
Sofiiuk, K., et al.: F-BRS: rethinking backpropagating refinement for interactive segmentation. In: CVPR, pp. 8623–8632 (2020)
Wang, G., et al.: DeepIGeoS: a deep interactive geodesic framework for medical image segmentation. PAMI 41(7), 1559–1572 (2018)
Wu, J., et al.: Milcut: a sweeping line multiple instance learning paradigm for interactive image segmentation. In: CVPR, pp. 256–263 (2014)
Xu, N., et al.: Deep GrabCut for object selection. arXiv preprint: arXiv:1707.00243 (2017)
Xu, N., et al.: Deep interactive object selection. In: CVPR, pp. 373–381 (2016)
Yuliang, L., et al.: Detecting curve text in the wild: new dataset and new solution. arXiv preprint: arXiv:1712.02170 (2017)
Zhang, C., et al.: Intention-aware feature propagation network for interactive segmentation. arXiv preprint: arXiv:2203.05145 (2022)
Zhang, K., Zhuang, X.: CycleMix: a holistic strategy for medical image segmentation from scribble supervision. In: CVPR, pp. 11656–11665 (2022)
Zhang, S., et al.: Interactive object segmentation with inside-outside guidance. In: CVPR, pp. 12234–12244 (2020)
Zhuang, X.: Multivariate mixture model for cardiac segmentation from multi-sequence MRI. In: Ourselin, S., Joskowicz, L., Sabuncu, M.R., Unal, G., Wells, W. (eds.) MICCAI 2016. LNCS, vol. 9901, pp. 581–588. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46723-8_67
Acknowledgements
This work is supported by National Natural Science Foundation of China (No. 61773325), Industry-University Cooperation Project of Fujian Science and Technology Department (No. 2021H6035), Fujian Key Technological Innovation and Industrialization Projects (No. 2023XQ023), and Fu-Xia-Quan National Independent Innovation Demonstration Project (No. 2022FX4).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Zhu, P., Wang, DH., Zhu, S. (2024). GridIIS: Grid Based Interactive Image Segmentation. In: Liu, Q., et al. Pattern Recognition and Computer Vision. PRCV 2023. Lecture Notes in Computer Science, vol 14435. Springer, Singapore. https://doi.org/10.1007/978-981-99-8552-4_28
Download citation
DOI: https://doi.org/10.1007/978-981-99-8552-4_28
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-8551-7
Online ISBN: 978-981-99-8552-4
eBook Packages: Computer ScienceComputer Science (R0)