Abstract
Segmenting an image into semantically meaningful parts is a fundamental and challenging task in computer vision. Automatic methods are able to segment an image into coherent regions, but such regions generally do not correspond to complete meaningful parts. In this paper, we show that even a single training example can greatly facilitate the induction of a semantically meaningful segmentation on novel images within the same domain: images depicting the same, or similar, objects in a similar setting.
Our approach constructs a non-parametric representation of the example segmentation by selecting patch-based representatives. This allows us to represent complex semantic regions containing a large variety of colors and textures. Given an input image, we first partition it into small homogeneous fragments, and the possible labelings of each fragment are assessed using a robust voting procedure. Graph-cuts optimization is then used to label each fragment in a globally optimal manner.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Efros, A.A., Leung, T.K.: Texture synthesis by non-parametric sampling. In: International Conference on Computer Vision, Corfu, Greece, pp. 1033–1038 (1999)
Hertzmann, A., Jacobs, C.E., Oliver, N., Curless, B., Salesin, D.H.: Image analogies. Computer Graphics and Interactive Techniques, 327–340 (2001)
Welsh, T., Ashikmin, M., Mueller, K.: Transferring color to greyscale images. Computer Graphics and Interactive Techniques, 277–280 (2002)
Drori, I., Cohen-Or, D., Yehurun, H.: Fragment-based image completion. ACM Transactions on Graphics (SIGGRAPH), 303–312 (2003)
Wexler, Y., Shechtman, E., Irani, M.: Space-time video completion. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 120–127 (2004)
Efros, A.A., Freeman, W.T.: Image quilting for texture synthesis and transfer. ACM Transactions on Graphics (SIGGRAPH), 341–346 (2001)
Vincent, L., Soille, P.: Watersheds in digital spaces: An efficient algorithm based on immersion simulations. Trans. on Pattern Analysis and Machine Intelligence 13, 583–598 (1991)
Comaniciu, D., Meer, P.: Mean shift: A robust approach toward feature space analysis. Trans. on Pattern Analysis and Machine Intelligence, 603–619 (2002)
Lucchese, L., Mitra, S.K.: Color image segmentation: A state-of-the-art survey. In: Proc. Indian National Science Academy (INSA-A), vol. 67, pp. 207–221 (2001)
Shi, J., Malik, J.: Normalized cuts and image segmentation. Trans. on Pattern Analysis and Machine Intelligence 22, 888–905 (2000)
Yedidia, J.S., Freeman, W.T., Weiss, Y.: Understanding belief propagation and its generalizations. Exploring artificial intelligence in the new millennium, 239–269 (2003)
Boykov, Y., Jolly, M.: Interactive graph cuts for optimal boundary and region segmentation of objects in N-D images. In: International Conference on Computer Vision, Vancouver, BC, pp. 105–112 (2001)
Freeman, W., Jones, T., Pasztor, E.: Example-based super-resolution. IEEE Comput. Graph. Appl. 22, 56–65 (2002)
Borenstein, E., Ullman, S.: Class-specific, top-down segmentation. In: European Conference on Computer Vision, Copenhagen, Denmark, vol. 2, pp. 109–124 (2002)
Li, Y., Sun, J., Tang, C.K., Shum, H.Y.: Lazy snapping. ACM Transactions on Graphics (SIGGRAPH) 23, 303–308 (2004)
Rother, C., Kolmogorov, V., Blake, A.: “GrabCut”: interactive foreground extraction using iterated graph cuts. ACM Transactions on Graphics 23, 309–314 (2004)
Boykov, Y., Veksler, O., Zabih, R.: Fast approximate energy minimization via graph cuts. IEEE Trans. Pattern Anal. Mach. Intell. 23, 1222–1239 (2001)
Wang, J., Xu, Y., Shum, H.Y., Cohen, M.F.: Video tooning. ACM Transactions on Graphics (SIGGRAPH) 23, 574–583 (2004)
Agarwala, A., Hertzmann, A., Salesin, D., Seitz, S.: Keyframe-based tracking for rotoscoping and animation. ACM Transactions on Graphics (SIGGRAPH) 23, 584–591 (2004)
Christoudias, C.M., Georgescu, B.: Edge detection and image segmentation (edison) system, http://www.caip.rutgers.edu/riul/research/robust.html
Boykov, Y., Kolmogorov, V.: Maxflow software, http://www.cs.cornell.edu/People/vnk/software.html
Mount, D., Arya, S.: Ann: Library for approximate nearest neighbor searching, http://www.cs.umd.edu/mount/ANN/
Wertheimer, M.: Productive Thinking. Collins, NY (1945)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Schnitman, Y., Caspi, Y., Cohen-Or, D., Lischinski, D. (2006). Inducing Semantic Segmentation from an Example. In: Narayanan, P.J., Nayar, S.K., Shum, HY. (eds) Computer Vision – ACCV 2006. ACCV 2006. Lecture Notes in Computer Science, vol 3852. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11612704_38
Download citation
DOI: https://doi.org/10.1007/11612704_38
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-31244-4
Online ISBN: 978-3-540-32432-4
eBook Packages: Computer ScienceComputer Science (R0)