Inducing Semantic Segmentation from an Example

Schnitman, Yaar; Caspi, Yaron; Cohen-Or, Daniel; Lischinski, Dani

doi:10.1007/11612704_38

Yaar Schnitman¹⁹,
Yaron Caspi¹⁹,
Daniel Cohen-Or¹⁹ &
…
Dani Lischinski²⁰

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 3852))

Included in the following conference series:

Asian Conference on Computer Vision

2239 Accesses
11 Citations

Abstract

Segmenting an image into semantically meaningful parts is a fundamental and challenging task in computer vision. Automatic methods are able to segment an image into coherent regions, but such regions generally do not correspond to complete meaningful parts. In this paper, we show that even a single training example can greatly facilitate the induction of a semantically meaningful segmentation on novel images within the same domain: images depicting the same, or similar, objects in a similar setting.

Our approach constructs a non-parametric representation of the example segmentation by selecting patch-based representatives. This allows us to represent complex semantic regions containing a large variety of colors and textures. Given an input image, we first partition it into small homogeneous fragments, and the possible labelings of each fragment are assessed using a robust voting procedure. Graph-cuts optimization is then used to label each fragment in a globally optimal manner.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Efros, A.A., Leung, T.K.: Texture synthesis by non-parametric sampling. In: International Conference on Computer Vision, Corfu, Greece, pp. 1033–1038 (1999)
Google Scholar
Hertzmann, A., Jacobs, C.E., Oliver, N., Curless, B., Salesin, D.H.: Image analogies. Computer Graphics and Interactive Techniques, 327–340 (2001)
Google Scholar
Welsh, T., Ashikmin, M., Mueller, K.: Transferring color to greyscale images. Computer Graphics and Interactive Techniques, 277–280 (2002)
Google Scholar
Drori, I., Cohen-Or, D., Yehurun, H.: Fragment-based image completion. ACM Transactions on Graphics (SIGGRAPH), 303–312 (2003)
Google Scholar
Wexler, Y., Shechtman, E., Irani, M.: Space-time video completion. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 120–127 (2004)
Google Scholar
Efros, A.A., Freeman, W.T.: Image quilting for texture synthesis and transfer. ACM Transactions on Graphics (SIGGRAPH), 341–346 (2001)
Google Scholar
Vincent, L., Soille, P.: Watersheds in digital spaces: An efficient algorithm based on immersion simulations. Trans. on Pattern Analysis and Machine Intelligence 13, 583–598 (1991)
Article Google Scholar
Comaniciu, D., Meer, P.: Mean shift: A robust approach toward feature space analysis. Trans. on Pattern Analysis and Machine Intelligence, 603–619 (2002)
Google Scholar
Lucchese, L., Mitra, S.K.: Color image segmentation: A state-of-the-art survey. In: Proc. Indian National Science Academy (INSA-A), vol. 67, pp. 207–221 (2001)
Google Scholar
Shi, J., Malik, J.: Normalized cuts and image segmentation. Trans. on Pattern Analysis and Machine Intelligence 22, 888–905 (2000)
Article Google Scholar
Yedidia, J.S., Freeman, W.T., Weiss, Y.: Understanding belief propagation and its generalizations. Exploring artificial intelligence in the new millennium, 239–269 (2003)
Google Scholar
Boykov, Y., Jolly, M.: Interactive graph cuts for optimal boundary and region segmentation of objects in N-D images. In: International Conference on Computer Vision, Vancouver, BC, pp. 105–112 (2001)
Google Scholar
Freeman, W., Jones, T., Pasztor, E.: Example-based super-resolution. IEEE Comput. Graph. Appl. 22, 56–65 (2002)
Article Google Scholar
Borenstein, E., Ullman, S.: Class-specific, top-down segmentation. In: European Conference on Computer Vision, Copenhagen, Denmark, vol. 2, pp. 109–124 (2002)
Google Scholar
Li, Y., Sun, J., Tang, C.K., Shum, H.Y.: Lazy snapping. ACM Transactions on Graphics (SIGGRAPH) 23, 303–308 (2004)
Article Google Scholar
Rother, C., Kolmogorov, V., Blake, A.: “GrabCut”: interactive foreground extraction using iterated graph cuts. ACM Transactions on Graphics 23, 309–314 (2004)
Article Google Scholar
Boykov, Y., Veksler, O., Zabih, R.: Fast approximate energy minimization via graph cuts. IEEE Trans. Pattern Anal. Mach. Intell. 23, 1222–1239 (2001)
Article Google Scholar
Wang, J., Xu, Y., Shum, H.Y., Cohen, M.F.: Video tooning. ACM Transactions on Graphics (SIGGRAPH) 23, 574–583 (2004)
Article Google Scholar
Agarwala, A., Hertzmann, A., Salesin, D., Seitz, S.: Keyframe-based tracking for rotoscoping and animation. ACM Transactions on Graphics (SIGGRAPH) 23, 584–591 (2004)
Article Google Scholar
Christoudias, C.M., Georgescu, B.: Edge detection and image segmentation (edison) system, http://www.caip.rutgers.edu/riul/research/robust.html
Boykov, Y., Kolmogorov, V.: Maxflow software, http://www.cs.cornell.edu/People/vnk/software.html
Mount, D., Arya, S.: Ann: Library for approximate nearest neighbor searching, http://www.cs.umd.edu/mount/ANN/
Wertheimer, M.: Productive Thinking. Collins, NY (1945)
Google Scholar

Download references

Author information

Authors and Affiliations

Tel Aviv University, Israel
Yaar Schnitman, Yaron Caspi & Daniel Cohen-Or
The Hebrew University of Jerusalem, Israel
Dani Lischinski

Authors

Yaar Schnitman
View author publications
You can also search for this author in PubMed Google Scholar
Yaron Caspi
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Cohen-Or
View author publications
You can also search for this author in PubMed Google Scholar
Dani Lischinski
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Center for Visual Information Technology, International Institute of Information Technology, Hyderabad, India
P. J. Narayanan
Department of Computer Science, Columbia University, 500 West 120th Street, NY 10027, New York, USA
Shree K. Nayar
Microsoft Research Asia, P.O. Box, Beijing, P.R. China
Heung-Yeung Shum

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Schnitman, Y., Caspi, Y., Cohen-Or, D., Lischinski, D. (2006). Inducing Semantic Segmentation from an Example. In: Narayanan, P.J., Nayar, S.K., Shum, HY. (eds) Computer Vision – ACCV 2006. ACCV 2006. Lecture Notes in Computer Science, vol 3852. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11612704_38

Download citation

DOI: https://doi.org/10.1007/11612704_38
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-31244-4
Online ISBN: 978-3-540-32432-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics