Abstract
We present a novel algorithm for segmentation of natural images that harnesses the principle of minimum description length (MDL). Our method is based on observations that a homogeneously textured region of a natural image can be well modeled by a Gaussian distribution and the region boundary can be effectively coded by an adaptive chain code. The optimal segmentation of an image is the one that gives the shortest coding length for encoding all textures and boundaries in the image, and is obtained via an agglomerative clustering process applied to a hierarchy of decreasing window sizes as multi-scale texture features. The optimal segmentation also provides an accurate estimate of the overall coding length and hence the true entropy of the image. We test our algorithm on the publicly available Berkeley Segmentation Dataset. It achieves state-of-the-art segmentation results compared to other existing methods.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Arbelaez, P. (2006). Boundary extraction in natural images using ultrametric contour maps. In Workshop on perceptual organization in computer vision.
Arbelaez, P., Maire, M., Fowlkes, C., & Malik, J. (2009). From contours to regions: an empirical evaluation. In Proceedings of the IEEE conference on computer vision and pattern recognition.
Comanicu, D., & Meer, P. (2002). Mean shift: a robust approach toward feature space analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24, 603–619.
Cour, T., Benezit, F., & Shi, J. (2005). Spectral segmentation with multiscale graph decomposition. In Proceedings of the IEEE conference on computer vision and pattern recognition.
Deng, Y., & Manjunath, B. (2001). Unsupervised segmentation of color-texture regions in images and video. IEEE Transactions on Pattern Analysis and Machine Intelligence, 23(8), 800–810.
Donoser, M., Urschler, M., Hirzer, M., & Bischof, H. (2009). Saliency driven total variation segmentation. In Proceedings of the international conference on computer vision.
Duda, R., Hart, P., & Stork, D. (2001). Pattern classification, 2nd edn. New York: Wiley.
Efros, A., & Leung, T. (1999). Texture synthesis by non-parametric sampling. In Proceedings of the international conference on computer vision.
Elder, J., & Zucker, S. (1996). Computing contour closures. In Proceedings of the European conference on computer vision.
Felzenszwalb, P., & Huttenlocher, D. (2004). Efficient graph-based image segmentation. International Journal of Computer Vision, 59(2), 167–181.
Freixenet, J., Munoz, X., Raba, D., Marti, J., & Cuff, X. (2002). Yet another survey on image segmentation. In Proceedings of European conference on computer vision (pp. 408–422).
Gevers, T., & Smeulders, A. (1997). Combining region splitting and edge detection through guided Delaunay image subdivision. In Proceedings of the IEEE conference on computer vision and pattern recognition.
Haralick, R., & Shapiro, L. (1985). Image segmentation techniques. Computer Vision, Graphics, and Image Processing, 29(1), 100–132.
Jain, A. (1989). Fundamentals of digital image processing. New York: Prentice Hall.
Kim, J., Fisher, J., Yezzi, A., Cetin, M., & Willsky, A. (2005). A nonparametric statistical method for image segmentation using information theory and curve evolution. IEEE Transactions on Pattern Analysis and Machine Intelligence, 14(10), 1486–1502.
Kim, T., Lee, K., & Lee, S. (2010). Learning full pairwise affinities for spectral segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition.
Kurita, T. (1995). An efficient clustering algorithm for region merging. IEICE Transactions of Information and Systems, E78-D(12), 1546–1551.
Levina, E., & Bickel, P. (2006). Texture synthesis and non-parametric resampling of random fields. Annals of Statistics, 34(4), 1751–1773.
Liu, Y., & Zalik, B. (2005). Efficient chain code with Huffman coding. Pattern Recognition, 38(4), 553–557.
Ma, Y., Derksen, H., Hong, W., & Wright, J. (2007). Segmentation of multivariate mixed data via lossy coding and compression. IEEE Transactions on Pattern Analysis and Machine Intelligence, 29(9), 1546–1562.
Malik, J., Belongie, S., Leung, T., & Shi, J. (2001). Contour and texture analysis for image segmentation. International Journal of Computer Vision, 43(1), 7–27.
Martin, D., Fowlkes, C., Tal, D., & Malik, J. (2001). A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics. In Proceedings of the international conference on computer vision.
Meila, M. (2005). Comparing clusterings: an axiomatic view. In Proceedings of the international conference on machine learning.
Mori, G., Ren, X., Efros, A., & Malik, J. (2004). Recovering human body configurations: combining segmentation and recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition.
Rand, W. (1971). Objective criteria for the evaluation of clustering methods. Journal of the American Statistical Association, 66(336), 846–850.
Rao, S., Mobahi, H., Yang, A., Sastry, S., & Ma, Y. (2009). Natural image segmentation with adaptive texture and boundary encoding. In Proceedings of the Asian conference on computer vision (Vol. 1, pp. 135–146).
Ren, X., Fowlkes, C., & Malik, J. (2005). Scale-invariant contour completion using condition random fields. In Proceedings of the international conference on computer vision.
Ren, X., Fowlkes, C., & Malik, J. (2008). Learning probabilistic models for contour completion in natural images. International Journal of Computer Vision, 77, 47–63.
Tremeau, A., & Borel, N. (1997). A region growing and merging algorithm to color segmentation. Pattern Recognition, 30(7), 1191–1204.
Tu, Z., & Zhu, S. (2002). Image segmentation by data-driven Markov Chain Monte Carlo. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(5), 657–673.
Varma, M., & Zisserman, A. (2003). Texture classification: are filter banks necessary? In Proceedings of the IEEE conference on computer vision and pattern recognition.
Wang, J, Jia, Y, Hua, X., Zhang, C., & Quan, L. (2008). Normalized tree partitioning for image segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition.
Yang, A., Wright, J., Ma, Y., & Sastry, S. (2008). Unsupervised segmentation of natural images via lossy data compression. Computer Vision and Image Understanding, 110(2), 212–225.
Yu, S. (2005). Segmentation induced by scale invariance. In Proceedings of the IEEE conference on computer vision and pattern recognition.
Zhu, Q., Song, G., & Shi, J. (2007). Untangling cycles for contour grouping. In Proceedings of the international conference on computer vision.
Author information
Authors and Affiliations
Corresponding author
Additional information
Research was supported in part by NSF IIS 07-03756, ONR N00014-09-1-0230, ARO MURI W911NF-06-1-0076, and ARL MAST-CTA W911NF-08-2-0004. Hossein Mobahi was supported by Computational Science & Engineering (CSE) Ph.D. fellowship of University of Illinois at Urbana-Champaign. The views and conclusions contained in this document are those of the authors and should not be interpreted as representing the official policies, either expressed or implied, of the Army Research Laboratory, US Government, or the CSE program. The US Government is authorized to reproduce and distribute for Government purposes notwithstanding any copyright notation hereon.
Rights and permissions
About this article
Cite this article
Mobahi, H., Rao, S.R., Yang, A.Y. et al. Segmentation of Natural Images by Texture and Boundary Compression. Int J Comput Vis 95, 86–98 (2011). https://doi.org/10.1007/s11263-011-0444-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11263-011-0444-0