Abstract
We propose efficiency of representation as a criterion for evaluating shape models, then apply this criterion to compare the boundary curve representation with the medial axis. We estimate the ⋮-entropy of two compact classes of curves. We then construct two adaptive encodings for non-compact classes of shapes, one using the boundary curve and the other using the medial axis, and determine precise conditions for when the medial axis is more efficient. Finally, we apply our results to databases of naturally occurring shapes, determining whether the boundary or medial axis is more efficient. Along the way we construct explicit near-optimal boundary-based approximations for compact classes of shapes, construct an explicit compression scheme for non-compact classes of shapes based on the medial axis, and derive some new results about the medial axis.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Beg, M.F., Miller, M.I., Trouve, A., and Younes, L. 2003. Computing metrics via geodesics on flows of diffeomorphisms. International Journal of Computer Vision.
Blum, H. 1973. Biological shape and visual science. J. Theor. Biol, 38:205–287.
Damon, J.N. 2005. Determining the geometry of boundaries of objects from medial data. IJCV, 63(1): 45–64.
Desolneux, A., Moisan, L., and Morel, J.-M. 2000. Meaningful alignments. IJCV, 40(1):7–23.
Folland, G. 1984. Real Analysis, John Wiley and Sons, p. 55.
Giblin, P. and Kimia, B. 2003. On the intrinsic reconstruction of shape from its symmetries. Trans. Pattern Analysis and Machine Intelligence, 25(7): 895–911.
Katz, R.A. and Pizer, S.M. 2003. Untangling the Blum medial axis transform. IJCV, 55(2–3):139–153.
Klassen, E., Srivastava, A., Mio, W., and Joshi, S. 2004. Analysis of planar shapes using geodesic paths in shape space. IEEE Trans. Pattern Analysis and Machine Intelligence, 26:372–383.
Kolmogorov, A.N. and Tikhomirov, V.M. 1959. ε-entropy and ε-capacity. American Mathematical Society Translations, Series 2, 17:277–364.
Langdon, G. and Rissanen, J. 1981. Compression of black and white images with arithmetic coding. IEEE Transactions in Communication, COMM-29:858–867.
Leonard, K. 2004. Measuring Shape Space: ε-entropy, adaptive coding and 2-dimensional shape. PhD Thesis, Brown University.
Leonard, K. 2005. On the massiveness of spaces of plane curves. Preprint.
Li, W. and Westheimer, G. 1997. Human discrimination of the implicit orientation of simple symmetrical patterns. Vision Research, 37(5):565–572.
Martin, D., Fowlkes, C., Tal, D., and Malik, J. 2001. A database of human segmented natural imaged and its application to evaluating segmentation algorithms and measuring ecological statistics. In Proc. 8th Int’l Conf. Computer Vision, 2:416–423.
Michor, P. and Mumford, D. 2006. Riemannian geometries on the space of plane curves. J. Eur. Math. Soc. 8:1–48.
Mokhtarian, F., Abbasi, S., and Kittler, J. 1996. Robust and efficient shape indexing through curvature scale space. In Proc. 6th British Machine Vision Conference, pp. 53–62.
Rissanen, J. 1989 Stochastic Complexity in Statistical Inquiry. World Scientific Press.
Rudin, L., Osher, S., and Fatemi, C. 1992. Nonlinear total variation based noise removal algorithms. Physica D, 60:259–268.
Sebastian, T., Klein, P., and Kimia, B. 2001. Recognition of shapes by editing shock graphs. In Proceedings of 8th International Conference of Computer Vision, Vancouver, IEEE Computer Society Press, pp. 755–762.
Shanks, D. 1993. Solved and Unsolved Problems in Number Theory. Chelsea Publishing Co., p. 143.
Shannon, C.E. 1948. A mathematical theory of communication. Bell System Technical Journal, 27:379–423 and 623–656.
Sharvit, D., Chan, J., and Kimia, B.B. 1998. Symmetry-based indexing of image databases. In Content-Based Access of Image and Video Libraries.
Shen, J. and Thalmann, D. 1996. Fast realistic human body deformation for animation and VR applications. Applications of Computer Graphics International, Pohang, pp. 166–174.
Siddiqi, K. and Kimia, B. 1996. A shock grammar for recognition. In Proceedings of the Conference on Computer Vision and Pattern Recognition, pp. 507–513.
Sloane, N. Online Encyclopedia of Integer Sequences, sequence A063074. http://www.research.att.com/~njas/sequences/Seis.html. Maintained by AT&T.
Tamrakar, A. and Kimia, B. 2004. Medial visual fragments as an intermediate image representation for segmentation and perceptual grouping. In Proceedings of IEEE Workshop on Perceptual Organization in Computer Vision.
Tu, Z.W., Chen, X.R., Yuille, A.L., and Zhu, S.-C. 2005. Image parsing: unifying segmentation, detection, and recognition. IJCV, 63(2):113–140.
Yu, S.X. and Shi, J. 2003. Concurrent object recognition and segmentation by graph partitioning. In IEEE Conference on Computer Vision and Pattern Recognition, Madison, Wisconsin.
Yushkevich, P., Fletcher, P., Joshi, S., Thall, A., and Pizer, S. 2002. Continuous medial representations for geometric object modeling in 2-D and 3-D. In Proceedings of the 1st Generative Model-based Vision Workshop, Copenhagen, DK, June.
Zhu, S.C. 1999. Stochastic jump diffusion processes for computing medial axes in markov random fields. IEEE Trans. Pattern Analysis and Machine Intelligence, 21(11):1158–1169.
Zhu, S.C. and Yuille, A.L. 1996. FORMS: A flexible object recognition and modeling system. International Journal of Computer Vision 20:187–212.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Leonard, K. Efficient Shape Modeling: ⋮-Entropy, Adaptive Coding, and Boundary Curves -vs- Blum’s Medial Axis. Int J Comput Vision 74, 183–199 (2007). https://doi.org/10.1007/s11263-006-0010-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11263-006-0010-3