Abstract
We consider some particular instances of the segmentation problem. We derive minimum message length (MML) expressions for stating the region boundaries for some one and two dimensional examples. It is found the message length cost of stating region boundaries is dependent on the noise of the data in the separated regions and also the ‘degree of separation’ of the two regions.
The framework given here can be extended to different shaped cuts and also non-constant fits for the regions. Possible applications for the work presented here include its use in tree (i.e. CART) regression and in image segmentation.
Preview
Unable to display preview. Download preview PDF.
References
L. Breiman et al. Classification and Regression Trees. Wadsworth, 1984.
J.H. Conway and N.J.A. Sloane. Sphere Packings, Lattices and Groups. Springer-Verlag, New York, 1988.
B. Dom. MDL estimation with Small Sample Sizes including an application to the problem of segmenting binary strings using bernoulli models. Technical Report RJ 9997 (89085) 12/15/95, IBM Research Division, Almaden Research Center, 650 Harry Rd, San Jose, CA, 95120–6099, 1995.
Mengxiang Li. Minimum description length based 2-d shape description. In IEEE 4th Int. Conf. on Computer Vision, pages 512–517, May 1992.
Z. Liang et al. Parameter estimation of finite mixtures using the EM algorithm and information criteria with applications to medical image processing. IEEE Trans. on Nuclear Science, 39(4):1126–1133, 1992.
J.J. Oliver and D.J. Hand. Introduction to minimum encoding inference. Technical report TR 4-94, Dept. of Statistics, Open University, UK, 1994.
J.J. Oliver, Baxter R.A., and Wallace O.S. Unsupervised Learning using MML. In Machine Learning: Proceedings of the Thirteenth International Conference (ICML 96), pages 364–372. Morgan Kaufmann Publishers, San Francisco, CA, 1996.
J.R. Quinlan. Improved use of continuous attributes in C4.5. Journal of Artificial Intelligence, 4:77–90, 1996.
O.S. Wallace and P.R. Freeman. Estimation and inference by compact coding. J. R. Statist. Soc B, 49(3):240–265, 1987.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1996 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Baxter, R.A., Oliver, J.J. (1996). The kindest cut: Minimum message length segmentation. In: Arikawa, S., Sharma, A.K. (eds) Algorithmic Learning Theory. ALT 1996. Lecture Notes in Computer Science, vol 1160. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-61863-5_36
Download citation
DOI: https://doi.org/10.1007/3-540-61863-5_36
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-61863-8
Online ISBN: 978-3-540-70719-6
eBook Packages: Springer Book Archive