Abstract
Many current recognition systems use variations on constrained tree search to locate objects in cluttered environments. If the system is simply finding instances of an object known to be in the scene, then previous formal analysis has shown that the expected amount of search is quadratic in the number of model and data features when all the data is known to come from a single object, but is exponential when spurious data is included. If one can group the data into subsets likely to have come from a single object, then terminating the search once a “good enough” interpretation is found reduces the expected search to cubic. Without successful grouping, terminated search is still exponential. These results apply to finding instances of a known object in the data. What happens when the object is not present? In this article, we turn to the problem of selecting models from a library, and examine the combinatorial cost of determining that an incorrectly chosen candidate object is not present in the data. We show that the expected search is again exponential, implying that naive approaches to library indexing are likely to carry an expensive overhead, since an exponential amount of work is needed to weed out each incorrect model. The analytic results are shown to be in agreement with empirical data for cluttered object recognition.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Ayache, N., and Faugeras, O.D. 1986. HYPER: A new approach for the recognition and positioning of two-dimensional objects. IEEE Trans. Patt. Anal. Mach. Intell. 8(1): 44–54.
Ballard, D.H. 1981. Generalizing the Hough transform to detect arbitrary patterns. Pattern Recognition 13(2): 111–122.
Bolles, R.C., and Cain, R.A. 1982. Recognizing and locating partially visible objects: The local feature focus method. Intern. J. Robotics Res. 1(3): 57–82.
Cass, T.A. 1988. A robust parallel implementation of 2D model-based recognition. IEEE Conf. Comput. Vision, Patt. Recog., Ann Arbor, MI, pp. 879–884.
Cyganski, D., and Orr, J.A. 1985. Applications of tensor theory to object recognition and orientation determination. IEEE Trans. Patt. Anal. Mach. Intell. 7(6): 662–673.
Davis, L.S. 1982. Hierarchical generalized Hough transforms and line-segment based generalized Hough transforms. Pattern Recognition. 15: pp. 277.
Flynn, P.J., and Jain, A.K. 1991. BONSAI: 3D object recognition using constrained search. IEEE Trans. Patt. Anal. Mach. Intell. 13(10):1066–1075.
Freuder, E.C., 1978. Synthesizing constraint expressions. Comm. ACM 21(11): 958–966.
Freuder, E.C. 1982. A sufficient condition for backtrack-free search. J. Assoc. Comput. Mach. 29(1):24–32
Gaschnig, J. 1979. Performance measurement and analysis of certain search algorithms. Ph.D. thesis, Carnegie-Mellon University, Computer Science.
Gleason, G., and Agin, G.J. 1979. A modular vision system for sensorcontrolled manipulation and inspection. Proc. 9th Intern. Symp. Indust. Robots, pp. 57–70.
Graham, R.L., Knuth, D.E., and Patashnik, O. 1989. Concrete Mathematics, Addison-Wesley: Reading, MA.
Grimson, W.E.L. 1989. On the recognition of curved objects. IEEE Trans. Patt. Anal. Mach. Intell. 11(6): 632–643.
Grimson, W.E.L. 1990a. The combinatorics of object recognition in cluttered environments using constrained search. Artificial Intelligence, 44(1–2): 121–166.
Grimson, W.E.L. 1990b. The effect of indexing on the complexity of object recognition. Proc. 3rd Intern. Conf. Comput. Vision, Osaka, Japan, pp. 644–651.
Grimson, W.E.L. 1991. The combinatorics of heuristic search termination for object recognition in cluttered environments. IEEE Trans. Patt. Anal. Mach. Intell. 13(9):920–935.
Grimson, W.E.L., and Huttenlocher, D.P. 1988. On the sensitivity of the Hough transform for object recognition. Proc. 2nd Intern. Conf. Comput. Vision, Tarpon Springs, FL., pp. 700–706.
Grimson, W.E.L., and Huttenlocher, D.P. 1989. On choosing thresholds for terminating search in object recognition. Memo 1110, M.I.T. Artificial Intelligence Laboratory (also to appear in IEEE Trans. Patt. Anal. Mach. Intell. 1991). 13(12):1201–1213.
Grimson, W.E.L., and Lozano-Pérez, T. 1984. Model-based recognition and localization from sparse range or tactile data. Intern. J. Robotics Res., 3(3): 3–35.
Grimson, W.E.L., and Lozano-Pérez, T. 1987. Localizing overlapping parts by searching the interpretation tree. IEEE Trans. Patt. Anal. Mach. Intell. 9(4): 469–482.
Haralick, R.M., and Elliot, G.L. 1980. Increasing tree search efficiency for constraint satisfaction problems. Artificial Intelligence 14: 263–313.
Haralick, R.M., and Shapiro, L.G. 1979. The consistent labeling problem: Part 1: IEEE Trans. Patt. Anal. Machine Intell., 1(4), pp. 173–184.
Hough, P.V.C. 1962. Methods and means for recognizing complex patterns. U.S. Patent 3069654.
Hu, M.K. 1962. Visual pattern recognition by moment invariants. IRE Trans. Inform. Theory 8: 179–187.
Huttenlocher, D.P. and Ullman, S. 1987. Object recognition using alignment. Proc. 1st Intern. Conf. Comput. Vision, London, pp. 102–111.
Illingworth, J., and Kittler, J. 1988. A survey of the Hough transform. Comput. Vision, Graphics, Image Proc. 44: 87–116.
Knapman, J. 1987. 3D model identification from stereo data. Proc. 1st Intern. Conf. Comput. Vision, London, pp. 547–551.
Lamdan, Y., Schwartz, J.T., and Wolfson, H.J. 1988. Object recognition by affine invariant matching. Proc. IEEE Conf. Comput. Vision. Patt. Recog., pp. 335–344.
Lowe, D.G. 1985. Perceptual Organization and Visual Recognition. Kluwer Academic Publishers: Boston.
Mackworth, A.K. 1977. Consistency in networks of constraints. Artificial Intelligence 8: 99–118.
Mackworth, A.K., and Freuder, E.C. 1985. The complexity of some polynomial network consistency algorithms for constraint satisfaction problems. Artificial Intelligence 25: 65–74.
Murray, D.W., and Cook, D.B. 1988. Using the orientation of fragmentary 3D edge segments for polyhedral object recognition. Intern J. Comput. Vision. 2(2): 153–169.
Nudel, B. 1983. Consistent-labeling problems and their algorithms: Expected-complexities and theory-based heuristics. Artificial Intelligence 21: 135–178.
Silberberg, T.M., Harwood, D.A., and Davis, L.S. 1986. Object recognition using oriented model points. Comput. Vision. Graphics. Image Proc. 35: 47–71.
Stockman, G., Kopstein, S., and Benett, S. 1982. Matching images to models for registration and object detection via clustering. IEEE Trans. Patt. Anal. Mach. Intell. 3(3): 229–241.
Teague, M.R. 1980. Image analysis via the general theory of moments. J. Optical Soc. Amer. 70: 920–930.
Thompson, D.W., and Mundy, J.L. 1987. Three-dimensional model matching from an unconstrained viewpoint. IEEE Intern. Conf. Robotics Autom., Raleigh, NC, pp. 28–220.
Wang, Y.F., Magee, M.J., and Aggarwal, J.K. 1984. Matching three-dimensional objects using silhouettes. IEEE Trans. Patt. Anal. and Mach. Intell. 6(4): 513–518.
Zahn, C.T. and Roskies, R.Z. 1972. Fourier descriptors for plane closed curves. IEEE Trans. Comput. 21(3): 269–281.
Author information
Authors and Affiliations
Additional information
This report describes research done at the Artificial Intelligence Laboratory of the Massachusetts Institute of Technology, and was funded in part by an Office of Naval Research University Research Initiative grant under contract N00014-86-K-0685, and in part by the Advanced Research Projects Agency of the Department of Defense under Army contract number DACA76-85-C-0010 and under Office of Naval Research contract N00014-85-K-0124. The author was also supported in part by the Matsushita Chair of Computer Science and Engineering, and by NSF contract number IRI-8900267. An earlier, shorter version of these results appeared in Grimson [1990b].
Rights and permissions
About this article
Cite this article
Eric, W., Grimson, L. The cost of choosing the wrong model in object recognition by constrained search. Int J Comput Vision 7, 195–210 (1992). https://doi.org/10.1007/BF00126393
Received:
Revised:
Issue Date:
DOI: https://doi.org/10.1007/BF00126393