Abstract
The cross ratio of four colinear points is of fundamental importance in model based vision, because it is the simplest numerical property of an object that is invariant under projection to an image. It provides a basis for algorithms to recognise objects from images without first estimating the position and orientation of the camera.
A quantitative analysis of the effectiveness of the cross ratio in model based vision is made. A given imageI of four colinear points is classified by making comparisons between the measured cross ratio τ of the four image points and the cross ratios stored in the model database. The imageI is accepted as a projection of an objectO σ with cross ratio σ if |τ−σ|≤ntu, wheren is the standard deviation of the image noise,t is a threshold andu=∥∇τ∥. The performance of the cross ratio is described quantitatively by the probability of rejectionR, the probability of false alarmF and the probability of misclassificationp σ(ς), defined for two model cross ratios σ, ς. The trade off between these different probabilities is determined byt.
It is assumed that in the absence of an object the image points have identical Gaussian distributions, and that in the presence of an object the image points have the appropriate conditional densities. The measurements of the image points are subject to small random Gaussian perturbations. Under these assumptions the trade offs betweenR,F andp σ(ς) are given to a good approximation byR=2(1−Ф(t)),F=r F ∈t,\(\sqrt {p_\sigma (\varsigma )} = e_\sigma \in t\left| {\sigma - \varsigma } \right|^{ - 1} \)∈t|σ−ς|−1, where ε is the relative noise level, Ф is cumulative distribution function for the normal distribution,r F is constant, ande σ is a function of σ only. The trade off betweenR andF is obtained in Maybank (1994). In this paper the trade off betweenR andp σ(ς) is obtained.
It is conjectured that the general form of the above trade offs betweenR,F andp σ(ς) is the same for a range of invariants useful in model based vision. The conjecture prompts the following definition: an invariant which has trade offs betweenR,F,p σ(ς) of the above form is said to benon-degenerate for model based vision.
The consequences of the trade off betweenR andp σ(ς) are examined. In particular, it is shown that for a fixed overall probability of misclassification there is a maximum possible model cross ratio σ m , and there is a maximum possible numberN of models. Approximate expressions for σ m andN are obtained. They indicate that in practice a model database containing only cross ratio values can have a size of order at most ten, for a physically plausible level of image noise, and for a probability of misclassification of the order 0.1.
Similar content being viewed by others
References
Cramér, H. 1945.Mathematical Models of Statistics, Princeton Mathematical Series, Vol. 9. Princeton University Press: Princeton (eighteenth printing 1991).
Devijver, P.A. and Kittler, J., 1982.Pattern Recognition: a statistical approach, Prentice-Hall: London, UK.
Forsyth, D., Mundy, J.L., Zisserman, A., Coelho, C., Heller, A., and Rothwell, C., 1991. Invariant descriptors for 3D object recognition and pose, IEEE Trans. Pattern Analysis and Machine Intelligence, 13, pp. 971–991.
Helstrom, C.W., 1960.Statistical Theory of Signal Detection, International Series of Monographs on Electronics and Instrumentation, Pergamon Press: Oxford.
Maybank, S.J., 1994c. Classification based on the cross ratio. In Mundy, J.L., Zisserman, A. and Forsyth, D. eds.Applications of Invariance in Computer Vision. Lecture Notes in Computer Science, Vol. 825, pp. 453–472, Springer-Verlag: Berlin.
Maybank, S.J., 1994. Probabilistic analysis of the application of the cross ratio to model based vision, Submitted toIJCV.
Maybank, S.J. and Beardsley, P.A., 1994a. Applications of invariants to model based vision, To appear inJournal of AppliedStatistics.
Maybank, S.J. and Beardsley, P.A. 1994b. Experimental investigation of the probability of misclassification, In preparation.
Maybank, S.J., 1993. Classification based on the cross ratio. In Mundy, J.L. and Zisserman, A. eds.Applications of Invariance to Computer Vision II, Proc. Second European-US Workshop on Invariance, Ponta Delgada, Azores, 9–14 October 1993, pp. 113–132
Mundy, J.L. and Zisserman, A., eds. 1992a.Geometric Invariance in Computer Vision, MIT Press: Cambridge MA, USA.
Mundy, J.L. and Zisserman, A., 1992b. Towards a new framework for vision, In Mundy, J.L. and Zisserman, A., eds.,Geometric Invariance in Computer Vision, pp. 1–39. MIT Press: Cambridge MA, USA.
Mundy, J.L. and Zisserman, A., eds. 1994.Applications of Invariance in Computer Vision II, Proc. Second European-US Workshop on Invariance, Ponta Delgada, Azores, 9–14 October 1993, To appear in Lecture Notes in Computer Science, Springer Verlag: Berlin.
Semple, J.G. and Kneebone, G.T., 1952.Algebraic Projective Geometry, Oxford University Press: Oxford, UK.
Wolfram, S., 1991.Mathematica: a system for doing mathematics by computer, Addison Wesley: Redwood City, USA. 2nd edition.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Maybank, S.J. Probabilistic analysis of the application of the cross ratio to model based vision: Misclassification. Int J Comput Vision 14, 199–210 (1995). https://doi.org/10.1007/BF01679682
Received:
Revised:
Issue Date:
DOI: https://doi.org/10.1007/BF01679682