Abstract
Bag of Words model has been widely used in the task of Object Categorization, and SIFT, computed for interest local regions, has been extracted from the image as the representative features, which can provide robustness and invariance to many kind of image transformation. Even though, they can only capture the local information, while be blind to the large picture of the image. Besides, the same part of different objects(like the head lamp of different cars) may also not able to be identically represented by SIFT and the like. In order to efficiently represent the object category, we design a new local descriptor–structural context, which shares the similar idea as Shape Context, capturing the relationship between current point and the remaining points, which is the extrema from the scale space of the image and can to some extent represent the structural of the image. This newly proposed descriptor can provide more discriminative representation of the object category, being invariant to intra-class difference, scale change, illumination variation, clutter noise, partial occlusion, small range of deformation, rotation and viewpoint change. Experiments on object categorization and image matching have proved the effectiveness of our newly proposed descriptor in describing the images of the same category.
Preview
Unable to display preview. Download preview PDF.
References
Belongie, S., Malik, J., Puzicha, J.: Shape matching and object recognition using shape context. IEEE Trans. Pattern Analysis and Machine Intelligence 2, 509–522 (2002)
Berg, A.C., Malik, J.: Geometric blur for template matching. In: Computer Vision and Pattern Recognition (2001)
Comaniciu, D., Meer, P.: Mean shift: A robust approach toward feature space analysis. IEEE Transaction on Pattern Analysis and Machine Intelligence (2002)
Csurka, G., Dance, C.R., Fan, L., Willamowski, J., Bray, C.: Visual categorization with bags of keypoints. In: ECCV 2004 (2004)
Fergus, R., Perona, P., Zisserman, A.: Object class recognition by unsupervised scale-invariant learning. In: CVPR 2003, pp. 264–271 (2003)
Ferrari, V., Fevrier, L., Jurie, F., Schmid, C.: Groups of adjacent contour segments for object detection. IEEE Transactions on Pattern Analysis and Machine Intelligence (2006)
Lindeberg, T.: Feature detection with automatic scale selection. International Journal of Computer Vision (1998)
Lowe, D.G.: Distinctive image features from scale-invariant keypoints (2004)
Mikolajczyk, K., Schmid, C.: A performance evaluation of local descriptors. IEEE Transactions on Pattern Analysis And Machine Intelligence 27 (2005)
Mikolajczyk, K., Tuytelaars, T., Schmid, C.: A comparison of affine region detectors. International Journal of Computer Vision (2006)
Mundy, J.L.: Object recognition in the geometric era: A retrospective. In: Toward Category Level Object Recognition, pp. 3–29. Springer, Heidelberg (2006)
Roberts, L.G.: Machine perception of three-dimensional solids. In: Optical and Electro-Optical Information Processing, pp. 159–197. MIT Press, Cambridge (1965)
Savarese, S., Winn, J., Criminisi, A.: Discriminative object class models of appearance and shape by correlatons. In: CVPR 2006, pp. 2033–2040. IEEE Computer Society, Los Alamitos (2006)
Scott, D.W.: Multivariate Density Estimation: Theory, Practice, and Visualization. John Wiley, New York (1992)
Sivic, J., Russell, B.C., Efros, A.A., Zisserman, A., Freeman, W.T.: Discovering Objects and their Localization in Images. In: ICCV, vol. 1 (2005)
Turk, M., Pentland, A.: Face recognition using eigenfaces. In: Proc. Conf. Computer Vision and Pattern Recognition (1991)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Liu, W., Yang, Y. (2009). Structural Context for Object Categorization. In: Muneesawang, P., Wu, F., Kumazawa, I., Roeksabutr, A., Liao, M., Tang, X. (eds) Advances in Multimedia Information Processing - PCM 2009. PCM 2009. Lecture Notes in Computer Science, vol 5879. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-10467-1_24
Download citation
DOI: https://doi.org/10.1007/978-3-642-10467-1_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-10466-4
Online ISBN: 978-3-642-10467-1
eBook Packages: Computer ScienceComputer Science (R0)