Structural Context for Object Categorization

Liu, Wei; Yang, Yubin

doi:10.1007/978-3-642-10467-1_24

Wei Liu²² &
Yubin Yang²²

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5879))

Included in the following conference series:

Pacific-Rim Conference on Multimedia

1421 Accesses
2 Citations

Abstract

Bag of Words model has been widely used in the task of Object Categorization, and SIFT, computed for interest local regions, has been extracted from the image as the representative features, which can provide robustness and invariance to many kind of image transformation. Even though, they can only capture the local information, while be blind to the large picture of the image. Besides, the same part of different objects(like the head lamp of different cars) may also not able to be identically represented by SIFT and the like. In order to efficiently represent the object category, we design a new local descriptor–structural context, which shares the similar idea as Shape Context, capturing the relationship between current point and the remaining points, which is the extrema from the scale space of the image and can to some extent represent the structural of the image. This newly proposed descriptor can provide more discriminative representation of the object category, being invariant to intra-class difference, scale change, illumination variation, clutter noise, partial occlusion, small range of deformation, rotation and viewpoint change. Experiments on object categorization and image matching have proved the effectiveness of our newly proposed descriptor in describing the images of the same category.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Belongie, S., Malik, J., Puzicha, J.: Shape matching and object recognition using shape context. IEEE Trans. Pattern Analysis and Machine Intelligence 2, 509–522 (2002)
Article Google Scholar
Berg, A.C., Malik, J.: Geometric blur for template matching. In: Computer Vision and Pattern Recognition (2001)
Google Scholar
Comaniciu, D., Meer, P.: Mean shift: A robust approach toward feature space analysis. IEEE Transaction on Pattern Analysis and Machine Intelligence (2002)
Google Scholar
Csurka, G., Dance, C.R., Fan, L., Willamowski, J., Bray, C.: Visual categorization with bags of keypoints. In: ECCV 2004 (2004)
Google Scholar
Fergus, R., Perona, P., Zisserman, A.: Object class recognition by unsupervised scale-invariant learning. In: CVPR 2003, pp. 264–271 (2003)
Google Scholar
Ferrari, V., Fevrier, L., Jurie, F., Schmid, C.: Groups of adjacent contour segments for object detection. IEEE Transactions on Pattern Analysis and Machine Intelligence (2006)
Google Scholar
Lindeberg, T.: Feature detection with automatic scale selection. International Journal of Computer Vision (1998)
Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints (2004)
Google Scholar
Mikolajczyk, K., Schmid, C.: A performance evaluation of local descriptors. IEEE Transactions on Pattern Analysis And Machine Intelligence 27 (2005)
Google Scholar
Mikolajczyk, K., Tuytelaars, T., Schmid, C.: A comparison of affine region detectors. International Journal of Computer Vision (2006)
Google Scholar
Mundy, J.L.: Object recognition in the geometric era: A retrospective. In: Toward Category Level Object Recognition, pp. 3–29. Springer, Heidelberg (2006)
Chapter Google Scholar
Roberts, L.G.: Machine perception of three-dimensional solids. In: Optical and Electro-Optical Information Processing, pp. 159–197. MIT Press, Cambridge (1965)
Google Scholar
Savarese, S., Winn, J., Criminisi, A.: Discriminative object class models of appearance and shape by correlatons. In: CVPR 2006, pp. 2033–2040. IEEE Computer Society, Los Alamitos (2006)
Google Scholar
Scott, D.W.: Multivariate Density Estimation: Theory, Practice, and Visualization. John Wiley, New York (1992)
Book MATH Google Scholar
Sivic, J., Russell, B.C., Efros, A.A., Zisserman, A., Freeman, W.T.: Discovering Objects and their Localization in Images. In: ICCV, vol. 1 (2005)
Google Scholar
Turk, M., Pentland, A.: Face recognition using eigenfaces. In: Proc. Conf. Computer Vision and Pattern Recognition (1991)
Google Scholar

Download references

Author information

Authors and Affiliations

State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, 210093, China
Wei Liu & Yubin Yang

Authors

Wei Liu
View author publications
You can also search for this author in PubMed Google Scholar
Yubin Yang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Naresuan University, 65000, Phisanulok, Thailand
Paisarn Muneesawang
Microsoft Research Asia, 100109, Beijing, China
Feng Wu
Tokyo Institute of Technology, 226-8503, Yokohama, Japan
Itsuo Kumazawa
Mahanakorn University of Technology, 10530, Bankok, Thailand
Athikom Roeksabutr
Institute of Information Science, Academia Sinica, Taipei, Taiwan
Mark Liao
Chinese University of Hong Kong, Shatin, N.T., Hong Kong,
Xiaoou Tang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Liu, W., Yang, Y. (2009). Structural Context for Object Categorization. In: Muneesawang, P., Wu, F., Kumazawa, I., Roeksabutr, A., Liao, M., Tang, X. (eds) Advances in Multimedia Information Processing - PCM 2009. PCM 2009. Lecture Notes in Computer Science, vol 5879. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-10467-1_24

Download citation

DOI: https://doi.org/10.1007/978-3-642-10467-1_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-10466-4
Online ISBN: 978-3-642-10467-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics