Skip to main content

Structural Context for Object Categorization

  • Conference paper
Advances in Multimedia Information Processing - PCM 2009 (PCM 2009)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5879))

Included in the following conference series:

Abstract

Bag of Words model has been widely used in the task of Object Categorization, and SIFT, computed for interest local regions, has been extracted from the image as the representative features, which can provide robustness and invariance to many kind of image transformation. Even though, they can only capture the local information, while be blind to the large picture of the image. Besides, the same part of different objects(like the head lamp of different cars) may also not able to be identically represented by SIFT and the like. In order to efficiently represent the object category, we design a new local descriptor–structural context, which shares the similar idea as Shape Context, capturing the relationship between current point and the remaining points, which is the extrema from the scale space of the image and can to some extent represent the structural of the image. This newly proposed descriptor can provide more discriminative representation of the object category, being invariant to intra-class difference, scale change, illumination variation, clutter noise, partial occlusion, small range of deformation, rotation and viewpoint change. Experiments on object categorization and image matching have proved the effectiveness of our newly proposed descriptor in describing the images of the same category.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Belongie, S., Malik, J., Puzicha, J.: Shape matching and object recognition using shape context. IEEE Trans. Pattern Analysis and Machine Intelligence 2, 509–522 (2002)

    Article  Google Scholar 

  2. Berg, A.C., Malik, J.: Geometric blur for template matching. In: Computer Vision and Pattern Recognition (2001)

    Google Scholar 

  3. Comaniciu, D., Meer, P.: Mean shift: A robust approach toward feature space analysis. IEEE Transaction on Pattern Analysis and Machine Intelligence (2002)

    Google Scholar 

  4. Csurka, G., Dance, C.R., Fan, L., Willamowski, J., Bray, C.: Visual categorization with bags of keypoints. In: ECCV 2004 (2004)

    Google Scholar 

  5. Fergus, R., Perona, P., Zisserman, A.: Object class recognition by unsupervised scale-invariant learning. In: CVPR 2003, pp. 264–271 (2003)

    Google Scholar 

  6. Ferrari, V., Fevrier, L., Jurie, F., Schmid, C.: Groups of adjacent contour segments for object detection. IEEE Transactions on Pattern Analysis and Machine Intelligence (2006)

    Google Scholar 

  7. Lindeberg, T.: Feature detection with automatic scale selection. International Journal of Computer Vision (1998)

    Google Scholar 

  8. Lowe, D.G.: Distinctive image features from scale-invariant keypoints (2004)

    Google Scholar 

  9. Mikolajczyk, K., Schmid, C.: A performance evaluation of local descriptors. IEEE Transactions on Pattern Analysis And Machine Intelligence 27 (2005)

    Google Scholar 

  10. Mikolajczyk, K., Tuytelaars, T., Schmid, C.: A comparison of affine region detectors. International Journal of Computer Vision (2006)

    Google Scholar 

  11. Mundy, J.L.: Object recognition in the geometric era: A retrospective. In: Toward Category Level Object Recognition, pp. 3–29. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  12. Roberts, L.G.: Machine perception of three-dimensional solids. In: Optical and Electro-Optical Information Processing, pp. 159–197. MIT Press, Cambridge (1965)

    Google Scholar 

  13. Savarese, S., Winn, J., Criminisi, A.: Discriminative object class models of appearance and shape by correlatons. In: CVPR 2006, pp. 2033–2040. IEEE Computer Society, Los Alamitos (2006)

    Google Scholar 

  14. Scott, D.W.: Multivariate Density Estimation: Theory, Practice, and Visualization. John Wiley, New York (1992)

    Book  MATH  Google Scholar 

  15. Sivic, J., Russell, B.C., Efros, A.A., Zisserman, A., Freeman, W.T.: Discovering Objects and their Localization in Images. In: ICCV, vol. 1 (2005)

    Google Scholar 

  16. Turk, M., Pentland, A.: Face recognition using eigenfaces. In: Proc. Conf. Computer Vision and Pattern Recognition (1991)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Liu, W., Yang, Y. (2009). Structural Context for Object Categorization. In: Muneesawang, P., Wu, F., Kumazawa, I., Roeksabutr, A., Liao, M., Tang, X. (eds) Advances in Multimedia Information Processing - PCM 2009. PCM 2009. Lecture Notes in Computer Science, vol 5879. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-10467-1_24

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-10467-1_24

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-10466-4

  • Online ISBN: 978-3-642-10467-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics