Abstract
The progress in the area of object recognition in the last decade is impressive. The literature reports new descriptors, new strategies, new ways to combine descriptors and classifiers and new problems in a so fast pace that it is hard to follow the whole area. A recent problem in the area is the fine-grained categorization. In this work, to address this problem, we propose a descriptor based on the application of morphological granulometries in the map of edges of an image. This descriptor is used to characterize the distribution of lengths and orientations of edges and to build a model for generic objects. We also propose a new spatial quantization with an arbitrary number of levels and divisions in each level. This quantization is so flexible that adjacent regions may have overlapping areas to avoid breakages in the structures that are near the border of the regions as it happens in the traditional spatial pyramids. Both approaches are used in a challenging and recent object recognition problem, the categorization of very similar classes. The proposed descriptor was used along with other descriptors and the overall performance of our solution to this problem was about 8% better than other work using the bag-of-words approach reported in the literature. Our descriptor showed a result 12% better when compared to the results of other edge-related descriptor in the categorization of very similar classes.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Lazebnik, S., Schmid, C., Ponce, J.: Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories. In: CVPR, New York, pp. 2169–2178 (2006)
Wang, J., Yang, J., Yu, K., Lv, F., Huang, T., Gong, Y.: Locality-constrained linear coding for image classification. In: CVPR, San Francisco, pp. 3360–3367 (2010)
Bay, H., Ess, A., Tuytelaars, T., Gool, L.: Speeded-Up Robust Features (SURF). Computer Vision and Image Understanding 110, 346–359 (2008)
Selim, S., Ismail, M.: K-Means-Type Algorithms: A Generalized Convergence Theorem and Characterization of Local Optimality. IEEE Trans. on PAMI 6, 81–87 (1984)
Bosch, A., Zisserman, A., Munoz, X.: Representing shape with a spatial pyramid kernel. In: Proc. ACM Int. Conf. on Image and Video Retrieval, pp. 401–408 (2007)
Matheron, G.: Random sets and integral geometry, vol. 261. Wiley, New York (1975)
Newell III., J.: Pixel classification by morphological granulometric features. Thesis, Rochester Institute of Technology (1991)
Soille, P.: Morphological Image Analysis: Principles and Applications, 2nd edn. Springer (2002)
Dalal, N., Triggs, B.: Histograms of Oriented Gradients for Human Detection. In: CVPR, San Diego, USA, pp. 886–893 (2005)
Yao, B., Bradski, G., Fei-Fei, L.: A codebook-free and annotation-free approach for fine-grained image categorization. In: CVPR, Providence, USA, pp. 3466–3473 (2012)
Wah, C., Branson, S., Welinder, P., Perona, P., Belongie, S.: The Caltech-UCSD Birds-200-2011 Dataset. California Institute of Technology (2011) CNS-TR-2011-001
Branson, S., Wah, C., Schroff, F., Babenko, B., Welinder, P., Perona, P., Belongie, S.: Visual recognition with humans in the loop. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part IV. LNCS, vol. 6314, pp. 438–451. Springer, Heidelberg (2010)
Lara, A., Hirata Jr., R.: Combining features to a class-specific model in an instance detection framework. SIBGRAPI (2011)
Canny, J.: A computational approach to edge detection. IEEE Trans. on PAMI 8, 679–698 (1986)
Lepetit, V., Fua, P.: Keypoint recognition using randomized trees. IEEE Trans. on PAMI 28, 1465–1479 (2006)
Bosch, A., Zisserman, A., Munoz, X.: Image Classification using Random Forests and Ferns. In: ICCV, Rio de Janeiro, Brazil, pp. 1–8 (2007)
Kohavi, R.: A study of cross-validation and bootstrap for accuracy estimation and model selection. In: International Joint Conference on Artificial Intelligence, pp. 1137–1145 (1995)
Fei-Fei, L., Fergus, R., Perona, P.: One-Shot Learning of Object Categories. IEEE Trans. on PAMI 28, 594–611 (2006)
Sivic, J., Zisserman, A.: Video Google: A Text Retrieval Approach to Object Matching in Videos. In: ICCV, Nice, France, pp. 1470–1478 (2003)
Baeza-Yates, R., Ribeiro-Neto, B.: Modern information retrieval. Addison-Wesley Longman Publishing Co. Inc., Boston (2011)
Chih-Fong, T.: Bag-of-Words Representation in Image Annotation: A Review. ISRN Artificial Intelligence 2010 (2012)
Ojala, T., Pietikainen, M., Harwood, D.: Performance evaluation of texture measures with classification based on Kullback discrimination of distributions. In: 12th IAPR Int. Conf. on Pattern Recognition, pp. 582–585 (1994)
Zadrozny, B., Elkan, C.: Transforming classifier scores into accurate multiclass probability estimates. In: 8th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, pp. 694–699 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lara, A.C., Hirata, R. (2013). A Granulometry Based Descriptor for Object Categorization. In: Hendriks, C.L.L., Borgefors, G., Strand, R. (eds) Mathematical Morphology and Its Applications to Signal and Image Processing. ISMM 2013. Lecture Notes in Computer Science, vol 7883. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38294-9_35
Download citation
DOI: https://doi.org/10.1007/978-3-642-38294-9_35
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38293-2
Online ISBN: 978-3-642-38294-9
eBook Packages: Computer ScienceComputer Science (R0)