Abstract
In this paper, we address a problem of detecting regions (bounding box) containing birds in images, which is closely related to the task of fine-grained visual classification (FGVC) of bird images. We note that there exist various sophisticated approaches proposed for this task within the overall framework of FGVC. However, we demonstrate that the problem of bird region detection, by itself, can be addressed in a rather simplistic manner. Our approach employs HOG features and a multi-scale detection framework using the SVM classifier, but where real-valued scores (or weights) from the SVM are used rather than the conventional binary decision labels. We validate our approach on a variety of bird images from the CUB-200 bird image data set and show that the proposed approach yields reasonable quality bird region detection.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
N. Kumar, P. Belhumeur, A. Biswas, D. Jacobs, W. John Kress, I. Lopez, and J. V. B. Soares, Leafsnap: A computer vision system for automatic plant species identification, European Conference on Computer Vision (ECCV), 2012.
M. Martineau, D. Conte, R. Raveaux, I. Arnault, D. Munier, and G. Venturini, A survey on image-based insect classification, Pattern Recognition, vol. 65, 2017, pp. 273–284.
L. Xie, Q. Tian, R. Hong, S. Yan, and B. Zhang, Hierarchical part matching for fine-grained visual categorization, Int. Conference on Computer Vision (ICCV), 2013.
N. Zhang, R. Farrell, F. Iandola, and T. Darrel, Deformable part descriptors for fine-grained recognition and attribute prediction, IEEE Int. Conference on Computer Vision (ICCV), 2013.
K. Duan, D. Parikh, D. Crandall, and K. Grauman, Discovering localized attributes for fine-grained recognition, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2012.
N. Zhang, J. Donahue, R. Girshick, and T. Darrell, Part-based R-CNNs for fine-grained category detection, European Conference on Computer Vision (ECCV), 2014.
P. Welinder, S. Branson, T. Mita, C. Wah, F. Schroff, S. Belongie, and P. Perona, Caltech-UCSD Birds 200. California Institute of Technology. CNS-TR-2010-001. 2010.
J. Krause, H. Jin, J. Yang, and F. Li, Fine-grained recognition without part annotations, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015.
A. Angelova and A. Niculescu-Mizil, Feature combination with multi-kernel learning for fine-grained visual classification, IEEE Winter Conference on Applications of Computer Vision (WACV), 2014.
B. Yao, A. Khosla and F. Li, Combining randomization and discrimination for fine-grained image categorization, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2011.
S. Huang, Z. Xu, D. Tao, and Y. Zhang, Part-Stacked CNN for Fine-Grained Visual Categorization, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016.
X. Zhang, H. Xiong, W. Zhou, W. Lin, Q. Tian, Picking deep filter responses for fine-grained image recognition, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016.
A. Angelova and S. Zhu, Efficient object detection and segmentation for fine-grained recognition, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2013.
M. Das and R. Manmatha, Automatic segmentation and indexing in a database of bird images, IEEE Int. Conference on Computer Vision (ICCV), 2001.
I. Lillo, J. Niebles, and A. Soto, Bird species classification based on color features, IEEE Int. Conference on on Systems, Man, and Cybernetics (SMC), 2013.
R. Yoshihashi, R. Kawakami, M. Iida, and T. Naemura, Evaluation of bird detection using time-lapse images around a wind farm, EWEA 2015 Annual Event, 2015.
A. Takeki, T. Tuan Trinh, R. Yoshihashi, R. Kawakami, M. Iida, and T. Naemura, Detection of small birds in large images by combining a deep detector with semantic segmentation, Int. Conference on Image Processing (ICIP), 2016.
T. Lin, A. Roy Chowdhury, and S. Maji. Bilinear CNN models for fine-grained visual recognition, International Conference on Computer Vision (ICCV) 2015.
J. Kraus, B. Sapp, A. Howard, H. Zhou, A. Toshev, T. Duerig, J. Philbin, and F. Li, The unreasonable effectiveness of noisy data for fine-grained recognition, European Conference on Computer Vision (ECCV), 2016.
Z. Ge, A. Bewley, C. McCool, B. Upcroft, P. Corke, and C. Sanderson, Fine-grained classification via mixture of deep convolutional neural networks, IEEE Winter Conference on Applications of Computer Vision (WACV), 2016.
N. Dalal and B. Triggs, Histograms of oriented gradients for human detection, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2005.
Dlib C++ library, http://blog.dlib.net/2014/02/dlib-186-released-make-your-own-object.html
C. Chang and C. Lin, LIBSVM: A library for support vector machines, ACM Transactions on Intelligent Systems and Technology, vol. 2, no. 3, 2011, pp. 1–27.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Kumar, R., Kumar, A., Bhavsar, A. (2018). Bird Region Detection in Images with Multi-scale HOG Features and SVM Scoring. In: Chaudhuri, B., Kankanhalli, M., Raman, B. (eds) Proceedings of 2nd International Conference on Computer Vision & Image Processing . Advances in Intelligent Systems and Computing, vol 704. Springer, Singapore. https://doi.org/10.1007/978-981-10-7898-9_29
Download citation
DOI: https://doi.org/10.1007/978-981-10-7898-9_29
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-7897-2
Online ISBN: 978-981-10-7898-9
eBook Packages: EngineeringEngineering (R0)