Enhanced Bird Detection from Low-Resolution Aerial Image Using Deep Neural Networks

Li, Ce; Zhang, Baochang; Hu, Hanwen; Dai, Jing

doi:10.1007/s11063-018-9871-z

Enhanced Bird Detection from Low-Resolution Aerial Image Using Deep Neural Networks

Published: 18 June 2018

Volume 49, pages 1021–1039, (2019)
Cite this article

Neural Processing Letters Aims and scope Submit manuscript

Ce Li ORCID: orcid.org/0000-0002-3081-6751¹,
Baochang Zhang^2,3,4,
Hanwen Hu² &
…
Jing Dai⁵

657 Accesses
8 Citations
3 Altmetric
Explore all metrics

Abstract

Bird detection in LR images is essential for the applications of unmanned aerial vehicles. It is still a challenging task because traditional discriminative features in high-resolution (HR) usually disappear in low-resolution (LR) images. Although recent advances in single image super-resolution (SISR) and object detection algorithms have offered unprecedented potential for computer-automated reconstructing LR images and detecting various objects, these algorithms are mainly evaluated using synthetic datasets. It is unclear how these algorithms would perform on bird images acquired in the wild and how we could gauge the progress in the real-time bird detection. This paper presents a novel bird detection framework in LR aerial images using deep neural networks (DNN). We collect a dataset named BIRD-50 and a public dataset named CUB-200 of real bird images with different scale low-resolutions. Using these datasets, we introduce a novel DNN based framework for bird detection in reconstructed HR images, which exploits the mapping function from LR to HR aerial image and detects the birds by the state-of-the-art object feature extraction and localization methods. By systematically analyzing the influence of the resolution reduction on the bird detection, the experimental results indicate that our approach has produced significantly improved detection precision for bird detection by the inclusion of SISR algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Deep learning techniques to classify agricultural crops through UAV imagery: a review

Article 05 March 2022

Abdelmalek Bouguettaya, Hafed Zarzour, … Amine Mohammed Taberkit

Automatic Flood Detection from Sentinel-1 Data Using a Nested UNet Model and a NASA Benchmark Dataset

Article Open access 12 March 2024

Binayak Ghosh, Shagun Garg, … Sandro Martinis

BEVFormer: Learning Bird’s-Eye-View Representation from Multi-camera Images via Spatiotemporal Transformers

Notes

BIRD-50 will be avaliable at the website: https://github.com/bczhang/bczhang/.

References

Stowell D, Wood M, Stylianou Y, Glotin H (2016) Bird detection in audio: a survey and a challenge. In: IEEE 26th international workshop on machine learning for signal processing
Huang C, Tsai C, Yang H (2011) An extended set of Haar-like features for bird detection based on AdaBoost. In: International conference SIP, Korea
Li W, Song D (2014) Automatic bird species detection from crowd sourced videos. IEEE Trans Autom Sci Eng 11(2):348–358
Article MathSciNet Google Scholar
Zhang J, Xu Q, Cao X et al (2014) Hierarchical incorporation of shape and shape dynamics for flying bird detection. Neurocomputing 131:179–190
Article Google Scholar
Timofte R, Smet V, Gool L (2013) Anchored neighborhood regression for fast example-based super-resolution. In: IEEE international conference on computer vision, pp 1920–1927
Rasti P, Uiboupin T, Escalera S, Anbarjafari G (2016) Convolutional neural network super resolution for face recognition in surveillance monitoring. Springer, Berlin, pp 175–184
Google Scholar
Kim J, Lee JK, Lee KM (2015) Accurate image super-resolution using very deep convolutional networks. IEEE Trans Pattern Anal Mach Intell 38(2):295–307
Google Scholar
Rusk N (2016) Accelerating the super-resolution convolutional neural network. Eur Conf Comput Vis 9905(1):35–35
Google Scholar
ElSayed A, Mahmood A, Sobh T (2017) Effect of super resolution on high dimensional features for unsupervised face recognition in the Wild. arXiv:1704.01464
Wang Z, Liu D, Yang J, Han W, Huang T (2015) Deep networks for image super-resolution with sparse prior. In: International conference on computer vision, pp 370–378
Dong C, Loy CC, He K, Tang X (2016) Image super-resolution using deep convolutional networks. IEEE Trans Pattern Anal Mach Intell 38(2):295–307
Article Google Scholar
Tai Y, Yang J, Liu X (2017) Image super-resolution via deep recursive residual network. In: IEEE conference on computer vision and pattern recognition
Dong C, Loy CC, He K, Tang X (2014) Learning a deep convolutional network for image super-resolution. In: European conference on computer vision, pp 184–199
Wang Z, Liu D, Yang J, Han W, Huang T (2015) Deep networks for image super-resolution with sparse prior. In: IEEE international conference on computer vision, pp 370–378
Zhang B, Gu J, Chen C, Han J, Su X, Cao X, Liu J (2018) One-two-one network for compression artifacts reduction in remote sensing. ISPRS J Photogramm Remote Sens. https://doi.org/10.1016/j.isprsjprs.2018.01.003
Yang L, Li C, Han J, Chen C, Ye Q, Zhang B, Cao X, Liu W (2017) Image reconstruction via manifold constrained convolutional sparse coding for image sets. IEEE J Sel Top Signal Process 11(7):1072–1081
Article Google Scholar
Dong C, Chen CL, Tang X (2016) Accelearting the super-resolution convolutional neural networks. In: European conference on computer vision
Uijlings JR, van de Sande KE, Gevers T, Smeul-ders AW (2013) Selective search for object recognition. Int J Comput Vis 104:154–171
Article Google Scholar
Zhang B, Perina A, Li Z, Murino V, Liu J, Ji R (2016) Bounding multiple gaussians uncertainty with application to object tracking. Int J Comput Vis 118(3):364–379
Article MathSciNet MATH Google Scholar
Zhang B, Luan S, Chen C, Han J, Wang W, Perina A, Shao L (2017) Latent constrained correlation filter. IEEE Trans Image Process 27:1038–1048
Article MathSciNet MATH Google Scholar
Zhang B, Li Z, Perina A, Bue A, Murino V, Liu J (2017) Adaptive local movement modeling for robust object tracking. IEEE Trans Circuits Syst Video Technol 27(7):1515–1526
Article Google Scholar
Yang CY, Yang MH (2013) Fast direct super-resolution by simple functions. In: IEEE international conference on computer vision, pp 561–568
Yang J, Wright J, Huang TS, Ma Y (2010) Image super-resolution via sparse representation. IEEE Trans Image Process 19(11):2861–2873
Article MathSciNet MATH Google Scholar
Huang JJ, Siu WC, Liu TR (2015) Fast image interpolation via random forests. IEEE Trans Image Process 24(10):3232–3245
Article MathSciNet MATH Google Scholar
Jiwon K, Lee JK, Lee KM (2016) Accurate image super-resolution using very deep convolutional networks. In: IEEE conference on computer vision and pattern recognition, pp 1646–1654
Keys RG (1981) Cubic convolution interpolation for digital image processing. IEEE Trans Acoust Speech Signal Process 29(6):1153–1160
Article MathSciNet MATH Google Scholar
Zhang K, Gao X, Tao D, Li X (2012) Single image super-resolution with non-local means and steering kernel regression. IEEE Trans Image Process 21(11):4544–4556
Article MathSciNet MATH Google Scholar
Bevilacqua M, Roumy A, Guillemot C, Alberi-Morel ML (2012) Low-complexity single-image super-resolution based on nonnegative neighbor embedding. In: British machine vision conference, pp 1–10
Dong C, Chen CL, He K (2014) Image super-resolution using deep convolutional networks. IEEE Trans Pattern Anal Mach Intell 38(2):295–307
Article Google Scholar
He K, Zhang X, Ren S, Sun J (2014) Spatial pyramid pooling in deep convolutional networks for visual recognition. In: European conference on computer vision
Girshick R (2015) Fast R-CNN. In: IEEE international conference on computer vision
Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: IEEE conference on computer vision and pattern recognition
Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. In: IEEE conference on computer vision and pattern recognition
Sermanet P, Eigen D, Zhang X, Mathieu M, Fergus R, LeCun Y (2014) Overfeat: integrated recognition, localization and detection using convolutional networks. In: International conference on learning representations
Carreira J, Sminchisescu C (2012) CPMC: automatic object segmentation using constrained parametric min-cuts. IEEE Trans Pattern Anal Mach Intell 34:1312–1328
Article Google Scholar
Arbeláez P, Pont-Tuset J, Barron JT, Marques F, Malik J (2014) Multiscale combinatorial grouping. In: IEEE conference on computer vision and pattern recognition
Erhan D, Szegedy C, Toshev A, Anguelov D (2014) Scalable object detection using deep neural networks. In: IEEE conference on computer vision and pattern recognition
Dai J, He K, Sun J (2015) Convolutional feature masking for joint object and stuff segmentation. In: IEEE conference on computer vision and pattern recognition
Ren S, He K, Girshick R, Sun J (2017) Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans Pattern Anal Mach Intell 39(6):1137–1149
Article Google Scholar
Pinheiro PO, Collobert R, Dollar P (2015) Learning to segment object candidates. In: Neural information processing systems, pp 1990–1998
Sande KEV, Uijlings JR, Gevers T, Smeulders AW (2011) Segmentation as selective search for object recognition. In: IEEE international conference on computer vision, pp 1879–1886
Zitnick CL, Dollar P (2014) Edge boxes: locating object proposals from edges. In: European conference on computer vision, pp 391–405
Welinder P, Branson S, Mita T (2010) Caltech-UCSD Birds 200. California Institute of Technology, Pasadena
Google Scholar
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: IEEE conference on computer vision and pattern recognition, pp 770–778
Szegedy C, Liu W, Jia Y, Sermanet P et al (2015) Going deeper with convolutions. In: IEEE conference on computer vision and pattern recognition, pp 1–9
Song S, Xiao J (2016) Deep sliding shapes for a modal 3D object detection in RGB-D images. In: Computer vision and pattern recognition, pp 808–816
Zhu J, Chen X, Yuille AL (2015) DeePM: a deep part-based model for object detection and semantic part localization, arXiv:1511.07131
Johnson J, Karpathy A, Fei-Fei L (2016) Densecap: fully convolutional localization networks for dense captioning. In: Computer vision and pattern recognition, pp 4565–4574
Wang Y, Wang L, Wang H, Li P (2016) End-to-end image super-resolution via deep and shallow convolutional networks, arXiv:1607.07680
Ren S, He K, Girshick R, Zhang X, Sun J (2017) Object detection networks on convolutional feature maps. IEEE Trans Pattern Anal Mach Intell 39(7):1476–1481
Article Google Scholar
Redmon J, Farhadi A (2017) YOLO9000: better, faster, stronger. In: IEEE conference on computer vision and pattern recognition, pp 6517–6525
Verstraeten W, Vermeulen B, Stuckens J et al (2010) Webcams for bird detection and monitoring: a demonstration study. Sensors 10:3480–3503
Article Google Scholar

Download references

Acknowledgements

The work was supported by the Natural Science Foundation of China under Contract 61601466, 61672079 and 61473086, and Shenzhen Peacock Plan KQTD201611 2515134654. This work was also supported by the Open Projects Program of National Laboratory of Pattern Recognition. Ce Li and Baochang Zhang are the correspondence authors.

Author information

Authors and Affiliations

Department of Computer Science, China University of Mining and Technology, Beijing, China
Ce Li
School of Automation Science and Electrical Engineering, Beihang University, Beijing, China
Baochang Zhang & Hanwen Hu
State Key Laboratory of Satellite Navigation System and Equipment Techonology, Shijiazhuang, Hebei, China
Baochang Zhang
Shenzhen Academy of Aerospace Technology, Shenzhen, China
Baochang Zhang
China Academy of Launch Vehicle Technology R&D Center, Beijing, China
Jing Dai

Authors

Ce Li
View author publications
You can also search for this author in PubMed Google Scholar
Baochang Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Hanwen Hu
View author publications
You can also search for this author in PubMed Google Scholar
Jing Dai
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Baochang Zhang.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Li, C., Zhang, B., Hu, H. et al. Enhanced Bird Detection from Low-Resolution Aerial Image Using Deep Neural Networks. Neural Process Lett 49, 1021–1039 (2019). https://doi.org/10.1007/s11063-018-9871-z

Download citation

Published: 18 June 2018
Issue Date: 15 June 2019
DOI: https://doi.org/10.1007/s11063-018-9871-z

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Enhanced Bird Detection from Low-Resolution Aerial Image Using Deep Neural Networks

Abstract

Access this article

Similar content being viewed by others

Deep learning techniques to classify agricultural crops through UAV imagery: a review

Automatic Flood Detection from Sentinel-1 Data Using a Nested UNet Model and a NASA Benchmark Dataset

BEVFormer: Learning Bird’s-Eye-View Representation from Multi-camera Images via Spatiotemporal Transformers

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation