Impact of compressed and down-scaled training images on vehicle detection in remote sensing imagery

Karim, Shahid; Zhang, Ye; Yin, Shoulin; Laghari, Asif Ali; Brohi, Ali Anwar

doi:10.1007/s11042-019-08033-x

Impact of compressed and down-scaled training images on vehicle detection in remote sensing imagery

Published: 07 August 2019

Volume 78, pages 32565–32583, (2019)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Shahid Karim¹,
Ye Zhang¹,
Shoulin Yin¹,
Asif Ali Laghari² &
…
Ali Anwar Brohi³

416 Accesses
29 Citations
Explore all metrics

Abstract

Vehicle detection in remote sensing imagery is a prominent issue over the last few years. In this application, the processing of optical remote sensing images becomes critical due to the complex environment, large size, occlusions and color variations. However, several approaches have been proposed to improve the training process but still, the efforts are moving towards optimal solutions. The training process is time-consuming and a large amount of memory is required to store those training images. Numerous traditional state-of-the-art approaches are suffering from problematic high computational time. In this paper, a new training methodology which is based on compressed and down-scaled images is implemented to reduce the training time. The training images are compressed at Quality Factor (QF) of 50 and down-scaled by scale factor of 0.5 to evaluate the performance for vehicle detection. The existing approaches of computer vision are taking advantage of high computational Graphical Processing Units (GPUs) to speed up the training process. The proposed framework is also a better way to reduce the computational time. To compare performance, we have trained the RCNN, Fast-RCNN, Faster-RCNN and Cascade detectors by using three types of training image sets and several experiments have been performed. More specifically, our approach makes the training faster than the training based on original images and training based on compressed images provides optimal results.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A review of object detection based on deep learning

Article 12 June 2020

Youzi Xiao, Zhiqiang Tian, … Xuguang Lan

Convolutional neural network: a review of models, methodologies and applications to object detection

Article 20 December 2019

Anamika Dhillon & Gyanendra K. Verma

Traffic sign recognition based on deep learning

Article Open access 07 March 2022

Yanzhao Zhu & Wei Qi Yan

Notes

https://gis.utah.gov/data/aerial-photography/hro/

References

Bak J-H, Huh J-H (2017) A study on the framework design of Korean-model PLC-integrated drone landing site in mountain regions: A software engineering approach. In: 2017 17th International Conference on Control, Automation and Systems (ICCAS), pp 1185–1190
Bay H, Ess A, Tuytelaars T, Van Gool L (2008) Speeded-up robust features (SURF). Comput Vis Image Underst 110(3):346–359
Article Google Scholar
Benenson R, Mathias M, Timofte R, Van Gool L (2012) Pedestrian detection at 100 frames per second. In: Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on, pp 2903–2910
Brubaker SC, Mullin MD, Rehg JM (2006) Towards optimal training of cascaded detectors. In: European Conference on Computer Vision, pp 325–337
Campos V, Sastre F, Yagües M, Bellver M, Giró-i-Nieto X, Torres J (2017) International Conference on Computational Science, ICCS 2017:12–14
Chen X, Xiang S, Liu C-L, Pan C-H (2013) Vehicle detection in satellite images by parallel deep convolutional neural networks. In: Pattern Recognition (ACPR), 2013 2nd IAPR Asian Conference on, pp 181–185
Chen X, Xiang S, Liu C-L, Pan C-H (2014) Vehicle detection in satellite images by hybrid deep convolutional neural networks. IEEE Geosci Remote Sens Lett 11(10):1797–1801
Article Google Scholar
Cheng G, Han J (2016) A survey on object detection in optical remote sensing images. ISPRS J Photogramm Remote Sens 117:11–28
Article Google Scholar
Cheng G, Zhou P, Han J (2016) Learning rotation-invariant convolutional neural networks for object detection in VHR optical remote sensing images. IEEE Trans Geosci Remote Sens 54(12):7405–7415
Article Google Scholar
Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on, vol 1, pp 886–893
Donoho DL (2006) Compressed sensing. IEEE Trans Inf Theory 52(4):1289–1306
Article MathSciNet Google Scholar
El-Bakry HM (2006) A new implementation of pca for fast face detection. Vectors 1:4
Google Scholar
Girshick R (2015) Fast r-cnn. In: Proceedings of the IEEE international conference on computer vision, pp 1440–1448
Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. Proc IEEE Conf Comput Vis Pattern Recognit: 580–587
Girshick R, Donahue J, Darrell T, Malik J (2016) Region-based convolutional networks for accurate object detection and segmentation. IEEE Trans Pattern Anal Mach Intell 38(1):142–158
Article Google Scholar
Guyon I, Elisseeff A (2003) An introduction to variable and feature selection. J Mach Learn Res 3:1157–1182
MATH Google Scholar
Han S, Mao H, Dally WJ (2015) Deep compression: Compressing deep neural networks with pruning, trained quantization and huffman coding. arXiv Prepr. arXiv1510.00149
Han S, Pool J, Tran J, Dally W (2015) Learning both weights and connections for efficient neural network. Adv Neural Inf Proces Syst: 1135–1143
Hassairi S, Ejbali R, Zaied M (2018) A deep stacked wavelet auto-encoders to supervised feature extraction to pattern classification. Multimed Tools Appl 77(5):5443–5459
Article Google Scholar
He K, Zhang X, Ren S, Sun J (2014) Spatial pyramid pooling in deep convolutional networks for visual recognition. In: European Conference on Computer Vision, pp 346–361
Hinton GE, Salakhutdinov RR (2006) Reducing the dimensionality of data with neural networks. Science 313(5786):504–507
Hosang J, Benenson R, Schiele B (2014) How good are detection proposals, really?. arXiv Prepr. arXiv1406.6962
Hu J, Xu T, Zhang J, Yang Y (2016) Fast Vehicle Detection in Satellite Images Using Fully Convolutional Network. In: Chinese Conference on Intelligent Visual Surveillance, pp 122–129
Huang Z, Pan Z, Lei B (2017) Transfer learning with deep convolutional neural network for SAR target classification with limited labeled data. Remote Sens 9(9):907
Article Google Scholar
Huh J-H (2018) PLC-integrated sensing technology in mountain regions for drone landing sites: focusing on software technology. Sensors 18(8):2693
Article Google Scholar
Ji C, Ma S (1997) Combined weak classifiers. Adv Neural Inf Proces Syst 9:494–500
Google Scholar
Jia Z, Saxena A, Chen T (2011) Robotic object detection: Learning to improve the classifiers using sparse graphs for path planning. In: IJCAI, pp 2072–2078
Jiang J (1999) Image compression with neural networks–a survey. Signal Process Image Commun 14(9):737–760
Article Google Scholar
Jin R, Kim J (2017) Tracking feature extraction techniques with improved SIFT for video identification. Multimed Tools Appl 76(4):5927–5936
Article Google Scholar
Jolliffe IT (1992) Principal component analysis and exploratory factor analysis. Stat Meth in Med Res 1(1):69–95.
Karim S, Zhang Y, Asif MR, Ali S (2017) Comparative analysis of feature extraction methods in satellite imagery. J Appl Remote Sens 11(4):42618
Article Google Scholar
Kato T, Ninomiya Y, Masaki I (2002) Preceding vehicle recognition based on learning from sample images. IEEE Trans Intell Transp Syst 3(4):252–260
Article Google Scholar
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. Adv Neural Inf Proces Syst, pp 1097–1105
Lampert CH, Blaschko MB, Hofmann T (2008) Beyond sliding windows: Object localization by efficient subwindow search. In: Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on, pp 1–8
Lebrun G, Charrier C, Cardot H (2004) SVM training time reduction using vector quantization. In: Pattern Recognition, 2004. ICPR 2004. Proceedings of the 17th International Conference on, vol 1, pp 160–163
Leitloff J, Hinz S, Stilla U (2010) Vehicle detection in very high resolution satellite images of city areas. IEEE Trans Geosci Remote Sens 48(7):2795–2806
Article Google Scholar
Li M, Wang J (2008) Remote sensing image compression based on classification and detection. In: Progress in Electromagnetics Research Symposium 2008 (PIERS 2008), pp 1–5
Liu K, Mattyus G (2015) Fast multiclass vehicle detection on aerial images. IEEE Geosci Remote Sens Lett 12(9):1938–1942
Article Google Scholar
Liu J, Zeng G (2012) Description of interest regions with oriented local self-similarity. Opt Commun 285(10):2549–2557
Article Google Scholar
Long Y, Gong Y, Xiao Z, Liu Q (2017) Accurate object localization in remote sensing images based on convolutional neural networks. IEEE Trans Geosci Remote Sens 55(5):2486–2498
Article Google Scholar
Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110
Article Google Scholar
Maggiori E, Tarabalka Y, Charpiat G, Alliez P (2017) Can Semantic Labeling Methods Generalize to Any City? The Inria Aerial Image Labeling Benchmark
Marcus M (2014) JPEG image compression. Dartmouth Coll https://math.dartmouth.edu/archive/m56s14/public_html/proj/Marcus_proj.pdf
Matas J, Chum O, Urban M, Pajdla T (2004) Robust wide-baseline stereo from maximally stable extremal regions. Image Vis Comput 22(10):761–767
Article Google Scholar
Mu Y, Yan S, Liu Y, Huang T, Zhou B (2008) Discriminative local binary patterns for human detection in personal album. In: Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on, pp 1–8
Naikal N, Yang AY, Sastry SS (2011) Informative feature selection for object recognition via sparse PCA. In: Computer Vision (ICCV), 2011 IEEE International Conference on, pp 818–825
Niyomugabo C, Choi H, Kim TY (2016) A modified Adaboost algorithm to reduce false positives in face detection. Math Probl Eng 2016:6. https://doi.org/10.1155/2016/5289413
O’Hanen B, Wisan M (2005) Jpeg compression
Ojala T, Pietikainen M, Maenpaa T (2002) Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans Pattern Anal Mach Intell 24(7):971–987
Article Google Scholar
Palubinskas G, Kurz F, Reinartz P (2008) Detection of traffic congestion in optical remote sensing imagery. In: Geoscience and Remote Sensing Symposium, 2008. IGARSS 2008. IEEE International, vol 2, pp II–426
Qu S, Wang Y, Meng G, Pan C (2016) Vehicle detection in satellite images by incorporating Objectness and convolutional neural network. J Ind Intell Inf Vol 4(2)
Razakarivony S, Jurie F (2016) Vehicle detection in aerial imagery: a small target detection benchmark. J Vis Commun Image Represent 34:187–203
Article Google Scholar
Redmon J, Divvala S, Girshick R, Farhadi A (2016) You only look once: unified, real-time object detection. Proc IEEE Conf Comput Vis Pattern Recognit, pp 779–788
Ren S, He K, Girshick R, Sun J (2015) Faster R-CNN: towards real-time object detection with region proposal networks. Adv Neural Inf Proces Syst, pp 91–99
Samuel AL (1967) Some studies in machine learning using the game of checkers. II—recent progress. IBM J Res Dev 11(6):601–617
Article Google Scholar
Shechtman E, Irani M (2007) Matching local self-similarities across images and videos. In: Computer Vision and Pattern Recognition, 2007. CVPR’07. IEEE Conference on, pp 1–8
Strang G (1999) The discrete cosine transform. SIAM Rev 41(1):135–147
Article MathSciNet Google Scholar
Sun Z, Bebis G, Miller R (2006) On-road vehicle detection: a review. IEEE Trans Pattern Anal Mach Intell 28(5):694–711
Article Google Scholar
Sundaram N (2012) Making computer vision computationally efficient. University of California, Berkeley
Google Scholar
Tang T, Zhou S, Deng Z, Lei L, Zou H (2017) Arbitrary-oriented vehicle detection in aerial imagery with single convolutional neural networks. Remote Sens 9(11):1170
Article Google Scholar
Taubman D, Marcellin M (2012) JPEG2000 image compression fundamentals, standards and practice: image compression fundamentals, standards and practice, vol 642. Springer Science & Business Media, New Year
Google Scholar
Tekalp AM (2015) Digital video processing. Prentice Hall Press, Upper Saddle River
Google Scholar
Teo CH, Tay YH, Lai WK (2005) A novel approach to improve the training time of convolutional networks for object recognition. In: Proceedings of the Twelfth International Conference on Neural Information Processing (ICONIP 2005), Taipei, Taiwan ROC, pp 17–22
Trier ØD, Jain AK, Taxt T (1996) Feature extraction methods for character recognition-a survey. Pattern Recogn 29(4):641–662
Article Google Scholar
Uijlings JRR, Van De Sande KEA, Gevers T, Smeulders AWM (2013) Selective search for object recognition. Int J Comput Vis 104(2):154–171
Article Google Scholar
Vedaldi A, Lenc K (2015) Matconvnet: convolutional neural networks for matlab. In: Proceedings of the 23rd ACM international conference on Multimedia, pp 689–692
Viola P, Jones M (2001) Rapid object detection using a boosted cascade of simple features. In: Computer Vision and Pattern Recognition, 2001. CVPR 2001. Proceedings of the 2001 IEEE Computer Society Conference on, vol 1, pp I–I
Viola P, Jones MJ (2004) Robust real-time face detection. Int J Comput Vis 57(2):137–154
Article Google Scholar
Wang Z, Bovik AC, Sheikh HR, Simoncelli EP (2004) Image quality assessment: from error visibility to structural similarity. IEEE Trans Image Process 13(4):600–612
Article Google Scholar
Wang L, Zhao X, Liu Y (2016) Reduce false positives for object detection by a priori probability in videos. Neurocomputing 208:325–332
Article Google Scholar
Wright J, Yang AY, Ganesh A, Sastry SS, Ma Y (2009) Robust face recognition via sparse representation. IEEE Trans Pattern Anal Mach Intell 31(2):210–227
Article Google Scholar
Xia G-S et al (2018) DOTA: A large-scale dataset for object detection in aerial images. In: Proc. CVPR
Yao S, Wang T, Shen W, Pan S, Chong Y, Ding F (2015) Feature selection and pedestrian detection based on sparse representation. PLoS One 10(8):e0134242
Article Google Scholar
Zhang W, Zelinsky G, Samaras D (2007) Real-time accurate object detection using multiple resolutions. In: Computer Vision, 2007. ICCV 2007. IEEE 11th International Conference on, pp 1–8
Zhu X, Vondrick C, Ramanan D, Fowlkes CC (2012) Do We Need More Training Data or Better Models for Object Detection?. In: BMVC, vol 3, p 5
Zitnick CL, Dollár P (2014) Edge boxes: Locating object proposals from edges. In: European Conference on Computer Vision, pp 391–405
Zou H, Hastie T, Tibshirani R (2006) Sparse principal component analysis. J Comput Graph Stat 15(2):265–286
Article MathSciNet Google Scholar

Download references

Acknowledgments

This work was supported by the National Natural Science Foundation of China under Grants 61471148.

Author information

Authors and Affiliations

School of Electronics and Information Engineering, Harbin Institute of Technology, Harbin, 150001, China
Shahid Karim, Ye Zhang & Shoulin Yin
School of Computer Science and Technology, Harbin Institute of Technology, Harbin, 150001, China
Asif Ali Laghari
School of Energy Science and Engineering, Harbin Institute of Technology, Harbin, 150001, China
Ali Anwar Brohi

Authors

Shahid Karim
View author publications
You can also search for this author in PubMed Google Scholar
Ye Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Shoulin Yin
View author publications
You can also search for this author in PubMed Google Scholar
Asif Ali Laghari
View author publications
You can also search for this author in PubMed Google Scholar
Ali Anwar Brohi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shahid Karim.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Karim, S., Zhang, Y., Yin, S. et al. Impact of compressed and down-scaled training images on vehicle detection in remote sensing imagery. Multimed Tools Appl 78, 32565–32583 (2019). https://doi.org/10.1007/s11042-019-08033-x

Download citation

Received: 24 October 2018
Revised: 12 May 2019
Accepted: 19 July 2019
Published: 07 August 2019
Issue Date: November 2019
DOI: https://doi.org/10.1007/s11042-019-08033-x

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Impact of compressed and down-scaled training images on vehicle detection in remote sensing imagery

Abstract

Access this article

Similar content being viewed by others

A review of object detection based on deep learning

Convolutional neural network: a review of models, methodologies and applications to object detection

Traffic sign recognition based on deep learning

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Impact of compressed and down-scaled training images on vehicle detection in remote sensing imagery

Abstract

Access this article

Similar content being viewed by others

A review of object detection based on deep learning

Convolutional neural network: a review of models, methodologies and applications to object detection

Traffic sign recognition based on deep learning

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation