Skip to main content
Log in

Low - resolution vehicle recognition based on deep feature fusion

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Recently, convolutional neural networks have achieved great success in image classification. However, the traditional convolutional neural network lacks the ability to distinguish image features, especially for the low resolution images with less feature information. In the vehicle recognition task, it is inevitable to lose some feature information by convolution during the process of the low-level feature is abstracted into the high-level semantic feature. In this paper, an improved convolutional neural network model with higher robustness is proposed, we call it feature fusion convolutional neural network (FFCNN), which can not only produce more discriminative features, but also can avoid interference caused by environmental factors to some extent. Firstly, the strategy of feature fusion is used to fuse the different low-level features in the convolution neural network. Secondly, in order to prevent overfitting, we combine with the network model of sparse and data augmentation to optimize the structure of the network model. The results of the experiment show that the model proposed in this paper has higher recognition accuracy compared with the traditional vehicle recognition methods and the original convolutional neural network models.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

References

  1. Arora S, Bhaskara A, Ge R, Ma T (2014) Provable bounds for learning some deep representations. In: International Conference on Machine Learning, Beijing, pp 584–592

  2. Buch N, Orwell J, Velastin SA (2009) 3D Extended Histogram of Oriented Gradients (3DHOG) for classification of road users in urban scenes. In: British Machine Vision Conference, London

  3. Dong C, Chen CL, He K et al (2016) Image super-resolution using deep convolutional networks. IEEE Trans Pattern Anal Mach Intell 38(2):295–307

    Article  Google Scholar 

  4. Escalera ADL, Armingol JMA, Mata M (2003) Traffic sign recognition and analysis for intelligent vehicles. Image Vis Comput 21(3):247–258

    Article  Google Scholar 

  5. Girshick RB, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition, Columbus, pp 580–587

  6. Glasner D, Bagon S, Irani M (2009) Super-resolution from a single image. In: IEEE International Conference on Computer Vision, Kyoto, pp 349–356

  7. Gupte S, Masoud O, Martin RFK et al (2002) Detection and classification of vehicles. IEEE Trans Intell Transp Syst 3(1):37–47

    Article  Google Scholar 

  8. Hasegawa O, Kanade T (2005) Type classification, color estimation, and specific target detection of moving targets on public streets. Mach Vis Appl 16(2):116–121

    Article  Google Scholar 

  9. He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, pp 770–778

  10. Hinton GE, Osindero S, Teh YW (2006) A fast learning algorithm for deep belief nets. Neural Comput 18(7):1527–1554

    Article  MathSciNet  Google Scholar 

  11. Hong C, Yu J, Chen X et al (2015) Image-based 3D human pose recovery with locality sensitive sparse retrieval. IEEE Trans Ind Electron 2013:2103–2108

    Google Scholar 

  12. Hong, S, You T, Kwak S, Han B (2015) Online tracking by learning discriminative saliency map with convolutional neural network. In: Proceedings of International Conference on International Conference on Machine Learning, Lille, pp 597–606

  13. Jia D, Krause J, Li FF (2013) Fine-grained crowdsourcing for fine-grained recognition. In: IEEE Conference on Computer Vision and Pattern Recognition, Portland, pp 580–587

  14. Jia Y, Shelhamer E, Donahue J, Karayev S, Long J (2014) Caffe: convolutional architecture for fast feature embedding. In: Proceedings of the 22nd ACM International Conference on Multimedia, Orlando, pp 675–678

  15. Krause J, Stark M, Jia D, Li FF (2014) 3D Object representations for fine-grained categorization. In: IEEE International Conference on Computer Vision Workshops, Sydney, pp 554–561

  16. Krause J, Gebru T, Deng J, Li LJ, Li FF (2014) Learning Features and Parts for Fine-Grained Recognition. In: International Conference on Pattern Recognition, Stockholm, pp 26–33

  17. Krizhevsky A, Sutskever I, Hinton GE (2012) ImageNet classification with deep convolutional neural networks. Adv Neural Inf Proces Syst 25(2):2012

    Google Scholar 

  18. Lécun Y, Bottou L, Bengio Y et al (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324

    Article  Google Scholar 

  19. Lin M, Chen Q, Yan S (2014) Network in network. In: International Conference on Learning Representations. ArXiv:1312.4400

  20. Ouwerkerk JDV (2006) Image super-resolution survey. Image Vis Comput 24(10):1039–1052

    Article  Google Scholar 

  21. Peng X, Hoffman J, Yu SX, Saenko K (2016) Fine-to-coarse knowledge transfer for low-res image classification. In: IEEE International Conference on Image Processing, Phoenix, pp 3683–3687

  22. Salvador J, Pérez-Pellitero E (2016) Naive bayes super-resolution forest. In: IEEE International Conference on Computer Vision, Santiago, pp 325–333

  23. Sermanet, P, Eigen D, Zhang X, Mathieu M, Fergus R, Lecun Y (2013) OverFeat: integrated recognition, localization and detection using convolutional networks. In: Computer Vision and Pattern Recognition. ArXiv:1312.6229

  24. Shelhamer E, Long J, Darrell T (2015) Fully convolutional networks for semantic segmentation. IEEE Trans Pattern Anal Mach Intell 79(10):1337–1342

    Google Scholar 

  25. Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. Computer Science. ArXiv:1409.1556

  26. Szegedy C, LiuW, Jia Y, Sermanet P, Reed, SE, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. In: IEEE Conference on Computer Vision and Pattern Recognition, Boston, pp 1–9

  27. Tao D, Hong C, Yu J, Wan J, Wang M (2015) Multimodal deep autoencoder for human pose recovery. IEEE Trans Image Process 24(12):5659–5670

    Article  MathSciNet  Google Scholar 

  28. Wang J, Yang J, Yu K, Lv F, Huang T, Gong Y (2010) Locality-constrained linear coding for image classification. In: IEEE Conference on Computer Vision and Pattern Recognition, San Francisco, pp 3360–3367

  29. Wang Z, Chang S, Yang Y, Liu D, Huang TS (2016) Studying very low resolution recognition using deep networks. In: IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, pp 4792–4800

  30. Yang L, Luo P, Loy CC, Tang X (2015) A large-scale car dataset for fine-grained categorization and verification. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston

  31. Zeiler M D, Fergus R (2014) Visualizing and understanding convolutional networks. In: European Conference on Computer Vision. Springer, Cham, pp 818–833

    Google Scholar 

  32. Zhang C, Chen X, Chen W (2006) A PCA-Based Vehicle Classification Framework. In: International Conference on Data Engineering Workshops, 2006. Proceedings, Atlanta, pp 17–17. Multimed Tools Appl

Download references

Acknowledgments

We express our sincere thanks to the anonymous reviewers for their useful comments and suggestions to raise the standard of the paper. This study was supported by the National Natural Science Foundation of China under Grant No. 61672202.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Juan Yang.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Xue, L., Zhong, X., Wang, R. et al. Low - resolution vehicle recognition based on deep feature fusion. Multimed Tools Appl 77, 27617–27639 (2018). https://doi.org/10.1007/s11042-018-5940-6

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-018-5940-6

Keywords

Navigation