Abstract
Recently, convolutional neural networks have achieved great success in image classification. However, the traditional convolutional neural network lacks the ability to distinguish image features, especially for the low resolution images with less feature information. In the vehicle recognition task, it is inevitable to lose some feature information by convolution during the process of the low-level feature is abstracted into the high-level semantic feature. In this paper, an improved convolutional neural network model with higher robustness is proposed, we call it feature fusion convolutional neural network (FFCNN), which can not only produce more discriminative features, but also can avoid interference caused by environmental factors to some extent. Firstly, the strategy of feature fusion is used to fuse the different low-level features in the convolution neural network. Secondly, in order to prevent overfitting, we combine with the network model of sparse and data augmentation to optimize the structure of the network model. The results of the experiment show that the model proposed in this paper has higher recognition accuracy compared with the traditional vehicle recognition methods and the original convolutional neural network models.
Similar content being viewed by others
References
Arora S, Bhaskara A, Ge R, Ma T (2014) Provable bounds for learning some deep representations. In: International Conference on Machine Learning, Beijing, pp 584–592
Buch N, Orwell J, Velastin SA (2009) 3D Extended Histogram of Oriented Gradients (3DHOG) for classification of road users in urban scenes. In: British Machine Vision Conference, London
Dong C, Chen CL, He K et al (2016) Image super-resolution using deep convolutional networks. IEEE Trans Pattern Anal Mach Intell 38(2):295–307
Escalera ADL, Armingol JMA, Mata M (2003) Traffic sign recognition and analysis for intelligent vehicles. Image Vis Comput 21(3):247–258
Girshick RB, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition, Columbus, pp 580–587
Glasner D, Bagon S, Irani M (2009) Super-resolution from a single image. In: IEEE International Conference on Computer Vision, Kyoto, pp 349–356
Gupte S, Masoud O, Martin RFK et al (2002) Detection and classification of vehicles. IEEE Trans Intell Transp Syst 3(1):37–47
Hasegawa O, Kanade T (2005) Type classification, color estimation, and specific target detection of moving targets on public streets. Mach Vis Appl 16(2):116–121
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, pp 770–778
Hinton GE, Osindero S, Teh YW (2006) A fast learning algorithm for deep belief nets. Neural Comput 18(7):1527–1554
Hong C, Yu J, Chen X et al (2015) Image-based 3D human pose recovery with locality sensitive sparse retrieval. IEEE Trans Ind Electron 2013:2103–2108
Hong, S, You T, Kwak S, Han B (2015) Online tracking by learning discriminative saliency map with convolutional neural network. In: Proceedings of International Conference on International Conference on Machine Learning, Lille, pp 597–606
Jia D, Krause J, Li FF (2013) Fine-grained crowdsourcing for fine-grained recognition. In: IEEE Conference on Computer Vision and Pattern Recognition, Portland, pp 580–587
Jia Y, Shelhamer E, Donahue J, Karayev S, Long J (2014) Caffe: convolutional architecture for fast feature embedding. In: Proceedings of the 22nd ACM International Conference on Multimedia, Orlando, pp 675–678
Krause J, Stark M, Jia D, Li FF (2014) 3D Object representations for fine-grained categorization. In: IEEE International Conference on Computer Vision Workshops, Sydney, pp 554–561
Krause J, Gebru T, Deng J, Li LJ, Li FF (2014) Learning Features and Parts for Fine-Grained Recognition. In: International Conference on Pattern Recognition, Stockholm, pp 26–33
Krizhevsky A, Sutskever I, Hinton GE (2012) ImageNet classification with deep convolutional neural networks. Adv Neural Inf Proces Syst 25(2):2012
Lécun Y, Bottou L, Bengio Y et al (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324
Lin M, Chen Q, Yan S (2014) Network in network. In: International Conference on Learning Representations. ArXiv:1312.4400
Ouwerkerk JDV (2006) Image super-resolution survey. Image Vis Comput 24(10):1039–1052
Peng X, Hoffman J, Yu SX, Saenko K (2016) Fine-to-coarse knowledge transfer for low-res image classification. In: IEEE International Conference on Image Processing, Phoenix, pp 3683–3687
Salvador J, Pérez-Pellitero E (2016) Naive bayes super-resolution forest. In: IEEE International Conference on Computer Vision, Santiago, pp 325–333
Sermanet, P, Eigen D, Zhang X, Mathieu M, Fergus R, Lecun Y (2013) OverFeat: integrated recognition, localization and detection using convolutional networks. In: Computer Vision and Pattern Recognition. ArXiv:1312.6229
Shelhamer E, Long J, Darrell T (2015) Fully convolutional networks for semantic segmentation. IEEE Trans Pattern Anal Mach Intell 79(10):1337–1342
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. Computer Science. ArXiv:1409.1556
Szegedy C, LiuW, Jia Y, Sermanet P, Reed, SE, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. In: IEEE Conference on Computer Vision and Pattern Recognition, Boston, pp 1–9
Tao D, Hong C, Yu J, Wan J, Wang M (2015) Multimodal deep autoencoder for human pose recovery. IEEE Trans Image Process 24(12):5659–5670
Wang J, Yang J, Yu K, Lv F, Huang T, Gong Y (2010) Locality-constrained linear coding for image classification. In: IEEE Conference on Computer Vision and Pattern Recognition, San Francisco, pp 3360–3367
Wang Z, Chang S, Yang Y, Liu D, Huang TS (2016) Studying very low resolution recognition using deep networks. In: IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, pp 4792–4800
Yang L, Luo P, Loy CC, Tang X (2015) A large-scale car dataset for fine-grained categorization and verification. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston
Zeiler M D, Fergus R (2014) Visualizing and understanding convolutional networks. In: European Conference on Computer Vision. Springer, Cham, pp 818–833
Zhang C, Chen X, Chen W (2006) A PCA-Based Vehicle Classification Framework. In: International Conference on Data Engineering Workshops, 2006. Proceedings, Atlanta, pp 17–17. Multimed Tools Appl
Acknowledgments
We express our sincere thanks to the anonymous reviewers for their useful comments and suggestions to raise the standard of the paper. This study was supported by the National Natural Science Foundation of China under Grant No. 61672202.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Xue, L., Zhong, X., Wang, R. et al. Low - resolution vehicle recognition based on deep feature fusion. Multimed Tools Appl 77, 27617–27639 (2018). https://doi.org/10.1007/s11042-018-5940-6
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-018-5940-6