Skip to main content

Abstract

The comparison between the feature-based method and the learning-based method is conducted in the training time, the accuracy and the generalization capacity, to address the optimisation for the multi-style fisheye imagery classification. We construct an srd-SIFT descriptor based SVM classifier to present the feature-based method for describing the influence of the dataset scale and the visual word scale on the classifier. The SVM classifier achieves 15.98% accuracy on the test set after 162 h training, with the condition that includes 800 images per class in 12 classes and 1500 visual words. For the learning-based method, we propose to expand training samples’ style variety, via style transformation, to facilitate the contemporary architecture retraining. Following this approach, we retrain the ResNet-50 by an artificial multi-style fisheye image dataset without complementing new training labels. The performance of the obtained ResNet classifier is evaluated on 6000 images collected in the real-world. The result shows that the retrained classifier has great generalization capacity and reaches 97.19% top-3 accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Davies, E.R.: Machine Vision: Theory, Algorithms, Practicalities. Elsevier, Amsterdam (2004)

    Google Scholar 

  2. Sarkar, M., Brown, M.H.: Graphical fisheye views of graphs. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 83–91 (1992)

    Google Scholar 

  3. Jung, H.G., Kim, D.S., Yoon, P.J., Kim, J.: Structure analysis based parking slot marking recognition for semi-automatic parking system. In: Joint IAPR International Workshops on Statistical Techniques in Pattern Recognition (SPR) and Structural and Syntactic Pattern Recognition (SSPR), pp. 384–393. Springer, Cham (2006)

    Google Scholar 

  4. Yang, J., Jiang, Y.-G., Hauptmann, A.G., Ngo, C.-W.: Evaluating bag-of-visual-words representations in scene classification. In: Proceedings of the International Workshop on Multimedia Information Retrieval, pp. 197–206 (2007)

    Google Scholar 

  5. Zhen, C., Georgiadis, A.: Parameterized synthetic image data set for fisheye lens. In: 2018 5th International Conference on Information Science and Control Engineering (ICISCE),. pp. 370–374. IEEE (2018)

    Google Scholar 

  6. Karami, E., Prasad, S., Shehata, M.: Image matching using SIFT, SURF, BRIEF and ORB: performance comparison for distorted images. arXiv Prepr. arXiv1710.02726 (2017)

    Google Scholar 

  7. Hansen, P., Boles, W., Corke, P.: Spherical diffusion for scale-invariant keypoint detection in wide-angle images. In: 2008 Digital Image Computing: Techniques and Applications. pp. 525–532. IEEE (2008)

    Google Scholar 

  8. Cruz-Mota, J., Bogdanova, I., Paquier, B., Bierlaire, M., Thiran, J.-P.: Scale invariant feature transform on the sphere: theory and applications. Int. J. Comput. Vis. 98, 217–241 (2012)

    Article  MathSciNet  Google Scholar 

  9. Lourenco, M., Barreto, J.P., Vasconcelos, F.: sRD-SIFT: keypoint detection and matching in images with radial distortion. IEEE Trans. Robot. 28, 752–760 (2012)

    Article  Google Scholar 

  10. Jeon, Y., Kim, J.: Active convolution: Learning the shape of convolution for image classification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4201–4209 (2017)

    Google Scholar 

  11. Coors, B., Paul Condurache, A., Geiger, A.: Spherenet: Learning spherical representations for detection and classification in omnidirectional images. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 518–533 (2018)

    Google Scholar 

  12. Khasanova, R., Frossard, P.: Graph-based classification of omnidirectional images. In: Proceedings of the IEEE International Conference on Computer Vision Workshops. pp. 869–878 (2017)

    Google Scholar 

  13. KIM, J., Kim, B.S., Savarese, S.: Comparing image classification methods: K-nearest-neighbor and support-vector-machines. In: Proceedings of the 6th WSEAS international conference on Computer Engineering and Applications, and Proceedings of the 2012 American conference on Applied Mathematics, pp. 42122–48109 (2012)

    Google Scholar 

  14. Liu, P., Choo, K.-K.R., Wang, L., Huang, F.: SVM or deep learning? a comparative study on remote sensing image classification. Soft Comput. 21, 7053–7065 (2017)

    Article  Google Scholar 

  15. Urban, S., Weinmann, M., Hinz, S.: mdBRIEF-a fast online-adaptable, distorted binary descriptor for real-time applications using calibrated wide-angle or fisheye cameras. Comput. Vis. Image Underst. 162, 71–86 (2017)

    Article  Google Scholar 

  16. Chang, C.-C., Lin, C.-J.: LIBSVM: A library for support vector machines. ACM Trans. Intell. Syst. Technol. 2, 1–27 (2011)

    Article  Google Scholar 

  17. He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)

    Google Scholar 

  18. Jia, Y., Shelhamer, E., Donahue, J., Karayev, S., Long, J., Girshick, R., Guadarrama, S., Darrell, T.: Caffe: convolutional architecture for fast feature embedding. In: Proceedings of the 22nd ACM International Conference on Multimedia, pp. 675–678 (2014)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhen Chen .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Chen, Z., Georgiadis, A. (2021). A Comparative Study for Fisheye Image Classification: SVM or DNN. In: Abraham, A., et al. Proceedings of the 12th International Conference on Soft Computing and Pattern Recognition (SoCPaR 2020). SoCPaR 2020. Advances in Intelligent Systems and Computing, vol 1383. Springer, Cham. https://doi.org/10.1007/978-3-030-73689-7_41

Download citation

Publish with us

Policies and ethics