Abstract
This paper presents a novel approach to solving the problem of poor learning data in complex object classification task. It efficiently combines the Visual Content Analysis technique known as the Scalable Vocabulary Tree (SVT) and contour-based descriptors to recommend new training samples. The SVT technique uses the SIFT features to identify and accurately localize objects of interest within the visual content of the processed query images. Despite the small learning data set its classification accuracy is pretty good and matches the accuracy of a dedicated CNN network trained under the same conditions. However, due to the ability of fast and effective incremental learning, it overcomes the convnet type networks. Contour-based classification based on Point Distance Histogram (PDH) is utilized then to increase the classification certainty. During this stage, the PDH descriptors representing a given object of interest are matched against descriptors stored in the pattern database, where each object is represented by a collection of 360 pattern outlines extracted from its 3D model. As finally reported, such an exact pattern representation allows for achieving a high classification accuracy of the entire approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
- 7.
- 8.
- 9.
- 10.
- 11.
- 12.
- 13.
- 14.
- 15.
References
http://www.live-counter.com/how-big-is-the-internet/. Accessed 26 Feb 2020
Taigman, Y., Yang, M., Ranzato, M., Wolf, L.: DeepFace: closing the gap to human-level performance in face verification. In: 2014 IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, pp. 1701–1708 (2014). https://doi.org/10.1109/cvpr.2014.220
Chollet, F.: Deep Learning with Python. Manning Publications Co., New York (2018)
Worring, M., Snoek, C.: Visual content analysis. In: Liu, L., Özsu, M.T. (eds.) Encyclopedia of Database Systems, pp 3360–3365. Springer, Boston (2009). https://doi.org/10.1007/978-0-387-39940-9_1019
Baran, R., Zeja, A.: The IMCOP system for data enrichment and content discovery and delivery. In: Proceedings of the 2015 International Conference on Computational Science and Computational Intelligence (CSCI 2015), pp 143–146, Las Vegas, USA (2015)
Wolff, E.: Microservices: Flexible Software Architectures. Addison-Wesley, Boston (2016)
Li, Z., Hoiem, D.: Learning without forgetting. PAMI 40, 2935–2947 (2018)
Tao, Y., Tu Y., Shyu, M..: Efficient incremental training for deep convolutional neural networks. In: 2019 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR), San Jose, CA, USA, pp. 286–291 (2019)
Cheng, Y., Wang, D., Zhou, P., Zhang, T.: A survey of model compression and acceleration for deep neural networks, CoRR, vol. abs/1710.09282 (2017)
Roy, D., Panda, P., Roy, K.: Tree-CNN: a hierarchical deep convolutional neural network for incremental learning. Neural Netw. 121, 148–160 (2018)
Nister, D., Stewenius, H.: Scalable recognition with a vocabulary tree. In: Conference on Computer Vision and Pattern Recognition, New York, NY, USA, pp. 2161–2168 (2006)
Baran, R.: Efficiency investigation of BoF, SVT and pyramid match algorithms in practical recognition applications. In: Proceedings of the 2017 IEEE International Conference on Mathematics and Computers in Sciences and in Industry (MCSI), pp 171–178, Corfu Island, Greece (2017)
Lowe, D.G.: Object recognition from local scale-invariant features. In: Proceedings of the ICCV 1999, vol 2, pp 1150–1157. IEEE Computer Society (1999)
Rublee, E., Rabaud, V., Konolige, K., Bradski, G. R.: ORB: an efficient alternative to SIFT or SURF. In: ICCV 2011, pp. 2564–2571 (2011)
Baran, R., Rudziński, F., Zeja, A.: Face recognition for movie character and actor discrimination based on similarity scores. In: Proceedings of the 2016 IEEE International Conference on Computational Science and Computational Intelligence (CSCI), pp 1333–1338, Las Vegas, USA (2016)
Rother, C., Kolmogorov, V., Blake, A.: GrabCut - interactive foreground extraction using iterated graph cuts. ACM Trans. Graph. 23(3), 309–314 (2004)
Otsu, N.: A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man Cybern. 9(1), 62–66 (1979)
Baran, R., Kleszcz, A.: The efficient spatial methods of contour approximation. In: Proceedings of the 2014 IEEE International Conference on Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA 2014), pp. 116–121, Poznań, Poland (2014)
Frejlichowski, D.: Shape representation using point distance histogram. Polish J. Environ. Stud. 16(4A), 90–93 (2007)
Frejlichowski, D.: application of the point distance histogram to the automatic identification of people by means of digital dental radiographic images. In: Chmielewski, L.J., Datta, A., Kozera, R., Wojciechowski, K. (eds.) ICCVG 2016. LNCS, vol. 9972, pp. 387–394. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46418-3_34
Acknowledgments
This work was supported by the Polish National Centre for Research and Development under the Smart Growth Operational Programme, INRED project no. POIR.01.01.01-00-0170/17. We want also to address our special thanks to our colleagues Iwo Ryszkowski and Krzysztof Nowakowski from AGH University of Science and Technology (Poland) for their valuable contributions to this work.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Baran, R., Zeja, A. (2020). A VCA-Based Approach to Enhance Learning Data Sets for Object Classification. In: Dziech, A., Mees, W., Czyżewski, A. (eds) Multimedia Communications, Services and Security. MCSS 2020. Communications in Computer and Information Science, vol 1284. Springer, Cham. https://doi.org/10.1007/978-3-030-59000-0_22
Download citation
DOI: https://doi.org/10.1007/978-3-030-59000-0_22
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-58999-8
Online ISBN: 978-3-030-59000-0
eBook Packages: Computer ScienceComputer Science (R0)