A VCA-Based Approach to Enhance Learning Data Sets for Object Classification

Baran, Remigiusz; Zeja, Andrzej

doi:10.1007/978-3-030-59000-0_22

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1284))

Included in the following conference series:

International Conference on Multimedia Communications, Services and Security

432 Accesses

Abstract

This paper presents a novel approach to solving the problem of poor learning data in complex object classification task. It efficiently combines the Visual Content Analysis technique known as the Scalable Vocabulary Tree (SVT) and contour-based descriptors to recommend new training samples. The SVT technique uses the SIFT features to identify and accurately localize objects of interest within the visual content of the processed query images. Despite the small learning data set its classification accuracy is pretty good and matches the accuracy of a dedicated CNN network trained under the same conditions. However, due to the ability of fast and effective incremental learning, it overcomes the convnet type networks. Contour-based classification based on Point Distance Histogram (PDH) is utilized then to increase the classification certainty. During this stage, the PDH descriptors representing a given object of interest are matched against descriptors stored in the pattern database, where each object is represented by a collection of 360 pattern outlines extracted from its 3D model. As finally reported, such an exact pattern representation allows for achieving a high classification accuracy of the entire approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

References

http://www.live-counter.com/how-big-is-the-internet/. Accessed 26 Feb 2020
Taigman, Y., Yang, M., Ranzato, M., Wolf, L.: DeepFace: closing the gap to human-level performance in face verification. In: 2014 IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, pp. 1701–1708 (2014). https://doi.org/10.1109/cvpr.2014.220
Chollet, F.: Deep Learning with Python. Manning Publications Co., New York (2018)
Google Scholar
Worring, M., Snoek, C.: Visual content analysis. In: Liu, L., Özsu, M.T. (eds.) Encyclopedia of Database Systems, pp 3360–3365. Springer, Boston (2009). https://doi.org/10.1007/978-0-387-39940-9_1019
Chapter Google Scholar
Baran, R., Zeja, A.: The IMCOP system for data enrichment and content discovery and delivery. In: Proceedings of the 2015 International Conference on Computational Science and Computational Intelligence (CSCI 2015), pp 143–146, Las Vegas, USA (2015)
Google Scholar
Wolff, E.: Microservices: Flexible Software Architectures. Addison-Wesley, Boston (2016)
Google Scholar
Li, Z., Hoiem, D.: Learning without forgetting. PAMI 40, 2935–2947 (2018)
Article Google Scholar
Tao, Y., Tu Y., Shyu, M..: Efficient incremental training for deep convolutional neural networks. In: 2019 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR), San Jose, CA, USA, pp. 286–291 (2019)
Google Scholar
Cheng, Y., Wang, D., Zhou, P., Zhang, T.: A survey of model compression and acceleration for deep neural networks, CoRR, vol. abs/1710.09282 (2017)
Google Scholar
Roy, D., Panda, P., Roy, K.: Tree-CNN: a hierarchical deep convolutional neural network for incremental learning. Neural Netw. 121, 148–160 (2018)
Article Google Scholar
Nister, D., Stewenius, H.: Scalable recognition with a vocabulary tree. In: Conference on Computer Vision and Pattern Recognition, New York, NY, USA, pp. 2161–2168 (2006)
Google Scholar
Baran, R.: Efficiency investigation of BoF, SVT and pyramid match algorithms in practical recognition applications. In: Proceedings of the 2017 IEEE International Conference on Mathematics and Computers in Sciences and in Industry (MCSI), pp 171–178, Corfu Island, Greece (2017)
Google Scholar
Lowe, D.G.: Object recognition from local scale-invariant features. In: Proceedings of the ICCV 1999, vol 2, pp 1150–1157. IEEE Computer Society (1999)
Google Scholar
Rublee, E., Rabaud, V., Konolige, K., Bradski, G. R.: ORB: an efficient alternative to SIFT or SURF. In: ICCV 2011, pp. 2564–2571 (2011)
Google Scholar
Baran, R., Rudziński, F., Zeja, A.: Face recognition for movie character and actor discrimination based on similarity scores. In: Proceedings of the 2016 IEEE International Conference on Computational Science and Computational Intelligence (CSCI), pp 1333–1338, Las Vegas, USA (2016)
Google Scholar
Rother, C., Kolmogorov, V., Blake, A.: GrabCut - interactive foreground extraction using iterated graph cuts. ACM Trans. Graph. 23(3), 309–314 (2004)
Article Google Scholar
Otsu, N.: A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man Cybern. 9(1), 62–66 (1979)
Article Google Scholar
Baran, R., Kleszcz, A.: The efficient spatial methods of contour approximation. In: Proceedings of the 2014 IEEE International Conference on Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA 2014), pp. 116–121, Poznań, Poland (2014)
Google Scholar
Frejlichowski, D.: Shape representation using point distance histogram. Polish J. Environ. Stud. 16(4A), 90–93 (2007)
Google Scholar
Frejlichowski, D.: application of the point distance histogram to the automatic identification of people by means of digital dental radiographic images. In: Chmielewski, L.J., Datta, A., Kozera, R., Wojciechowski, K. (eds.) ICCVG 2016. LNCS, vol. 9972, pp. 387–394. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46418-3_34
Chapter Google Scholar

Download references

Acknowledgments

This work was supported by the Polish National Centre for Research and Development under the Smart Growth Operational Programme, INRED project no. POIR.01.01.01-00-0170/17. We want also to address our special thanks to our colleagues Iwo Ryszkowski and Krzysztof Nowakowski from AGH University of Science and Technology (Poland) for their valuable contributions to this work.

Author information

Authors and Affiliations

Department of Computer Science, Electronics and Electrical Engineering, Kielce University of Technology, Kielce, Poland
Remigiusz Baran
Department of Teleinformatics, University of Computer Engineering and Telecommunications, Kielce, Poland
Andrzej Zeja

Authors

Remigiusz Baran
View author publications
You can also search for this author in PubMed Google Scholar
Andrzej Zeja
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Remigiusz Baran .

Editor information

Editors and Affiliations

AGH University of Science and Technology, Kraków, Poland
Andrzej Dziech
Royal Military Academy, Brussels, Belgium
Wim Mees
Gdańsk University of Technology, Gdańsk, Poland
Andrzej Czyżewski

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Baran, R., Zeja, A. (2020). A VCA-Based Approach to Enhance Learning Data Sets for Object Classification. In: Dziech, A., Mees, W., Czyżewski, A. (eds) Multimedia Communications, Services and Security. MCSS 2020. Communications in Computer and Information Science, vol 1284. Springer, Cham. https://doi.org/10.1007/978-3-030-59000-0_22

Download citation

DOI: https://doi.org/10.1007/978-3-030-59000-0_22
Published: 24 September 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-58999-8
Online ISBN: 978-3-030-59000-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics