Object Searching on Video Using ORB Descriptor and Support Vector Machine

Adhinata, Faisal Dharma; Harjoko, Agus; Wahyono

doi:10.1007/978-3-030-63119-2_20

Faisal Dharma Adhinata⁸,
Agus Harjoko⁹ &
Wahyono⁹

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1287))

Included in the following conference series:

International Conference on Computational Collective Intelligence

1397 Accesses

Abstract

One of the main stages in object searching on video is extracting object regions from video. Template matching is popular technique for performing a such task. However, the use of template matching has a limitation that requires a large object as a template. If the template size is too small, it would obtain few features. On the other hand, ORB descriptors are often used for representing the object with a good accuracy and fast processing time. Therefore, this research proposed to use machine learning method combining with ORB descriptor for object searching on video data. Processing video in all frames is inefficient. Thus, frames are selected into keyframes using mutual information entropy. The ORB descriptors are then extracted from selected frame in order to find candidate region of objects. To verify and classify the object regions, multiclass support vector machine was used to train ORB descriptor of regions. For evaluation, the use of ORB would be compared with other descriptor, such as SIFT and SURF for showing its superiority in both accuracy and processing time. In experiment, it is found that object searching with ORB descriptor performs faster processing time, which is 0.219 s, while SIFT 1.011 s and SURF 0.503 s. Meanwhile, it also achieves the best F₁ value, which is 0.9 compared to SIFT 0.63 and SURF 0.65.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Tracking feature extraction techniques with improved SIFT for video identification

Article 22 May 2015

Improved object recognition results using SIFT and ORB feature detector

Article 19 October 2019

A Novel Video Reconstruction of Randomized Frames Using ORB Descriptor

References

Ouyang, S.Z., Zhong, L., Luo, R.Q.: The comparison and analysis of extracting video key frame. IOP Conf. Ser. Mater. Sci. Eng. 359(1), 012010 (2018)
Article Google Scholar
Sun, L., Zhou, Y.: A key frame extraction method based on mutual information and image entropy. In: 2011 International Conference on Multimedia Technology, vol. 1, no. 6077, pp. 35–38 (2011)
Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)
Article Google Scholar
Bay, H., Ess, A., Tuytelaars, T., Vangool, L.: Speeded-up robust features (SURF). Comput. Vis. Image Underst. 110(3), 346–359 (2006)
Article Google Scholar
Rublee, E., Rabaud, V., Konolige, K., Bradski, G.: ORB: an efficient alternative to SIFT or SURF. In: Proceedings of IEEE International Conference on Computer Vision, pp. 2564–2571 (2011)
Google Scholar
Zhou, D., Tian, Y., Li, X., Wu, J.: ORB-based template matching through convolutional features map. In: Proceedings of 2019 Chinese Automation Congress, pp. 4695–4699 (2019)
Google Scholar
Purohit, M., Yadav, A.R.: Comparison of feature extraction techniques to recognize traffic rule violations using low processing embedded system. In: 2018 5th International Conference on Signal Process. Integration Networks, pp. 154–158 (2018)
Google Scholar
Murugeswari, M., Veluchamy, S.: Hand gesture recognition system for real-time application. In: 2014 IEEE International Conference on Advanced Communications, Control and Computing Technologies ICACCCT 2014, no. 978, pp. 1220–1225 (2014)
Google Scholar
Li, Q., Wang, X.: Image classification based on SIFT and SVM. In: Proceedings of 17th IEEE/ACIS Conference on Computer and Information Science, vol. 1, no. 1, pp. 762–765 (2018)
Google Scholar
Jabnoun, H., Benzarti, F., Amiri, H.: Object recognition for blind people based on features extraction. In: International Image Processing, Applications and Systems Conference, pp. 1–6 (2014)
Google Scholar
Yu, H., Kong, L.: An optimization of video sequence stitching method. In: 2018 IEEE Conference on Mechatronics and Automation, pp. 387–391 (2018)
Google Scholar
Toapanta, S.M.T., Cruz, A.A. C., Gallegos, L.E.M., Trejo, J.A.O.: Algorithms for efficient biometric systems to mitigate the integrity of a distributed database. In: CITS 2018 - 2018 International Conference on Computer, Information and Telecommunication Systems, pp. 1–5 (2018)
Google Scholar
Sainui, J., Sugiyama, M.: Minimum dependency key frames selection via quadratic mutual information. In: 10th International Conference on Digital Information Management, pp. 148–153 (2015)
Google Scholar
Calonder, M., Lepetit, V., Strecha, C., Fua, P.: BRIEF: binary robust independent elementary features. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010. LNCS, vol. 6314, pp. 778–792. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-15561-1_56
Chapter Google Scholar
Rao, T., Ikenaga, T.: Quadrant segmentation and ring-like searching based FPGA implementation of ORB matching system for full-HD video. In: Proceedings of 15th IAPR International Conference on Machine Vision Applications MVA 2017, pp. 89–92 (2017)
Google Scholar
Comaniciu, D., Meer, P.: Mean shift: a robust approach toward feature space analysis. IEEE Trans. Pattern Anal. Mach. Intell. 24(5), 603–619 (2002)
Article Google Scholar

Download references

Acknowledgment

This work was supported by the 2020 Final Assignment Recognition Program (RTA) of Universitas Gadjah Mada (No. 2488/UN1.P.III/DIT-LIT/PT/2020).

Author information

Authors and Affiliations

Master Program of Computer Science, Universitas Gadjah Mada, Yogyakarta, Indonesia
Faisal Dharma Adhinata
Department of Computer Science and Electronics, Universitas Gadjah Mada, Yogyakarta, Indonesia
Agus Harjoko & Wahyono

Authors

Faisal Dharma Adhinata
View author publications
You can also search for this author in PubMed Google Scholar
Agus Harjoko
View author publications
You can also search for this author in PubMed Google Scholar
Wahyono
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wahyono .

Editor information

Editors and Affiliations

Wroclaw University of Economics and Business, Wrocław, Poland
Marcin Hernes
Wrocław University of Science and Technology, Wrocław, Poland
Krystian Wojtkiewicz
University of Newcastle, Newcastle, Australia
Edward Szczerbicki

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Adhinata, F.D., Harjoko, A., Wahyono (2020). Object Searching on Video Using ORB Descriptor and Support Vector Machine. In: Hernes, M., Wojtkiewicz, K., Szczerbicki, E. (eds) Advances in Computational Collective Intelligence. ICCCI 2020. Communications in Computer and Information Science, vol 1287. Springer, Cham. https://doi.org/10.1007/978-3-030-63119-2_20

Download citation

DOI: https://doi.org/10.1007/978-3-030-63119-2_20
Published: 19 November 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-63118-5
Online ISBN: 978-3-030-63119-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics