Abstract
One of the main stages in object searching on video is extracting object regions from video. Template matching is popular technique for performing a such task. However, the use of template matching has a limitation that requires a large object as a template. If the template size is too small, it would obtain few features. On the other hand, ORB descriptors are often used for representing the object with a good accuracy and fast processing time. Therefore, this research proposed to use machine learning method combining with ORB descriptor for object searching on video data. Processing video in all frames is inefficient. Thus, frames are selected into keyframes using mutual information entropy. The ORB descriptors are then extracted from selected frame in order to find candidate region of objects. To verify and classify the object regions, multiclass support vector machine was used to train ORB descriptor of regions. For evaluation, the use of ORB would be compared with other descriptor, such as SIFT and SURF for showing its superiority in both accuracy and processing time. In experiment, it is found that object searching with ORB descriptor performs faster processing time, which is 0.219 s, while SIFT 1.011 s and SURF 0.503 s. Meanwhile, it also achieves the best F1 value, which is 0.9 compared to SIFT 0.63 and SURF 0.65.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Ouyang, S.Z., Zhong, L., Luo, R.Q.: The comparison and analysis of extracting video key frame. IOP Conf. Ser. Mater. Sci. Eng. 359(1), 012010 (2018)
Sun, L., Zhou, Y.: A key frame extraction method based on mutual information and image entropy. In: 2011 International Conference on Multimedia Technology, vol. 1, no. 6077, pp. 35–38 (2011)
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)
Bay, H., Ess, A., Tuytelaars, T., Vangool, L.: Speeded-up robust features (SURF). Comput. Vis. Image Underst. 110(3), 346–359 (2006)
Rublee, E., Rabaud, V., Konolige, K., Bradski, G.: ORB: an efficient alternative to SIFT or SURF. In: Proceedings of IEEE International Conference on Computer Vision, pp. 2564–2571 (2011)
Zhou, D., Tian, Y., Li, X., Wu, J.: ORB-based template matching through convolutional features map. In: Proceedings of 2019 Chinese Automation Congress, pp. 4695–4699 (2019)
Purohit, M., Yadav, A.R.: Comparison of feature extraction techniques to recognize traffic rule violations using low processing embedded system. In: 2018 5th International Conference on Signal Process. Integration Networks, pp. 154–158 (2018)
Murugeswari, M., Veluchamy, S.: Hand gesture recognition system for real-time application. In: 2014 IEEE International Conference on Advanced Communications, Control and Computing Technologies ICACCCT 2014, no. 978, pp. 1220–1225 (2014)
Li, Q., Wang, X.: Image classification based on SIFT and SVM. In: Proceedings of 17th IEEE/ACIS Conference on Computer and Information Science, vol. 1, no. 1, pp. 762–765 (2018)
Jabnoun, H., Benzarti, F., Amiri, H.: Object recognition for blind people based on features extraction. In: International Image Processing, Applications and Systems Conference, pp. 1–6 (2014)
Yu, H., Kong, L.: An optimization of video sequence stitching method. In: 2018 IEEE Conference on Mechatronics and Automation, pp. 387–391 (2018)
Toapanta, S.M.T., Cruz, A.A. C., Gallegos, L.E.M., Trejo, J.A.O.: Algorithms for efficient biometric systems to mitigate the integrity of a distributed database. In: CITS 2018 - 2018 International Conference on Computer, Information and Telecommunication Systems, pp. 1–5 (2018)
Sainui, J., Sugiyama, M.: Minimum dependency key frames selection via quadratic mutual information. In: 10th International Conference on Digital Information Management, pp. 148–153 (2015)
Calonder, M., Lepetit, V., Strecha, C., Fua, P.: BRIEF: binary robust independent elementary features. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010. LNCS, vol. 6314, pp. 778–792. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-15561-1_56
Rao, T., Ikenaga, T.: Quadrant segmentation and ring-like searching based FPGA implementation of ORB matching system for full-HD video. In: Proceedings of 15th IAPR International Conference on Machine Vision Applications MVA 2017, pp. 89–92 (2017)
Comaniciu, D., Meer, P.: Mean shift: a robust approach toward feature space analysis. IEEE Trans. Pattern Anal. Mach. Intell. 24(5), 603–619 (2002)
Acknowledgment
This work was supported by the 2020 Final Assignment Recognition Program (RTA) of Universitas Gadjah Mada (No. 2488/UN1.P.III/DIT-LIT/PT/2020).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Adhinata, F.D., Harjoko, A., Wahyono (2020). Object Searching on Video Using ORB Descriptor and Support Vector Machine. In: Hernes, M., Wojtkiewicz, K., Szczerbicki, E. (eds) Advances in Computational Collective Intelligence. ICCCI 2020. Communications in Computer and Information Science, vol 1287. Springer, Cham. https://doi.org/10.1007/978-3-030-63119-2_20
Download citation
DOI: https://doi.org/10.1007/978-3-030-63119-2_20
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-63118-5
Online ISBN: 978-3-030-63119-2
eBook Packages: Computer ScienceComputer Science (R0)