Skip to main content

Object Searching on Video Using ORB Descriptor and Support Vector Machine

  • Conference paper
  • First Online:
Advances in Computational Collective Intelligence (ICCCI 2020)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1287))

Included in the following conference series:

  • 1397 Accesses

Abstract

One of the main stages in object searching on video is extracting object regions from video. Template matching is popular technique for performing a such task. However, the use of template matching has a limitation that requires a large object as a template. If the template size is too small, it would obtain few features. On the other hand, ORB descriptors are often used for representing the object with a good accuracy and fast processing time. Therefore, this research proposed to use machine learning method combining with ORB descriptor for object searching on video data. Processing video in all frames is inefficient. Thus, frames are selected into keyframes using mutual information entropy. The ORB descriptors are then extracted from selected frame in order to find candidate region of objects. To verify and classify the object regions, multiclass support vector machine was used to train ORB descriptor of regions. For evaluation, the use of ORB would be compared with other descriptor, such as SIFT and SURF for showing its superiority in both accuracy and processing time. In experiment, it is found that object searching with ORB descriptor performs faster processing time, which is 0.219 s, while SIFT 1.011 s and SURF 0.503 s. Meanwhile, it also achieves the best F1 value, which is 0.9 compared to SIFT 0.63 and SURF 0.65.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Ouyang, S.Z., Zhong, L., Luo, R.Q.: The comparison and analysis of extracting video key frame. IOP Conf. Ser. Mater. Sci. Eng. 359(1), 012010 (2018)

    Article  Google Scholar 

  2. Sun, L., Zhou, Y.: A key frame extraction method based on mutual information and image entropy. In: 2011 International Conference on Multimedia Technology, vol. 1, no. 6077, pp. 35–38 (2011)

    Google Scholar 

  3. Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)

    Article  Google Scholar 

  4. Bay, H., Ess, A., Tuytelaars, T., Vangool, L.: Speeded-up robust features (SURF). Comput. Vis. Image Underst. 110(3), 346–359 (2006)

    Article  Google Scholar 

  5. Rublee, E., Rabaud, V., Konolige, K., Bradski, G.: ORB: an efficient alternative to SIFT or SURF. In: Proceedings of IEEE International Conference on Computer Vision, pp. 2564–2571 (2011)

    Google Scholar 

  6. Zhou, D., Tian, Y., Li, X., Wu, J.: ORB-based template matching through convolutional features map. In: Proceedings of 2019 Chinese Automation Congress, pp. 4695–4699 (2019)

    Google Scholar 

  7. Purohit, M., Yadav, A.R.: Comparison of feature extraction techniques to recognize traffic rule violations using low processing embedded system. In: 2018 5th International Conference on Signal Process. Integration Networks, pp. 154–158 (2018)

    Google Scholar 

  8. Murugeswari, M., Veluchamy, S.: Hand gesture recognition system for real-time application. In: 2014 IEEE International Conference on Advanced Communications, Control and Computing Technologies ICACCCT 2014, no. 978, pp. 1220–1225 (2014)

    Google Scholar 

  9. Li, Q., Wang, X.: Image classification based on SIFT and SVM. In: Proceedings of 17th IEEE/ACIS Conference on Computer and Information Science, vol. 1, no. 1, pp. 762–765 (2018)

    Google Scholar 

  10. Jabnoun, H., Benzarti, F., Amiri, H.: Object recognition for blind people based on features extraction. In: International Image Processing, Applications and Systems Conference, pp. 1–6 (2014)

    Google Scholar 

  11. Yu, H., Kong, L.: An optimization of video sequence stitching method. In: 2018 IEEE Conference on Mechatronics and Automation, pp. 387–391 (2018)

    Google Scholar 

  12. Toapanta, S.M.T., Cruz, A.A. C., Gallegos, L.E.M., Trejo, J.A.O.: Algorithms for efficient biometric systems to mitigate the integrity of a distributed database. In: CITS 2018 - 2018 International Conference on Computer, Information and Telecommunication Systems, pp. 1–5 (2018)

    Google Scholar 

  13. Sainui, J., Sugiyama, M.: Minimum dependency key frames selection via quadratic mutual information. In: 10th International Conference on Digital Information Management, pp. 148–153 (2015)

    Google Scholar 

  14. Calonder, M., Lepetit, V., Strecha, C., Fua, P.: BRIEF: binary robust independent elementary features. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010. LNCS, vol. 6314, pp. 778–792. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-15561-1_56

    Chapter  Google Scholar 

  15. Rao, T., Ikenaga, T.: Quadrant segmentation and ring-like searching based FPGA implementation of ORB matching system for full-HD video. In: Proceedings of 15th IAPR International Conference on Machine Vision Applications MVA 2017, pp. 89–92 (2017)

    Google Scholar 

  16. Comaniciu, D., Meer, P.: Mean shift: a robust approach toward feature space analysis. IEEE Trans. Pattern Anal. Mach. Intell. 24(5), 603–619 (2002)

    Article  Google Scholar 

Download references

Acknowledgment

This work was supported by the 2020 Final Assignment Recognition Program (RTA) of Universitas Gadjah Mada (No. 2488/UN1.P.III/DIT-LIT/PT/2020).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Wahyono .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Adhinata, F.D., Harjoko, A., Wahyono (2020). Object Searching on Video Using ORB Descriptor and Support Vector Machine. In: Hernes, M., Wojtkiewicz, K., Szczerbicki, E. (eds) Advances in Computational Collective Intelligence. ICCCI 2020. Communications in Computer and Information Science, vol 1287. Springer, Cham. https://doi.org/10.1007/978-3-030-63119-2_20

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-63119-2_20

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-63118-5

  • Online ISBN: 978-3-030-63119-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics