Skip to main content

OGB: A Distinctive and Efficient Feature for Mobile Augmented Reality

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9516))

Abstract

The distinctiveness and efficiency of a feature descriptor used for object recognition and tracking are fundamental to the user experience of a mobile augmented reality (MAR) system. However, existing descriptors are either too compute-expensive to achieve real-time performance on a mobile device, or not sufficiently distinctive to identify correct matches from a large database. As a result, current MAR systems are still limited in both functionalities and capabilities, which greatly restrict their deployment in practice. In this paper, we propose a highly distinctive and efficient binary descriptor, called Oriented Gradients Binary (OGB). OGB captures the major edge/gradient structure that is an important characteristic of local shapes and appearance. Specifically, OGB computes the distribution of major edge/gradient directions within an image patch. To achieve high efficiency, aggressive down-sampling is applied to the patch to significantly reduce the computational complexity, while maintaining major edge/gradient directions within the patch. Comparing to the state-of-the-art binary descriptors including ORB, BRISK and FREAK, which are primarily designed for speed, OGB has similar construction efficiency, while achieves a superior performance for both object recognition and tracking tasks running on a mobile handheld device.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Compu. Vision 60(2), 91–110 (2004)

    Article  Google Scholar 

  2. Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Proceedings of CVPR 2005 (2005)

    Google Scholar 

  3. Calonder, M., Lepetit, V., Strecha, C., Fua, P.: Brief: binary robust independent elementary features. In: Proceedings of ECCV 2010 (2010)

    Google Scholar 

  4. Rublee, E., Rabaud, V., Konolige, K., Bradski, G.: ORB: an efficient alternative to SIFT or SURF. In: Proceedings of ICCV 2011, Barcelona, Spain (2011)

    Google Scholar 

  5. Leutenegger, S., Chli, M., Siegwart, R.: BRISK: binary robust invariant scalable keypoints. In: Proceedings of CVPR 2011 (2011)

    Google Scholar 

  6. Alahi, A., Ortiz, R., Vandergheynst, P.: FREAK: fast retinal keypoint. In: Proceedings of CVPR 2012 (2012)

    Google Scholar 

  7. Salakhutdinov, R., Hinton, G.: Semantic hashing. Int. J. Approximate Reasoning, 3 (2009)

    Google Scholar 

  8. Weiss, Y., Fergus, R., Torralba, A.: Spectral hashing. In: Proceedings of NIPS 2009, pp: 1753–1760 (2009)

    Google Scholar 

  9. Gong, Y., Lazebnik, S., Gordo, A., Perronnin, F.: Iterative quantization: a procrustean approach to learning binanry codes for large-scale image retrieval. IEEE Trans. PAMI (2012)

    Google Scholar 

  10. Wang, J., Kumar, S., Chang, S.-F. Sequential projection learning for hashing with compact codes. In: Proceedings of ICML 2010 (2010)

    Google Scholar 

  11. Wagner, D., Reitmayr, G., Mulloni, A., Drummond, T., Schmalstieg, D.: Pose tracking from natural features on mobile phones. In: Proceedings of ISMAR 2008 (2008)

    Google Scholar 

  12. Wagner, D., Schmalstieg, D., Bischof, H.: Multiple target detection and tracking with guaranteed framerates on mobile phones. In: Proceedings of ISMAR 2009 (2009)

    Google Scholar 

  13. Wagner, D., Mulloni, A., Langlotz, T., Schmalstieg, D.: Real-time panoramic mapping and tracking on mobile phones. In: Proceedings of IEEE VR 2010 (2010)

    Google Scholar 

  14. Klein, G., Murray, D.: Parallel tracking and mapping on a camera phone. In: Proceedings of ISMAR 2009, Orlando (October 2009)

    Google Scholar 

  15. Parker, J., Kenyon, R., Troxel, D.: Comparison of interpolating methods for image resampling. IEEE Trans. Med. Imaging 2(1), 31–39 (1983)

    Article  Google Scholar 

  16. Rosten, E., Porter, R., Drummond, T.: Faster and better: a machine learning approach to corner detection. IEEE Trans. PAMI 32, 105–119 (2010)

    Article  Google Scholar 

  17. OpenCV. http://sourceforge.net/projects/opencvlibrary/

  18. ImageSequences. http://www.robots.ox.ac.uk/~vgg/research/affine

  19. Everingham, M., Van Gool, L., Williams, C.K.I., Winn, J., Zisserman, A.: The PASCAL visual object classes challenge (2009)

    Google Scholar 

  20. Fischler, M.A., Bolles, R.C.: Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Comm. ACM 24(6), 381–395 (1981)

    Article  MathSciNet  Google Scholar 

  21. Chum, O., Matas, J.: Matching with PROSAC – progressive sample consensus. In: Proceedings of CVPR 2005, vol. 1, pp. 220–226 (2005)

    Google Scholar 

  22. Gionis, A., Indyk, P., Motwani, R.: Similarity search in high dimensions via hashing. In: Proceedings of VLDB (1999)

    Google Scholar 

  23. Lv, Q., Josephson, W., Wang, Z., Charikar, M., Li, K.: Multi-probe LSH: efficient indexing for high-dimensional similarity search. In: Proceedings of VLDB (2007)

    Google Scholar 

  24. Hong, R.C., Tang, L.X., Hu, J., Li, G.D., Jiang, J.G.: Advertising object in web videos. Neurocomputing 119, 118–124 (2013)

    Article  Google Scholar 

  25. Wang, M., Li, G.D., Lu, Z., Gao, Y., Chua, T.-S.: When amazon meets google: product visualization by exploring multiple information sources. ACM Trans. Internet Technol. 12(4), Article 2 (2013)

    Article  Google Scholar 

  26. Wang, M., Li, H., Tao, D.C., Lu, K., Wu, X.D.: Multimodal graph-based reranking for web image search. IEEE Trans. Image Process. 21(11), 4649–4661 (2012)

    Article  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xin Yang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Yang, X., Wang, X., Cheng, KT.(. (2016). OGB: A Distinctive and Efficient Feature for Mobile Augmented Reality. In: Tian, Q., Sebe, N., Qi, GJ., Huet, B., Hong, R., Liu, X. (eds) MultiMedia Modeling. MMM 2016. Lecture Notes in Computer Science(), vol 9516. Springer, Cham. https://doi.org/10.1007/978-3-319-27671-7_40

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-27671-7_40

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-27670-0

  • Online ISBN: 978-3-319-27671-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics