Skip to main content
Log in

STMVO: biologically inspired monocular visual odometry

  • Original Article
  • Published:
Neural Computing and Applications Aims and scope Submit manuscript

Abstract

Visual odometry (VO) is a fundamental and challenging problem in both the computer vision community and the robotics community. VO refers the process of recovering the relative movements of a camera by analyzing the associated image sequence. While VO is generally formulated as descriptors-based feature tracking with outliers rejection and global optimization, these algorithms are not only computationally expensive but also lack robustness. In the paper, a biologically inspired solution to the monocular visual odometry problem was presented, which was named as shunting short-term memory monocular visual odometry. The proposed method is simple and concise in both concept and implementation. To be more specific, it utilizes the shunting short-term memory to represent the key frames and the latest observations and also to adapt to uncertainties and ambiguities. And scan matching scheme is adopted to search the movement that best explained the difference between the latest observation and the key frame. Because of the dynamic properties of the neural network, the proposed method requires neither explicit extraction of features and descriptors, nor outliers detection and bundle optimization. Theoretical analysis in the paper showed that the proposed method has Lyapunov stability and constant computational complexity. The proposed method was also compared with the classical monocular VO algorithm in real indoor environments, and the experimental results proved that the proposed method outperforms the classical method on both effectiveness and robustness.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

References

  1. Scaramuzza D, Fraundorfer F (2011) Visual odometry [tutorial]. Robot Autom Mag IEEE 18(4):80–92

    Article  Google Scholar 

  2. Nistér D, Naroditsky O, Bergen J (2004) Visual odometry. In: Proceedings of the 2004 IEEE computer society conference on computer vision and pattern recognition, 2004. CVPR 2004, vol 1, IEEE pp 1–652

  3. Fraundorfer F, Scaramuzza D (2012) Visual odometry: part II: matching, robustness, optimization, and applications. Robot Autom Mag IEEE 19(2):78–90

    Article  Google Scholar 

  4. Li Y, Olson EB (2011) Structure tensors for general purpose lidar feature extraction. In: 2011 IEEE international conference on robotics and automation (ICRA). IEEE, pp 1869–1874

  5. Cao Z, Cheng L, Zhou C, Gu N, Wang X, Tan M (2015) Spiking neural network-based target tracking control for autonomous mobile robots. Neural Comput Appl 26(8):1839–1847

    Article  Google Scholar 

  6. Forster C, Pizzoli M, Scaramuzza D (2014) Svo: fast semi-direct monocular visual odometry. In: 2014 IEEE international conference on robotics and automation (ICRA), May 2014, pp 15–22

  7. Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110

    Article  Google Scholar 

  8. Bay H, Ess A, Tuytelaars T, Van Gool L (2008) Speeded-up robust features (surf). Comput Vis Image Underst 110(3):346–359

    Article  Google Scholar 

  9. Zheng Y, Jeon B, Xu D, Wu Q, Zhang H (2015) Image segmentation by generalized hierarchical fuzzy c-means algorithm. J Intell Fuzzy Syst Appl Eng Technol 28(2):961–973

    Google Scholar 

  10. Chen B, Shu H, Coatrieux G, Chen G, Sun X, Coatrieux JL (2015) Color image analysis by quaternion-type moments. J Math Imaging Vis 51(1):124–144

    Article  MathSciNet  MATH  Google Scholar 

  11. Xia Z, Wang X, Sun X, Liu Q, Xiong N (2016) Steganalysis of LSB matching using differences between nonadjacent pixels. Multimed Tools Appl 75:1947–1962

    Article  Google Scholar 

  12. Li Y, Li S, Song Q, Liu H, Meng MQH (2014) Fast and robust data association using posterior based approximate joint compatibility test. IEEE Trans Ind Inf 10(1):331–339

    Article  Google Scholar 

  13. Li J, Li X, Yang B, Sun X (2015) Segmentation-based image copy-move forgery detection scheme. IEEE Trans Inf Forensics Secur 10(3):507–518

    Article  Google Scholar 

  14. Pan Z, Zhang Y, Kwong S (2015) Efficient motion and disparity estimation optimization for low complexity multiview video coding. IEEE Trans Broadcast 61(2):166–176

    Article  Google Scholar 

  15. Nistér D (2004) An efficient solution to the five-point relative pose problem. IEEE Trans Pattern Anal Mach Intell 26(6):756–770

    Article  Google Scholar 

  16. Williams B, Reid I (2010) On combining visual slam and visual odometry. In: 2010 IEEE international conference on robotics and automation (ICRA). IEEE pp 3494–3500

  17. Mur-Artal R, Montiel J, Tardós JD (2015) Orb-slam: a versatile and accurate monocular slam system. IEEE Trans Robot 31(5):1147–1163

    Article  Google Scholar 

  18. Gu B, Sheng V, Tay KY, Romano W, Li S (2015) Incremental support vector learning for ordinal regression. IEEE Trans Neural Netw Learn Syst 26(7):1403–1416

    Article  MathSciNet  Google Scholar 

  19. Li S, Liu B, Li Y (2013) Selective positive-negative feedback produces the winner-take-all competition in recurrent neural networks. IEEE Trans Neural Netw Learn Syst 24(2):301–309

    Article  MathSciNet  Google Scholar 

  20. Luo X, You Z, Zhou M, Li S, Leung H, Xia Y, Zhu Q (2015) A highly efficient approach to protein interactome mapping based on collaborative filtering framework. Sci Rep 5:7702

    Article  Google Scholar 

  21. Duan S, Dong Z, Hu X, Wang L, Li H (2016) Small-world hopfield neural networks with weight salience priority and memristor synapses for digit recognition. Neural Comput Appl 27(4):837–844

    Article  Google Scholar 

  22. Zhao D, Ferguson AA, Jiang N (2016) What makes up plant genomes: the vanishing line between transposable elements and genes. Biochim Biophys Acta (BBA) Gene Regul Mech 1859(2):366–380

    Article  Google Scholar 

  23. Li S, Cui H, Li Y, Liu B, Lou Y (2013) Decentralized control of collaborative redundant manipulators with partial command coverage via locally connected recurrent neural networks. Neural Comput Appl 23(3–4):1051–1060

    Article  Google Scholar 

  24. Li Y, Li S, Ge Y (2013) A biologically inspired solution to simultaneous localization and consistent mapping in dynamic environments. Neurocomputing 104:170–179

    Article  Google Scholar 

  25. Li S, Wang Z, Li Y (2013) Using Laplacian eigenmap as heuristic information to solve nonlinear constraints defined on a graph and its application in distributed range-free localization of wireless sensor networks. Neural Process Lett 37(3):411–424

    Article  Google Scholar 

  26. Gu B, Sheng VS, Wang Z, Ho D, Osman S, Li S (2015) Incremental learning for \(\nu\)-support vector regression. Neural Netw 67:140–150

    Article  Google Scholar 

  27. Chen L, Liu C, Wu R, He Y, Chai Y (2016) Finite-time stability criteria for a class of fractional-order neural networks with delay. Neural Comput Appl 27(3):549–556

    Article  Google Scholar 

  28. Grossberg S (1988) Nonlinear neural networks: principles, mechanisms, and architectures. Neural Netw 1(1):17–61

    Article  Google Scholar 

  29. Hopfield JJ (1995) Pattern recognition computation using action potential timing for stimulus representation. Nature 376(6535):33–36

    Article  Google Scholar 

  30. Scaramuzza D, Siegwart R (2008) Appearance-guided monocular omnidirectional visual odometry for outdoor ground vehicles. IEEE Trans Robot 24(5):1015–1026

    Article  Google Scholar 

  31. Hartley R, Zisserman A (2003) Multiple view geometry in computer vision. Cambridge University Press, Cambridge

    MATH  Google Scholar 

  32. Varga RS (1962) Matrix iterative analysis. Prentice Hall, Englewood Cliffs

    Google Scholar 

  33. Li Y, Olson EB (2012) IPJC: The incremental posterior joint compatibility test for fast feature cloud matching. In: 2012 IEEE/RSJ international conference on intelligent robots and systems (IROS) (Oct. 2012), pp 147 –153

Download references

Acknowledgments

The authors would like to thank projects “A Project Funded by the Priority Academic Program Development of Jiangsu Higher Education Institutions” and “Jiangsu Collaborative Innovation Center on Atmospheric Environment and Equipment Technology” for their final support.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yangming Li.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, Y., Zhang, J. & Li, S. STMVO: biologically inspired monocular visual odometry. Neural Comput & Applic 29, 215–225 (2018). https://doi.org/10.1007/s00521-016-2536-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00521-016-2536-9

Keywords

Navigation