Abstract
Visual odometry (VO) is a fundamental and challenging problem in both the computer vision community and the robotics community. VO refers the process of recovering the relative movements of a camera by analyzing the associated image sequence. While VO is generally formulated as descriptors-based feature tracking with outliers rejection and global optimization, these algorithms are not only computationally expensive but also lack robustness. In the paper, a biologically inspired solution to the monocular visual odometry problem was presented, which was named as shunting short-term memory monocular visual odometry. The proposed method is simple and concise in both concept and implementation. To be more specific, it utilizes the shunting short-term memory to represent the key frames and the latest observations and also to adapt to uncertainties and ambiguities. And scan matching scheme is adopted to search the movement that best explained the difference between the latest observation and the key frame. Because of the dynamic properties of the neural network, the proposed method requires neither explicit extraction of features and descriptors, nor outliers detection and bundle optimization. Theoretical analysis in the paper showed that the proposed method has Lyapunov stability and constant computational complexity. The proposed method was also compared with the classical monocular VO algorithm in real indoor environments, and the experimental results proved that the proposed method outperforms the classical method on both effectiveness and robustness.







Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Scaramuzza D, Fraundorfer F (2011) Visual odometry [tutorial]. Robot Autom Mag IEEE 18(4):80–92
Nistér D, Naroditsky O, Bergen J (2004) Visual odometry. In: Proceedings of the 2004 IEEE computer society conference on computer vision and pattern recognition, 2004. CVPR 2004, vol 1, IEEE pp 1–652
Fraundorfer F, Scaramuzza D (2012) Visual odometry: part II: matching, robustness, optimization, and applications. Robot Autom Mag IEEE 19(2):78–90
Li Y, Olson EB (2011) Structure tensors for general purpose lidar feature extraction. In: 2011 IEEE international conference on robotics and automation (ICRA). IEEE, pp 1869–1874
Cao Z, Cheng L, Zhou C, Gu N, Wang X, Tan M (2015) Spiking neural network-based target tracking control for autonomous mobile robots. Neural Comput Appl 26(8):1839–1847
Forster C, Pizzoli M, Scaramuzza D (2014) Svo: fast semi-direct monocular visual odometry. In: 2014 IEEE international conference on robotics and automation (ICRA), May 2014, pp 15–22
Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110
Bay H, Ess A, Tuytelaars T, Van Gool L (2008) Speeded-up robust features (surf). Comput Vis Image Underst 110(3):346–359
Zheng Y, Jeon B, Xu D, Wu Q, Zhang H (2015) Image segmentation by generalized hierarchical fuzzy c-means algorithm. J Intell Fuzzy Syst Appl Eng Technol 28(2):961–973
Chen B, Shu H, Coatrieux G, Chen G, Sun X, Coatrieux JL (2015) Color image analysis by quaternion-type moments. J Math Imaging Vis 51(1):124–144
Xia Z, Wang X, Sun X, Liu Q, Xiong N (2016) Steganalysis of LSB matching using differences between nonadjacent pixels. Multimed Tools Appl 75:1947–1962
Li Y, Li S, Song Q, Liu H, Meng MQH (2014) Fast and robust data association using posterior based approximate joint compatibility test. IEEE Trans Ind Inf 10(1):331–339
Li J, Li X, Yang B, Sun X (2015) Segmentation-based image copy-move forgery detection scheme. IEEE Trans Inf Forensics Secur 10(3):507–518
Pan Z, Zhang Y, Kwong S (2015) Efficient motion and disparity estimation optimization for low complexity multiview video coding. IEEE Trans Broadcast 61(2):166–176
Nistér D (2004) An efficient solution to the five-point relative pose problem. IEEE Trans Pattern Anal Mach Intell 26(6):756–770
Williams B, Reid I (2010) On combining visual slam and visual odometry. In: 2010 IEEE international conference on robotics and automation (ICRA). IEEE pp 3494–3500
Mur-Artal R, Montiel J, Tardós JD (2015) Orb-slam: a versatile and accurate monocular slam system. IEEE Trans Robot 31(5):1147–1163
Gu B, Sheng V, Tay KY, Romano W, Li S (2015) Incremental support vector learning for ordinal regression. IEEE Trans Neural Netw Learn Syst 26(7):1403–1416
Li S, Liu B, Li Y (2013) Selective positive-negative feedback produces the winner-take-all competition in recurrent neural networks. IEEE Trans Neural Netw Learn Syst 24(2):301–309
Luo X, You Z, Zhou M, Li S, Leung H, Xia Y, Zhu Q (2015) A highly efficient approach to protein interactome mapping based on collaborative filtering framework. Sci Rep 5:7702
Duan S, Dong Z, Hu X, Wang L, Li H (2016) Small-world hopfield neural networks with weight salience priority and memristor synapses for digit recognition. Neural Comput Appl 27(4):837–844
Zhao D, Ferguson AA, Jiang N (2016) What makes up plant genomes: the vanishing line between transposable elements and genes. Biochim Biophys Acta (BBA) Gene Regul Mech 1859(2):366–380
Li S, Cui H, Li Y, Liu B, Lou Y (2013) Decentralized control of collaborative redundant manipulators with partial command coverage via locally connected recurrent neural networks. Neural Comput Appl 23(3–4):1051–1060
Li Y, Li S, Ge Y (2013) A biologically inspired solution to simultaneous localization and consistent mapping in dynamic environments. Neurocomputing 104:170–179
Li S, Wang Z, Li Y (2013) Using Laplacian eigenmap as heuristic information to solve nonlinear constraints defined on a graph and its application in distributed range-free localization of wireless sensor networks. Neural Process Lett 37(3):411–424
Gu B, Sheng VS, Wang Z, Ho D, Osman S, Li S (2015) Incremental learning for \(\nu\)-support vector regression. Neural Netw 67:140–150
Chen L, Liu C, Wu R, He Y, Chai Y (2016) Finite-time stability criteria for a class of fractional-order neural networks with delay. Neural Comput Appl 27(3):549–556
Grossberg S (1988) Nonlinear neural networks: principles, mechanisms, and architectures. Neural Netw 1(1):17–61
Hopfield JJ (1995) Pattern recognition computation using action potential timing for stimulus representation. Nature 376(6535):33–36
Scaramuzza D, Siegwart R (2008) Appearance-guided monocular omnidirectional visual odometry for outdoor ground vehicles. IEEE Trans Robot 24(5):1015–1026
Hartley R, Zisserman A (2003) Multiple view geometry in computer vision. Cambridge University Press, Cambridge
Varga RS (1962) Matrix iterative analysis. Prentice Hall, Englewood Cliffs
Li Y, Olson EB (2012) IPJC: The incremental posterior joint compatibility test for fast feature cloud matching. In: 2012 IEEE/RSJ international conference on intelligent robots and systems (IROS) (Oct. 2012), pp 147 –153
Acknowledgments
The authors would like to thank projects “A Project Funded by the Priority Academic Program Development of Jiangsu Higher Education Institutions” and “Jiangsu Collaborative Innovation Center on Atmospheric Environment and Equipment Technology” for their final support.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Li, Y., Zhang, J. & Li, S. STMVO: biologically inspired monocular visual odometry. Neural Comput & Applic 29, 215–225 (2018). https://doi.org/10.1007/s00521-016-2536-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-016-2536-9