Stable Monocular Visual Odometry Based on Optical Flow Matching

Liu, Yujia; Shi, Chaoxia; Wang, Yanqing

doi:10.1007/978-981-97-1332-5_2

Yujia Liu⁸,
Chaoxia Shi⁸ &
Yanqing Wang⁹

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 2060))

Included in the following conference series:

International Artificial Intelligence Conference

48 Accesses

Abstract

A monocular visual odometry method combining deep learning and geometry is proposed to address the sparsity and position discontinuity problems. An unsupervised optical flow estimation network is constructed to obtain high-quality dense optical flow. Next, optical flow masks and depth masks are proposed for filtering key points. Finally, position estimation and scale recovery are performed based on multi-view geometry. Experiments are extensively validated on the KITTI dataset, and the visual odometry method achieves 3.48 (%) translation error and 0.67 rotation error (\(^{\circ }\)/100 m) outperforming the baseline DF-VO.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 64.99; Price excludes VAT (USA)

Softcover Book: USD 84.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Dai, J., et al.: Deformable convolutional networks. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 764–773 (2017)
Google Scholar
Forster, C., Pizzoli, M., Scaramuzza, D.: SVO: fast semi-direct monocular visual odometry. In: 2014 IEEE International Conference on Robotics and Automation (ICRA), pp. 15–22. IEEE (2014)
Google Scholar
Geiger, A., Lenz, P., Urtasun, R.: Are we ready for autonomous driving? The KITTI vision benchmark suite. In: Conference on Computer Vision and Pattern Recognition (CVPR) (2012)
Google Scholar
Godard, C., Mac Aodha, O., Firman, M., Brostow, G.J.: Digging into self-supervised monocular depth estimation. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 3828–3838 (2019)
Google Scholar
Jonschkowski, R., Stone, A., Barron, J.T., Gordon, A., Konolige, K., Angelova, A.: What matters in unsupervised optical flow. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12347, pp. 557–572. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58536-5_33
Chapter Google Scholar
Kong, L., Yang, J.: MDFlow: unsupervised optical flow learning by reliable mutual knowledge distillation. IEEE Trans. Circuits Syst. Video Technol. 33(2), 677–688 (2023). https://doi.org/10.1109/TCSVT.2022.3205375
Article Google Scholar
Lin, T.Y., Dollár, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2117–2125 (2017)
Google Scholar
Liu, H., Huang, D.D., Geng, Z.Y.: Visual odometry algorithm based on deep learning. In: 2021 6th International Conference on Image, Vision and Computing (ICIVC), pp. 322–327. IEEE (2021)
Google Scholar
Liu, P., King, I., Lyu, M.R., Xu, J.: DDFlow: learning optical flow with unlabeled data distillation. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, pp. 8770–8777 (2019)
Google Scholar
Liu, P., Lyu, M., King, I., Xu, J.: SelFlow: self-supervised learning of optical flow. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 4571–4580 (2019)
Google Scholar
Luo, K., Wang, C., Liu, S., Fan, H., Wang, J., Sun, J.: UPFlow: upsampling pyramid for unsupervised optical flow learning. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 1045–1054 (2021)
Google Scholar
Menze, M., Heipke, C., Geiger, A.: Joint 3D estimation of vehicles and scene flow. In: ISPRS Workshop on Image Sequence Analysis (ISA) (2015)
Google Scholar
Mur-Artal, R., Tardós, J.D.: ORB-SLAM2: an open-source SLAM system for monocular, stereo, and RGB-D cameras. IEEE Trans. Rob. 33(5), 1255–1262 (2017)
Article Google Scholar
Vaswani, A., et al.: Attention is all you need. In: Advances in Neural Information Processing Systems 30 (2017)
Google Scholar
Wang, H., Fan, R., Liu, M.: CoT-AMFlow: adaptive modulation network with co-teaching strategy for unsupervised optical flow estimation. In: Conference on Robot Learning, pp. 143–155. PMLR (2021)
Google Scholar
Zhan, H., Garg, R., Weerasekera, C.S., Li, K., Agarwal, H., Reid, I.: Unsupervised learning of monocular depth estimation and visual odometry with deep feature reconstruction. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 340–349 (2018)
Google Scholar
Zhan, H., Weerasekera, C.S., Bian, J.W., Garg, R., Reid, I.: DF-VO: what should be learnt for visual odometry? arXiv preprint arXiv:2103.00933 (2021)
Zou, Y., Luo, Z., Huang, J.-B.: DF-Net: unsupervised joint learning of depth and flow using cross-task consistency. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11209, pp. 38–55. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01228-1_3
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Nanjing University of Science and Technology, Nanjing, 210094, China
Yujia Liu & Chaoxia Shi
Nanjing Xiaozhuang University, Nanjing, 211171, China
Yanqing Wang

Authors

Yujia Liu
View author publications
You can also search for this author in PubMed Google Scholar
Chaoxia Shi
View author publications
You can also search for this author in PubMed Google Scholar
Yanqing Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Chaoxia Shi or Yanqing Wang .

Editor information

Editors and Affiliations

Huazhong University of Science and Technology, Wuhan, Hubei, China
Hai Jin
Chinese Academy of Science, Shenzhen, China
Yi Pan
Nanjing University of Science and Technology, Nanjing, China
Jianfeng Lu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Liu, Y., Shi, C., Wang, Y. (2024). Stable Monocular Visual Odometry Based on Optical Flow Matching. In: Jin, H., Pan, Y., Lu, J. (eds) Computer Networks and IoT. IAIC 2023. Communications in Computer and Information Science, vol 2060. Springer, Singapore. https://doi.org/10.1007/978-981-97-1332-5_2

Download citation

DOI: https://doi.org/10.1007/978-981-97-1332-5_2
Published: 03 April 2024
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-1331-8
Online ISBN: 978-981-97-1332-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Stable Monocular Visual Odometry Based on Optical Flow Matching