MC-HDCNN: Computing the Stereo Matching Cost with a Hybrid Dilated Convolutional Neural Network

Liu, Yunhong; Huang, Yizhu

doi:10.1007/978-3-030-36808-1_16

Yunhong Liu⁹ &
Yizhu Huang⁹

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1142))

Included in the following conference series:

International Conference on Neural Information Processing

2948 Accesses

Abstract

Designing a model to quickly obtain an accurate matching cost is a vital problem in the stereo matching method. We present an algorithm called MC-HDCNN, which is based on hybrid dilated convolution neural network, for computing matching cost of two image patches. HDCNN uses the dilated convolution of the series to obtain a larger receptive field, while avoiding the “gridding” effect and ensuring the integrity of the receptive field. In addition, by adding batch normalization layer after each layer of the convolution, the gradient dispersion in the backward propagation and the generalization of the network can be improved effectively. We evaluate our method on the KITTI stereo data set. The results show that the proposed algorithm has certain advantages in accuracy and speed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

MC-DCNN: Dilated Convolutional Neural Network for Computing Stereo Matching Cost

Guided aggregation and disparity refinement for real-time stereo matching

Article 11 March 2024

Improved Stereo Matching Accuracy Based on Selective Backpropagation and Extended Cost Volume

Article 29 April 2022

References

Scharstein, D., Szeliski, R.: A taxonomy and evaluation of dense two-frame stereo correspondence algorithms. In: IEEE SMBV, pp. 131–140 (2001)
Google Scholar
Žbontar, J., LeCun, Y.: Stereo matching by training a convolutional neural network to compare image patches. JMLR 17(1), 2287–2318 (2016)
MATH Google Scholar
Ioffe, S., Szegedy, C.: Batch normalization: accelerating deep network training by reducing internal covariate shift. In: International Conference on Machine Learning, pp. 448–456 (2015)
Google Scholar
Yoon, K.J., Kweon, I.S.: Adaptive support-weight approach for correspondence search. IEEE Trans. Pattern Anal. Mach. Intell. 28, 650–656 (2006)
Article Google Scholar
Lei, C., Selzer, J., Yang, Y.H.: Region-tree based stereo using dynamic programming optimization. In: IEEE CVPR, pp. 2378–2385 (2006)
Google Scholar
Kolmogorov, V., Zabih, R.: Computing visual correspondence with occlusions using graph cuts. In: IEEE ICCV, pp. 508–515 (2001)
Google Scholar
Hirschmüller, H.: Semi-global matching-motivation, developments and applications. Photogram. Week 11, 173–184 (2011)
Google Scholar
Žbontar, J., LeCun, Y.: Computing the stereo matching cost with a convolutional neural network. In: IEEE CVPR, pp. 1592–1599 (2015)
Google Scholar
Luo, W., Schwing, A.G., Urtasun, R.: Efficient deep learning for stereo matching. In: IEEE CVPR, pp. 5695–5703 (2016)
Google Scholar
Park, H., Lee, K.M.: Look wider to match image patches with convolutional neural networks. IEEE Signal Process. Lett. 24, 1788–1792 (2017)
Article Google Scholar
Mayer, N., Ilg, E., Häusser, P., et al.: A large dataset to train convolutional networks for disparity, optical flow, and scene flow estimation. In: IEEE CVPR, pp. 4040–4048 (2016)
Google Scholar
Kendall, A., Martirosyan, H., Dasgupta, S., et al.: End-to-end learning of geometry and context for deep stereo regression. In: IEEE ICCV, pp. 66–75 (2017)
Google Scholar
Song, X., Zhao, X., Hu, H., Fang, L.: EdgeStereo: a context integrated residual pyramid network for stereo matching. In: Jawahar, C.V., Li, H., Mori, G., Schindler, K. (eds.) ACCV 2018. LNCS, vol. 11365, pp. 20–35. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-20873-8_2
Chapter Google Scholar
Wang, P., Chen, P., Yuan, Y., et al.: Understanding convolution for semantic segmentation. In: IEEE WACV, PP. 1451–1460 (2018)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Faculty of Electronic Information and Electrical Engineering, Dalian University of Technology, Dalian, China
Yunhong Liu & Yizhu Huang

Authors

Yunhong Liu
View author publications
You can also search for this author in PubMed Google Scholar
Yizhu Huang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yunhong Liu .

Editor information

Editors and Affiliations

Australian National University, Canberra, ACT, Australia
Tom Gedeon
Murdoch University, Murdoch, WA, Australia
Kok Wai Wong
Kyungpook National University, Daegu, Korea (Republic of)
Minho Lee

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Liu, Y., Huang, Y. (2019). MC-HDCNN: Computing the Stereo Matching Cost with a Hybrid Dilated Convolutional Neural Network. In: Gedeon, T., Wong, K., Lee, M. (eds) Neural Information Processing. ICONIP 2019. Communications in Computer and Information Science, vol 1142. Springer, Cham. https://doi.org/10.1007/978-3-030-36808-1_16

Download citation

DOI: https://doi.org/10.1007/978-3-030-36808-1_16
Published: 05 December 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-36807-4
Online ISBN: 978-3-030-36808-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics