A Coarse to Fine Object Proposal Framework for Autonomous Driving Object Detection Using Binocular Image

Liu, Xiaolong; Cai, Wanzeng; Liang, Zhengfa; Feng, Yiliu

doi:10.1007/978-981-10-3969-0_25

Xiaolong Liu¹³,
Wanzeng Cai¹³,
Zhengfa Liang¹³ &
…
Yiliu Feng¹³

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 699))

Included in the following conference series:

International Conference on Geo-Informatics in Resource Management and Sustainable Ecosystem

1064 Accesses

Abstract

The now widely used object proposal methods for object detection commonly get fulfilling results on the dataset, which is captured in simple scenes. But the performance degraded when it comes to complicate real traffic scene. In our paper, a coarse to fine object proposal generating framework is proposed for autonomous driving object detection, provides a better object proposal solution in complex circumstances. By adding several low level geometrical features, which can be efficiently computed from binocular images, we recalculate scores for the candidate bounding boxes generated by coarse region proposal approaches with a Bayesian probability model. Our proposal generation approach is validated on the challenging KITTI benchmark, achieving state-of-art object proposal performance for pedestrian, car and cyclist.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Geiger, A., Lenz, P., Urtasun, R.: Are we ready for autonomous driving? The KITTI vision benchmark suite. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3354–3361 (2012)
Google Scholar
Zitnick, C.L., Dollár, P.: Edge boxes: locating object proposals from edges. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8693, pp. 391–405. Springer, Heidelberg (2014). doi:10.1007/978-3-319-10602-1_26
Google Scholar
van de Sande, K.E.A., Uijlings, J.R.R., Gevers, T., Smeulders, A.W.M.: Segmentation as selective search for object recognition. In: IEEE International Conference on Computer Vision, ICCV 2011, Barcelona, Spain, pp. 1879–1886, November 2011
Google Scholar
Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: Computer Vision and Pattern Recognition, pp. 580–587. IEEE (2014)
Google Scholar
Girshick, R.: Fast R-CNN. In: IEEE International Conference on Computer Vision IEEE, pp. 1440–1448 (2015)
Google Scholar
Felzenszwalb, P.F., Huttenlocher, D.P.: Efficient graph-based image segmentation. Int. J. Comput. Vis. 59, 167–181 (2004)
Article Google Scholar
Chang, K.Y., Liu, T.L., Chen, H.T., Lai, S.H.: Fusing generic objectness and visual saliency for salient object detection, pp. 914–921 (2011)
Google Scholar
Alexe, B., Deselaers, T., Ferrari, V.: What is an object? In: 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 73–80. IEEE (2010)
Google Scholar
Arbelaez, P., Ponttuset, J., Barron, J., Marques, F., Malik, J.: Multiscale combinatorial grouping. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 328–335 (2014)
Google Scholar
Cheng, M.M., Zhang, Z., Lin, W.Y., Torr, P.: BING: binarized normed gradients for objectness estimation at 300fps, pp. 3286–3293 (2014)
Google Scholar
Chen, X., Kundu, K., Zhu, Y.: 3D object proposals for accurate object class detection (2015)
Google Scholar
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell., p. 1 (2016)
Google Scholar
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. Comput. Sci. (2014)
Google Scholar
Bigdeli, S.A., Budweiser, G., Zwicker, M.: Temporally coherent disparity maps using CRFs with fast 4D filtering. In: IAPR Asian Conference on Pattern Recognition IEEE (2015)
Google Scholar
Seki, A., Pollefeys, M.: Patch based confidence prediction for dense disparity map. In: British Machine Vision Conference (BMVC) (2016)
Google Scholar
Guney, F., Geiger, A.: Displets: resolving stereo ambiguities using object knowledge. In: Computer Vision and Pattern Recognition. IEEE (2015)
Google Scholar
Guo, K., Li, N., Zhang, M.: The application of RANSIC in video mosaicing. In: Second International Conference on Electric Information and Control Engineering, pp. 652–655 (2012)
Google Scholar
Wang, X., Yang, M., Zhu, S., Lin, Y.: Regionlets for generic object detection. IEEE Trans. Pattern Anal. Mach. Intell. 37(10), 2071–2084 (2015)
Article Google Scholar
Tuytelaars, T.: Dense interest points. In: IEEE Conference on Computer Vision & Pattern Recognition, pp. 2281–2288 (2010)
Google Scholar

Download references

Author information

Authors and Affiliations

College of Computer, National University of Defense Technology, Changsha, China
Xiaolong Liu, Wanzeng Cai, Zhengfa Liang & Yiliu Feng

Authors

Xiaolong Liu
View author publications
You can also search for this author in PubMed Google Scholar
Wanzeng Cai
View author publications
You can also search for this author in PubMed Google Scholar
Zhengfa Liang
View author publications
You can also search for this author in PubMed Google Scholar
Yiliu Feng
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xiaolong Liu .

Editor information

Editors and Affiliations

Beijing Institute of Technology, Beijing, China
Hanning Yuan
Beijing Institute of Technology, Beijing, China
Jing Geng
Wuhan University, Wuhan, China
Fuling Bian

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Liu, X., Cai, W., Liang, Z., Feng, Y. (2017). A Coarse to Fine Object Proposal Framework for Autonomous Driving Object Detection Using Binocular Image. In: Yuan, H., Geng, J., Bian, F. (eds) Geo-Spatial Knowledge and Intelligence. GRMSE 2016. Communications in Computer and Information Science, vol 699. Springer, Singapore. https://doi.org/10.1007/978-981-10-3969-0_25

Download citation

DOI: https://doi.org/10.1007/978-981-10-3969-0_25
Published: 03 March 2017
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-3968-3
Online ISBN: 978-981-10-3969-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics