Directional Edge Boxes: Exploiting Inner Normal Direction Cues for Effective Object Proposal Generation

Bai, Xiang; Zhang, Zheng; Wang, Hong-Yang; Shen, Wei

doi:10.1007/s11390-017-1752-9

Directional Edge Boxes: Exploiting Inner Normal Direction Cues for Effective Object Proposal Generation

Regular Paper
Published: 14 July 2017

Volume 32, pages 701–713, (2017)
Cite this article

Journal of Computer Science and Technology Aims and scope Submit manuscript

Xiang Bai¹,
Zheng Zhang¹,
Hong-Yang Wang¹ &
…
Wei Shen²

254 Accesses
3 Citations
Explore all metrics

Abstract

Edges are important cues for localizing object proposals. The recent progresses to this problem are mostly driven by defining effective objectness measures based on edge cues. In this paper, we develop a new representation named directional edges on which each edge pixel is assigned with a direction toward object center, through learning a direction prediction model with convolutional neural networks in a holistic manner. Based on directional edges, two new objectness measures are designed for ranking object proposals. Experiments show that the proposed method achieves 97.1% object recall at an overlap threshold of 0.5 and 81.9% object recall at an overlap threshold of 0.7 at 1 000 proposals on the PASCAL VOC 2007 test dataset, which is superior to the state-of-the-art methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

BING: Binarized normed gradients for objectness estimation at 300fps

Article Open access 08 April 2019

Toward Scale-Invariance and Position-Sensitive Region Proposal Networks

Bidirectional Non-local Networks for Object Detection

References

Lim J, Zitnick C, Doll´ar P. Sketch tokens: A learned midlevel representation for contour and object detection. In Proc. CVPR, June 2013, pp.3158-3165.
Doll´ar P, Zitnick C. Structured forests for fast edge detection. In Proc. ICCV, December 2013, pp.1841-1848.
Bertasius G, Shi J, Torresani L. DeepEdge: A multi-scale bifurcated deep network for top-down contour detection. In Proc. CVPR, June 2015, pp.4380-4389.
Shen W, Wang X, Wang Y, Bai X, Zhang Z. DeepContour: A deep convolutional feature learned by positivesharing loss for contour detection. In Proc. CVPR, June 2015, pp.3982-3991.
Xie S, Tu Z. Holistically-nested edge detection. In Proc. ICCV, December 2015, pp.1395-1403.
Zitnick C L, Doll´ar P. Edge Boxes: Locating object proposals from edges. In Proc. ECCV, September 2014, pp.391-405.
Cheng M M, Zhang Z, Lin W Y, Torr P. BING: Binarized normed gradients for objectness estimation at 300fps. In Proc. CVPR, June 2014, pp.3286-3293.
Lu C, Liu S, Jia J, Tang C K. Contour Box: Rejecting object proposals without explicit closed contours. In Proc. ICCV, December 2015, pp.2021-2029.
Qi Y, Song Y Z, Xiang T, Zhang H, Hospedales T, Li Y, Guo J. Making better use of edges via perceptual grouping. In Proc. CVPR, June 2015, pp.1856-1865.
Long J, Shelhamer E, Darrell T. Fully convolutional networks for semantic segmentation. In Proc. CVPR, June 2015, pp.3431-3440.
Xia G S, Delon J, Gousseau Y. Shape-based invariant texture indexing. International Journal of Computer Vision, 2010, 88(3): 382-403.
Article MathSciNet Google Scholar
Xia G S, Delon J, Gousseau Y. Accurate junction detection and characterization in natural images. International Journal of Computer Vision, 2014, 106(1): 31-56.
Article MathSciNet MATH Google Scholar
Xie J, Dai G, Zhu F, Wong E, Fang Y. DeepShape: Deeplearned shape descriptor for 3D shape retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(7): 1335-1345.
Article Google Scholar
Dai G, Xie J, Zhu F, Fang Y. Learning a discriminative deformation-invariant 3D shape descriptor via many-to-one encoder. Pattern Recognition Letters, 2016, 83: 330-338.
Article Google Scholar
Everingham M, van Gool L, Williams C K, Winn J, Zisserman A. The Pascal visual object classes (VOC) challenge. IJCV, 2010, 88(2): 303-338.
Article Google Scholar
Alexe B, Deselaers T, Ferrari V. What is an object? In Proc. CVPR, June 2010, pp.73-80.
van de Sande K E, Uijlings J R, Gevers T, Smeulders A W. Segmentation as selective search for object recognition. In Proc. ICCV, November 2011, pp.1879-1886.
Yanulevskaya V, Uijlings J, Sebe N. Learning to group objects. In Proc. CVPR, June 2014, pp.3134-3141.
Manen S, Guillaumin M, van Gool L. Prime object proposals with Randomized Prim’s algorithm. In Proc. ICCV, December 2013, pp.2536-2543.
Xiao Y, Lu C, Tsougenis E, Lu Y, Tang C K. Complexityadaptive distance metric for object proposals generation. In Proc. CVPR, June 2015, pp.778-786.
Rantalankila P, Kannala J, Rahtu E. Generating object segmentation proposals using global and local search. In Proc. CVPR, June 2014, pp.2417-2424.
Arbeláez P, Pont-Tuset J, Barron J, Marques F, Malik J. Multiscale combinatorial grouping. In Proc. CVPR, June 2014, pp.328-335.
Endres I, Hoiem D. Category-independent object proposals with diverse ranking. IEEE Trans. PAMI, 2014, 36(2): 222-234.
Article Google Scholar
Humayun A, Li F, Rehg J. RIGOR: Reusing inference in graph cuts for generating object regions. In Proc. CVPR, June 2014, pp.336-343.
Humayun A, Li F, Rehg J M. The middle child problem: Revisiting parametric mincut and seeds for object proposals. In Proc. ICCV, December 2015, pp.1600-1608.
Krähenbühl P, Koltun V. Geodesic object proposals. In Proc. ECCV, Sept. 2014, pp.725-739.
Lee T, Fidler S, Dickinson S. Learning to combine midlevel cues for object proposal generation. In Proc. ICCV, December 2015, pp.1680-1688.
Wang C, Zhao L, Liang S, Zhang L, Jia J, Wei Y. Object proposal by multibranch hierarchical segmentation. In Proc. CVPR, June 2015, pp.3873-3881.
Pinheiro P O, Collobert R, Doll´ar P. Learning to segment object candidates. In Proc. Advances in Neural Information Processing Systems, Dec. 2015, pp.1990-1998.
Martin D R, Fowlkes C C, Malik J. Learning to detect natural image boundaries using local brightness, color, and texture cues. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2004, 26(5): 530-549.
Arbelaez P, Maire M, Fowlkes C, Malik J. Contour detection and hierarchical image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2011, 33(5): 898-916.
Article Google Scholar
Ren X, Bo L. Discriminatively trained sparse code gradients for contour detection. In Proc. Advances in Neural Information Processing Systems, December 2012, pp.593-601.
Hwang J J, Liu T L. Pixel-wise deep learning for contour detection. arXiv:1504.01989, 2015. https://arxiv.org/abs-/1504.01989, May 2017.
Rahtu E, Kannala J, Blaschko M. Learning a category independent object detection cascade. In Proc. ICCV, November 2011, pp.1052-1059.
Kuo W, Hariharan B, Malik J. DeepBox: Learning objectness with convolutional networks. In Proc. ICCV, December 2015, pp.2479-2487.
Ghodrati A, Diba A, Pedersoli M, Tuytelaars T, van Gool L. DeepProposal: Hunting objects by cascading deep convolutional layers. In Proc. CVPR, June 2015, pp.2578-2586.
Chen X, Ma H, Wang X, Zhao Z. Improving object proposals with multithresholding straddling expansion. In Proc. CVPR, June 2015, pp.2587-2595.
Zhang Z, Liu Y, Bolukbasi T, Cheng M M, Saligrama V. BING++: A fast high quality object proposal generator at 100fps. arXiv:1511.04511, 2015. https://arxiv.org/abs/-1511.04511, Apr. 2017.
Xiao Y,Wu J, Yuan J. mCENTRIST: A multichannel feature generation mechanism for scene categorization. IEEE Trans. Image Processing, 2014, 23(2): 823-836.
Article MathSciNet Google Scholar
Fang Z, Cao Z, Xiao Y, Zhu L, Yuan J. Adobe Boxes: Locating object proposals using object adobes. IEEE Trans. Image Processing, 2016, 25(9): 4116-4128.
MathSciNet Google Scholar
He S, Lau R W. Oriented object proposals. In Proc. the IEEE International Conference on Computer Vision, Dec. 2015, pp.280-288.
Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition. arXiv:1409.1556, 2014. https://arxiv.org/abs/1409.1556, May 2017.
Lin T Y, Maire M, Belongie S, Hays J, Perona P, Ramanan D, Dollár P, Zitnick C L. Microsoft COCO: Common objects in context. In Proc. the 13th European Conference on Computer Vision, September 2014, pp.740-755.
Alexe B, Deselaers T, Ferrari V. Measuring the objectness of image windows. IEEE Trans. PAMI, 2012, 34(11): 2189-2202.
Article Google Scholar
Zhang Z, Warrell J, Torr P H. Proposal generation for object detection using cascaded ranking SVMs. In Proc. CVPR, June 2011, pp.1497-1504.
Krähenbühl P, Koltun V. Learning to propose objects. In Proc. CVPR, June 2015, pp.1574-1582.
Ballard D H. Generalizing the Hough transform to detect arbitrary shapes. Pattern Recognition, 1981, 13(2): 111-122.
Article MATH Google Scholar
Borgefors G. Hierarchical chamfer matching: A parametric edge matching algorithm. IEEE Trans. PAMI, 1988, 10(6): 849-865.
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Electronic Information and Communications, Huazhong University of Science and Technology, Wuhan, 430074, China
Xiang Bai, Zheng Zhang & Hong-Yang Wang
Key Laboratory of Specialty Fiber Optics and Optical Access Networks, Shanghai University, Shanghai, 200444, China
Wei Shen

Authors

Xiang Bai
View author publications
You can also search for this author in PubMed Google Scholar
Zheng Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Hong-Yang Wang
View author publications
You can also search for this author in PubMed Google Scholar
Wei Shen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wei Shen.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bai, X., Zhang, Z., Wang, HY. et al. Directional Edge Boxes: Exploiting Inner Normal Direction Cues for Effective Object Proposal Generation. J. Comput. Sci. Technol. 32, 701–713 (2017). https://doi.org/10.1007/s11390-017-1752-9

Download citation

Received: 16 December 2016
Revised: 15 May 2017
Published: 14 July 2017
Issue Date: July 2017
DOI: https://doi.org/10.1007/s11390-017-1752-9

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Directional Edge Boxes: Exploiting Inner Normal Direction Cues for Effective Object Proposal Generation

Abstract

Access this article

Similar content being viewed by others

BING: Binarized normed gradients for objectness estimation at 300fps

Toward Scale-Invariance and Position-Sensitive Region Proposal Networks

Bidirectional Non-local Networks for Object Detection

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Directional Edge Boxes: Exploiting Inner Normal Direction Cues for Effective Object Proposal Generation

Abstract

Access this article

Similar content being viewed by others

BING: Binarized normed gradients for objectness estimation at 300fps

Toward Scale-Invariance and Position-Sensitive Region Proposal Networks

Bidirectional Non-local Networks for Object Detection

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation