skip to main content
10.1145/3357000.3366142acmconferencesArticle/Chapter ViewAbstractPublication PagesgisConference Proceedingsconference-collections
research-article

Small Traffic Sign Detection Through Selective Feature Fusion Based Faster R-CNN With Arc-Softmax Loss

Published: 05 November 2019 Publication History

Abstract

Traffic signs are basic and important elements in maps. They are related to traffic regulations, profoundly affecting/managing the travel mode of human beings and efficiency of vehicle running. Traffic sign mining technology is applied in many research fields such as traditional map update, high-precision map establishment and automatic driving. Image based traffic sign identification technology has the advantages of low cost and high efficiency over manual processing mode, and traffic sign detection has thus become a significant task with the pacing advancement of autonomous driving. However, many common object detection methods cannot be directly applied to this task, as the size of traffic signs are very small yet they vary considerably. Due to such characteristics, features of traffic signs are difficult to capture, and are harder to discriminate between classes. To address this problem, we proposed a selective feature fusion based Faster R-CNN with Arc-Softmax loss, which optimizes the detection performance from the two following ways: network structure and loss function. We discover that each Faster R-CNN layer is only capable of detecting targets within a certain size range. By carefully selecting and combining different layers' feature maps, we can extract features that effectively represent traffic signs of various sizes. Then, Arc-Softmax loss penalizes the angular distances between the feature vectors of different signs, and their corresponding weight vectors of the last fully connected layers, thereby encouraging intra-class compactness and inter-class separability between learned features. Extensive analysis and experiments on the challenging Tsinghua-Tencent 100K benchmark demonstrate the superiority and implementation simplicity of our proposed method. Code will be made publicly available.

References

[1]
Martín Abadi, Ashish Agarwal, Paul Barham, Eugene Brevdo, Zhifeng Chen, Craig Citro, Gregory S. Corrado, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Ian J. Goodfellow, Andrew Harp, Geoffrey Irving, Michael Isard, Yangqing Jia, Rafal Józefowicz, Lukasz Kaiser, Manjunath Kudlur, Josh Levenberg, Dan Mané, Rajat Monga, Sherry Moore, Derek Gordon Murray, Chris Olah, Mike Schuster, Jonathon Shlens, Benoit Steiner, Ilya Sutskever, Kunal Talwar, Paul A. Tucker, Vincent Vanhoucke, Vijay Vasudevan, Fernanda B. Viégas, Oriol Vinyals, Pete Warden, Martin Wattenberg, Martin Wicke, Yuan Yu, and Xiaoqiang Zheng. 2016. TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems. CoRR abs/1603.04467 (2016). arXiv:1603.04467 http://arxiv.org/abs/1603.04467
[2]
Sean Bell, C. Lawrence Zitnick, Kavita Bala, and Ross B. Girshick. 2015. Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks. CoRR abs/1512.04143 (2015). arXiv:1512.04143 http://arxiv.org/abs/1512.04143
[3]
Xiaozhi Chen, Kaustav Kundu, Yukun Zhu, Andrew Berneshawi, Huimin Ma, Sanja Fidler, and Raquel Urtasun. 2015. 3D object proposals for accurate object class detection. In International Conference on Neural Information Processing Systems. 424--432.
[4]
Dumitru Erhan, Christian Szegedy, Alexander Toshev, and Dragomir Anguelov. 2013. Scalable Object Detection using Deep Neural Networks. CoRR abs/1312.2249 (2013). arXiv:1312.2249 http://arxiv.org/abs/1312.2249
[5]
Mark Everingham, Luc Gool, Christopher K Williams, John Winn, and Andrew Zisserman. 2010. The Pascal Visual Object Classes (VOC) Challenge. International Journal of Computer Vision 88, 2 (2010), 303--338.
[6]
Cheng-Yang Fu, Wei Liu, Ananth Ranga, Ambrish Tyagi, and Alexander C. Berg. 2017. DSSD: Deconvolutional Single Shot Detector. CoRR abs/1701.06659 (2017). arXiv:1701.06659 http://arxiv.org/abs/1701.06659
[7]
Ian J. Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron C. Courville, and Yoshua Bengio. 2014. Generative Adversarial Nets. In Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, December 8-13 2014, Montreal, Quebec, Canada, Zoubin Ghahramani, Max Welling, Corinna Cortes, Neil D. Lawrence, and Kilian Q. Weinberger (Eds.). 2672--2680. http://papers.nips.cc/paper/5423-generative-adversarial-nets
[8]
J. A Hartigan. 1979. A K-Means Clustering Algorithm. Appl Stat 28, 1 (1979), 100--108.
[9]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep Residual Learning for Image Recognition. In IEEE Conference on Computer Vision and Pattern Recognition. 770--778.
[10]
Peiyun Hu and Deva Ramanan. 2016. Finding Tiny Faces. CoRR abs/1612.04402 (2016). arXiv:1612.04402 http://arxiv.org/abs/1612.04402
[11]
Jisoo Jeong, Hyojin Park, and Nojun Kwak. 2017. Enhancement of SSD by concatenating feature maps for object detection. CoRR abs/1705.09587 (2017). arXiv:1705.09587 http://arxiv.org/abs/1705.09587
[12]
A. Krizhevsky and G. Hinton. 2009. Learning multiple layers of features from tiny images. Master's thesis, Department of Computer Science, University of Toronto (2009).
[13]
Y. Lecun, L. Bottou, Y. Bengio, and P. Haffner. 1998. Gradient-based learning applied to document recognition. Proc. IEEE 86, 11 (Nov 1998), 2278--2324. https://doi.org/10.1109/5.726791
[14]
Jianan Li, Xiaodan Liang, Yunchao Wei, Tingfa Xu, Jiashi Feng, and Shuicheng Yan. 2017. Perceptual Generative Adversarial Networks for Small Object Detection. CoRR abs/1706.05274 (2017). arXiv:1706.05274 http://arxiv.org/abs/1706.05274
[15]
Tsung-Yi Lin, Piotr Dollár, Ross B. Girshick, Kaiming He, Bharath Hariharan, and Serge J. Belongie. 2016. Feature Pyramid Networks for Object Detection. CoRR abs/1612.03144 (2016). arXiv:1612.03144 http://arxiv.org/abs/1612.03144
[16]
Tsung-Yi Lin, Priya Goyal, Ross B. Girshick, Kaiming He, and Piotr Dollár. 2017. Focal Loss for Dense Object Detection. CoRR abs/1708.02002 (2017). arXiv:1708.02002 http://arxiv.org/abs/1708.02002
[17]
Tsung-Yi Lin, Michael Maire, Serge J. Belongie, Lubomir D. Bourdev, Ross B. Girshick, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, and C. Lawrence Zitnick. 2014. Microsoft COCO: Common Objects in Context. CoRR abs/1405.0312 (2014). arXiv:1405.0312 http://arxiv.org/abs/1405.0312
[18]
Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott E. Reed, Cheng-Yang Fu, and Alexander C. Berg. 2015. SSD: Single Shot MultiBox Detector. CoRR abs/1512.02325 (2015). arXiv:1512.02325 http://arxiv.org/abs/1512.02325
[19]
Weiyang Liu, Yandong Wen, Zhiding Yu, and Meng Yang. 2016. Large-Margin Softmax Loss for Convolutional Neural Networks. CoRR abs/1612.02295 (2016). arXiv:1612.02295 http://arxiv.org/abs/1612.02295
[20]
Yu Liu, Hongyang Li, and Xiaogang Wang. 2017. Learning Deep Features via Congenerous Cosine Loss for Person Recognition. CoRR abs/1702.06890 (2017). arXiv:1702.06890 http://arxiv.org/abs/1702.06890
[21]
Zibo Meng, Xiaochuan Fan, Xin Chen, Min Chen, and Yan Tong. 2017. Detecting Small Signs from Large Images. In IEEE International Conference on Information Reuse and Integration.
[22]
Shaoqing Ren, Kaiming He, Ross B. Girshick, and Jian Sun. 2017. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Trans. Pattern Anal. Mach. Intell. 39, 6 (2017), 1137--1149. https://doi.org/10.1109/TPAMI.2016.2577031
[23]
Ruslan Salakhutdinov and Andriy Mnih. 2007. Probabilistic Matrix Factorization. In Advances in Neural Information Processing Systems 20, Proceedings of the Twenty-First Annual Conference on Neural Information Processing Systems, Vancouver, British Columbia, Canada, December 3-6, 2007, John C. Platt, Daphne Koller, Yoram Singer, and Sam T. Roweis (Eds.). Curran Associates, Inc., 1257--1264. http://papers.nips.cc/paper/3208-probabilistic-matrix-factorization
[24]
Klaus Schoeffmann, Thanarat H. Chalidabhongse, Chong-Wah Ngo, Supavadee Aramvith, Noel E. O'Connor, Yo-Sung Ho, Moncef Gabbouj, and Ahmed Elgammal (Eds.). 2018. MultiMedia Modeling- 24th International Conference, MMM 2018, Bangkok, Thailand, February 5-7, 2018, Proceedings, Part I. Lecture Notes in Computer Science, Vol. 10704. Springer. https://doi.org/10.1007/978-3-319-73603-7
[25]
Florian Schroff, Dmitry Kalenichenko, and James Philbin. 2015. FaceNet: A Unified Embedding for Face Recognition and Clustering. CoRR abs/1503.03832 (2015). arXiv:1503.03832 http://arxiv.org/abs/1503.03832
[26]
Pierre Sermanet, David Eigen, Xiang Zhang, Michael Mathieu, Rob Fergus, and Yann Lecun. 2013. OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks. Eprint Arxiv (2013).
[27]
Abhinav Shrivastava, Rahul Sukthankar, Jitendra Malik, and Abhinav Gupta. 2016. Beyond Skip Connections: Top-Down Modulation for Object Detection. CoRR abs/1612.06851 (2016). arXiv:1612.06851 http://arxiv.org/abs/1612.06851
[28]
Karen Simonyan and Andrew Zisserman. 2014. Very Deep Convolutional Networks for Large-Scale Image Recognition. Computer Science (2014).
[29]
Xu Tang, Daniel K. Du, Zeqiang He, and Jingtuo Liu. 2018. PyramidBox: A Context-assisted Single Shot Face Detector. CoRR abs/1803.07737 (2018). arXiv:1803.07737 http://arxiv.org/abs/1803.07737
[30]
Hao Wang, Yitong Wang, Zheng Zhou, Xing Ji, Zhifeng Li, Dihong Gong, Jingchao Zhou, and Wei Liu. 2018. CosFace: Large Margin Cosine Loss for Deep Face Recognition. CoRR abs/1801.09414 (2018). arXiv:1801.09414 http://arxiv.org/abs/1801.09414
[31]
Yandong Wen, Kaipeng Zhang, Zhifeng Li, and Yu Qiao. 2016. A Discriminative Feature Learning Approach for Deep Face Recognition. In European Conference on Computer Vision. 499--515.
[32]
Fan Yang, Wongun Choi, and Yuanqing Lin. 2016. Exploit All the Layers: Fast and Accurate CNN Object Detector with Scale Dependent Pooling and Cascaded Rejection Classifiers. In Computer Vision and Pattern Recognition. 2129--2137.
[33]
Zhe Zhu, Dun Liang, Song-Hai Zhang, Xiaolei Huang, Baoli Li, and Shi-Min Hu. 2016. Traffic-Sign Detection and Classification in the Wild. In 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016, Las Vegas, NV, USA, June 27-30, 2016. IEEE Computer Society, 2110--2118. https://doi.org/10.1109/CVPR.2016.232

Cited By

View all
  • (2020)Object Detection for Automatic Driving Based on Deep Learning2020 International Conference on Computing and Data Science (CDS)10.1109/CDS49703.2020.00065(1-8)Online publication date: Aug-2020

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
IWCTS'19: Proceedings of the 12th ACM SIGSPATIAL International Workshop on Computational Transportation Science
November 2019
89 pages
ISBN:9781450369671
DOI:10.1145/3357000
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 05 November 2019

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. map updates
  2. object detection
  3. traffic sign

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

SIGSPATIAL '19
Sponsor:

Acceptance Rates

Overall Acceptance Rate 42 of 57 submissions, 74%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)7
  • Downloads (Last 6 weeks)0
Reflects downloads up to 17 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2020)Object Detection for Automatic Driving Based on Deep Learning2020 International Conference on Computing and Data Science (CDS)10.1109/CDS49703.2020.00065(1-8)Online publication date: Aug-2020

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media