Non-pre-trained Mine Pedestrian Detection Based on Automatic Generation of Anchor Box

Wei, Xing; Wang, Changguang; Zhang, Haitao; Liu, Shaofan; Lu, Yang

doi:10.1007/978-3-030-59019-2_14

Non-pre-trained Mine Pedestrian Detection Based on Automatic Generation of Anchor Box

Xing Wei¹¹,
Changguang Wang¹¹,
Haitao Zhang¹¹,
Shaofan Liu¹¹ &
…
Yang Lu¹¹

Conference paper
First Online: 09 September 2020

420 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 12385))

Abstract

Mine pedestrian detection is an important part of computer vision and one of the key technologies of unmanned locomotive. In order to improve the structural adaptability of pedestrian detection network, reduce the workload of pre-training and reduce the risk of “negative migration” brought by migration learning, a non-pre-training underground pedestrian detection network based on anchor box is proposed. Firstly, the network model of mine pedestrian detection is introduced, including non-pretrained backbone network and branch structure detection network. Among them, the non-pre-trained backbone network mainly adds BatchNorm operation to make the gradient more stable and smooth. Anchor location prediction branch and anchor shape prediction branch in the detection network work together to improve the regression accuracy of anchor box. Secondly, the loss function of network training is described and the training parameters are adjusted by weighted loss. Finally, the experimental results based on the video of Taoyuan and Xinji mine in Anhui province are given. The experimental data show that the proposed algorithm can still maintain 96.3% AP at a real-time processing rate of 24 FPS. Compared with the RefineDet512, the AP increases by 2.4%.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Lin, T.Y., Dollár, P., Girshick, R., et al.: Feature pyramid networks for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2117–2125 (2017)
Google Scholar
Krishnamurthy, B., Sarkar, M.: Deep-learning network architecture for object detection. US Patent 10,152,655, 11 Dec 2018
Google Scholar
Zhu, R., Zhang, S., Wang, X., et al.: ScratchDet: training single-shot object detectors from scratch. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2268–2277 (2019)
Google Scholar
Anantaram, C., Kopparapu, S.K., Patel, C.R., et al.: Systems and methods for automatic repair of speech recognition engine output using a sliding window mechanism. US Patent 10,410,622, 10 September 2019
Google Scholar
Ioffe, S., Szegedy, C.: Batch normalization: accelerating deep network training by reducing internal covariate shift. arXiv: Learning (2015)
Ren, S., He, K., Girshick, R., et al.: Faster R-CNN: towards real-time object detection with region proposal networks. In: Neural Information Processing Systems, pp. 91–99 (2015)
Google Scholar
He, K., Zhang, X., Ren, S., et al.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Google Scholar
Redmon, J., Divvala, S.K., Girshick, R., et al.: You only look once: unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779–788 (2016)
Google Scholar
Liu, W., et al.: SSD: single shot multibox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_2
Chapter Google Scholar
Lin, T., Goyal, P., Girshick, R., et al.: Focal loss for dense object detection. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2999–3007 (2017)
Google Scholar
Zhou, X., Wang, D., Krahenbuhl, P., et al.: Objects as points. Computer Vision and Pattern Recognition. arXiv (2019)
Google Scholar
Zhao, Q., Sheng, T., Wang, Y., et al.: M2Det: a single-shot object detector based on multi-level feature pyramid network. In: National Conference on Artificial Intelligence, vol. 33, no. 01, pp. 9259–9266 (2019)
Google Scholar

Download references

Acknowledgments

This work was supported by Anhui Provincial Key R&D Program (201904d08020040) and National Key R&D Program of China (2018F YC0604404).

Author information

Authors and Affiliations

Hefei University of Technology, Hefei, China
Xing Wei, Changguang Wang, Haitao Zhang, Shaofan Liu & Yang Lu

Authors

Xing Wei
View author publications
You can also search for this author in PubMed Google Scholar
Changguang Wang
View author publications
You can also search for this author in PubMed Google Scholar
Haitao Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Shaofan Liu
View author publications
You can also search for this author in PubMed Google Scholar
Yang Lu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Changguang Wang .

Editor information

Editors and Affiliations

Shandong University, Qingdao, China
Dongxiao Yu
TU Berlin, Berlin, Germany
Falko Dressler
Qilu University of Technology, Jinan, China
Jiguo Yu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wei, X., Wang, C., Zhang, H., Liu, S., Lu, Y. (2020). Non-pre-trained Mine Pedestrian Detection Based on Automatic Generation of Anchor Box. In: Yu, D., Dressler, F., Yu, J. (eds) Wireless Algorithms, Systems, and Applications. WASA 2020. Lecture Notes in Computer Science(), vol 12385. Springer, Cham. https://doi.org/10.1007/978-3-030-59019-2_14

Download citation

DOI: https://doi.org/10.1007/978-3-030-59019-2_14
Published: 09 September 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-59018-5
Online ISBN: 978-3-030-59019-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics