Abstract
Clearance intrusion of foreign objects is a great threat to high-speed railway operation safety. An accurate and fast detection system of foreign object intrusion is important. In this paper, a multi-scale foreign object detection algorithm named feature fusion CenterNet with variable focus multi-scale augmentation (FFCN-VFMS) is proposed for the high-speed railway clearance scene. We render the ground truth with two-dimensional Gaussian distributions to generate the confidence score of each region in the image. In addition, a variable focus multi-scale augmentation (VFMS) method is proposed for multi-scale object detection, which takes detection results as prior knowledge to find the range of subsequent detection that contains most small objects. Moreover, feature fusion CenterNet (FFCN) adopts bidirectional iterative deep aggregation (BiIDA) to fuse the features in different convolutional layers and a spatial pyramid pooling (SPP) module to fuse feature maps extracted by different receptive fields. Our method was tested on public PASCAL VOC2007 datasets and our railway clearance intrusion (RCI) datasets. In comparison with related methods, FFCN-VFMS achieves better performance than comparison detectors with respect to accuracy and speed.
Similar content being viewed by others
References
Wisultschew C, Mujica G, Lanza-Gutierrez JM, Portilla J (2021) 3D-LIDAR based object detection and tracking on the edge of IoT for railway level crossing. IEEE Access 9:35718–35729
Catalano A, Bruno FA, Galliano C, Pisco M, Persiano GV, Cutolo A, Cusano A (2017) An optical fiber intrusion detection system for railway security. Sensors and Actuators A-Physical 253:91– 100
Li Z, Zhang J, Wang M, Chai J, Wu Y, Peng F (2020) An anti-noise ϕ-OTDR based distributed acoustic sensing system for high-speed railway intrusion detection. Laser Phys 30 (8):085103
Xie Z, Qin Y (2019) High-speed railway perimeter intrusion detection approach based on internet of things. Advances in Mechanical Engineering 11(2):1687814018821511
Wang Y, Yu Z, Zhu L (2018) Foreground detection with deeply learned multi-scale spatial-temporal features. Sensors 18(4269):1–16
Tastimur C, Karakose M, Akin E (2017) Image processing based level crossing detection and foreign objects recognition approach in railways. International Journal of Applied Mathematics Electronics and Computers 2017 Special Issue(1), pp 19–23
Guo B, Xingfang Z, Yingzi L, Liqiang Z, Zujun Y (2018) Novel registration and fusion algorithm for multimodal railway images with different field of views. J Adv Trans 2018(7836169):1–16
LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521:436–444
Guo Y, Liu Y, Oerlemans A, Lao S, Wu S, Lew MS (2016) Deep learning for visual understanding: A review. Neurocomputing 187:27–48
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: 26th Advances in neural information processing systems (NIPS 2012), pp 1106–1114. Lake Tahoe, Nevada, USA
Simonyan K, Zisserman A (2015) Very deep convolutional networks for large-scale image recognition. In: 3rd International conference on learning representations (ICLR 2015), pp 1–14. San Diego, CA, USA
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR 2016), pp 770–778. Las Vegas, Nevada, USA
Huang G, Liu Z, Maaten LVD, Weinberger KQ (2017) Densely connected convolutional networks. In: 2017 IEEE conference on computer vision and pattern recognition (CVPR), pp 2261–2269. Honolulu, HI, USA
Tan M, Le Q (2019) Efficientnet: Rethinking model scaling for convolutional neural networks. In: Proceedings of the 36th international conference on machine learning, pp 6105–6114. Long Beach, California, USA
Sun C, Ai Y, Wang S, Zhang W (2020) Mask-guided SSD for small-object detection. Appl Intell
Wang Y, Zhu L, Yu Z (2019) Foreground detection for infrared videos with multiscale 3-D fully convolutional network. IEEE Geosci Remote Sens Lett 16(5):712–716
Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR 2014), pp 580–587. Columbus, OH, USA
He K, Zhang X, Ren S, Sun J (2015) Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence 37(9):1904–1916
Girshick RB (2015) Fast R-CNN. In: Proceedings of the IEEE international conference on computer vision (ICCV 2015), pp 1440–1448. Santiago, Chile
Ren S, He K, Girshick R, Sun J (2017) Faster R-CNN: Towards real-time object detection with region proposal networks. IEEE Transactions on Pattern Analysis and Machine Intelligence 39(6):1137–1149
He K, Gkioxari G, Dollar P, Girshick R (2020) Mask R-CNN. IEEE Transactions on Pattern Analysis and Machine Intelligence 42(2):386–397
Cai Z, Vasconcelos N (2018) Cascade R-CNN: Delving into high quality object detection. In: Proceedings of the 2018 IEEE/CVF conference on computer vision and pattern recognition, pp 6154–6162. Salt Lake City, UT, USA
Redmon J, Divvala SK, Girshick RB, Farhadi A (2016) You only look once: Unified, real-time object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR 2016), pp 779–788. Las Vegas, NV, USA
Redmon J, Farhadi A (2017) YOLO9000: better, faster, stronger. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR 2017), pp 6517–6525. Honolulu, HI, USA
Redmon J, Farhadi A (2018) YOLOv3: An incremental improvement. arXiv:1804.02767
Liu W, Anguelov D, Erhan D, Szegedy C, Reed SE, Fu C-Y, Berg AC (2016) SSD: single shot multibox detector. In: Proceedings of the European conference on computer vision (ECCV 2016), pp 21–37
Fu C-Y, Liu W, Ranga A, Tyagi A, Berg AC (2017) DSSD : Deconvolutional single shot detector. arXiv:1701.06659
Zhang S, Wen L, Bian X, Lei Z, Li SZ (2018) Single-shot refinement neural network for object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR 2018), pp 4203–4212. Salt Lake City, UT, USA
Tian Z, Shen C, Chen H, He T (2019) FCOS: Fully convolutional one-stage object detection. In: 2019 IEEE/CVF international conference on computer vision (ICCV), pp 9627–9636. Seoul, Korea (South)
Law H, Deng J (2020) Cornernet: Detecting objects as paired keypoints. Int J Comput Vis 128(3):642–656
Duan K, Bai S, Xie L, Qi H, Huang Q, Tian Q (2019) CenterNet: Keypoint triplets for object detection. In: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV 2019), pp 6569–6578. Seoul, Korea
Zhou X, Wang D, Krähenbühl P (2019) Objects as points. arXiv:1904.07850
Lin TY, Goyal P, Girshick R, He K, Dollar P (2020) Focal loss for dense object detection. IEEE Transactions on Pattern Analysis and Machine Intelligence 42(2):318–327
Zhou Y, Tao X, Yu Z, Fujita H (2019) Train-movement situation recognition for safety justification using moving-horizon TBM-based multisensor data fusion. Knowl-Based Syst 177:117– 126
Li Z, Zhang J, Wang M, Zhong Y, Peng F (2020) Fiber distributed acoustic sensing using convolutional long short-term memory network: a field test on high-speed railway intrusion detection. Opt Express 28(3):2925–2938
Zhangyu W, Guizhen Y, Xinkai W, Haoran L, Da L (2021) A camera and lidar data fusion method for railway object detection. IEEE Sensors J, pp 1–13
Gao P, Zhang Q, Wang F, Xiao L, Fujita H, Zhang Y (2020) Learning reinforced attentional representation for end-to-end visual tracking. Inf Sci 517:52–67
Gao P, Yuan R, Wang F, Xiao L, Fujita H, Zhang Y (2020) Siamese attentional keypoint network for high performance visual tracking. Knowl-Based Syst 193:105448
Yang W, Liqiang Z, Zujun Y, Baoqing G (2016) High-speed railway clearance surveillance system based on convolutional neural networks. In: Proceedings of the 8th international conference on digital image processing (ICDIP 2016), pp 1–6. Chengu, China
Huang H, Liang L, Zhao G, Yang Y, Ou K (2019) Railway clearance intrusion detection in aerial video based on convolutional neural network. In: Proceedings of the Chinese control and decision conference (CCDC 2019), pp 1644–1648. Nanchang, China
Wang W, Zhu L, Guo B (2019) Reliable identification of redundant kernels for convolutional neural network compression. J Vis Commun Image Represent 63(102582):1–12
Yan X, Huiqing T, Lili HU (2020) Railway foreign body intrusion detection based on faster R-CNN network model. J China Railway Society 42(5):91–98
Xiaoying Y, Hongsheng S, Ze J, Yu D (2020) Detection method of railway intruding obstacle based on YOLO algorithm. J Lanzhou Jiaotong University 39(2):37–42
Guo B, Shi J, Zhu L, Yu Z (2019) High-speed railway clearance intrusion detection with improved SSD network. Appl Sci 9(15):1–20
Li Y, Dong H, Li H, Zhang X, Zhang B, Xiao Z (2020) Multi-block SSD based on small object detection for UAV railway scene surveillance. Chin J Aeronaut 33(6):1747–1755
Lin TY, Dollar P, Girshick R, He K, Hariharan B, Belongie S (2017) Feature pyramid networks for object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR 2017), pp 936–944. Honolulu, HI, USA
Liu S, Qi L, Qin H, Shi J, Jia J (2018) Path aggregation network for instance segmentation. In: 2018 IEEE/CVF conference on computer vision and pattern recognition (CVPR 2018), pp 8759–8768. Salt Lake City, UT, USA
Tan M, Pang R, Le QV (2020) Efficientdet: Scalable and efficient object detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (CVPR 2020), pp 10781–10790. Seattle, WA, USA
Yu F, Wang D, Shelhamer E, Darrell T (2018) Deep layer aggregation. In: 2018 IEEE/CVF conference on computer vision and pattern recognition, pp 2403–2412. Salt Lake City, UT, USA
Zhao H, Shi J, Qi X, Wang X, Jia J (2017) Pyramid scene parsing network. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (CVPR 2017), pp 6230–6239. Honolulu, HI, USA
Huang Z, Wang J, Fu X, Yu T, Guo Y, Wang R (2020) DC-SPP-YOLO: Dense connection and spatial pyramid pooling based YOLO for object detection. Inf Sci 522:241–258
Dai J, Qi H, Xiong Y, Li Y, Zhang G, Hu H, Wei Y (2017) Deformable convolutional networks. In: Proceedings of the IEEE international conference on computer vision (ICCV 2017), pp 764–773
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interests
The authors declare that they have no conflict of interest.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This work was supported by the National Natural Science Foundation of China under Grant 62076022.
Rights and permissions
About this article
Cite this article
Tian, R., Shi, H., Guo, B. et al. Multi-scale object detection for high-speed railway clearance intrusion. Appl Intell 52, 3511–3526 (2022). https://doi.org/10.1007/s10489-021-02534-9
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-021-02534-9