AFTD-Net: real-time anchor-free detection network of threat objects for X-ray baggage screening

Wei, Yiru; Zhu, Zhiliang; Yu, Hai; Zhang, Wei

doi:10.1007/s11554-021-01136-5

AFTD-Net: real-time anchor-free detection network of threat objects for X-ray baggage screening

Special Issue Paper
Published: 31 May 2021

Volume 18, pages 1343–1356, (2021)
Cite this article

Journal of Real-Time Image Processing Aims and scope Submit manuscript

Yiru Wei¹,
Zhiliang Zhu¹,
Hai Yu¹ &
…
Wei Zhang¹

309 Accesses
5 Citations
Explore all metrics

Abstract

X-ray baggage screening is a vitally important task to detect all kinds of threat objects at controlled access positions, which can prevent crime and guard personal safety. It is generally performed by screeners to visually determine whether a bag contains threat objects. However, manual detection shows several limitations, from the detection errors to different detection results produced by different screeners. To address these issues, several automated detection approaches have been proposed; nevertheless, none of the methods can achieve end-to-end detection and the results have only classification information without positional information. In this paper, we propose a real-time anchor-free detector of threat objects that can recognize threat objects without using pre-designed anchor boxes. We employ a lightweight but strong backbone network: MobileNetV2 to extract the multi-level information. The backbone network is followed by a deformation layer which aims at handling the nonrigid deformation of threat objects in X-ray images. To further strengthen the proposed network, we design a context enhancement module to aggregate the multi-scale features and generate the enhanced features. We name the network as anchor-free detection network of threat objects (AFTD-Net). We demonstrate the effectiveness of the proposed method against other object detection algorithms on the GDXray database. Our AFTD-Net is a fully convolutional network which does not need any pre-designed anchors and achieves a real-time computation speed of 44.8 FPS.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An automated detection model of threat objects for X-ray baggage inspection based on depthwise separable convolution

Article 02 January 2021

MC-CDPNet: Multi-Channel Correlated Detail Preserving Network for X-Ray-Based Baggage Screening

Article 25 May 2023

Detection of threat objects in baggage inspection with X-ray images using deep learning

Article 19 November 2020

References

Gaus, Y.F.A., Bhowmik, N., Breckon, T.P.: On the use of deep learning for the detection of firearms in X-ray baggage security imagery. In: 2019 IEEE International Symposium on Technologies for Homeland Security (HST), pp. 1–7 (2019). https://doi.org/10.1109/HST47167.2019.9032917
Bayat, A., Nand, A.K., Koh, D.H., Pereira, M., Pomplun, M.: Scene grammar in human and machine recognition of objects and scenes. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 2073–20737 (2018). https://doi.org/10.1109/CVPRW.2018.00268
Khazaie, V.R., AkhavanPour, A., Ebrahimpour, R.: Occluded visual object recognition using deep conditional generative adversarial nets and feedforward convolutional neural networks. In: 2020 International Conference on Machine Vision and Image Processing (MVIP), pp. 1–6 (2020). https://doi.org/10.1109/MVIP49855.2020.9116887
Tang, S., Roberts, D., Golparvar-Fard, M.: Human-object interaction recognition for automatic construction site safety inspection. Autom. Constr. 120, 103356 (2020). https://doi.org/10.1016/j.autcon.2020.103356
Article Google Scholar
Bicanski, A., Burgess, N.: A computational model of visual recognition memory via grid cells. Curr. Biol. 29(6), 979-990.e4 (2019). https://doi.org/10.1016/j.cub.2019.01.077
Article Google Scholar
Michel, S., Ruiter, J.C.D., Hogervorst, M., Koller, S.M., Schwaninger, A.: Computer-based training increases efficiency in X-ray image interpretation by aviation security screeners. In: IEEE International Carnahan Conference on Security Technology, pp. 201–206 (2007)
Riffo, V., Mery, D.: Automated detection of threat objects using adapted implicit shape model. IEEE Trans. Syst. Man Cybern. Syst. 46(4), 472–482 (2017)
Article Google Scholar
Mery, D., Svec, E., Arias, M.: Object recognition in baggage inspection using adaptive sparse representations of X-ray images. In: Pacific-rim Symposium on Image and Video Technology, pp. 709–720 (2015)
Uroukov, I., Speller, R.: A preliminary approach to intelligent X-ray imaging for baggage inspection at airports. Signal Process. Res. 4(5), 1–11 (2015)
Article Google Scholar
Turcsany, D., Mouton, A., Breckon, T.P.: Improving feature-based object recognition for X-ray baggage security screening using primed visual words. In: IEEE International Conference on Industrial Technology, pp. 1140–1145 (2013)
Bastan, M., Yousefi, M.R., Breuel, T.M.: Visual Words on Baggage X-ray Images. Springer (2011)
Google Scholar
Franzel, T., Schmidt, U., Roth, S.: Object Detection in Multi-view X-Ray Images. Springer (2012)
Book Google Scholar
Flitton, G., Breckon, T.P., Megherbi, N.: A comparison of 3D interest point descriptors with application to airport baggage object detection in complex CT imagery. Pattern Recognit. 46(9), 2420–2436 (2013)
Article Google Scholar
Megherbi, N., Han, J., Breckon, T.P., Flitton, G.T.: A comparison of classification approaches for threat detection in CT based baggage screening. In: IEEE International Conference on Image Processing, pp. 3109–3112 (2013)
Mery, D., Svec, E., Arias, M., Riffo, V., Saavedra, J.M., Banerjee, S.: Modern computer vision techniques for X-ray testing in baggage inspection. IEEE Trans. Syst. Man Cybern. Syst. 47(4), 682–692 (2017)
Article Google Scholar
Akcay, S., Kundegorski, M.E., Willcocks, C.G., Breckon, T.P.: Using deep convolutional neural network architectures for object classification and detection within x-ray baggage security imagery. IEEE Trans. Inf. Forensics Secur. 13(9), 2203–2215 (2018). https://doi.org/10.1109/TIFS.2018.2812196
Article Google Scholar
Mery, D., Riffo, V., Zscherpel, U., Mondragón, G., Lillo, I., Zuccar, I., Lobel, H., Carrasco, M.: GDXray: the database of X-ray images for nondestructive testing. J. Nondestruct. Eval. 34(4), 42 (2015)
Article Google Scholar
Redmon, J., Farhadi, A.: YOLOv3: an incremental improvement. arXiv:1804.02767 (2018)
Law, H., Deng, J.: CornerNet: detecting objects as paired keypoints. Int. J. Comput. Vis. 128, 642–656 (2020)
Article Google Scholar
Law, H., Teng, Y., Russakovsky, O., Deng, J.: CornerNet-Lite: efficient keypoint based object detection. arXiv:1904.08900 (2019)
Duan, K., Bai, S., Xie, L., Qi, H., Huang, Q., Tian, Q.: CenterNet: keypoint triplets for object detection. arXiv:1904.08189 (2019)
Newell, A., Yang, K., Deng, J.: Stacked hourglass networks for human pose estimation. In: European Conference on Computer Vision, pp. 483–499 (2016)
Sandler, M., Howard, A., Zhu, M., Zhmoginov, A., Chen, L.: Inverted residuals and linear bottlenecks: mobile networks for classification, detection and segmentation. arXiv:1801.04381v2 (2018)
Lin, T.Y., Dollár, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition, pp. 2117–2125 (2017)
Jaderberg, M., Simonyan, K., Zisserman, A., Kavukcuoglu, K.: Spatial transformer networks. In: Advances in Neural Information Precessing Systems, pp. 2017–2025 (2015)
Ma, N., Zhang, X., Zheng, H.T., Sun, J.: ShuffleNet V2: practical guidelines for efficient CNN architecture design. In: European Conference on Computer Vision, pp. 122–138 (2018)
Iandola, F.N., Han, S., Moskewicz, M.W., Ashraf, K., Dally, W.J., Keutzer, K.: SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size. arXiv:1602.07360 (2016)
Girshick, R.: Fast R-CNN. In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1440–1448 (2015)
Bodla, N., Singh, B., Chellappa, R., Davis, L.S.: Soft-NMS-improving object detection with one line of code. In: 2017 IEEE International Conference on Computer Vision (ICCV), pp. 5561–5569 (2017)
Paszke, A., Gross, S., Chintala, S., Chanan, G., Yang, E., DeVito, Z., Lin, Z., Desmaison, A., Antiga, L., Lerer, A.: Automatic differentiation in pytorch. In: 31st Conference on Neural Information Processing Systems (NIPS) (2017)
Kingma, D., Ba, J.: Adam: A method for stochastic optimization. arXiv:1412.6980 (2014)
Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S., Ma, S., Huang, Z., Karpathy, A., Khosla, A., Bernstein, M.: Imagenet large scale visual recognition challenge. Int. J. Comput. Vis. 115(3), 211–252 (2014)
Article MathSciNet Google Scholar

Download references

Acknowledgements

This research was supported by the National Natural Science Foundation of China (Grant nos. 61977014, 61902056, 61603082), the Fundamental Research Funds for the Central Universities (Grant nos. N2017011, N2017016).

Author information

Authors and Affiliations

Software College, Northeastern University, Shenyang, 110819, China
Yiru Wei, Zhiliang Zhu, Hai Yu & Wei Zhang

Authors

Yiru Wei
View author publications
You can also search for this author in PubMed Google Scholar
Zhiliang Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Hai Yu
View author publications
You can also search for this author in PubMed Google Scholar
Wei Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhiliang Zhu.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wei, Y., Zhu, Z., Yu, H. et al. AFTD-Net: real-time anchor-free detection network of threat objects for X-ray baggage screening. J Real-Time Image Proc 18, 1343–1356 (2021). https://doi.org/10.1007/s11554-021-01136-5

Download citation

Received: 09 July 2020
Accepted: 18 May 2021
Published: 31 May 2021
Issue Date: August 2021
DOI: https://doi.org/10.1007/s11554-021-01136-5

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

AFTD-Net: real-time anchor-free detection network of threat objects for X-ray baggage screening

Abstract

Access this article

Similar content being viewed by others

An automated detection model of threat objects for X-ray baggage inspection based on depthwise separable convolution

MC-CDPNet: Multi-Channel Correlated Detail Preserving Network for X-Ray-Based Baggage Screening

Detection of threat objects in baggage inspection with X-ray images using deep learning

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

AFTD-Net: real-time anchor-free detection network of threat objects for X-ray baggage screening

Abstract

Access this article

Similar content being viewed by others

An automated detection model of threat objects for X-ray baggage inspection based on depthwise separable convolution

MC-CDPNet: Multi-Channel Correlated Detail Preserving Network for X-Ray-Based Baggage Screening

Detection of threat objects in baggage inspection with X-ray images using deep learning

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation