Skip to main content
Log in

Dangerous goods detection based on transfer learning in X-ray images

  • Original Article
  • Published:
Neural Computing and Applications Aims and scope Submit manuscript

Abstract

Computer vision technology is used to analyze X-ray images and detect dangerous goods in the process of logistics and express delivery. It is a security technology which can reduce labor strength and improve working efficiency. At present, there are many excellent detection models and methods in the field of object detection for visible light images, such as R-CNN, Fast R-CNN, Faster R-CNN, YOLO, SSD. These deep neural network-based detection methods achieved excellent performance on ImageNet. The training of object detection models on X-ray image datasets for dangerous goods detection is the focus of research in the field. Due to practical reasons, it is difficult to collect a comprehensive image dataset of dangerous goods (positive samples). In order to overcome this problem, this paper uses a multi-task transfer learning method on the basis of classification task and location search task on SSD network. The research in this paper focuses on adding additional convolutional layers in the SSD network to re-learn the knowledge of the model learned from the source domain. Experiments show that compared with the traditional method of fine-tuning, this method has better transfer learning ability on SSD network. This method was used to perform experiments in SSD300 on the image datasets screened from GDXray and achieved a mean average precision (mAP) of 0.915.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

Notes

  1. GDXray is the GRIMA X-ray database, published by the Machine Intelligence Group at the Department of Computer Science of the Pontificia Universidad Catolica de Chile on http://dmery.ing.puc.cl/index.php/material/gdxray/.

  2. For the convenience of comparison, we are labeled it as SSD300 * in Tables 5 and 6.

References

  1. Agrawal P, Carreira J, Malik J (2015) Learning to see by moving. In: 2015 IEEE international conference on computer vision (ICCV), pp 37–45

  2. Akcay S, Kundegorski ME, Willcocks CG, Breckon TP (2018) Using deep convolutional neural network architectures for object classification and detection within X-ray baggage security imagery. IEEE Trans Inf Forensics Secur 13(9):2203–2215

    Google Scholar 

  3. Bay H, Ess A, Tuytelaars T, Gool LJV (2008) Speeded-up robust features (surf). Comput Vis Image Underst 110(3):346–359

    Google Scholar 

  4. Belongie S, Malik J, Puzicha J (2001) Shape context: a new descriptor for shape matching and object recognition. In: Advances in neural information processing systems 13: proceedings of the 2000 conference, pp 831–837

  5. Blalock G, Kadiyali V, Simon DH (2007) The impact of post-9/11 airport security measures on the demand for air travel. J Law Econ 50(4):731–755

    Google Scholar 

  6. Everingham M, Gool LJV, Williams CKI, Winn JM, Zisserman A (2010) The Pascal visual object classes (VOC) challenge. Int J Comput Vis 88(2):303–338

    Google Scholar 

  7. Farfade SS, Saberian MJ, Li LJ (2015) Multi-view face detection using deep convolutional neural networks. In: ACM on international conference on multimedia retrieval, pp 643–650

  8. Felzenszwalb PF, Girshick RB, Mcallester DA, Ramanan D (2010) Object detection with discriminatively trained part-based models. IEEE Trans Pattern Anal Mach Intell 32(9):1627–1645

    Google Scholar 

  9. Franzel T, Schmidt U, Roth S (2012) Object detection in multi-view X-ray images. In: Pinz A, Pock T, Bischof H, Leberl F (eds) DAGM/OAGM 2012. LNCS, vol 7476. Springer, Heidelberg, pp 144–154

    Google Scholar 

  10. Girshick RB, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: CVPR ’14 proceedings of the 2014 IEEE conference on computer vision and pattern recognition, pp 580–587

  11. Hay GA (1978) X-ray imaging. J Phys E 11(5):377–385

    Google Scholar 

  12. Hinton GE, Salakhutdinov RR (2006) Reducing the dimensionality of data with neural networks. Science 313(5786):504–507

    MathSciNet  MATH  Google Scholar 

  13. Huang G, Liu Z, Der Maaten LV, Weinberger KQ (2017) Densely connected convolutional networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4700–4708

  14. Kazlauciunas A (2001) Digital imaging: theory and application. Part I: theory. Surf Coat Int Part B Coat Trans 84(1):1–9

    Google Scholar 

  15. Khotanzad A, Hong YH (1990) Invariant image recognition by Zernike moments. IEEE Trans Pattern Anal Mach Intell 12(5):489–497

    Google Scholar 

  16. Kingma DP, Ba J (2015) Adam: a method for stochastic optimization. In: International conference on learning representations

  17. Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: International conference on neural information processing systems, pp 1097–1105

  18. Lee SI, Chatalbashev V, Vickrey D, Koller D (2007) Learning a meta-level prior for feature relevance from multiple related tasks. In: Proceedings of the 24th international conference on Machine learning, pp 489–496

  19. Li FF, Fergus R, Perona P (2007) Learning generative visual models from few training examples: an incremental Bayesian approach tested on 101 object categories. Comput Vis Image Underst 106(1):59–70

    Google Scholar 

  20. Lin TY, Maire M, Belongie SJ, Hays J, Perona P, Ramanan D, Dollár P, Zitnick CL (2014) Microsoft coco: Common objects in context. In: European conference on computer vision, pp 740–755

  21. Liu W, Anguelov D, Erhan D, Szegedy C, Reed SE, Fu CY, Berg AC (2016) Ssd: Single shot multibox detector. In: European conference on computer vision, pp 21–37

  22. Mery D (2013) X-ray testing by computer vision. In: 2013 IEEE conference on computer vision and pattern recognition workshops, pp 360–367

  23. Mery D (2014) Computer vision technology for X-ray testing. Insight 56(3):147–155

    Google Scholar 

  24. Mery D, Riffo V (2014) Automated object recognition using multiple X-ray views. Mater Eval 72(11):1362–1372

    Google Scholar 

  25. Mery D, Riffo V, Zscherpel U, Mondragón G, Lillo I, Zuccar I, Lobel H, Carrasco M (2015) Gdxray: the database of X-ray images for nondestructive testing. J Nondestr Eval 34(4):42

    Google Scholar 

  26. Mery D, Svec E, Arias M (2015) Object recognition in baggage inspection using adaptive sparse representations of X-ray images. In: Pacific-rim symposium on image and video technology, pp 709–720

  27. Mery D, Svec E, Arias M, Riffo V, Saavedra JM, Banerjee S (2017) Modern computer vision techniques for X-ray testing in baggage inspection. IEEE Trans Syst Man Cybern Syst 47(4):682–692

    Google Scholar 

  28. Oquab M, Bottou L, Laptev I, Sivic J (2014) Learning and transferring mid-level image representations using convolutional neural networks. In: CVPR ’14 proceedings of the 2014 IEEE conference on computer vision and pattern recognition, pp 1717–1724

  29. Pan SJ, Yang Q (2010) A survey on transfer learning. IEEE Trans Knowl Data Eng 22(10):1345–1359

    Google Scholar 

  30. Redmon J, Divvala SK, Girshick RB, Farhadi A (2016) You only look once: Unified, real-time object detection. In: 2016 IEEE conference on computer vision and pattern recognition (CVPR), pp 779–788

  31. Redmon J, Farhadi A (2017) Yolo9000: better, faster, stronger. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7263–7271

  32. Ren S, He K, Girshick RB, Sun J (2017) Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans Pattern Anal Mach Intell 39(6):1137–1149

    Google Scholar 

  33. Riffo V, Mery D (2017) Automated detection of threat objects using adapted implicit shape model. IEEE Trans Syst Man Cybern Syst 46(4):472–482

    Google Scholar 

  34. Roomi MM (2012) Detection of concealed weapons in X-ray images using fuzzy k-NN. Int J Comput Sci Eng Inf 2(2):187–196

    MathSciNet  Google Scholar 

  35. Russakovsky O, Deng J, Su H, Krause J, Satheesh S, Ma S, Huang Z, Karpathy A, Khosla A, Bernstein M, Berg AC, Fei-Fei L (2015) Imagenet large scale visual recognition challenge. Int J Comput Vis 115(3):211–252

    MathSciNet  Google Scholar 

  36. Rusu AA, Rabinowitz NC, Desjardins G, Soyer H, Kirkpatrick J, Kavukcuoglu K, Pascanu R, Hadsell R (2016) Progressive neural networks. arXiv:1606.04671

  37. Shetty S (2016) Application of convolutional neural network for image classification on pascal VOC challenge 2012 dataset. arXiv:1607.03785

  38. Szegedy C, Liu W, Jia Y, Sermanet P, Reed SE, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. In: 2015 IEEE conference on computer vision and pattern recognition (CVPR), pp 1–9

  39. Tulsiani S, Carreira J, Malik J (2015) Pose induction for novel object categories. In: 2015 IEEE international conference on computer vision (ICCV), pp 64–72

  40. Turcsany D, Mouton A, Breckon TP (2013) Improving feature-based object recognition for X-ray baggage security screening using primed visualwords. In: IEEE international conference on industrial technology, pp 1140–1145

  41. Uroukov I, Speller R (2015) A preliminary approach to intelligent X-ray imaging for baggage inspection at airports. Signal Process Res 4(5):1–11

    Google Scholar 

  42. Vardhan PH, Priyadarsini PSU (2016) Transfer learning using convolutional neural networks for object classification within X-ray baggage security imagery. Res J Pharm Biol Chem Sci 7:222–229

    Google Scholar 

  43. Yosinski J, Clune J, Bengio Y, Lipson H (2014) How transferable are features in deep neural networks? In: Advances in neural information processing systems, pp 3320–3328

  44. Zhang H, Cao X, Ho JKL, Chow TWS (2017) Object-level video advertising: an optimization framework. IEEE Trans Ind Inform 13(2):520–531

    Google Scholar 

  45. Zhang H, Ji Y, Huang W, Liu L (2018) Sitcom-star-based clothing retrieval for video advertising: a deep learning framework. Neural Comput Appl. https://doi.org/10.1007/s00521-018-3579-x

    Google Scholar 

  46. Zhang N, Zhu J (2015) A study of X-ray machine image local semantic features extraction model based on bag-of-words for airport security. Int J Smart Sens Intell Syst 8(1):45–64

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yuanxi Wei.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wei, Y., Liu, X. Dangerous goods detection based on transfer learning in X-ray images. Neural Comput & Applic 32, 8711–8724 (2020). https://doi.org/10.1007/s00521-019-04360-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00521-019-04360-0

Keywords

Navigation