Abstract
Computer vision technology is used to analyze X-ray images and detect dangerous goods in the process of logistics and express delivery. It is a security technology which can reduce labor strength and improve working efficiency. At present, there are many excellent detection models and methods in the field of object detection for visible light images, such as R-CNN, Fast R-CNN, Faster R-CNN, YOLO, SSD. These deep neural network-based detection methods achieved excellent performance on ImageNet. The training of object detection models on X-ray image datasets for dangerous goods detection is the focus of research in the field. Due to practical reasons, it is difficult to collect a comprehensive image dataset of dangerous goods (positive samples). In order to overcome this problem, this paper uses a multi-task transfer learning method on the basis of classification task and location search task on SSD network. The research in this paper focuses on adding additional convolutional layers in the SSD network to re-learn the knowledge of the model learned from the source domain. Experiments show that compared with the traditional method of fine-tuning, this method has better transfer learning ability on SSD network. This method was used to perform experiments in SSD300 on the image datasets screened from GDXray and achieved a mean average precision (mAP) of 0.915.





Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Notes
GDXray is the GRIMA X-ray database, published by the Machine Intelligence Group at the Department of Computer Science of the Pontificia Universidad Catolica de Chile on http://dmery.ing.puc.cl/index.php/material/gdxray/.
References
Agrawal P, Carreira J, Malik J (2015) Learning to see by moving. In: 2015 IEEE international conference on computer vision (ICCV), pp 37–45
Akcay S, Kundegorski ME, Willcocks CG, Breckon TP (2018) Using deep convolutional neural network architectures for object classification and detection within X-ray baggage security imagery. IEEE Trans Inf Forensics Secur 13(9):2203–2215
Bay H, Ess A, Tuytelaars T, Gool LJV (2008) Speeded-up robust features (surf). Comput Vis Image Underst 110(3):346–359
Belongie S, Malik J, Puzicha J (2001) Shape context: a new descriptor for shape matching and object recognition. In: Advances in neural information processing systems 13: proceedings of the 2000 conference, pp 831–837
Blalock G, Kadiyali V, Simon DH (2007) The impact of post-9/11 airport security measures on the demand for air travel. J Law Econ 50(4):731–755
Everingham M, Gool LJV, Williams CKI, Winn JM, Zisserman A (2010) The Pascal visual object classes (VOC) challenge. Int J Comput Vis 88(2):303–338
Farfade SS, Saberian MJ, Li LJ (2015) Multi-view face detection using deep convolutional neural networks. In: ACM on international conference on multimedia retrieval, pp 643–650
Felzenszwalb PF, Girshick RB, Mcallester DA, Ramanan D (2010) Object detection with discriminatively trained part-based models. IEEE Trans Pattern Anal Mach Intell 32(9):1627–1645
Franzel T, Schmidt U, Roth S (2012) Object detection in multi-view X-ray images. In: Pinz A, Pock T, Bischof H, Leberl F (eds) DAGM/OAGM 2012. LNCS, vol 7476. Springer, Heidelberg, pp 144–154
Girshick RB, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: CVPR ’14 proceedings of the 2014 IEEE conference on computer vision and pattern recognition, pp 580–587
Hay GA (1978) X-ray imaging. J Phys E 11(5):377–385
Hinton GE, Salakhutdinov RR (2006) Reducing the dimensionality of data with neural networks. Science 313(5786):504–507
Huang G, Liu Z, Der Maaten LV, Weinberger KQ (2017) Densely connected convolutional networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4700–4708
Kazlauciunas A (2001) Digital imaging: theory and application. Part I: theory. Surf Coat Int Part B Coat Trans 84(1):1–9
Khotanzad A, Hong YH (1990) Invariant image recognition by Zernike moments. IEEE Trans Pattern Anal Mach Intell 12(5):489–497
Kingma DP, Ba J (2015) Adam: a method for stochastic optimization. In: International conference on learning representations
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: International conference on neural information processing systems, pp 1097–1105
Lee SI, Chatalbashev V, Vickrey D, Koller D (2007) Learning a meta-level prior for feature relevance from multiple related tasks. In: Proceedings of the 24th international conference on Machine learning, pp 489–496
Li FF, Fergus R, Perona P (2007) Learning generative visual models from few training examples: an incremental Bayesian approach tested on 101 object categories. Comput Vis Image Underst 106(1):59–70
Lin TY, Maire M, Belongie SJ, Hays J, Perona P, Ramanan D, Dollár P, Zitnick CL (2014) Microsoft coco: Common objects in context. In: European conference on computer vision, pp 740–755
Liu W, Anguelov D, Erhan D, Szegedy C, Reed SE, Fu CY, Berg AC (2016) Ssd: Single shot multibox detector. In: European conference on computer vision, pp 21–37
Mery D (2013) X-ray testing by computer vision. In: 2013 IEEE conference on computer vision and pattern recognition workshops, pp 360–367
Mery D (2014) Computer vision technology for X-ray testing. Insight 56(3):147–155
Mery D, Riffo V (2014) Automated object recognition using multiple X-ray views. Mater Eval 72(11):1362–1372
Mery D, Riffo V, Zscherpel U, Mondragón G, Lillo I, Zuccar I, Lobel H, Carrasco M (2015) Gdxray: the database of X-ray images for nondestructive testing. J Nondestr Eval 34(4):42
Mery D, Svec E, Arias M (2015) Object recognition in baggage inspection using adaptive sparse representations of X-ray images. In: Pacific-rim symposium on image and video technology, pp 709–720
Mery D, Svec E, Arias M, Riffo V, Saavedra JM, Banerjee S (2017) Modern computer vision techniques for X-ray testing in baggage inspection. IEEE Trans Syst Man Cybern Syst 47(4):682–692
Oquab M, Bottou L, Laptev I, Sivic J (2014) Learning and transferring mid-level image representations using convolutional neural networks. In: CVPR ’14 proceedings of the 2014 IEEE conference on computer vision and pattern recognition, pp 1717–1724
Pan SJ, Yang Q (2010) A survey on transfer learning. IEEE Trans Knowl Data Eng 22(10):1345–1359
Redmon J, Divvala SK, Girshick RB, Farhadi A (2016) You only look once: Unified, real-time object detection. In: 2016 IEEE conference on computer vision and pattern recognition (CVPR), pp 779–788
Redmon J, Farhadi A (2017) Yolo9000: better, faster, stronger. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7263–7271
Ren S, He K, Girshick RB, Sun J (2017) Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans Pattern Anal Mach Intell 39(6):1137–1149
Riffo V, Mery D (2017) Automated detection of threat objects using adapted implicit shape model. IEEE Trans Syst Man Cybern Syst 46(4):472–482
Roomi MM (2012) Detection of concealed weapons in X-ray images using fuzzy k-NN. Int J Comput Sci Eng Inf 2(2):187–196
Russakovsky O, Deng J, Su H, Krause J, Satheesh S, Ma S, Huang Z, Karpathy A, Khosla A, Bernstein M, Berg AC, Fei-Fei L (2015) Imagenet large scale visual recognition challenge. Int J Comput Vis 115(3):211–252
Rusu AA, Rabinowitz NC, Desjardins G, Soyer H, Kirkpatrick J, Kavukcuoglu K, Pascanu R, Hadsell R (2016) Progressive neural networks. arXiv:1606.04671
Shetty S (2016) Application of convolutional neural network for image classification on pascal VOC challenge 2012 dataset. arXiv:1607.03785
Szegedy C, Liu W, Jia Y, Sermanet P, Reed SE, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. In: 2015 IEEE conference on computer vision and pattern recognition (CVPR), pp 1–9
Tulsiani S, Carreira J, Malik J (2015) Pose induction for novel object categories. In: 2015 IEEE international conference on computer vision (ICCV), pp 64–72
Turcsany D, Mouton A, Breckon TP (2013) Improving feature-based object recognition for X-ray baggage security screening using primed visualwords. In: IEEE international conference on industrial technology, pp 1140–1145
Uroukov I, Speller R (2015) A preliminary approach to intelligent X-ray imaging for baggage inspection at airports. Signal Process Res 4(5):1–11
Vardhan PH, Priyadarsini PSU (2016) Transfer learning using convolutional neural networks for object classification within X-ray baggage security imagery. Res J Pharm Biol Chem Sci 7:222–229
Yosinski J, Clune J, Bengio Y, Lipson H (2014) How transferable are features in deep neural networks? In: Advances in neural information processing systems, pp 3320–3328
Zhang H, Cao X, Ho JKL, Chow TWS (2017) Object-level video advertising: an optimization framework. IEEE Trans Ind Inform 13(2):520–531
Zhang H, Ji Y, Huang W, Liu L (2018) Sitcom-star-based clothing retrieval for video advertising: a deep learning framework. Neural Comput Appl. https://doi.org/10.1007/s00521-018-3579-x
Zhang N, Zhu J (2015) A study of X-ray machine image local semantic features extraction model based on bag-of-words for airport security. Int J Smart Sens Intell Syst 8(1):45–64
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Wei, Y., Liu, X. Dangerous goods detection based on transfer learning in X-ray images. Neural Comput & Applic 32, 8711–8724 (2020). https://doi.org/10.1007/s00521-019-04360-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-019-04360-0