Abstract
Innovations on the Internet of Everything (IoE) enabled systems are driving a change in the settings where we interact in smart units, recognized globally as smart city environments. However, intelligent video-surveillance systems are critical to increasing the security of these smart cities. More precisely, in today’s world of smart video surveillance, person re-identification (Re-ID) has gained increased consideration by researchers. Various researchers have designed deep learning-based algorithms for person Re-ID because they have achieved substantial breakthroughs in computer vision problems. In this line of research, we designed an adaptive feature refinement-based deep learning architecture to conduct person Re-ID. In the proposed architecture, the inter-channel and inter-spatial relationship of features between the images of the same individual taken from nonidentical camera viewpoints are focused on learning spatial and channel attention. In addition, the spatial pyramid pooling layer is inserted to extract the multiscale and fixed-dimension feature vectors irrespective of the size of the feature maps. Furthermore, the model’s effectiveness is validated on the CUHK01 and CUHK02 datasets. When compared with existing approaches, the approach presented in this paper achieves encouraging Rank 1 and 5 scores of 24.6% and 54.8%, respectively.
Similar content being viewed by others
References
Neirotti P, De Marco A, Cagliano A C, Mangano G, Scorrano F. Current trends in smart city initiatives: some stylised facts. Cities, 2014, 38: 25–36
Vlacheas P, Giaffreda R, Stavroulaki V, Kelaidonis D, Foteinos V, Poulios G, Demestichas P, Somov A, Biswas A R, Moessner K. Enabling smart cities through a cognitive management framework for the internet of things. IEEE Communications Magazine, 2013, 51(6): 102–111
Singh P, Nayyar A, Kaur A, Ghosh U. Blockchain and fog based architecture for internet of everything in smart cities. Future Internet, 2020, 12(4): 61
Zheng L, Yang Y, Hauptmann A G. Person re-identification: past, present and future. 2016, arXiv preprint arXiv: 1610.02984
Wu D, Zheng S J, Zhang X P, Yuan C A, Cheng F, Zhao Y, Lin Y J, Zhao Z-Q, Jiang Y L, Huang D S. Deep learning-based methods for person re-identification: a comprehensive review. Neurocomputing, 2019, 337: 354–371
Zahra A, Perwaiz N, Shahzad M, Fraz M M. Person re-identification: a retrospective on domain specific open challenges and future trends. 2022, arXiv preprint arXiv: 2202.13121
Ye M, Shen J, Lin G, Xiang T, Shao L, Hoi S C H. Deep learning for person re-identification: a survey and outlook. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, 44(6): 2872–2893
Wu W, Tao D, Li H, Yang Z, Cheng J. Deep features for person re-identification on metric learning. Pattern Recognition, 2021, 110: 107424
Chen X, Xu H, Li Y, Bian M. Person re-identification by low-dimensional features and metric learning. Future Internet, 2021, 13(11): 289
Li R, Zhang B, Teng Z, Fan J. A divide-and-unite deep network for person re-identification. Applied Intelligence, 2021, 51(3): 1479–1491
Ming Z, Zhu M, Wang X, Zhu J, Cheng J, Gao C, Yang Y, Wei X. Deep learning-based person re-identification methods: a survey and outlook of recent works. Image and Vision Computing, 2022, 119: 104394
Lin S, Li C T. Person re-identification with soft biometrics through deep learning. In: Jiang R, Li C T, Crookes D, Meng W, Rosenberger C, eds. Deep Biometrics. Cham: Springer, 2020, 21–36
Shoukry N, Abd El Ghany MA, Salem M A M. Multi-modal long-term person re-identification using physical soft bio-metrics and body figure. Applied Sciences, 2022, 12(6): 2835
Nambiar A, Bernardino A, Nascimento J C. Gait-based person re-identification: a survey. ACM Computing Surveys, 2020, 52(2): 33
Woo S, Park J, Lee J-Y, Kweon I S. CBAM: convolutional block attention module. In: Proceedings of the 15th European Conference on Computer Vision. 2018, 3–19
Li W, Zhao R, Xiao T, Wang X DeepReID: deep filter pairing neural network for person re-identification. In: Proceedings of 2014 IEEE Conference on Computer Vision and Pattern Recognition. 2014, 152–159
Ahmed E, Jones M, Marks T K. An improved deep learning architecture for person re-identification. In: Proceedings of 2015 IEEE Conference on Computer Vision and Pattern Recognition. 2015, 3908–3916
Chen S, Qin J, Ji X, Lei B, Wang T, Ni D, Cheng J-Z. Automatic scoring of multiple semantic attributes with multi-task feature leverage: a study on pulmonary nodules in CT images. IEEE Transactions on Medical Imaging, 2017, 36(3): 802–814
Huang Y, Sheng H, Zheng Y, Xiong Z. DeepDiff: learning deep difference features on human body parts for person re-identification. Neurocomputing, 2017, 241: 191–203
Zhao H, Tian M, Sun S, Shao J, Yan J, Yi S, Wang X, Tang X. Spindle net: person re-identification with human body region guided feature decomposition and fusion. In: Proceedings of 2017 IEEE Conference on Computer Vision and Pattern Recognition. 2017, 907–915
Hermans A, Beyer L, Leibe B. In defense of the triplet loss for person re-identification. 2017, arXiv preprint arXiv: 1703.07737
He Z, Jung C, Fu Q, Zhang Z. Deep feature embedding learning for person re-identification based on lifted structured loss. Multimedia Tools and Applications, 2019, 78(5): 5863–5880
Wu L, Wang Y, Li X, Gao J. What-and-where to match: deep spatially multiplicative integration networks for person re-identification. Pattern Recognition, 2018, 76: 727–738
Chatfield K, Simonyan K, Vedaldi A, Zisserman A. Return of the devil in the details: delving deep into convolutional nets. In: Proceedings of British Machine Vision Conference. 2014
Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition. In: Proceedings of the 3rd International Conference on Learning Representations. 2015
Wu L, Hong R, Wang Y, Wang M. Cross-entropy adversarial view adaptation for person re-identification. IEEE Transactions on Circuits and Systems for Video Technology, 2020, 30(7): 2081–2092
Zhu X, Liu J, Xie H, Zha Z-J. Adaptive alignment network for person re-identification. In: Proceedings of the 25th International Conference on Multimedia Modeling. 2019, 16–27
Wu A, Zheng W-S, Lai J-H. Robust depth-based person re-identification. IEEE Transactions on Image Processing, 2017, 26(6): 2588–2603
Imani Z, Soltanizadeh H. Histogram of the node strength and histogram of the edge weight: two new features for RGB-D person re-identification. Science China Information Sciences, 2018, 61(9): 092108
Ren L, Lu J, Feng J, Zhou J. Multi-modal uniform deep learning for RGB-D person re-identification. Pattern Recognition, 2017, 72: 446–457
Wu A, Zheng W-S, Yu H-X, Gong S, Lai J. RGB-infrared cross-modality person re-identification. In: Proceedings of 2017 IEEE International Conference on Computer Vision. 2017, 5390–5399
Møgelmose A, Bahnsen C, Moeslund T B, Clapes A, Escalera S. Tri-modal person re-identification with RGB, depth and thermal features. In: Proceedings of 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops. 2013, 301–307
Silva B N, Khan M, Han K. Towards sustainable smart cities: a review of trends, architectures, components, and open challenges in smart cities. Sustainable Cities and Society, 2018, 38: 697–713
Majeed U, Khan L U, Yaqoob I, Kazmi S M A, Salah K, Hong C S. Blockchain for IoT-based smart cities: recent advances, requirements, and future challenges. Journal of Network and Computer Applications, 2021, 181: 103007
Ullah F, Al-Turjman F, Nayyar A. IoT-based green city architecture using secured and sustainable android services. Environmental Technology & Innovation, 2020, 20: 101091
Li J, Wang J, Ullah F. An end-to-end task-simplified and anchor-guided deep learning framework for image-based head pose estimation. IEEE Access, 2020, 8: 42458–42468
Hubel D H, Wiesel T N. Receptive fields and functional architecture of monkey striate cortex. The Journal of Physiology, 1968, 195(1): 215–243
Bukhari M, Bajwa K B, Gillani S, Maqsood M, Durrani M Y, Mehmood I, Ugail H, Rho S. An efficient gait recognition method for known and unknown covariate conditions. IEEE Access, 2021, 9: 6465–6477
Ashraf R, Afzal S, Rehman A U, Gul S, Baber J, Bakhtyar M, Mehmood I, Song O Y, Maqsood M. Region-of-interest based transfer learning assisted framework for skin cancer detection. IEEE Access, 2020, 8: 147858–147871
Maqsood M, Bukhari M, Ali Z, Gillani S, Mehmood I, Rho S, Jung Y-A. A residual-learning-based multi-scale parallel-convolutions- assisted efficient CAD system for liver tumor detection. Mathematics, 2021, 9(10): 1133
Maqsood M, Yasmin S, Mehmood I, Bukhari M, Kim M. An efficient DA-net architecture for lung nodule segmentation. Mathematics, 2021, 9(13): 1457
Niu Z, Zhong G, Yu H. A review on the attention mechanism of deep learning. Neurocomputing, 2021, 452: 48–62
Guo M H, Xu T X, Liu J J, Liu Z N, Jiang P T, Mu T J, Zhang S H, Martin R R, Cheng M M, Hu S M. Attention mechanisms in computer vision: a survey. Computational Visual Media, 2022, 8(3): 331–368
He K, Zhang X, Ren S, Sun J. Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 37(9): 1904–1916
Lazebnik S, Schmid C, Ponce J. Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: Proceedings of 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 2006, 2169–2178
Li W, Wang X. Locally aligned feature transforms across views. In: Proceedings of 2013 IEEE Conference on Computer Vision and Pattern Recognition. 2013, 3594–3601
Li W, Zhao R, Wang X. Human reidentification with transferred metric learning. In: Proceedings of the 11th Asian Conference on Computer Vision. 2012, 31–44
Köstinger M, Hirzer M, Wohlhart P, Roth P M, Bischof H. Large scale metric learning from equivalence constraints. In: Proceedings of 2012 IEEE Conference on Computer Vision and Pattern Recognition. 2012, 2288–2295
Zheng L, Shen L, Tian L, Wang S, Wang J, Tian Q. Scalable person re-identification: a benchmark. In: Proceedings of 2015 IEEE International Conference on Computer Vision. 2015, 1116–1124
Fan H, Zheng L, Yan C, Yang Y. Unsupervised person re-identification: clustering and fine-tuning. ACM Transactions on Multimedia Computing, Communications, and Applications, 2018, 14(4): 83
Feng G, Liu W, Tao D, Zhou Y. Hessian regularized distance metric learning for people re-identification. Neural Processing Letters, 2019, 50(3): 2087–2100
Acknowledgements
This paper was supported by Korea Institute for Advancement of Technology (KIAT) grant funded by the Korea Government (MOTIE) (P0008703, The Competency Development Program for Industry Specialist) and also the MSIT (Ministry of Science and ICT), Republic of Korea, under the ITRC (Information Technology Research Center) support program (IITP-2022-2018-0-01799) supervised by the IITP (Institute for Information & Communications Technology Planning & Evaluation).
Author information
Authors and Affiliations
Corresponding authors
Additional information
Muazzam Maqsood is serving as an Assistant Professor at the Department of Computer Science, COMSATS University Islamabad, Attock Campus, Pakistan. He holds a PhD in software engineering with a keen interest in artificial intelligence and deep learning-based systems. His main research focus is to use the latest machine learning and deep learning algorithms to develop automated solutions, especially in the field of pattern recognition and data analytics. He has published various top-ranked impact factor papers in the area of image processing, medical imaging, recommender systems, stock exchange prediction, and big data analytics. He is also a reviewer of many impact factor journals and a program committee member of various international conferences.
Sadaf Yasmin is currently working as Assistant Professor at the Department of Computer Science, COMSATS University Islamabad, Attock Campus, Pakistan. She has completed her MS and PhD in Computer Science from Capital University of Science and Technology, Pakistan, and BS in Software Engineering from (APCOMS) NUML Islamabad, Pakistan. She has worked on several research projects during and after her PhD She is also serving as a reviewer for various reputed journals. Her research interests include network protocol design, computer vision, medical imaging, and pattern recognition.
Saira Gillani received her PhD degree in Information Sciences from Corvinus University of Budapest, Hungary. She joined the COMSATS Institute of Information Technology, Pakistan in 2016. She also served as an assistant professor in Saudi Electronic University, Saudi Arabia. She is currently serving as an associate professor in Bahria University Lahore, Pakistan. Previously, she worked as research scholar in Corvinno, Technology Transfer Center of Information Technology and Services in Budapest, Hungary and also worked as research associate in CoReNet (Center of Research in Networks and Telecom), CUST, Pakistan. Her areas of interest include data sciences, text mining, data mining, machine learning, vehicular networks, mobile edge computing and Internet of Things.
Maryam Bukhari is perusing her MS degree at COMSATS University Islamabad, Attock Campus, Pakistan. Her research areas include machine learning and image processing.
Seungmin Rho is currently an associate professor at Department of Industrial Security at Chung-Ang University, Republic of Korea. His current research interests include database, big data analysis, music retrieval, multimedia systems, machine learning, knowledge management as well as computational intelligence. He has published 300 papers in refereed journals and conference proceedings in these areas. He has been involved in more than 20 conferences and workshops as various chairs and more than 30 conferences/workshops as a program committee member. He has edited a number of international journal special issues as a guest editor, such as multimedia systems, information fusion, and engineering applications of artificial intelligence.
Sang-Soo Yeo received a PhD degree in Computer Science & Engineering from Chung-Ang University, Republic of Korea in 2005. He is a professor at the Department of Computer Engineering, Mokwon University, Republic of Korea. He worked for MOIS, Ministry of Interior and Safety and worked for PIPC, Personal Information Protection Commission, Republic of Korea from Feb. 2020 to Jul. 2021. He is President of the Institution of Creative Research Professionals (ICRP), and Vice President of ICT Platform Society (ICTPS). He is serving as Steering Chair of the PlatCon conference series, a very comprehensive conference series on platform technology and services. Dr. Yeo’s research interests include security, privacy, personal information Protection, ubiquitous computing, multimedia service, ubiquitous computing, embedded system, and bioinformatics.
Electronic supplementary material
11704_2022_2050_MOESM1_ESM.pdf
An efficient deep learning-assisted person re-identification solution for intelligent video surveillance in smart cities
Rights and permissions
About this article
Cite this article
Maqsood, M., Yasmin, S., Gillani, S. et al. An efficient deep learning-assisted person re-identification solution for intelligent video surveillance in smart cities. Front. Comput. Sci. 17, 174329 (2023). https://doi.org/10.1007/s11704-022-2050-4
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11704-022-2050-4