Skip to main content
Log in

An efficient deep learning-assisted person re-identification solution for intelligent video surveillance in smart cities

  • Research Article
  • Published:
Frontiers of Computer Science Aims and scope Submit manuscript

Abstract

Innovations on the Internet of Everything (IoE) enabled systems are driving a change in the settings where we interact in smart units, recognized globally as smart city environments. However, intelligent video-surveillance systems are critical to increasing the security of these smart cities. More precisely, in today’s world of smart video surveillance, person re-identification (Re-ID) has gained increased consideration by researchers. Various researchers have designed deep learning-based algorithms for person Re-ID because they have achieved substantial breakthroughs in computer vision problems. In this line of research, we designed an adaptive feature refinement-based deep learning architecture to conduct person Re-ID. In the proposed architecture, the inter-channel and inter-spatial relationship of features between the images of the same individual taken from nonidentical camera viewpoints are focused on learning spatial and channel attention. In addition, the spatial pyramid pooling layer is inserted to extract the multiscale and fixed-dimension feature vectors irrespective of the size of the feature maps. Furthermore, the model’s effectiveness is validated on the CUHK01 and CUHK02 datasets. When compared with existing approaches, the approach presented in this paper achieves encouraging Rank 1 and 5 scores of 24.6% and 54.8%, respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Neirotti P, De Marco A, Cagliano A C, Mangano G, Scorrano F. Current trends in smart city initiatives: some stylised facts. Cities, 2014, 38: 25–36

    Article  Google Scholar 

  2. Vlacheas P, Giaffreda R, Stavroulaki V, Kelaidonis D, Foteinos V, Poulios G, Demestichas P, Somov A, Biswas A R, Moessner K. Enabling smart cities through a cognitive management framework for the internet of things. IEEE Communications Magazine, 2013, 51(6): 102–111

    Article  Google Scholar 

  3. Singh P, Nayyar A, Kaur A, Ghosh U. Blockchain and fog based architecture for internet of everything in smart cities. Future Internet, 2020, 12(4): 61

    Article  Google Scholar 

  4. Zheng L, Yang Y, Hauptmann A G. Person re-identification: past, present and future. 2016, arXiv preprint arXiv: 1610.02984

  5. Wu D, Zheng S J, Zhang X P, Yuan C A, Cheng F, Zhao Y, Lin Y J, Zhao Z-Q, Jiang Y L, Huang D S. Deep learning-based methods for person re-identification: a comprehensive review. Neurocomputing, 2019, 337: 354–371

    Article  Google Scholar 

  6. Zahra A, Perwaiz N, Shahzad M, Fraz M M. Person re-identification: a retrospective on domain specific open challenges and future trends. 2022, arXiv preprint arXiv: 2202.13121

  7. Ye M, Shen J, Lin G, Xiang T, Shao L, Hoi S C H. Deep learning for person re-identification: a survey and outlook. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, 44(6): 2872–2893

    Article  Google Scholar 

  8. Wu W, Tao D, Li H, Yang Z, Cheng J. Deep features for person re-identification on metric learning. Pattern Recognition, 2021, 110: 107424

    Article  Google Scholar 

  9. Chen X, Xu H, Li Y, Bian M. Person re-identification by low-dimensional features and metric learning. Future Internet, 2021, 13(11): 289

    Article  Google Scholar 

  10. Li R, Zhang B, Teng Z, Fan J. A divide-and-unite deep network for person re-identification. Applied Intelligence, 2021, 51(3): 1479–1491

    Article  Google Scholar 

  11. Ming Z, Zhu M, Wang X, Zhu J, Cheng J, Gao C, Yang Y, Wei X. Deep learning-based person re-identification methods: a survey and outlook of recent works. Image and Vision Computing, 2022, 119: 104394

    Article  Google Scholar 

  12. Lin S, Li C T. Person re-identification with soft biometrics through deep learning. In: Jiang R, Li C T, Crookes D, Meng W, Rosenberger C, eds. Deep Biometrics. Cham: Springer, 2020, 21–36

    Chapter  Google Scholar 

  13. Shoukry N, Abd El Ghany MA, Salem M A M. Multi-modal long-term person re-identification using physical soft bio-metrics and body figure. Applied Sciences, 2022, 12(6): 2835

    Article  Google Scholar 

  14. Nambiar A, Bernardino A, Nascimento J C. Gait-based person re-identification: a survey. ACM Computing Surveys, 2020, 52(2): 33

    Article  Google Scholar 

  15. Woo S, Park J, Lee J-Y, Kweon I S. CBAM: convolutional block attention module. In: Proceedings of the 15th European Conference on Computer Vision. 2018, 3–19

  16. Li W, Zhao R, Xiao T, Wang X DeepReID: deep filter pairing neural network for person re-identification. In: Proceedings of 2014 IEEE Conference on Computer Vision and Pattern Recognition. 2014, 152–159

  17. Ahmed E, Jones M, Marks T K. An improved deep learning architecture for person re-identification. In: Proceedings of 2015 IEEE Conference on Computer Vision and Pattern Recognition. 2015, 3908–3916

  18. Chen S, Qin J, Ji X, Lei B, Wang T, Ni D, Cheng J-Z. Automatic scoring of multiple semantic attributes with multi-task feature leverage: a study on pulmonary nodules in CT images. IEEE Transactions on Medical Imaging, 2017, 36(3): 802–814

    Article  Google Scholar 

  19. Huang Y, Sheng H, Zheng Y, Xiong Z. DeepDiff: learning deep difference features on human body parts for person re-identification. Neurocomputing, 2017, 241: 191–203

    Article  Google Scholar 

  20. Zhao H, Tian M, Sun S, Shao J, Yan J, Yi S, Wang X, Tang X. Spindle net: person re-identification with human body region guided feature decomposition and fusion. In: Proceedings of 2017 IEEE Conference on Computer Vision and Pattern Recognition. 2017, 907–915

  21. Hermans A, Beyer L, Leibe B. In defense of the triplet loss for person re-identification. 2017, arXiv preprint arXiv: 1703.07737

  22. He Z, Jung C, Fu Q, Zhang Z. Deep feature embedding learning for person re-identification based on lifted structured loss. Multimedia Tools and Applications, 2019, 78(5): 5863–5880

    Article  Google Scholar 

  23. Wu L, Wang Y, Li X, Gao J. What-and-where to match: deep spatially multiplicative integration networks for person re-identification. Pattern Recognition, 2018, 76: 727–738

    Article  Google Scholar 

  24. Chatfield K, Simonyan K, Vedaldi A, Zisserman A. Return of the devil in the details: delving deep into convolutional nets. In: Proceedings of British Machine Vision Conference. 2014

  25. Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition. In: Proceedings of the 3rd International Conference on Learning Representations. 2015

  26. Wu L, Hong R, Wang Y, Wang M. Cross-entropy adversarial view adaptation for person re-identification. IEEE Transactions on Circuits and Systems for Video Technology, 2020, 30(7): 2081–2092

    Google Scholar 

  27. Zhu X, Liu J, Xie H, Zha Z-J. Adaptive alignment network for person re-identification. In: Proceedings of the 25th International Conference on Multimedia Modeling. 2019, 16–27

  28. Wu A, Zheng W-S, Lai J-H. Robust depth-based person re-identification. IEEE Transactions on Image Processing, 2017, 26(6): 2588–2603

    Article  MathSciNet  MATH  Google Scholar 

  29. Imani Z, Soltanizadeh H. Histogram of the node strength and histogram of the edge weight: two new features for RGB-D person re-identification. Science China Information Sciences, 2018, 61(9): 092108

    Article  Google Scholar 

  30. Ren L, Lu J, Feng J, Zhou J. Multi-modal uniform deep learning for RGB-D person re-identification. Pattern Recognition, 2017, 72: 446–457

    Article  Google Scholar 

  31. Wu A, Zheng W-S, Yu H-X, Gong S, Lai J. RGB-infrared cross-modality person re-identification. In: Proceedings of 2017 IEEE International Conference on Computer Vision. 2017, 5390–5399

  32. Møgelmose A, Bahnsen C, Moeslund T B, Clapes A, Escalera S. Tri-modal person re-identification with RGB, depth and thermal features. In: Proceedings of 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops. 2013, 301–307

  33. Silva B N, Khan M, Han K. Towards sustainable smart cities: a review of trends, architectures, components, and open challenges in smart cities. Sustainable Cities and Society, 2018, 38: 697–713

    Article  Google Scholar 

  34. Majeed U, Khan L U, Yaqoob I, Kazmi S M A, Salah K, Hong C S. Blockchain for IoT-based smart cities: recent advances, requirements, and future challenges. Journal of Network and Computer Applications, 2021, 181: 103007

    Article  Google Scholar 

  35. Ullah F, Al-Turjman F, Nayyar A. IoT-based green city architecture using secured and sustainable android services. Environmental Technology & Innovation, 2020, 20: 101091

    Article  Google Scholar 

  36. Li J, Wang J, Ullah F. An end-to-end task-simplified and anchor-guided deep learning framework for image-based head pose estimation. IEEE Access, 2020, 8: 42458–42468

    Article  Google Scholar 

  37. Hubel D H, Wiesel T N. Receptive fields and functional architecture of monkey striate cortex. The Journal of Physiology, 1968, 195(1): 215–243

    Article  Google Scholar 

  38. Bukhari M, Bajwa K B, Gillani S, Maqsood M, Durrani M Y, Mehmood I, Ugail H, Rho S. An efficient gait recognition method for known and unknown covariate conditions. IEEE Access, 2021, 9: 6465–6477

    Article  Google Scholar 

  39. Ashraf R, Afzal S, Rehman A U, Gul S, Baber J, Bakhtyar M, Mehmood I, Song O Y, Maqsood M. Region-of-interest based transfer learning assisted framework for skin cancer detection. IEEE Access, 2020, 8: 147858–147871

    Article  Google Scholar 

  40. Maqsood M, Bukhari M, Ali Z, Gillani S, Mehmood I, Rho S, Jung Y-A. A residual-learning-based multi-scale parallel-convolutions- assisted efficient CAD system for liver tumor detection. Mathematics, 2021, 9(10): 1133

    Article  Google Scholar 

  41. Maqsood M, Yasmin S, Mehmood I, Bukhari M, Kim M. An efficient DA-net architecture for lung nodule segmentation. Mathematics, 2021, 9(13): 1457

    Article  Google Scholar 

  42. Niu Z, Zhong G, Yu H. A review on the attention mechanism of deep learning. Neurocomputing, 2021, 452: 48–62

    Article  Google Scholar 

  43. Guo M H, Xu T X, Liu J J, Liu Z N, Jiang P T, Mu T J, Zhang S H, Martin R R, Cheng M M, Hu S M. Attention mechanisms in computer vision: a survey. Computational Visual Media, 2022, 8(3): 331–368

    Article  Google Scholar 

  44. He K, Zhang X, Ren S, Sun J. Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 37(9): 1904–1916

    Article  Google Scholar 

  45. Lazebnik S, Schmid C, Ponce J. Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: Proceedings of 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 2006, 2169–2178

  46. Li W, Wang X. Locally aligned feature transforms across views. In: Proceedings of 2013 IEEE Conference on Computer Vision and Pattern Recognition. 2013, 3594–3601

  47. Li W, Zhao R, Wang X. Human reidentification with transferred metric learning. In: Proceedings of the 11th Asian Conference on Computer Vision. 2012, 31–44

  48. Köstinger M, Hirzer M, Wohlhart P, Roth P M, Bischof H. Large scale metric learning from equivalence constraints. In: Proceedings of 2012 IEEE Conference on Computer Vision and Pattern Recognition. 2012, 2288–2295

  49. Zheng L, Shen L, Tian L, Wang S, Wang J, Tian Q. Scalable person re-identification: a benchmark. In: Proceedings of 2015 IEEE International Conference on Computer Vision. 2015, 1116–1124

  50. Fan H, Zheng L, Yan C, Yang Y. Unsupervised person re-identification: clustering and fine-tuning. ACM Transactions on Multimedia Computing, Communications, and Applications, 2018, 14(4): 83

    Google Scholar 

  51. Feng G, Liu W, Tao D, Zhou Y. Hessian regularized distance metric learning for people re-identification. Neural Processing Letters, 2019, 50(3): 2087–2100

    Article  Google Scholar 

Download references

Acknowledgements

This paper was supported by Korea Institute for Advancement of Technology (KIAT) grant funded by the Korea Government (MOTIE) (P0008703, The Competency Development Program for Industry Specialist) and also the MSIT (Ministry of Science and ICT), Republic of Korea, under the ITRC (Information Technology Research Center) support program (IITP-2022-2018-0-01799) supervised by the IITP (Institute for Information & Communications Technology Planning & Evaluation).

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Seungmin Rho or Sang-Soo Yeo.

Additional information

Muazzam Maqsood is serving as an Assistant Professor at the Department of Computer Science, COMSATS University Islamabad, Attock Campus, Pakistan. He holds a PhD in software engineering with a keen interest in artificial intelligence and deep learning-based systems. His main research focus is to use the latest machine learning and deep learning algorithms to develop automated solutions, especially in the field of pattern recognition and data analytics. He has published various top-ranked impact factor papers in the area of image processing, medical imaging, recommender systems, stock exchange prediction, and big data analytics. He is also a reviewer of many impact factor journals and a program committee member of various international conferences.

Sadaf Yasmin is currently working as Assistant Professor at the Department of Computer Science, COMSATS University Islamabad, Attock Campus, Pakistan. She has completed her MS and PhD in Computer Science from Capital University of Science and Technology, Pakistan, and BS in Software Engineering from (APCOMS) NUML Islamabad, Pakistan. She has worked on several research projects during and after her PhD She is also serving as a reviewer for various reputed journals. Her research interests include network protocol design, computer vision, medical imaging, and pattern recognition.

Saira Gillani received her PhD degree in Information Sciences from Corvinus University of Budapest, Hungary. She joined the COMSATS Institute of Information Technology, Pakistan in 2016. She also served as an assistant professor in Saudi Electronic University, Saudi Arabia. She is currently serving as an associate professor in Bahria University Lahore, Pakistan. Previously, she worked as research scholar in Corvinno, Technology Transfer Center of Information Technology and Services in Budapest, Hungary and also worked as research associate in CoReNet (Center of Research in Networks and Telecom), CUST, Pakistan. Her areas of interest include data sciences, text mining, data mining, machine learning, vehicular networks, mobile edge computing and Internet of Things.

Maryam Bukhari is perusing her MS degree at COMSATS University Islamabad, Attock Campus, Pakistan. Her research areas include machine learning and image processing.

Seungmin Rho is currently an associate professor at Department of Industrial Security at Chung-Ang University, Republic of Korea. His current research interests include database, big data analysis, music retrieval, multimedia systems, machine learning, knowledge management as well as computational intelligence. He has published 300 papers in refereed journals and conference proceedings in these areas. He has been involved in more than 20 conferences and workshops as various chairs and more than 30 conferences/workshops as a program committee member. He has edited a number of international journal special issues as a guest editor, such as multimedia systems, information fusion, and engineering applications of artificial intelligence.

Sang-Soo Yeo received a PhD degree in Computer Science & Engineering from Chung-Ang University, Republic of Korea in 2005. He is a professor at the Department of Computer Engineering, Mokwon University, Republic of Korea. He worked for MOIS, Ministry of Interior and Safety and worked for PIPC, Personal Information Protection Commission, Republic of Korea from Feb. 2020 to Jul. 2021. He is President of the Institution of Creative Research Professionals (ICRP), and Vice President of ICT Platform Society (ICTPS). He is serving as Steering Chair of the PlatCon conference series, a very comprehensive conference series on platform technology and services. Dr. Yeo’s research interests include security, privacy, personal information Protection, ubiquitous computing, multimedia service, ubiquitous computing, embedded system, and bioinformatics.

Electronic supplementary material

11704_2022_2050_MOESM1_ESM.pdf

An efficient deep learning-assisted person re-identification solution for intelligent video surveillance in smart cities

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Maqsood, M., Yasmin, S., Gillani, S. et al. An efficient deep learning-assisted person re-identification solution for intelligent video surveillance in smart cities. Front. Comput. Sci. 17, 174329 (2023). https://doi.org/10.1007/s11704-022-2050-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s11704-022-2050-4

Keywords

Navigation