Abstract
The Vista-Lite system labels major challenges in video surveillance involving computational complexity, restricted transferability over datasets, and the absence of an impactful approach to examine data from various cameras. This system emphasis three methodologies to solve these issues UOAL, TempoNet and BDSO. UOAL identifies abnormalities in video content via a segmentation approach improving accuracy in complex environments. TempoNet concentrates on forecasting motions and behaviors utilizing modern neural network frameworks, enhancing response times in identifying possibly malicious situations. BDSO enhances the computational resources by tuning system parameters thus assuring flexibility and decreasing false alarms. This fusion improves system persistence, sensibility and functional cost-efficiency making the solution versatile to vast surveillance scenarios. Comprehensive experiments using pedestrian, UCSD, and mall datasets established increased performance with 99% accuracy indicating the system’s capacity to maintain real-time, multi-camera data. Vista-Lite provides a novel, innovative, flexible approach to video surveillance combining anomaly detection, motion prediction, and resource optimization for improving and enhancing the domain.

















Similar content being viewed by others
Data availability
Data sharing not applicable to this article as no datasets were generated or analyzed during the current study. The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request. All data generated or analyzed during this study are included in this published article.
Change history
19 January 2025
Affiliation details for authors were incorrect and now are corrected.
20 January 2025
A Correction to this paper has been published: https://doi.org/10.1007/s11227-024-06877-1
References
Ganapathy S, Ajmera D (2024) An intelligent video surveillance system for detecting the vehicles on road using refined yolov4. Comput Electr Eng 113:109036
Banelas D, Petrakis EG (2024) Motioninsights: real-time object tracking in streaming video. Mach Vision Appl 35:1–16
Dhillon A, Verma GK (2020) Convolutional neural network: a review of models, methodologies and applications to object detection. Prog Artif Intell 9:85–112
Chen Y, Tang Y, Xiao Y, Yuan Q, Zhang Y, Liu F et al (2024) Satellite video single object tracking: a systematic review and an oriented object tracking benchmark. ISPRS J Photogramm Remote Sens 210:212–240
Vijayakumar A and Vairavasundaram S (2024) Yolo-based object detection models: a review and its applications. Multimed Tools Appl pp. 1–40
Marvasti-Zadeh SM, Cheng L, Ghanei-Yakhdan H, Kasaei S (2021) Deep learning for visual tracking: a comprehensive survey. IEEE Trans Intell Trans Syst 23:3943–3968
Kaur J, Singh W (2024) A systematic review of object detection from images using deep learning. Multimed Tools Appl 83:12253–12338
Gopalakrishnan S, Sheela MS, Saranya K, Hephzipah JJ (2023) A novel deep learning-based heart disease prediction system using convolutional neural networks (CNN) algorithm. Int J Intell Syst Appl Eng 11:516–522
Sharma C, Ghosh S, Shenoy KA, Poornalatha G (2024) A novel multiclass object detection dataset enriched with frequency data. IEEE Access 12:85551–85564
Liu Y, Li B, Zhou X, Li D, Duan Q (2024) Online multi-object tracking method for shrimps in high density using multi-task you only look once exceeding and cascade strategy. Eng Appl Artif Intell 136:108882
Gupta A, Anpalagan A, Guan L, Khwaja AS (2021) Deep learning for object detection and scene perception in self-driving cars: survey, challenges, and open issues. Array 10:100057
Miao B, Bennamoun M, Gao Y, and Mian A (2024) Region aware video object segmentation with deep motion modeling. IEEE Transactions on Image Processing,
Shi L, Lan X, Lan X, Zhang T (2024) Quick calibration of massive urban outdoor surveillance cameras. ISPRS J Photogramm Remote Sens 214:231–243
Liu S, Liu D, Srivastava G, Połap D, Woźniak M (2021) Overview and methods of correlation filter algorithms in object tracking. Complex Intell Syst 7:1895–1917
He Z, Zhao K, Zeng D (2024) TLtrack: combining transformers and a linear model for robust multi-object tracking. AI 5:938–947
Yadav SP, Jindal M, Rani P, de Albuquerque VHC, dos Santos Nascimento C, Kumar M (2024) An improved deep learning-based optimal object detection system from images. Multimed Tools Appl 83:30045–30072
Zhu P, Wen L, Du D, Bian X, Fan H, Hu Q et al (2021) Detection and tracking meet drones challenge. IEEE Trans Pattern Anal Mach Intell 44:7380–7399
M. E. Wibowo, A. Ashari, and Putra MPK (2021) Improvement of deep learning-based human detection using dynamic thresholding for intelligent surveillance system. Int J Adv Comput Sci Appl 12
Xu Z, Jain DK, Shamsolmoali P, Goli A, Neelakandan S, Jain A (2024) Slime mold optimization with hybrid deep learning enabled crowd-counting approach in video surveillance. Neural Comput Appl 36:2215–2229
Chiranjeevi VR and Malathi D (2024) Anomaly graph: leveraging dynamic graph convolutional networks for enhanced video anomaly detection in surveillance and security applications. Neural Comput Applpp. 1–18
Rajkumar R, Gopalakrishnan S, Praveena K, Venkatesan M, Ramamoorthy K, Hephzipah JJ (2024) DARKNET-53 convolutional neural network-based image processing for breast cancer detection. Mesopotamian J Artif Intell Healthc 2024:59–68
A. Kamboj (2024) The progression of transformers from language to vision to MOT: a literature review on multi-object tracking with transformers, arXiv preprint arXiv:2406.16784
Maheswari S and Gopalakrishnan S (2024) Dynamic channel attention for enhanced spatial feature extraction in medical image analysis using advanced attention capsule network. In 2024 International Conference on Integrated Circuits and Communication Systems (ICICACS), pp. 1–7
Amosa TI, Sebastian P, Izhar LI, Ibrahim O, Ayinla LS, Bahashwan AA et al (2023) Multi-camera multi-object tracking: a review of current trends and future advances. Neurocomputing 552:126558
Onyema EM, Balasubaramanian S, Iwendi C, Prasad BS, Edeh CD (2023) Remote monitoring system using slow-fast deep convolution neural network model for identifying anti-social activities in surveillance applications. Measurement: Sensors 27:100718
Ullah W, Hussain T, Baik SW (2023) Vision transformer attention with multi-reservoir echo state network for anomaly recognition. Inf Process Manag 60:103289
Raza A, Chelloug SA, Alatiyyah MH, Jalal A, Park J (2023) Multiple pedestrian detection and tracking in night vision surveillance systems. CMC 75:3275–3289
Jha S, Seo C, Yang E, Joshi GP (2021) Real time object detection and trackingsystem for video surveillance system. Multimed Tools Appl 80:3981–3996
Elharrouss O, Almaadeed N, Al-Maadeed S (2021) A review of video surveillance systems. J Visual Commun Image Represent 77:103116
Ciaparrone G, Sánchez FL, Tabik S, Troiano L, Tagliaferri R, Herrera F (2020) Deep learning in video multi-object tracking: a survey. Neurocomputing 381:61–88
Meinhardt T, Kirillov A, Leal-Taixe L, and Feichtenhofer C (2022) Trackformer: multi-object tracking with transformers. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 8844-8854
Zhang Y, Wang C, Wang X, Zeng W, Liu W (2021) Fairmot: on the fairness of detection and re-identification in multiple object tracking. Int J Comput Vis 129:3069–3087
Ji Y, Zhang H, Jie Z, Ma L, Wu QJ (2020) CASNet: a cross-attention siamese network for video salient object detection. IEEE Trans on Neural Netw Learn Syst 32:2676–2690
Dang B, Ma D, Li S, Dong X, Zang H, Ding R (2024) Enhancing kitchen independence: deep learning-based object detection for visually impaired assistance. Acad J Sci Technol 9:180–184
Islam M, Dukyil AS, Alyahya S, Habib S (2023) An IoT enable anomaly detection system for smart city surveillance. Sensors 23:2358
Acknowledgments
The authors extend their appreciation to the Deanship of Research and Graduate Studies at King Khalid University for funding this work through Research Project under grant number RGP.1/321/45.
Funding
This work is funded by the Deanship of Research and Graduate Studies at King Khalid University under grant number RGP.1/321/45.
Author information
Authors and Affiliations
Contributions
All authors contributed to the study conception and design. Material preparation, data collection and analysis were performed by Dr. Areej Alasiry. The first draft of the manuscript was written by Mr. Mohammed Qayyum and Dr. Areej commented on previous versions of the manuscript. The author Dr. Areej Alasiry involved in the background study of the paper and helped the mathematical derivations. Also technically involved and provided a factual review and helped edit the manuscript. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no competing interests.
Ethical Statement
I will conduct myself with integrity, fidelity, and honesty. I will openly take responsibility for my actions, and only make agreements, which I intend to keep. I will not intentionally engage in or participate in any form of malicious harm to another person or animal.
Informed Consent for data Used
All subjects gave their informed consent for inclusion before they participated in the study. The study was conducted in accordance with the Declaration of Helsinki. I consent to participate in the research project, and the following has been explained to me: the research may not be of direct benefit to me. My participation is completely voluntary. My right to withdraw from the study at any time without any implications to me.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Affiliation details for authors were incorrect and now are corrected.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Alasiry, A., Qayyum, M. A smart vista-lite system for anomaly detection and motion prediction for video surveillance in vibrant urban settings. J Supercomput 81, 297 (2025). https://doi.org/10.1007/s11227-024-06753-y
Accepted:
Published:
DOI: https://doi.org/10.1007/s11227-024-06753-y