Automatic robot Manoeuvres detection using computer vision and deep learning techniques: a perspective of internet of robotics things (IoRT)

Mahajan, Hemant B.; Uke, Nilesh; Pise, Priya; Shahade, Makarand; Dixit, Vandana G.; Bhavsar, Swapna; Deshpande, Sarita D.

doi:10.1007/s11042-022-14253-5

Automatic robot Manoeuvres detection using computer vision and deep learning techniques: a perspective of internet of robotics things (IoRT)

Published: 16 November 2022

Volume 82, pages 23251–23276, (2023)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Hemant B. Mahajan ORCID: orcid.org/0000-0001-6703-7711¹,
Nilesh Uke²,
Priya Pise³,
Makarand Shahade⁴,
Vandana G. Dixit⁵,
Swapna Bhavsar⁵ &
…
Sarita D. Deshpande⁵

1153 Accesses
1 Altmetric
Explore all metrics

Abstract

To minimize any impediments in real-time Internet of Things (IoT)-enabled robotics applications, this study demonstrated how to build and deploy a revolutionary framework using computer vision and deep learning. In contrast to robotic path planning algorithms based on geolocation. We focus on sensor-captured streams/images and geographical information to enable the Internet of Robotic Things (IoRT) to evolve. The application will collect real-time data from moving robotics at various situations and intervals and use it for research projects. The data collected in videos/image forms are delivered in the robotics application using visual sensor nodes. In this study, anticipating moving robot moves automatically early on can aid in issuing commands to monitor and regulate robots’ future activities before they occur. To do so, we propose the framework using efficient computer vision techniques and a deep learning classifier. The computer vision methods are designed for frame quality improvement, object segmentation, and feature estimation. The Long-Term Short Memory (LSTM) classifier detects robot motions automatically from initial sequential features. We mainly designed the proposed model using an LSTM classifier to perform the earlier prediction from the initial sequential features of partial video frames and to overcome the problems of exploding and vanishing gradients. LSTM helps to reduce the prediction duration with higher accuracy. It also enables the central system of a certain robotic application to prevent collisions caused by impediments in the interior or outdoor situation. The simulation results utilizing publicly available research datasets demonstrate the proposed model’s efficiency and robustness compared to state-of-the-art approaches. The overall accuracy of the proposed model has improved approximately by 5% and reduced computational complexity by 84% approximately.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Automatic Training Method of Deep Neural Network for Robot Vision

Improved Deep Neural Network Object Tracking System for Applications in Home Robotics

Efficient deep learning-based semantic mapping approach using monocular vision for resource-limited mobile robots

Article 25 April 2022

Data availability

The datasets analysed during the current study are available from the corresponding author on reasonable request.

References

Alcácer, V, Cruz-Machado, V (2019) Scanning the industry 4.0: a literature review on Technologies for Manufacturing Systems. Eng Sci Technol Int J
Alhayani B, Abbas ST, Mohammed HJ, Mahajan HB (2021) Intelligent secured two-way image transmission using Corvus Corone module over WSN. Wirel Pers Commun 120:665–700. https://doi.org/10.1007/s11277-021-08484-2
Article Google Scholar
Bar-Hillel A, Lerner R, Levi D, Raz G (2011) Recent progress in road and lane detection: a survey. Mach Vis Appl 25:727–745
Article Google Scholar
Bhatti UA, Huang M, Wang H, Zhang Y, Mehmood A, Di W (2017) Recommendation system for immunization coverage and monitoring. Human Vaccines Immunotherapeutics 14(1):165–171. https://doi.org/10.1080/21645515.2017.1379639
Article Google Scholar
Bhatti UA, Huang M, Wu D, Zhang Y, Mehmood A, Han H (2018) Recommendation system using feature extraction and pattern recognition in clinical care systems. Enterp Inf Syst 13:1–23. https://doi.org/10.1080/17517575.2018.1557256
Article Google Scholar
Bhatti, U, Yu, Z, Yuan, L, Nawaz, SA, Bhatti, M, Mehmood, A, Ain, QUl, Zeehsan, Z, Wen, L. (2020) Geometric algebra applications in geospatial artificial intelligence and remote sensing image processing. IEEE Access. PP. 1–1. https://doi.org/10.1109/ACCESS.2020.3018544.
Bhatti, U, Yu, Z, Chanussot, J, Zeeshan Z, Yuan, L, Luo, W, Nawaz, SA, Bhatti, M, Ain, QUl, Mehmood, A. (2021) Local Similarity-Based Spatial-Spectral Fusion Hyperspectral Image Classification With Deep CNN and Gabor Filtering. IEEE Trans Geosci Remote Sensing. PP. 1–15. https://doi.org/10.1109/TGRS.2021.3090410.
Bhatti UA, Yan Y, Zhou M, Ali S, Hussain A, Qingsong H, Yu Z, Yuan L (2021) Time series analysis and forecasting of air pollution particulate matter (PM2.5): An SARIMA and factor analysis approach. IEEE Access 9:41019–41031. https://doi.org/10.1109/access.2021.3060744
Article Google Scholar
Bhatti UA, Zeeshan Z, Nizamani MM, Bazai S, Yu Z, Yuan L (2022) Assessing the change of ambient air quality patterns in Jiangsu Province of China pre-to post-COVID-19. Chemosphere 288(Pt 2):132569. https://doi.org/10.1016/j.chemosphere.2021.132569
Article Google Scholar
Calli B, Singh A, Bruce J, Walsman A, Konolige K, Srinivasa S, Abbeel P, Dollar AM (2017) Yale-CMU-Berkeley dataset for robotic manipulation research. Int J Robot Res 36(3):261–268. https://doi.org/10.1177/0278364917700714
Article Google Scholar
Caruso, D, Engel, J, Cremers, D (2015) Large-scale direct SLAM for omnidirectional cameras. 2015 IEEE/RSJ international conference on intelligent robots and systems (IROS), 141–148
Chang, C, Siagian, C, Itti, L (2010) Mobile robot vision navigation & localization using gist and saliency. 2010 IEEE/RSJ international conference on intelligent robots and systems, 4147–4154
Chauhan A, Jakhar SK, Chauhan C (2020) The interplay of circular economy with industry 4.0 enabled smart city drivers of healthcare waste disposal. J Clean Prod 279:123854–123854
Article Google Scholar
Chen H, Sun D (2012) Moving groups of microparticles into Array with a robot–tweezers manipulation system. IEEE Trans Robot 28:1069–1080
Article Google Scholar
Chen M, Sinha A, Hu K, Shah MI (2021) Impact of technological innovation on energy efficiency in industry 4.0 era: moderation of shadow economy in sustainable development. Technol Forecast Soc Change 164:120521
Article Google Scholar
Cummins M, Newman P (2008) FAB-MAP: probabilistic localization and mapping in the space of appearance. Int J Robot Res 27(6):647–665. https://doi.org/10.1177/0278364908090961
Article Google Scholar
Duan, Y, Huang, X, Yu, X (2016) Multi-robot dynamic virtual potential point hunting strategy based on FIS. 2016 IEEE Chinese guidance, Navigation and Control Conference (CGNCC), 332–335
Fang Y, Liu J, Li J, Cheng J, Hu J, Yi D, Xiao X, Bhatti UA (2022) Robust zero-watermarking algorithm for medical images based on SIFT and Bandelet-DCT. Multimed Tools Appl 81:16863–16879. https://doi.org/10.1007/s11042-022-12592-x
Article Google Scholar
Gil-Martin M, San-Segundo R, Fernandez-Martinez F, de Cordoba R (2020) Human activity recognition adapted to the type of movement. Comput Electr Eng 88:106822. https://doi.org/10.1016/j.compeleceng.2020.106822
Article Google Scholar
Gualtieri L, Rauch E, Vidoni R (2021) Emerging research fields in safety and ergonomics in industrial collaborative robotics: A systematic literature review. Rob Comput Integr Manuf 67:Article 101998
Article Google Scholar
Ha J, Shin J, Park H, Paik J (2021) Action recognition network using stacked short-term deep features and bidirectional moving average. Appl Sci 11(12):5563. https://doi.org/10.3390/app11125563
Article Google Scholar
Hari Pavan A, Anvitha P, Prem Sai A, Sunil I, Maruthi Y, Radhesyam V (2022) Human action recognition in videos using deep neural network. In: Chowdary PSR, Anguera J, Satapathy SC, Bhateja V (eds) Evolution in signal processing and telecommunication networks. Lecture notes in electrical engineering, vol 839. Springer, Singapore. https://doi.org/10.1007/978-981-16-8554-5_31
Chapter Google Scholar
Hasnain A, Sheng Y, Hashmi et al (2022) Time Series Analysis and Forecasting of Air Pollutants Based on Prophet Forecasting Model in Jiangsu Province, China Citation. Front Environ Sci 10:1. https://doi.org/10.3389/fenvs.2022.945628
Article Google Scholar
Hu, S, Cao, C, Pan, J (2017) Deep-learned pedestrian avoidance policy for robot navigation. 2017 IEEE international conference on robotics and biomimetics (ROBIO), 338–343
Hussain A, Hussain T, Ullah W, Baik SW (2022) Vision transformer and deep sequence learning for human activity recognition in surveillance videos. Comput Intell Neurosci 2022:3454167. https://doi.org/10.1155/2022/3454167
Article Google Scholar
Ijaz MF, Attique M, Son Y (2020) Data-driven cervical Cancer prediction model with outlier detection and over-sampling methods. Sensors 20(10):2809. https://doi.org/10.3390/s20102809
Article Google Scholar
Jain, A, Koppula, HS, Soh, S, Raghavan, B, Singh, A, Saxena, A (2016) Brain4Cars: Car that knows before you do via sensory-fusion deep learning architecture. ArXiv, abs/1601.00740
Jama, M; Schinstock, D (2011) Parallel Tracking and Mapping for Controlling VTOL Airframe. J Control Sci Eng, 2011(), 1–10. https://doi.org/10.1155/2011/413074
Khan IU, Afzal S, Lee JW (2022) Human activity recognition via hybrid deep learning based model. Sensors (Basel, Switzerland) 22(1):323. https://doi.org/10.3390/s22010323
Article Google Scholar
Kim J-H, Won CS (2020) Action recognition in videos using pre-trained 2D convolutional neural networks. IEEE Access 8:60179–60188. https://doi.org/10.1109/access.2020.2983427
Article Google Scholar
Kipper LM, Iepsen S, Dal Forno AJ, Frozza R, Furstenau L, Agnes J, Cossul D (2021) Scientific mapping to identify competencies required through industry 4.0 Technol. Soc 64:Article 101454
Google Scholar
Kose, N, Kopuklu, O, Unnervik, A, Rigoll, G (2019) Real-time driver state monitoring using a CNN based Spatio-temporal approach*. 2019 IEEE intelligent transportation systems conference (ITSC). https://doi.org/10.1109/itsc.2019.8917460.
Kruijff, GM, Colas, F, Svoboda, T, Diggelen, JV, Balmer, P, Pirri, F, Worst, R (2012) Designing intelligent robots for human-robot teaming in Urban search and rescue. AAAI Spring Symposium: Designing Intelligent Robots
Kumar P, Singh RK, Kumar V (2021) Managing supply chains for sustainable operations in the era of industry 4.0 and circular economy: analysis of barriers Resour. Conserv Recycl 164:Article 105215
Article Google Scholar
Lee, S-H, Lee, B. (2015). Kalman Consensus Based Multi-Robot SLAM with a Rao-Blackwellized Particle Filter. J Autom Control Eng. 368–372. https://doi.org/10.12720/joace.3.5.368-372.
Liang X, Wang H, Chen W, Guo D, Liu T (2015) Adaptive image-based trajectory tracking control of wheeled Mobile robots with an uncalibrated fixed camera. IEEE Trans Control Syst Technol 23:2266–2282
Article Google Scholar
Liu M (2016) Robotic online path planning on point cloud. IEEE Trans Cybern 46:1217–1228
Article Google Scholar
Liu, M, Colas, F, Siegwart, RY (2011) Regional topological segmentation based on mutual information graphs. 2011 IEEE international conference on robotics and automation, 3269–3274
Liu M, Pradalier C, Siegwart R (2013) Visual homing from scale with an uncelebrated omnidirectional camera. IEEE Trans Robot 29(6):1353–1365
Article Google Scholar
Lu K, Li J, An X, He H (2015) Vision Sensor-Based Road Detection for Field Robot Navigation. Sensors (Basel, Switzerland) 15:29594–29617
Article Google Scholar
Ma C, Hu X, Xiao J, Du H, Zhang G (2020) Improved ORB algorithm using three-patch method and local gray difference. Sensors 20(4):975. https://doi.org/10.3390/s20040975
Article Google Scholar
Mahajan, HB, Badarla, A (2018) Application of internet of things for smart precision farming: solutions and challenges. Int J Adv Sci Technol, Vol Dec 2018, PP. 37–45
Mahajan HB, Badarla A (2019) Experimental analysis of recent clustering algorithms for wireless sensor network: application of IoT based smart precision farming. J Adv Res Dyn Control Syst 11(9):116–125. https://doi.org/10.5373/JARDCS/V11I9/20193162
Article Google Scholar
Mahajan HB, Badarla A (2020) Detecting HTTP vulnerabilities in IoT-based precision farming connected with cloud environment using artificial intelligence. Int J Adv Sci Technol 29(3):214–226
Google Scholar
Mahajan HB, Badarla A (2021) Cross-layer protocol for WSN-assisted IoT smart farming applications using nature inspired algorithm. Wirel Pers Commun 121:3125–3149. https://doi.org/10.1007/s11277-021-08866-6
Article Google Scholar
Mahajan HB, Badarla A, Junnarkar AA (2021) CL-IoT: cross-layer internet of things protocol for intelligent manufacturing of smart farming. J Ambient Intell Humaniz Comput 12:7777–7791. https://doi.org/10.1007/s12652-020-02502-0
Article Google Scholar
Mahajan, HB, Rashid, AS, Junnarkar, AA et al. (2022) Integration of Healthcare 4.0 and blockchain into secure cloud-based electronic health records systems. Appl Nanosci, https://doi.org/10.1007/s13204-021-02164-0.
Mandal M, Singh PK, Ijaz MF, Shafi J, Sarkar R (2021) A tri-stage wrapper-filter feature selection framework for disease classification. Sensors 21(16):5571. https://doi.org/10.3390/s21165571
Article Google Scholar
Mikhail, A, Kamil, IA, Mahajan, H (2017) Increasing SCADA System Availability by Fault Tolerance Techniques. 2017 International conference on computing, communication, Control and Automation (ICCUBEA) https://doi.org/10.1109/iccubea.2017.8463911
Mikhail, A, Kareem, HH, Mahajan, H (2017) Fault Tolerance to Balance for Messaging Layers in Communication Society. 2017 International conference on computing, communication, Control and Automation (ICCUBEA) https://doi.org/10.1109/iccubea.2017.8463871
Momen, S, Lima, TI, Siddika, R (2016) Group decision by house-hunting agents in multi-robot systems. 2016 2nd international symposium on agent, multi-agent systems and robotics (ISAMSR), 73-78
Newcombe, RA, Lovegrove, SJ, Davison, AJ (2011) 2011 International Conference on Computer Vision - DTAM: Dense tracking and mapping in real-time. (0), 2320–2327. https://doi.org/10.1109/iccv.2011.6126513.
Newcombe, RA, Izadi, S, Hilliges, O, Molyneaux, D, Kim, D, Davison, AJ, Kohli, P, Shotton, J, Hodges, S, Fitzgibbon, AW (2011) KinectFusion: real-time dense surface mapping and tracking. 2011 10th IEEE international symposium on mixed and augmented reality, 127-136
Ran, L, Zhang, Y, Zhang, Q, Yang, T (2017) Convolutional neural network-based robot navigation using uncalibrated spherical images †. Sensors (Basel, Switzerland), 17
Ruiz-Sarmiento JR, Galindo C, Gonzalez-Jimenez J (2017) Robot@home, a robotic dataset for semantic mapping of home environments. Int J Robot Res 36(2):131–141. https://doi.org/10.1177/0278364917695640
Article Google Scholar
Serpush F, Rezaei M (2021) Complex human action recognition using a hierarchical feature reduction and deep learning-based method. SN Comput Sci 2:94. https://doi.org/10.1007/s42979-021-00484-0
Article Google Scholar
Shen, H, Li, N, Rojas, S, Zhang, L (2016) Multi-robot cooperative hunting. 2016 international conference on collaboration technologies and systems (CTS), 349–353
Shi Q, Zhang HB, Ren HT, du JX, Lei Q (2020) Consistent constraint-based video-level learning for action recognition. J Image Video Proc 2020(35). https://doi.org/10.1186/s13640-020-00519-1
Srinivasu PN, SivaSai JG, Ijaz MF, Bhoi AK, Kim W, Kang JJ (2021) Classification of skin disease using deep learning neural networks with MobileNet V2 and LSTM. Sensors 21(8):2852. https://doi.org/10.3390/s21082852
Article Google Scholar
Srinivasu PN, Ahmed S, Alhumam A, Bhoi AK, Ijaz MF (2021) An AW-HARIS Based Automated Segmentation of Human Liver Using CT Images. Comput Mater Continua 69:3303–3319. https://doi.org/10.32604/cmc.2021.018472
Article Google Scholar
Tai, L, Liu, M (2016) Deep-learning in Mobile robotics - from perception to control systems: a survey on why and why not. ArXiv, abs/1612.07139
Tai, L, Li, S, Liu, M (2017) Autonomous exploration of mobile robots through deep neural networks. Int J Adv Robot Syst, 14
Tai, L, Li, S, Liu, M (2017) Autonomous exploration of mobile robots through deep neural networks. Int J Adv Robot Syst, 14
Uke, N, Pise, P, Mahajan, HB, et.al. (2021) Healthcare 4.0 Enabled Lightweight Security Provisions for Medical Data Processing. Turkish Journal of Computer and Mathematics (2021), Vol. 12, No. 11. https://doi.org/10.17762/turcomat.v12i11.5858.
Ullah A, Muhammad K, Ding W, Palade V, Haq IU, Baik SW (2021) Efficient activity recognition using lightweight CNN and DS-GRU network for surveillance applications. Appl Soft Comput 103:107102. https://doi.org/10.1016/j.asoc.2021.107102
Article Google Scholar
Urban, S, Hinz, S (2016) MultiCol-SLAM - A Modular Real-Time Multi-Camera SLAM System. ArXiv, abs/1610.07336
Vulli A, Srinivasu PN, Sashank M, Shafi J, Choi J, Ijaz MF (2022) Fine-tuned DenseNet-169 for breast Cancer metastasis prediction using FastAI and 1-cycle policy. Sensors (Basel, Switzerland) 22(8):2988. https://doi.org/10.3390/s22082988
Article Google Scholar
Wang H, Guo D, Liang X, Chen W, Hu G, Leang KK (2017) Adaptive vision-based leader–follower formation control of Mobile robots. IEEE Trans Ind Electron 64:2893–2902
Article Google Scholar
Werthen-Brabants L, Bhavanasi G, Couckuyt I, Dhaene T, Deschrijver D (2022) Split BiRNN for real-time activity recognition using radar and deep learning. Sci Rep 12:7436. https://doi.org/10.1038/s41598-022-08240-x
Article Google Scholar
Xia, C (2015) Intelligent Mobile robot learning in autonomous navigation
Yoneda K, Kato C, Sugawara T (2013) Autonomous learning of target decision strategies without Communications for Continuous Coordinated Cleaning Tasks. 2013 IEEE/WIC/ACM Int Joint Conf Web Intell (WI) Intell Agent Technol (IAT) 2:216–223
Google Scholar

Download references

Author information

Authors and Affiliations

Research Analysis and Data Scientist, Godwit Technologies, Pune, India
Hemant B. Mahajan
Trinity Academy of Engineering, Pune, India
Nilesh Uke
Indira College of Engineering and Management, Pune, India
Priya Pise
SVKM’s Institute of Technology, Mumbai Agra Highway, Behind Gurudwara, Dhule, India
Makarand Shahade
PES Modern College of Engineering, Pune, India
Vandana G. Dixit, Swapna Bhavsar & Sarita D. Deshpande

Authors

Hemant B. Mahajan
View author publications
You can also search for this author in PubMed Google Scholar
Nilesh Uke
View author publications
You can also search for this author in PubMed Google Scholar
Priya Pise
View author publications
You can also search for this author in PubMed Google Scholar
Makarand Shahade
View author publications
You can also search for this author in PubMed Google Scholar
Vandana G. Dixit
View author publications
You can also search for this author in PubMed Google Scholar
Swapna Bhavsar
View author publications
You can also search for this author in PubMed Google Scholar
Sarita D. Deshpande
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hemant B. Mahajan.

Ethics declarations

Ethical approval

This article does not contain any studies with human participants performed by any of the authors.

Conflict of interest

All authors declares that they has no conflict of interest.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Mahajan, H.B., Uke, N., Pise, P. et al. Automatic robot Manoeuvres detection using computer vision and deep learning techniques: a perspective of internet of robotics things (IoRT). Multimed Tools Appl 82, 23251–23276 (2023). https://doi.org/10.1007/s11042-022-14253-5

Download citation

Received: 23 March 2022
Revised: 17 September 2022
Accepted: 04 November 2022
Published: 16 November 2022
Issue Date: June 2023
DOI: https://doi.org/10.1007/s11042-022-14253-5

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Automatic robot Manoeuvres detection using computer vision and deep learning techniques: a perspective of internet of robotics things (IoRT)

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Automatic Training Method of Deep Neural Network for Robot Vision

Improved Deep Neural Network Object Tracking System for Applications in Home Robotics

Efficient deep learning-based semantic mapping approach using monocular vision for resource-limited mobile robots

Data availability

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Ethical approval

Conflict of interest

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now