research-article

An assistive model for visually impaired people using YOLO and MTCNN

Authors:

Ferdousi Rahman,

Israt Jahan Ritun,

Nafisa Farhin,

Jia UddinAuthors Info & Claims

ICCSP '19: Proceedings of the 3rd International Conference on Cryptography, Security and Privacy

Pages 225 - 230

https://doi.org/10.1145/3309074.3309114

Published: 19 January 2019 Publication History

Get Access

Abstract

Visually impaired people face difficulties in safe and independent movement which deprive them from regular professional and social activities in both indoors and outdoors. Similarly they have distressin identification of surrounding environment fundamentals. This paper presents a model to detect brightness and major colors in real-time image by using RGB method by means of an external camera and then identification of fundamental objects as well as facial recognition from personal dataset. For the Object identification and Facial Recognition, YOLO Algorithm and MTCNN Networking are used, respectively. The software support is achieved by using OpenCV libraries of Python as well as implementing machine learning process. The major processor used for our model, Raspberry Pi scans and detects the facial edges via Pi camera and objects in the image are captured and recognized using mobile camera. Image recognition results are transferred to the blind users by means of text-to-speech library. The device portability is achieved by using a battery. The object detection process achieved 6-7 FPS processing with an accuracy rate of 63-80%. The face identification process achieved 80-100% accuracy.

References

[1]

Global Data on Visual Impairments 2010. Available online: http://www.who.int/blindness/GLOBALDATAFINALforweb.pdf (Aaccessed on 23 April 2017).

Google Scholar

[2]

He K., X. Zhang, S. Ren, J. Sun, "Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition", "Fleet D., Pajdla T., Schiele B., Tuytelaars T. (eds) Computer Vision", Lecture Notes in Computer Science, vol 8691. Springer.

Google Scholar

[3]

Prof. P Y Kumbhar, Mohd Attaullah, S. Dhere, S. Kumar Hipparagi, "Real Time Face Detection and Tracking Using OpenCV," Int. Journal For Research In Emerging Science and Technology, Vol.4, No. 4, Apr-2017.

Google Scholar

[4]

W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, F.Cheng-Yang, C. Alexander Berg, "SSD: Single Shot MultiBox Detector," ver. 5, 2016,Cornell University Library.

Google Scholar

[5]

J. Redmon, S. Divvala, R. Girshick, A. Farhadi; "You Only Look Once: Unified, Real-Time Object Detection," IEEE Conference on Computer Vision and Pattern Recognition; Published on: 12 December 2016.

Google Scholar

[6]

Joseph Redmon, Ali Farhadi; YOLO9000: Better, Faster, Stronger; 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 09 November 2017.

Google Scholar

[7]

{Online}. https://adeshpande3.github.io/A-Beginner%27s-Guide-To-Understanding-Convolutional-Neural-Networks-Part-2/ (Accessed on 19th July 2018).

Google Scholar

[8]

M. Nakib, R. T. Khan, M. S. Hasan and J. Uddin, "Crime Scene Prediction by Detecting Threatening Objects Using Convolutional Neural Network," Int. Conf. on Computer, Communication, Chemical, Material and Electronic Engineering, pp. 1--4, 2018.

Google Scholar

[9]

K. Zhang, Z. Zhang, Z. Li and Y. Qiao, "Joint Face Detection and Alignment Using Multitask Cascaded Convolutional Networks," in IEEE Signal Processing Letters, vol. 23, no. 10, pp. 1499--1503, Oct. 2016.

Crossref

Google Scholar

[10]

S. Billotta, G. Bonanno, S. Garozzo, A. Grillo, D. Marano, G. Romeo, "You Only Look Once:Unified, Real-Time Object Detection," Nuclear Instruments and Methods in Physics Research, Section A: Accelerators, Spectrometers, Detectors and Assoc. Equip., vol. 794, pp. 185--192, 2015.

Crossref

Google Scholar

[11]

J. Redmon, "Darknet: Open source neural networks in c." {onloine}. http://pjreddie.com/darknet/, 2013--2016.

Google Scholar

[12]

{Online}. https://pjreddie.com/darknet/yolo/ (Accessed on: 14th July 2018).

Google Scholar

[13]

M.S. Akbar, P. Sarkar, A.M. Ashray, A.T. Mansoor, J. Uddin, "Face Recognition and RFID Verified Attendance System," Int. Conf. on Emerging Technologies in Computing 2018, London, UK.

Google Scholar

[14]

{Online}. https://www.raspberrypi.org/products/raspberry-pi-3-model-b/ (Accessed on : 10th July 2018).

Google Scholar

Cited By

View all

Praneeth Kumar Chinni NPranav Reddy Kaamaala SVishva Vardhan BUday Kishan ASunny Richards VNooka Harsh Vardhan RPuneet (2024)Vision Sense: Real-Time Object Detection And Audio Feedback System For Visually Impaired Individuals2024 2nd World Conference on Communication & Computing (WCONF)10.1109/WCONF61366.2024.10692302(1-6)Online publication date: 12-Jul-2024
https://doi.org/10.1109/WCONF61366.2024.10692302
Shadangi SRout B(2024)Deep Learning Based Techniques to Develop & Enhance Assistive Gear for Visually Impaired2024 IEEE International Conference on Computer Vision and Machine Intelligence (CVMI)10.1109/CVMI61877.2024.10782209(1-6)Online publication date: 19-Oct-2024
https://doi.org/10.1109/CVMI61877.2024.10782209
Upadhyaya BPramanik PRoy PSen R(2024)Real-Time Obstacle Detection Using YOLOv8 on Raspberry Pi 4 for Visually Challenged PeopleSmart Trends in Computing and Communications10.1007/978-981-97-1320-2_19(221-235)Online publication date: 14-Jun-2024
https://doi.org/10.1007/978-981-97-1320-2_19
Show More Cited By

Index Terms

An assistive model for visually impaired people using YOLO and MTCNN
1. Computing methodologies
  1. Machine learning
    1. Machine learning approaches
      1. Classification and regression trees

Recommendations

Assistive systems for visually impaired people: A survey on current requirements and advancements
Abstract
In this survey, we provide a comprehensive study on the assistive technological devices which help visually impaired persons in their day-to-day lives. With various forms of disabilities such as visual, auditory, mobility, or cognitive impairment ...
Information Communication Assistive Technologies for Visually Impaired People

The information explosion era provides the foundation for a technological solution to enable the visually impaired to more independent living in the community. This paper first provides a review of assistive technologies for visually impaired people. ...
Assistive technology-based solutions in learning mathematics for visually-impaired people: exploring issues, challenges and opportunities
Abstract
In the absence of vision, visually impaired and blind people rely upon the tactile sense and hearing to obtain information about their surrounding environment. These senses cannot fully compensate for the absence of vision, so visually impaired ...

Comments

Information & Contributors

Information

Published In

ICCSP '19: Proceedings of the 3rd International Conference on Cryptography, Security and Privacy

January 2019

303 pages

ISBN:9781450366182

DOI:10.1145/3309074

Conference Chairs:
Yulin Wang
Wuhan University, China
,
Chin-Chen Chang
Feng Chia University, Taiwan

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 January 2019

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

ICCSP 2019

ICCSP 2019: 2019 the 3rd International Conference on Cryptography, Security and Privacy

January 19 - 21, 2019

Kuala Lumpur, Malaysia

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

14
Total Citations
View Citations
377
Total Downloads

Downloads (Last 12 months)25
Downloads (Last 6 weeks)4

Reflects downloads up to 13 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Praneeth Kumar Chinni NPranav Reddy Kaamaala SVishva Vardhan BUday Kishan ASunny Richards VNooka Harsh Vardhan RPuneet (2024)Vision Sense: Real-Time Object Detection And Audio Feedback System For Visually Impaired Individuals2024 2nd World Conference on Communication & Computing (WCONF)10.1109/WCONF61366.2024.10692302(1-6)Online publication date: 12-Jul-2024
https://doi.org/10.1109/WCONF61366.2024.10692302
Shadangi SRout B(2024)Deep Learning Based Techniques to Develop & Enhance Assistive Gear for Visually Impaired2024 IEEE International Conference on Computer Vision and Machine Intelligence (CVMI)10.1109/CVMI61877.2024.10782209(1-6)Online publication date: 19-Oct-2024
https://doi.org/10.1109/CVMI61877.2024.10782209
Upadhyaya BPramanik PRoy PSen R(2024)Real-Time Obstacle Detection Using YOLOv8 on Raspberry Pi 4 for Visually Challenged PeopleSmart Trends in Computing and Communications10.1007/978-981-97-1320-2_19(221-235)Online publication date: 14-Jun-2024
https://doi.org/10.1007/978-981-97-1320-2_19
Supriya SSingh KGoswami BMukul MSamui AMisra C(2023)Buddy App: Virtual Assistant For Old Aged And Visually Challenged People2023 OITS International Conference on Information Technology (OCIT)10.1109/OCIT59427.2023.10430636(861-866)Online publication date: 13-Dec-2023
https://doi.org/10.1109/OCIT59427.2023.10430636
Priyanka RVivekrabinson KKathijanasreen AChembian WVeerasundar AS S(2023)Robust Object Detection and Tracking Model for Visually Impaired People Using Deep Convolution Neural Network Model2023 International Conference on Sustainable Communication Networks and Application (ICSCNA)10.1109/ICSCNA58489.2023.10370133(1593-1598)Online publication date: 15-Nov-2023
https://doi.org/10.1109/ICSCNA58489.2023.10370133
Guevara JJudilla JPulido LUmali RDimaunahan ETayactac REscalona GHonra J(2023)Mobility Assistive Technology with Artificial Intelligence: Indoor Navigation Assistance for the Visually Impaired Using Arduino-Based Assistive Goggles2023 IEEE 15th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment, and Management (HNICEM)10.1109/HNICEM60674.2023.10589004(1-6)Online publication date: 19-Nov-2023
https://doi.org/10.1109/HNICEM60674.2023.10589004
Chaudhary DMathur AChauhan AGupta A(2023)Assistive Object Recognition and Obstacle Detection System for the Visually Impaired Using YOLO2023 13th International Conference on Cloud Computing, Data Science & Engineering (Confluence)10.1109/Confluence56041.2023.10048808(353-358)Online publication date: 19-Jan-2023
https://doi.org/10.1109/Confluence56041.2023.10048808
Adhikari NBehera NE VPimo EChaturvedi VTripathi V(2022)Modeling of Optimal Deep Learning Enabled Object Detection and Classification on Drone Imagery2022 International Conference on Augmented Intelligence and Sustainable Systems (ICAISS)10.1109/ICAISS55157.2022.10010957(303-309)Online publication date: 24-Nov-2022
https://doi.org/10.1109/ICAISS55157.2022.10010957
Ariza JPearce J(2022)Low-Cost Assistive Technologies for Disabled People Using Open-Source Hardware and Software: A Systematic Literature ReviewIEEE Access10.1109/ACCESS.2022.322144910(124894-124927)Online publication date: 2022
https://doi.org/10.1109/ACCESS.2022.3221449
Valipoor Mde Antonio A(2022)Recent trends in computer vision-driven scene understanding for VI/blind users: a systematic mappingUniversal Access in the Information Society10.1007/s10209-022-00868-w22:3(983-1005)Online publication date: 6-Feb-2022
https://doi.org/10.1007/s10209-022-00868-w
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Abstract

References

Cited By

Index Terms

Recommendations

Assistive systems for visually impaired people: A survey on current requirements and advancements

Information Communication Assistive Technologies for Visually Impaired People

Assistive technology-based solutions in learning mathematics for visually-impaired people: exploring issues, challenges and opportunities

Comments

Information

Published In

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations