Abstract
In spite of the fact that various electronic assistive devices have been designed for individuals who are impaired visually in recent decades, but very few solutions have been developed for aiding them in order to recognize general objects in the environment. In any case, the studies in this area are gaining momentum. Among the different innovations being used for this reason, solutions based on computer vision are developing as one of the most encouraging alternatives principally because of their accessibility and affordability. This study presents an integrated approach towards detection of obstacles for assisting the visually impaired persons. Here, we employed Gaussian filtering based improved discrete cosine transform (IDCT) and convolutional neural network (CNN) (IDCT-CNN) which can enhance the detection accuracy. User defined functionality of DCT is implemented here as an advancement to existing DCT and results show better performance. Also, the proposed Gaussian filtering based IDCT-CNN performance is compared with the existing KNN classifier and related existing research. Results showed promising outcomes where the accuracy of the proposed approach outscored with 99% in comparison with the second best of morphological closing technique which achieved 81%.













Similar content being viewed by others
Explore related subjects
Discover the latest articles and news from researchers in related subjects, suggested using machine learning.Data Availability
The dataset used for this research work is available at: YouTube—https://www.youtube.com/watch?v=JdKkFuINzYI
References
Agarwal, N., Iyer, A., Naidu, S., & Rodrigues, S. (2015, February). Electronic guidance system for the visually impaired: A framework. In 2015 international conference on technologies for sustainable development (ICTSD) (pp. 1–5). IEEE.
Mariotti, S. P. (2012). Global data on visual impairments 2010. World Health Organization, 20.
Ye, H., Malu, M., Oh, U., & Findlater, L. (2014, April). Current and future mobile and wearable device use by people with visual impairments. In Proceedings of the SIGCHI conference on human factors in computing systems (pp. 3123–3132). ACM.
Lee, C. L., Chen, C. Y., Sung, P. C., & Lu, S. Y. (2014). Assessment of a simple obstacle detection device for the visually impaired. Applied ergonomics, 45(4), 817–824.
Elmannai, W., & Elleithy, K. (2017). Sensor-based assistive devices for visually-impaired people: Current status, challenges, and future directions. Sensors, 17(3), 565.
Tapu, R., Mocanu, B., & Tapu, E. (2014, November). A survey on wearable devices used to assist the visual impaired user navigation in outdoor environments. In 2014 11th international symposium on electronics and telecommunications (ISETC) (pp. 1–4). IEEE.
Seneviratne, S., Hu, Y., Nguyen, T., Lan, G., Khalifa, S., Thilakarathna, K., Hassan, M., & Seneviratne, A. (2017). A survey of wearable devices and challenges. IEEE Communications Surveys & Tutorials, 19(4), 2573–2620.
Rodríguez, A., Yebes, J. J., Alcantarilla, P., Bergasa, L., Almazán, J., & Cela, A. (2012). Assisting the visually impaired: Obstacle detection and warning system by acoustic feedback. Sensors, 12(12), 17476–17496.
Mocanu, B., Tapu, R., & Zaharia, T. (2016). When ultrasonic sensors and computer vision join forces for efficient obstacle detection and recognition. Sensors, 16(11), 1807.
Nguyen, V. D., Van Nguyen, H., Tran, D. T., Lee, S. J., & Jeon, J. W. (2016). Learning framework for robust obstacle detection, recognition, and tracking. IEEE Transactions on Intelligent Transportation Systems, 18(6), 1633–1646.
Bashiri, F. S., LaRose, E., Badger, J. C., D’Souza, R. M., Yu, Z., & Peissig, P. (2018, November). Object detection to assist visually impaired people: A deep neural network adventure. In International symposium on visual computing (pp. 500–510). Springer.
Joshi, R. C., Yadav, S., Dutta, M. K., & Travieso-Gonzalez, C. M. (2020). Efficient multi-object detection and smart navigation using artificial intelligence for visually impaired people. Entropy, 22(9), 941.
Afif, M., Ayachi, R., Pissaloux, E., Said, Y., & Atri, M. (2020). Indoor objects detection and recognition for an ICT mobility assistance of visually impaired people. Multimedia Tools and Applications, 79(41), 31645–31662.
Socher, R., Huval, B., Bath, B., Manning, C. D., & Ng, A. Y. (2012). Convolutional-recursive deep learning for 3d object classification. In Advances in neural information processing systems (pp. 656–664).
Bhowmick, A., Prakash, S., Bhagat, R., Prasad, V., & Hazarika, S. M. (2014, December). IntelliNavi: Navigation for blind based on Kinect and machine learning. In International workshop on multi-disciplinary trends in artificial intelligence (pp. 172–183). Springer.
Chaudhry, S., & Chandra, R. (2015). Design of a mobile face recognition system for visually impaired persons. arXiv preprint arXiv:1502.00756.
Poggi, M., & Mattoccia, S. (2016, June). A wearable mobility aid for the visually impaired based on embedded 3d vision and deep learning. In 2016 IEEE symposium on computers and communication (ISCC) (pp. 208–213). IEEE.
Tapu, R., Mocanu, B., & Zaharia, T. (2017). DEEP-SEE: Joint object detection, tracking and recognition with application to visually impaired navigational assistance. Sensors, 17(11), 2473.
Kaur, B., & Bhattacharya, J. (2018). A scene perception system for visually impaired based on object detection and classification using multi-modal DCNN. arXiv preprint arXiv:1805.08798.
Cabello, F., León, J., Iano, Y., & Arthur, R. (2015, September). Implementation of a fixed-point 2D Gaussian filter for image processing based on FPGA. In 2015 signal processing: algorithms, architectures, arrangements, and applications (SPA) (pp. 28–33). IEEE.
Chen, Q., Zhao, L., Lu, J., Kuang, G., Wang, N., & Jiang, Y. (2012). Modified two-dimensional Otsu image segmentation algorithm and fast realisation. IET Image Processing, 6(4), 426–433.
Makkar, H., & Pundir, A. (2014). Image analysis using improved Otsu’s thresholding method. International Journal on Recent and Innovation Trends in Computing and Communication, 2(8), 2122–2126.
Belloulata, K., Belallouche, L., Belalia, A., & Kpalma, K. (2014, July). Region based image retrieval using shape-adaptive dct. In 2014 IEEE China summit & international conference on signal and information processing (ChinaSIP) (pp. 470–474). IEEE.
Yuan, L. U. O., Wu, C. M., & Zhang, Y. (2013). Facial expression feature extraction using hybrid PCA and LBP. The Journal of China Universities of Posts and Telecommunications, 20(2), 120–124.
Haralick, R. M., Shanmugam, K., & Dinstein, I. H. (1973). Textural features for image classification. IEEE Transactions on systems, man, and cybernetics, 6, 610–621.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Singh, Y., Kaur, L. & Neeru, N. A New Improved Obstacle Detection Framework Using IDCT and CNN to Assist Visually Impaired Persons in an Outdoor Environment. Wireless Pers Commun 124, 3685–3702 (2022). https://doi.org/10.1007/s11277-022-09533-0
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11277-022-09533-0