Skip to main content

Towards More Natural Social Interactions of Visually Impaired Persons

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 9386))

Abstract

We review recent computer vision techniques with reference to the specific goal of assisting the social interactions of a person affected by very severe visual impairment or by total blindness. We consider a scenario in which a sequence of images is acquired and processed by a wearable device, and we focus on the basic tasks of detecting and recognizing people and their facial expression. We review some methodologies of Visual Domain Adaptation that could be employed to adapt existing classification strategies to the specific scenario. We also consider other sources of information that could be exploited to improve the performance of the system.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Be My Eyes: Web site. http://www.bemyeyes.org/ (accessed August 5, 2015)

  2. FaceSpeaker: Web site. http://www.facespeaker.org/ (accessed August 5, 2015)

  3. Horus Technology: Web site. http://horus.technology/en/ (accessed August 5, 2015)

  4. vEyes: Web site. http://www.veyes.it/ (accessed August 5, 2015)

  5. Krishna, S., Little, G., Black, J., Panchanathan, S.: A wearable face recognition system for individuals with visual impairments. In: Proceedings of the 7th international ACM SIGACCESS conference on Computers and accessibility, pp. 106–113. ACM (2005)

    Google Scholar 

  6. McDaniel, T., Krishna, S., Balasubramanian, V., Colbry, D., Panchanathan, S.: Using a haptic belt to convey non-verbal communication cues during social interactions to individuals who are blind. In: IEEE International Workshop on Haptic Audio visual Environments and Games, HAVE 2008, pp. 13–18. IEEE (2008)

    Google Scholar 

  7. Bonetto, M., Carrato, S., Fenu, G., Medvet, E., Mumolo, E., Pellegrino, F.A., Ramponi, G.: Image processing issues in a social assistive system for the blind. In: 9th International Symposium on Image and Signal Processing and Analysis (ISPA), September 2015

    Google Scholar 

  8. Jain, V., Farfade, S.S.: Adapting classification cascades to new domains. In: IEEE International Conference on Computer Vision (ICCV), pp. 105–112. IEEE (2013)

    Google Scholar 

  9. Patel, V., Gopalan, R., Li, R., Chellappa, R.: Visual domain adaptation: A survey of recent advances. IEEE Signal Processing Magazine 32(3), 53–69 (2015)

    Article  Google Scholar 

  10. Jain, V., Learned-Miller, E.: Online domain adaptation of a pre-trained cascade of classifiers. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 577–584. IEEE (2011)

    Google Scholar 

  11. Daumé III, H.: Frustratingly easy domain adaptation. In: Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, pp. 256–263 (2007)

    Google Scholar 

  12. Li, W., Duan, L., Xu, D., Tsang, I.W.: Learning with augmented features for supervised and semi-supervised heterogeneous domain adaptation. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(6), 1134–1148 (2014)

    Article  Google Scholar 

  13. Gopalan, R., Li, R., Chellappa, R.: Domain adaptation for object recognition: An unsupervised approach. In: IEEE International Conference on Computer Vision (ICCV), pp. 999–1006. IEEE (2011)

    Google Scholar 

  14. Gopalan, R., Li, R., Chellappa, R.: Unsupervised adaptation across domain shifts by generating intermediate data representations. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(11), 2288–2302 (2014)

    Article  Google Scholar 

  15. Zhang, C., Zhang, Z.: A survey of recent advances in face detection. Microsoft Research, Technical report MSR-TR-2010-66 (2010)

    Google Scholar 

  16. Saenko, K., Kulis, B., Fritz, M., Darrell, T.: Adapting visual category models to new domains. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part IV. LNCS, vol. 6314, pp. 213–226. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  17. Kulis, B., Saenko, K., Darrell, T.: What you saw is not what you get: Domain adaptation using asymmetric kernel transforms. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1785–1792. IEEE (2011)

    Google Scholar 

  18. Jhuo, I.H., Liu, D., Lee, D., Chang, S.F.: Robust visual domain adaptation with low-rank reconstruction. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2168–2175. IEEE (2012)

    Google Scholar 

  19. Yang, J., Yan, R., Hauptmann, A.G.: Cross-domain video concept detection using adaptive SVMs. In: Proceedings of the 15th international conference on Multimedia, pp. 188–197. ACM (2007)

    Google Scholar 

  20. Aytar, Y., Zisserman, A.: Tabula rasa: Model transfer for object category detection. In: IEEE International Conference on Computer Vision (ICCV), pp. 2252–2259. IEEE (2011)

    Google Scholar 

  21. Dal Col, L., Pellegrino, F.A.: Fast and accurate object detection by means of recursive monomial feature elimination and cascade of SVM. In: Fanti, M., Giua, A. (eds.) Proceedings of the IEEE Conference on Automation Science and Engineering, pp. 304–309, Trieste (2011)

    Google Scholar 

  22. Viola, P., Jones, M.J.: Robust real-time face detection. International Journal of Computer Vision 57(2), 137–154 (2004)

    Article  Google Scholar 

  23. Patil, R., Vineet, S., Mandal, A.S.: Facial expression recognition in image sequences using active shape model and svm. In: Proceedings of the UKSim 5th European Symposium on Computer Modeling and Simulation, pp. 16–18, December 2011

    Google Scholar 

  24. Gu, W., Xiang, C., Venkatesh, Y., Huang, D., Lin, H.: Facial expression recognition using radial encoding of local gabor features and classifier synthesis. Pattern Recognition, pp. 80–91 (2012)

    Google Scholar 

  25. Zhang, S., Zhao, X., Lei, B.: Facial expression recognition based on local binary patterns and local fisher discriminant analysis. WSEAS Trans. Signal Process, pp. 21–31 (2012)

    Google Scholar 

  26. Xiaoming, Z., Shiqing, Z.: Facial expression recognition based on local binary patterns and least squares support vector machines. Lecture Notes in Electrical Engineering 140, 707–712 (2012)

    Article  Google Scholar 

  27. He, L., Wang, X., Yu, C., Wu, K.: Facial expression recognition using embedded hidden markov model. In: IEEE International Conference on Systems, Man and Cybernetics, pp. 1568–1572 (2009)

    Google Scholar 

  28. Piparsaniyan, Y., Sharma, V.K., Mahapatr, K.K.: Robust facial expression recognition using Gabor feature and bayesian discriminating classifier. In: Proc. of Int. Conf. on Comm. and Signal Processing, pp. 538–541 (2014)

    Google Scholar 

  29. Suk, M., Prabhakaran, B.: Real-time facial expression recognition on smartphones. In: Proc. of IEEE Winter Conference on Applications of Computer Vision, pp. 1054–1059 (2015)

    Google Scholar 

  30. Cid, F., Prado, J., Bustos, P., Nunez, P.: A real time and robust facial expression recognition and imitation approach for affective human-robot interaction using Gabor filtering. In: Proc. of IROS, pp. 2188–2193 (2013)

    Google Scholar 

  31. Song, I., Kim, H.J., Jeon, P.B.: Deep learning for real-time robust facial expression recognition on a smartphone. In: Proc. of IEEE Int. Conf. on Cons. Electronics (2014)

    Google Scholar 

  32. Krizhevsky, A., Sutskever, I., Hinton, G.: Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems, pp. 1097–1105 (2012)

    Google Scholar 

  33. Davis, M., Smith, M., Canny, J., Good, N., King, S., Janakiraman, R.: Towards context-aware face recognition. In: Proceedings of the 13th annual ACM international conference on Multimedia, pp. 483–486. ACM (2005)

    Google Scholar 

  34. O’Hare, N., Smeaton, A.F.: Context-aware person identification in personal photo collections. IEEE Transactions on Multimedia 11(2), 220–228 (2009)

    Article  Google Scholar 

  35. Kapoor, A., Lin, D., Baker, S., Hua, G., Akbarzadeh, A.: How to make face recognition work: The power of modeling context. AAAI Work (2012)

    Google Scholar 

  36. Zhou, H., Mian, A., Wei, L., Creighton, D., Hossny, M., Nahavandi, S.: Recent advances on singlemodal and multimodal face recognition: A survey. IEEE Transactions on Human-Machine Systems 44(6), 701–716 (2014)

    Article  Google Scholar 

  37. Paleari, M., Huet, B., Chellali, R.: Towards multimodal emotion recognition: a new approach. In: Proceedings of the ACM International Conference on Image and Video Retrieval, pp. 174–181. ACM (2010)

    Google Scholar 

  38. Acquisti, A., Gross, R., Stutzman, F.: Face recognition and privacy in the age of augmented reality. Journal of Privacy and Confidentiality 6(2), 1 (2014)

    Google Scholar 

  39. Taigman, Y., Yang, M., Ranzato, M., Wolf, L.: Deepface: closing the gap to human-level performance in face verification. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1701–1708. IEEE (2014)

    Google Scholar 

  40. Bharadwaj, S., Vatsa, M., Singh, R.: Aiding face recognition with social context association rule based re-ranking. In: IEEE International Joint Conference on Biometrics (IJCB), pp. 1–8. IEEE (2014)

    Google Scholar 

  41. Medvet, E., Bartoli, A., Davanzo, G., De Lorenzo, A.: Automatic face annotation in news images by mining the web. In: Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology, vol. 01, pp. 47–54. IEEE Computer Society (2011)

    Google Scholar 

  42. Wang, D., Hoi, S.C.H., He, Y.: A unified learning framework for auto face annotation by mining web facial images. In: Proceedings of the 21st ACM international conference on Information and knowledge management, pp. 1392–1401. ACM (2012)

    Google Scholar 

  43. Dantone, M., Bossard, L., Quack, T., Van Gool, L.: Augmented faces. In: IEEE International Conference on Computer Vision Workshops (ICCV Workshops), pp. 24–31. IEEE (2011)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Giovanni Ramponi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Carrato, S., Fenu, G., Medvet, E., Mumolo, E., Pellegrino, F.A., Ramponi, G. (2015). Towards More Natural Social Interactions of Visually Impaired Persons. In: Battiato, S., Blanc-Talon, J., Gallo, G., Philips, W., Popescu, D., Scheunders, P. (eds) Advanced Concepts for Intelligent Vision Systems. ACIVS 2015. Lecture Notes in Computer Science(), vol 9386. Springer, Cham. https://doi.org/10.1007/978-3-319-25903-1_63

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-25903-1_63

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-25902-4

  • Online ISBN: 978-3-319-25903-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics