Towards More Natural Social Interactions of Visually Impaired Persons

Carrato, Sergio; Fenu, Gianfranco; Medvet, Eric; Mumolo, Enzo; Pellegrino, Felice Andrea; Ramponi, Giovanni

doi:10.1007/978-3-319-25903-1_63

Towards More Natural Social Interactions of Visually Impaired Persons

Sergio Carrato¹⁹,
Gianfranco Fenu¹⁹,
Eric Medvet¹⁹,
Enzo Mumolo¹⁹,
Felice Andrea Pellegrino¹⁹ &
…
Giovanni Ramponi¹⁹

Conference paper
First Online: 06 November 2015

2834 Accesses
5 Citations

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 9386))

Abstract

We review recent computer vision techniques with reference to the specific goal of assisting the social interactions of a person affected by very severe visual impairment or by total blindness. We consider a scenario in which a sequence of images is acquired and processed by a wearable device, and we focus on the basic tasks of detecting and recognizing people and their facial expression. We review some methodologies of Visual Domain Adaptation that could be employed to adapt existing classification strategies to the specific scenario. We also consider other sources of information that could be exploited to improve the performance of the system.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Be My Eyes: Web site. http://www.bemyeyes.org/ (accessed August 5, 2015)
FaceSpeaker: Web site. http://www.facespeaker.org/ (accessed August 5, 2015)
Horus Technology: Web site. http://horus.technology/en/ (accessed August 5, 2015)
vEyes: Web site. http://www.veyes.it/ (accessed August 5, 2015)
Krishna, S., Little, G., Black, J., Panchanathan, S.: A wearable face recognition system for individuals with visual impairments. In: Proceedings of the 7th international ACM SIGACCESS conference on Computers and accessibility, pp. 106–113. ACM (2005)
Google Scholar
McDaniel, T., Krishna, S., Balasubramanian, V., Colbry, D., Panchanathan, S.: Using a haptic belt to convey non-verbal communication cues during social interactions to individuals who are blind. In: IEEE International Workshop on Haptic Audio visual Environments and Games, HAVE 2008, pp. 13–18. IEEE (2008)
Google Scholar
Bonetto, M., Carrato, S., Fenu, G., Medvet, E., Mumolo, E., Pellegrino, F.A., Ramponi, G.: Image processing issues in a social assistive system for the blind. In: 9th International Symposium on Image and Signal Processing and Analysis (ISPA), September 2015
Google Scholar
Jain, V., Farfade, S.S.: Adapting classification cascades to new domains. In: IEEE International Conference on Computer Vision (ICCV), pp. 105–112. IEEE (2013)
Google Scholar
Patel, V., Gopalan, R., Li, R., Chellappa, R.: Visual domain adaptation: A survey of recent advances. IEEE Signal Processing Magazine 32(3), 53–69 (2015)
Article Google Scholar
Jain, V., Learned-Miller, E.: Online domain adaptation of a pre-trained cascade of classifiers. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 577–584. IEEE (2011)
Google Scholar
Daumé III, H.: Frustratingly easy domain adaptation. In: Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, pp. 256–263 (2007)
Google Scholar
Li, W., Duan, L., Xu, D., Tsang, I.W.: Learning with augmented features for supervised and semi-supervised heterogeneous domain adaptation. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(6), 1134–1148 (2014)
Article Google Scholar
Gopalan, R., Li, R., Chellappa, R.: Domain adaptation for object recognition: An unsupervised approach. In: IEEE International Conference on Computer Vision (ICCV), pp. 999–1006. IEEE (2011)
Google Scholar
Gopalan, R., Li, R., Chellappa, R.: Unsupervised adaptation across domain shifts by generating intermediate data representations. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(11), 2288–2302 (2014)
Article Google Scholar
Zhang, C., Zhang, Z.: A survey of recent advances in face detection. Microsoft Research, Technical report MSR-TR-2010-66 (2010)
Google Scholar
Saenko, K., Kulis, B., Fritz, M., Darrell, T.: Adapting visual category models to new domains. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part IV. LNCS, vol. 6314, pp. 213–226. Springer, Heidelberg (2010)
Chapter Google Scholar
Kulis, B., Saenko, K., Darrell, T.: What you saw is not what you get: Domain adaptation using asymmetric kernel transforms. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1785–1792. IEEE (2011)
Google Scholar
Jhuo, I.H., Liu, D., Lee, D., Chang, S.F.: Robust visual domain adaptation with low-rank reconstruction. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2168–2175. IEEE (2012)
Google Scholar
Yang, J., Yan, R., Hauptmann, A.G.: Cross-domain video concept detection using adaptive SVMs. In: Proceedings of the 15th international conference on Multimedia, pp. 188–197. ACM (2007)
Google Scholar
Aytar, Y., Zisserman, A.: Tabula rasa: Model transfer for object category detection. In: IEEE International Conference on Computer Vision (ICCV), pp. 2252–2259. IEEE (2011)
Google Scholar
Dal Col, L., Pellegrino, F.A.: Fast and accurate object detection by means of recursive monomial feature elimination and cascade of SVM. In: Fanti, M., Giua, A. (eds.) Proceedings of the IEEE Conference on Automation Science and Engineering, pp. 304–309, Trieste (2011)
Google Scholar
Viola, P., Jones, M.J.: Robust real-time face detection. International Journal of Computer Vision 57(2), 137–154 (2004)
Article Google Scholar
Patil, R., Vineet, S., Mandal, A.S.: Facial expression recognition in image sequences using active shape model and svm. In: Proceedings of the UKSim 5th European Symposium on Computer Modeling and Simulation, pp. 16–18, December 2011
Google Scholar
Gu, W., Xiang, C., Venkatesh, Y., Huang, D., Lin, H.: Facial expression recognition using radial encoding of local gabor features and classifier synthesis. Pattern Recognition, pp. 80–91 (2012)
Google Scholar
Zhang, S., Zhao, X., Lei, B.: Facial expression recognition based on local binary patterns and local fisher discriminant analysis. WSEAS Trans. Signal Process, pp. 21–31 (2012)
Google Scholar
Xiaoming, Z., Shiqing, Z.: Facial expression recognition based on local binary patterns and least squares support vector machines. Lecture Notes in Electrical Engineering 140, 707–712 (2012)
Article Google Scholar
He, L., Wang, X., Yu, C., Wu, K.: Facial expression recognition using embedded hidden markov model. In: IEEE International Conference on Systems, Man and Cybernetics, pp. 1568–1572 (2009)
Google Scholar
Piparsaniyan, Y., Sharma, V.K., Mahapatr, K.K.: Robust facial expression recognition using Gabor feature and bayesian discriminating classifier. In: Proc. of Int. Conf. on Comm. and Signal Processing, pp. 538–541 (2014)
Google Scholar
Suk, M., Prabhakaran, B.: Real-time facial expression recognition on smartphones. In: Proc. of IEEE Winter Conference on Applications of Computer Vision, pp. 1054–1059 (2015)
Google Scholar
Cid, F., Prado, J., Bustos, P., Nunez, P.: A real time and robust facial expression recognition and imitation approach for affective human-robot interaction using Gabor filtering. In: Proc. of IROS, pp. 2188–2193 (2013)
Google Scholar
Song, I., Kim, H.J., Jeon, P.B.: Deep learning for real-time robust facial expression recognition on a smartphone. In: Proc. of IEEE Int. Conf. on Cons. Electronics (2014)
Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.: Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems, pp. 1097–1105 (2012)
Google Scholar
Davis, M., Smith, M., Canny, J., Good, N., King, S., Janakiraman, R.: Towards context-aware face recognition. In: Proceedings of the 13th annual ACM international conference on Multimedia, pp. 483–486. ACM (2005)
Google Scholar
O’Hare, N., Smeaton, A.F.: Context-aware person identification in personal photo collections. IEEE Transactions on Multimedia 11(2), 220–228 (2009)
Article Google Scholar
Kapoor, A., Lin, D., Baker, S., Hua, G., Akbarzadeh, A.: How to make face recognition work: The power of modeling context. AAAI Work (2012)
Google Scholar
Zhou, H., Mian, A., Wei, L., Creighton, D., Hossny, M., Nahavandi, S.: Recent advances on singlemodal and multimodal face recognition: A survey. IEEE Transactions on Human-Machine Systems 44(6), 701–716 (2014)
Article Google Scholar
Paleari, M., Huet, B., Chellali, R.: Towards multimodal emotion recognition: a new approach. In: Proceedings of the ACM International Conference on Image and Video Retrieval, pp. 174–181. ACM (2010)
Google Scholar
Acquisti, A., Gross, R., Stutzman, F.: Face recognition and privacy in the age of augmented reality. Journal of Privacy and Confidentiality 6(2), 1 (2014)
Google Scholar
Taigman, Y., Yang, M., Ranzato, M., Wolf, L.: Deepface: closing the gap to human-level performance in face verification. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1701–1708. IEEE (2014)
Google Scholar
Bharadwaj, S., Vatsa, M., Singh, R.: Aiding face recognition with social context association rule based re-ranking. In: IEEE International Joint Conference on Biometrics (IJCB), pp. 1–8. IEEE (2014)
Google Scholar
Medvet, E., Bartoli, A., Davanzo, G., De Lorenzo, A.: Automatic face annotation in news images by mining the web. In: Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology, vol. 01, pp. 47–54. IEEE Computer Society (2011)
Google Scholar
Wang, D., Hoi, S.C.H., He, Y.: A unified learning framework for auto face annotation by mining web facial images. In: Proceedings of the 21st ACM international conference on Information and knowledge management, pp. 1392–1401. ACM (2012)
Google Scholar
Dantone, M., Bossard, L., Quack, T., Van Gool, L.: Augmented faces. In: IEEE International Conference on Computer Vision Workshops (ICCV Workshops), pp. 24–31. IEEE (2011)
Google Scholar

Download references

Author information

Authors and Affiliations

DIA, University of Trieste, Trieste, Italy
Sergio Carrato, Gianfranco Fenu, Eric Medvet, Enzo Mumolo, Felice Andrea Pellegrino & Giovanni Ramponi

Authors

Sergio Carrato
View author publications
You can also search for this author in PubMed Google Scholar
Gianfranco Fenu
View author publications
You can also search for this author in PubMed Google Scholar
Eric Medvet
View author publications
You can also search for this author in PubMed Google Scholar
Enzo Mumolo
View author publications
You can also search for this author in PubMed Google Scholar
Felice Andrea Pellegrino
View author publications
You can also search for this author in PubMed Google Scholar
Giovanni Ramponi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Giovanni Ramponi .

Editor information

Editors and Affiliations

Dipartimento di Matematica e Informatica, Università di Catania, Catania, Catania, Italy
Sebastiano Battiato
Arcueil CX, France
Jacques Blanc-Talon
Catania, Italy
Giovanni Gallo
Gent, Belgium
Wilfried Philips
CSIRO, Sydney, New South Wales, Australia
Dan Popescu
Vision Lab., University of Antwerp, Antwerpen, Belgium
Paul Scheunders

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Carrato, S., Fenu, G., Medvet, E., Mumolo, E., Pellegrino, F.A., Ramponi, G. (2015). Towards More Natural Social Interactions of Visually Impaired Persons. In: Battiato, S., Blanc-Talon, J., Gallo, G., Philips, W., Popescu, D., Scheunders, P. (eds) Advanced Concepts for Intelligent Vision Systems. ACIVS 2015. Lecture Notes in Computer Science(), vol 9386. Springer, Cham. https://doi.org/10.1007/978-3-319-25903-1_63

Download citation

DOI: https://doi.org/10.1007/978-3-319-25903-1_63
Published: 06 November 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-25902-4
Online ISBN: 978-3-319-25903-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics