Abstract
We review recent computer vision techniques with reference to the specific goal of assisting the social interactions of a person affected by very severe visual impairment or by total blindness. We consider a scenario in which a sequence of images is acquired and processed by a wearable device, and we focus on the basic tasks of detecting and recognizing people and their facial expression. We review some methodologies of Visual Domain Adaptation that could be employed to adapt existing classification strategies to the specific scenario. We also consider other sources of information that could be exploited to improve the performance of the system.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Be My Eyes: Web site. http://www.bemyeyes.org/ (accessed August 5, 2015)
FaceSpeaker: Web site. http://www.facespeaker.org/ (accessed August 5, 2015)
Horus Technology: Web site. http://horus.technology/en/ (accessed August 5, 2015)
vEyes: Web site. http://www.veyes.it/ (accessed August 5, 2015)
Krishna, S., Little, G., Black, J., Panchanathan, S.: A wearable face recognition system for individuals with visual impairments. In: Proceedings of the 7th international ACM SIGACCESS conference on Computers and accessibility, pp. 106–113. ACM (2005)
McDaniel, T., Krishna, S., Balasubramanian, V., Colbry, D., Panchanathan, S.: Using a haptic belt to convey non-verbal communication cues during social interactions to individuals who are blind. In: IEEE International Workshop on Haptic Audio visual Environments and Games, HAVE 2008, pp. 13–18. IEEE (2008)
Bonetto, M., Carrato, S., Fenu, G., Medvet, E., Mumolo, E., Pellegrino, F.A., Ramponi, G.: Image processing issues in a social assistive system for the blind. In: 9th International Symposium on Image and Signal Processing and Analysis (ISPA), September 2015
Jain, V., Farfade, S.S.: Adapting classification cascades to new domains. In: IEEE International Conference on Computer Vision (ICCV), pp. 105–112. IEEE (2013)
Patel, V., Gopalan, R., Li, R., Chellappa, R.: Visual domain adaptation: A survey of recent advances. IEEE Signal Processing Magazine 32(3), 53–69 (2015)
Jain, V., Learned-Miller, E.: Online domain adaptation of a pre-trained cascade of classifiers. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 577–584. IEEE (2011)
Daumé III, H.: Frustratingly easy domain adaptation. In: Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, pp. 256–263 (2007)
Li, W., Duan, L., Xu, D., Tsang, I.W.: Learning with augmented features for supervised and semi-supervised heterogeneous domain adaptation. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(6), 1134–1148 (2014)
Gopalan, R., Li, R., Chellappa, R.: Domain adaptation for object recognition: An unsupervised approach. In: IEEE International Conference on Computer Vision (ICCV), pp. 999–1006. IEEE (2011)
Gopalan, R., Li, R., Chellappa, R.: Unsupervised adaptation across domain shifts by generating intermediate data representations. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(11), 2288–2302 (2014)
Zhang, C., Zhang, Z.: A survey of recent advances in face detection. Microsoft Research, Technical report MSR-TR-2010-66 (2010)
Saenko, K., Kulis, B., Fritz, M., Darrell, T.: Adapting visual category models to new domains. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part IV. LNCS, vol. 6314, pp. 213–226. Springer, Heidelberg (2010)
Kulis, B., Saenko, K., Darrell, T.: What you saw is not what you get: Domain adaptation using asymmetric kernel transforms. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1785–1792. IEEE (2011)
Jhuo, I.H., Liu, D., Lee, D., Chang, S.F.: Robust visual domain adaptation with low-rank reconstruction. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2168–2175. IEEE (2012)
Yang, J., Yan, R., Hauptmann, A.G.: Cross-domain video concept detection using adaptive SVMs. In: Proceedings of the 15th international conference on Multimedia, pp. 188–197. ACM (2007)
Aytar, Y., Zisserman, A.: Tabula rasa: Model transfer for object category detection. In: IEEE International Conference on Computer Vision (ICCV), pp. 2252–2259. IEEE (2011)
Dal Col, L., Pellegrino, F.A.: Fast and accurate object detection by means of recursive monomial feature elimination and cascade of SVM. In: Fanti, M., Giua, A. (eds.) Proceedings of the IEEE Conference on Automation Science and Engineering, pp. 304–309, Trieste (2011)
Viola, P., Jones, M.J.: Robust real-time face detection. International Journal of Computer Vision 57(2), 137–154 (2004)
Patil, R., Vineet, S., Mandal, A.S.: Facial expression recognition in image sequences using active shape model and svm. In: Proceedings of the UKSim 5th European Symposium on Computer Modeling and Simulation, pp. 16–18, December 2011
Gu, W., Xiang, C., Venkatesh, Y., Huang, D., Lin, H.: Facial expression recognition using radial encoding of local gabor features and classifier synthesis. Pattern Recognition, pp. 80–91 (2012)
Zhang, S., Zhao, X., Lei, B.: Facial expression recognition based on local binary patterns and local fisher discriminant analysis. WSEAS Trans. Signal Process, pp. 21–31 (2012)
Xiaoming, Z., Shiqing, Z.: Facial expression recognition based on local binary patterns and least squares support vector machines. Lecture Notes in Electrical Engineering 140, 707–712 (2012)
He, L., Wang, X., Yu, C., Wu, K.: Facial expression recognition using embedded hidden markov model. In: IEEE International Conference on Systems, Man and Cybernetics, pp. 1568–1572 (2009)
Piparsaniyan, Y., Sharma, V.K., Mahapatr, K.K.: Robust facial expression recognition using Gabor feature and bayesian discriminating classifier. In: Proc. of Int. Conf. on Comm. and Signal Processing, pp. 538–541 (2014)
Suk, M., Prabhakaran, B.: Real-time facial expression recognition on smartphones. In: Proc. of IEEE Winter Conference on Applications of Computer Vision, pp. 1054–1059 (2015)
Cid, F., Prado, J., Bustos, P., Nunez, P.: A real time and robust facial expression recognition and imitation approach for affective human-robot interaction using Gabor filtering. In: Proc. of IROS, pp. 2188–2193 (2013)
Song, I., Kim, H.J., Jeon, P.B.: Deep learning for real-time robust facial expression recognition on a smartphone. In: Proc. of IEEE Int. Conf. on Cons. Electronics (2014)
Krizhevsky, A., Sutskever, I., Hinton, G.: Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems, pp. 1097–1105 (2012)
Davis, M., Smith, M., Canny, J., Good, N., King, S., Janakiraman, R.: Towards context-aware face recognition. In: Proceedings of the 13th annual ACM international conference on Multimedia, pp. 483–486. ACM (2005)
O’Hare, N., Smeaton, A.F.: Context-aware person identification in personal photo collections. IEEE Transactions on Multimedia 11(2), 220–228 (2009)
Kapoor, A., Lin, D., Baker, S., Hua, G., Akbarzadeh, A.: How to make face recognition work: The power of modeling context. AAAI Work (2012)
Zhou, H., Mian, A., Wei, L., Creighton, D., Hossny, M., Nahavandi, S.: Recent advances on singlemodal and multimodal face recognition: A survey. IEEE Transactions on Human-Machine Systems 44(6), 701–716 (2014)
Paleari, M., Huet, B., Chellali, R.: Towards multimodal emotion recognition: a new approach. In: Proceedings of the ACM International Conference on Image and Video Retrieval, pp. 174–181. ACM (2010)
Acquisti, A., Gross, R., Stutzman, F.: Face recognition and privacy in the age of augmented reality. Journal of Privacy and Confidentiality 6(2), 1 (2014)
Taigman, Y., Yang, M., Ranzato, M., Wolf, L.: Deepface: closing the gap to human-level performance in face verification. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1701–1708. IEEE (2014)
Bharadwaj, S., Vatsa, M., Singh, R.: Aiding face recognition with social context association rule based re-ranking. In: IEEE International Joint Conference on Biometrics (IJCB), pp. 1–8. IEEE (2014)
Medvet, E., Bartoli, A., Davanzo, G., De Lorenzo, A.: Automatic face annotation in news images by mining the web. In: Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology, vol. 01, pp. 47–54. IEEE Computer Society (2011)
Wang, D., Hoi, S.C.H., He, Y.: A unified learning framework for auto face annotation by mining web facial images. In: Proceedings of the 21st ACM international conference on Information and knowledge management, pp. 1392–1401. ACM (2012)
Dantone, M., Bossard, L., Quack, T., Van Gool, L.: Augmented faces. In: IEEE International Conference on Computer Vision Workshops (ICCV Workshops), pp. 24–31. IEEE (2011)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Carrato, S., Fenu, G., Medvet, E., Mumolo, E., Pellegrino, F.A., Ramponi, G. (2015). Towards More Natural Social Interactions of Visually Impaired Persons. In: Battiato, S., Blanc-Talon, J., Gallo, G., Philips, W., Popescu, D., Scheunders, P. (eds) Advanced Concepts for Intelligent Vision Systems. ACIVS 2015. Lecture Notes in Computer Science(), vol 9386. Springer, Cham. https://doi.org/10.1007/978-3-319-25903-1_63
Download citation
DOI: https://doi.org/10.1007/978-3-319-25903-1_63
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-25902-4
Online ISBN: 978-3-319-25903-1
eBook Packages: Computer ScienceComputer Science (R0)