Abstract
Person identification is of paramount importance in security, surveillance, human-computer interfaces, and smart spaces. All these applications attempt the recognition of people based on audiovisual data. The way the systems collect these data divides them into two categories: Near-field systems: Both the sensor and the person to be identified focus on each other. Far-field systems: The sensors monitor an entire space in which the person appears, occasionally collecting useful data (face and/or speech) about that person. Also, the person pays no attention to the sensors and is possibly unaware of their existence.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
C. Barras, X. Zhu, J.-L. Gauvain, and L. Lamel. The CLEAR’06 LIMSI acoustic speaker identification system for CHIL seminars. In Multimodal Technologies for Perception of Humans. First International Evaluation Workshop on Classification of Events, Activities and Relationships, CLEAR 2006, LNCS 4122, pages 233–240, 2006.
C. Barras, X. Zhu, C.-C. Leung, J.-L. Gauvain, and L. Lamel. Acoustic speaker identification: The LIMSI CLEAR’07 system. In Multimodal Technologies for Perception of Humans, Proceedings of the International Evaluation Workshops CLEAR 2007 and RT 2007, LNCS 4625, pages 233–239, Baltimore, MD, May 8-11 2007.
P. Ejarque, A. Garde, J. Anguita, and J. Hernando. On the use of genuine-impostor statistical information for score fusion in multimodal biometrics. Annals of Telecommunications, Special Issue on Multimodal Biometrics, 62(1-2):109–129, Apr. 2007.
H. K. Ekenel, M. Fischer, Q. Jin, and R. Stiefelhagen. Multi-modal person identification in a smart environment. In CVPR Biometrics Workshop 2007, IEEE Conference on Computer Vision and Pattern Recognition, Minneapolis, Jun. 2007.
5. H. K. Ekenel, M. Fischer, and R. Stiefelhagen. Face recognition in smart rooms. In Machine Learning for Multimodal Interaction, Fourth International Workshop, MLMI 2007, Brno, Czech Republic, Jun. 2007.
H. K. Ekenel and Q. Jin. ISL person identification systems in the CLEAR evaluations. In Multimodal Technologies for Perception of Humans. First International Evaluation Workshop on Classification of Events, Activities and Relationships, CLEAR 2006, LNCS 4122, pages 249–257, Southampton, UK, Apr. 6-7 2007.
H. K. Ekenel, Q. Jin, and M. Fischer. ISL person identification systems in the CLEAR 2007 evaluations. In Multimodal Technologies for Perception of Humans, Proceedings of the International Evaluation Workshops CLEAR 2007 and RT 2007, LNCS 4625, pages 256–265, Baltimore, MD, May 8-11 2007.
H. K. Ekenel and A. Pnevmatikakis. Video-based face recognition evaluation in the CHIL project – run 1. In 7th IEEE International Conference on Automatic Face and Gesture Recognition, FG06, pages 85–90, 2006.
H. K. Ekenel and R. Stiefelhagen. Analysis of local appearance-based face recognition: Effects of feature selection and feature normalization. In CVPR Biometrics Workshop, New York, Jun. 2006.
M. Farrús, P. Ejarque, A. Temko, and J. Hernando. Histogram Equalization in SVM Multimodal Person Verification. In ICB, pages 819–827, 2007.
J.-L. Gauvain and C. Lee. Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains. IEEE Transactions on Speech and Audio Processing, 2(2):291–298, Apr. 1994.
S. Z. Li, L. Zhang, S. Liao, X. Zhu, R. Chu, M. Ao, and R. He. A near-infrared image based face recognition system. In 7th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2006), pages 455–460, Southampton, UK, April 2006.
J. Luque and J. Hernando. Robust speaker identification for meetings: UPC CLEAR07 meeting room evaluation system. In Multimodal Technologies for Perception of Humans, Proceedings of the International EvaluationWorkshops CLEAR 2007 and RT 2007, LNCS 4625, pages 266–275, Baltimore, MD, May 8-11 2007.
J. Luque, R. Morros, A. Garde, J. Anguita, M. Farrus, D. Macho, F. Marqués, C. Martínez, V. Vilaplana, and J. Hernando. Audio, video and multimodal person identification in a smart room. In Multimodal Technologies for Perception of Humans. First International Evaluation Workshop on Classification of Events, Activities and Relationships, CLEAR 2006, LNCS 4122, pages 258–269, Southampton, UK, Apr. 6-7 2006. Springer-Verlag.
A. Pnevmatikakis and L. Polymenakos. Far-Field Multi-Camera Video-to-Video Face Recognition. I-Tech Education and Publishing, 2007.
A. Stergiou, A. Pnevmatikakis, and L. Polymenakos. A decision fusion system across time and classifiers for audio-visual person identification. In Multimodal Technologies for Perception of Humans, Proceedings of the first International CLEAR evaluation workshop, CLEAR 2006, LNCS 4122, pages 223–232, Southampton, UK, Apr. 6-7 2006.
A. Stergiou, A. Pnevmatikakis, and L. Polymenakos. The AIT multimodal person identification system for CLEAR 2007. In Multimodal Technologies for Perception of Humans, Proceedings of the International EvaluationWorkshops CLEAR 2007 and RT 2007, LNCS 4625, pages 221–232, Baltimore, MD, May 8-11 2007.
R. Stiefelhagen, K. Bernardin, R. Bowers, J. Garofolo, D. Mostefa, and P. Soundararajan. The CLEAR 2006 evaluation. In Multimodal Technologies for Perception of Humans, Proceedings of the First International CLEAR Evaluation Workshop, CLEAR 2006, LNCS 4122, pages 1–45, Southampton, UK, Apr. 6-7 2006.
R. Stiefelhagen, K. Bernardin, R. Bowers, R. T. Rose, M. Michel, and J. Garofolo. The CLEAR 2007 evaluation. In Multimodal Technologies for Perception of Humans, Proceedings of the International Evaluation Workshops CLEAR 2007 and RT 2007, LNCS 4625, pages 3–34, Baltimore, MD, May 8-11 2007.
P. A. Viola and M. J. Jones. Rapid object detection using a boosted cascade of simple features. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2001), pages 511–518, Kauai, HI, Dec. 2001.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag London Limited
About this chapter
Cite this chapter
Pnevmatikakis, A., Ekenel, H.K., Barras, C., Hernando, J. (2009). Multimodal Person Identification. In: Waibel, A., Stiefelhagen, R. (eds) Computers in the Human Interaction Loop. Human–Computer Interaction Series. Springer, London. https://doi.org/10.1007/978-1-84882-054-8_4
Download citation
DOI: https://doi.org/10.1007/978-1-84882-054-8_4
Publisher Name: Springer, London
Print ISBN: 978-1-84882-053-1
Online ISBN: 978-1-84882-054-8
eBook Packages: Computer ScienceComputer Science (R0)