Skip to main content

Part of the book series: Human–Computer Interaction Series ((HCIS))

  • 984 Accesses

Abstract

Person identification is of paramount importance in security, surveillance, human-computer interfaces, and smart spaces. All these applications attempt the recognition of people based on audiovisual data. The way the systems collect these data divides them into two categories: Near-field systems: Both the sensor and the person to be identified focus on each other. Far-field systems: The sensors monitor an entire space in which the person appears, occasionally collecting useful data (face and/or speech) about that person. Also, the person pays no attention to the sensors and is possibly unaware of their existence.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. C. Barras, X. Zhu, J.-L. Gauvain, and L. Lamel. The CLEAR’06 LIMSI acoustic speaker identification system for CHIL seminars. In Multimodal Technologies for Perception of Humans. First International Evaluation Workshop on Classification of Events, Activities and Relationships, CLEAR 2006, LNCS 4122, pages 233–240, 2006.

    Google Scholar 

  2. C. Barras, X. Zhu, C.-C. Leung, J.-L. Gauvain, and L. Lamel. Acoustic speaker identification: The LIMSI CLEAR’07 system. In Multimodal Technologies for Perception of Humans, Proceedings of the International Evaluation Workshops CLEAR 2007 and RT 2007, LNCS 4625, pages 233–239, Baltimore, MD, May 8-11 2007.

    Google Scholar 

  3. P. Ejarque, A. Garde, J. Anguita, and J. Hernando. On the use of genuine-impostor statistical information for score fusion in multimodal biometrics. Annals of Telecommunications, Special Issue on Multimodal Biometrics, 62(1-2):109–129, Apr. 2007.

    Google Scholar 

  4. H. K. Ekenel, M. Fischer, Q. Jin, and R. Stiefelhagen. Multi-modal person identification in a smart environment. In CVPR Biometrics Workshop 2007, IEEE Conference on Computer Vision and Pattern Recognition, Minneapolis, Jun. 2007.

    Google Scholar 

  5. 5. H. K. Ekenel, M. Fischer, and R. Stiefelhagen. Face recognition in smart rooms. In Machine Learning for Multimodal Interaction, Fourth International Workshop, MLMI 2007, Brno, Czech Republic, Jun. 2007.

    Google Scholar 

  6. H. K. Ekenel and Q. Jin. ISL person identification systems in the CLEAR evaluations. In Multimodal Technologies for Perception of Humans. First International Evaluation Workshop on Classification of Events, Activities and Relationships, CLEAR 2006, LNCS 4122, pages 249–257, Southampton, UK, Apr. 6-7 2007.

    Google Scholar 

  7. H. K. Ekenel, Q. Jin, and M. Fischer. ISL person identification systems in the CLEAR 2007 evaluations. In Multimodal Technologies for Perception of Humans, Proceedings of the International Evaluation Workshops CLEAR 2007 and RT 2007, LNCS 4625, pages 256–265, Baltimore, MD, May 8-11 2007.

    Google Scholar 

  8. H. K. Ekenel and A. Pnevmatikakis. Video-based face recognition evaluation in the CHIL project – run 1. In 7th IEEE International Conference on Automatic Face and Gesture Recognition, FG06, pages 85–90, 2006.

    Google Scholar 

  9. H. K. Ekenel and R. Stiefelhagen. Analysis of local appearance-based face recognition: Effects of feature selection and feature normalization. In CVPR Biometrics Workshop, New York, Jun. 2006.

    Google Scholar 

  10. M. Farrús, P. Ejarque, A. Temko, and J. Hernando. Histogram Equalization in SVM Multimodal Person Verification. In ICB, pages 819–827, 2007.

    Google Scholar 

  11. J.-L. Gauvain and C. Lee. Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains. IEEE Transactions on Speech and Audio Processing, 2(2):291–298, Apr. 1994.

    Article  Google Scholar 

  12. S. Z. Li, L. Zhang, S. Liao, X. Zhu, R. Chu, M. Ao, and R. He. A near-infrared image based face recognition system. In 7th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2006), pages 455–460, Southampton, UK, April 2006.

    Google Scholar 

  13. J. Luque and J. Hernando. Robust speaker identification for meetings: UPC CLEAR07 meeting room evaluation system. In Multimodal Technologies for Perception of Humans, Proceedings of the International EvaluationWorkshops CLEAR 2007 and RT 2007, LNCS 4625, pages 266–275, Baltimore, MD, May 8-11 2007.

    Google Scholar 

  14. J. Luque, R. Morros, A. Garde, J. Anguita, M. Farrus, D. Macho, F. Marqués, C. Martínez, V. Vilaplana, and J. Hernando. Audio, video and multimodal person identification in a smart room. In Multimodal Technologies for Perception of Humans. First International Evaluation Workshop on Classification of Events, Activities and Relationships, CLEAR 2006, LNCS 4122, pages 258–269, Southampton, UK, Apr. 6-7 2006. Springer-Verlag.

    Google Scholar 

  15. A. Pnevmatikakis and L. Polymenakos. Far-Field Multi-Camera Video-to-Video Face Recognition. I-Tech Education and Publishing, 2007.

    Google Scholar 

  16. A. Stergiou, A. Pnevmatikakis, and L. Polymenakos. A decision fusion system across time and classifiers for audio-visual person identification. In Multimodal Technologies for Perception of Humans, Proceedings of the first International CLEAR evaluation workshop, CLEAR 2006, LNCS 4122, pages 223–232, Southampton, UK, Apr. 6-7 2006.

    Google Scholar 

  17. A. Stergiou, A. Pnevmatikakis, and L. Polymenakos. The AIT multimodal person identification system for CLEAR 2007. In Multimodal Technologies for Perception of Humans, Proceedings of the International EvaluationWorkshops CLEAR 2007 and RT 2007, LNCS 4625, pages 221–232, Baltimore, MD, May 8-11 2007.

    Google Scholar 

  18. R. Stiefelhagen, K. Bernardin, R. Bowers, J. Garofolo, D. Mostefa, and P. Soundararajan. The CLEAR 2006 evaluation. In Multimodal Technologies for Perception of Humans, Proceedings of the First International CLEAR Evaluation Workshop, CLEAR 2006, LNCS 4122, pages 1–45, Southampton, UK, Apr. 6-7 2006.

    Google Scholar 

  19. R. Stiefelhagen, K. Bernardin, R. Bowers, R. T. Rose, M. Michel, and J. Garofolo. The CLEAR 2007 evaluation. In Multimodal Technologies for Perception of Humans, Proceedings of the International Evaluation Workshops CLEAR 2007 and RT 2007, LNCS 4625, pages 3–34, Baltimore, MD, May 8-11 2007.

    Google Scholar 

  20. P. A. Viola and M. J. Jones. Rapid object detection using a boosted cascade of simple features. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2001), pages 511–518, Kauai, HI, Dec. 2001.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag London Limited

About this chapter

Cite this chapter

Pnevmatikakis, A., Ekenel, H.K., Barras, C., Hernando, J. (2009). Multimodal Person Identification. In: Waibel, A., Stiefelhagen, R. (eds) Computers in the Human Interaction Loop. Human–Computer Interaction Series. Springer, London. https://doi.org/10.1007/978-1-84882-054-8_4

Download citation

  • DOI: https://doi.org/10.1007/978-1-84882-054-8_4

  • Publisher Name: Springer, London

  • Print ISBN: 978-1-84882-053-1

  • Online ISBN: 978-1-84882-054-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics