Skip to main content

A speaker identification agent

  • Text-independent Speaker Authentication
  • Conference paper
  • First Online:
Audio- and Video-based Biometric Person Authentication (AVBPA 1997)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1206))

  • 2406 Accesses

Abstract

This paper describes a prototype application which combines speaker identification technology and an agent architecture to provide user-definable monitors for incoming voicemail messages. Through a Web-distributable Java user interface, the user may enter requests by using spoken or typed natural language. Multiple distributed agents process the requests, periodically testing the user's voicemail system to identify the composer of the message from a set of selected speakers. When a message meets the conditions specified by the user, agents locate the requester's position and notify him or her of the arrival of the important message by using various communication media (email, fax, telephone, pager). The technology responsible for identifying a speaker from voice is a text-independent method developed at SRI International. Encapsulating this capability as an agent permits plug- and-play reusability in the growing number of applications being developed within the agent-based framework.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Automated Office Assistant System: http://www.ai.sri.com/cgi-bin/oaa/office.pl

    Google Scholar 

  2. Cheyer A. and Julia L. (1995). Multimodal Maps: An Agent-based Approach. CMC'95: Eindhoven, Netherlands, pp. 103–113.

    Google Scholar 

  3. CIC Signature SentinelTM: http://www.cic.com

    Google Scholar 

  4. Heck L.P. and Weintraub M. (1997). Handset-Dependent Background Models for Robust Text-Independent Speaker Recognition. Submitted to IEEE Proc. Intern. Conf. on Acoust., Speech, and Signal Proc.: Munich, Germany.

    Google Scholar 

  5. Higgins A., Bahler L. and Porter J. (1992). Speaker verification using randomized phrase prompting. Digital Signal Processing, Vol. 1, pp. 89–106.

    Google Scholar 

  6. Open Agent ArchitectureTM: http://www.ai.sri.com/∼oaa

    Google Scholar 

  7. Reynolds D.A. (1995). Speaker identification and verification using Gaussian mixture speaker models. Speech Communications, Vol. 17, pp. 91–108.

    Google Scholar 

  8. Rosenberg A.E., DeLong J., Lee C.H., Juang B.H. and Soong F.K. (1992). The use of cohort normalized scores for speaker verification. IEEE Proc. Intern. Conf. Speech and Signal Proc., pp. 599–602.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Josef Bigün Gérard Chollet Gunilla Borgefors

Rights and permissions

Reprints and permissions

Copyright information

© 1997 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Julia, L.E., Heck, L.P., Cheyer, A.J. (1997). A speaker identification agent. In: Bigün, J., Chollet, G., Borgefors, G. (eds) Audio- and Video-based Biometric Person Authentication. AVBPA 1997. Lecture Notes in Computer Science, vol 1206. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0016003

Download citation

  • DOI: https://doi.org/10.1007/BFb0016003

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-62660-2

  • Online ISBN: 978-3-540-68425-1

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics