Abstract
In this paper we address the problem of extracting caller information from voicemail messages, such as the identity and phone number of the caller. Previous work in information extraction from speech includes spoken document retrieval and named entity detection. This task differs from the named entity task in that the information we are interested in is a subset of the named entities in the message, and consequently, the need to pick the correct subset makes the problem more difficult. Also, the caller’s identity may include information that is not typically associated with a named entity. In this work, we present two information extraction methods, one based on hand-crafted rules, one based on statistically trained maximum entropy model.We evaluate their performance on both manually transcribed messages and on the output of a speech recognition system.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
NIST: Proceedings of the Eighth Text REtrieval Conference (TREC-8) (1999)
DARPA: Proceedings of the DARPA Broadcast News Workshop(1999)
Miller D., Boisen S., Schwartz R., Stone R., Weischedel R.: Named Entity Extraction from Noisy Input: Speech and OCR. Proceedings of ANLP-NAACL (2000) 316–324
Kim J.-H., Woodland P.C.: A Rule-based Named Entity Recognition System for Speech Input. Sixth International Conference on Spoken Language Processing (2000)
Koumpis K., Renals S.: Transcription and Summarization of Voicemail Speech. Sixth International Conference on Spoken Language Processing (2000)
Huang J., Kingsbury B., Mangu L., Padmanabhan M., Saon G., Zweig G.: Recent Improvements in Speech Recognition Performance on Large Vocabulary Conversational Speech (Voicemail and Switchboard). Sixth International Conference on Spoken Language Processing (2000)
Ratnaparkhi A.: A Maximum Entropy Part of Speech Tagger: In Brill E., Church K. (eds.) Conference on Empirical Methods in Natural Language Processing (1996)
Ratnaparkhi A., Reynar J., Roukos S.: A Maximum EntropyModel for Prepositional Phrase Attachment. Proceedings of the Human Language Technology Workshop (1994) 250–255
Andrew Borthwick, John Sterling, Eugene Agichtein, and Ralph Grishman. NYU: Description of the MENE Named Entity System as Used in MUC-7. Seventh Message Understanding Conference(MUC-7) (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Huang, J., Zweig, G., Padmanabhan, M. (2002). Extracting Caller Information from Voicemail. In: Coden, A.R., Brown, E.W., Srinivasan, S. (eds) Information Retrieval Techniques for Speech Applications. IRTSA 2001. Lecture Notes in Computer Science, vol 2273. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45637-6_6
Download citation
DOI: https://doi.org/10.1007/3-540-45637-6_6
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43156-5
Online ISBN: 978-3-540-45637-7
eBook Packages: Springer Book Archive