Abstract
This paper describes the issues and preliminary work involved in the creation of an information retrieval system that will manage the retrieval from collections composed of both speech recognised and ordinary text documents. In previous work, it has been shown that because of recognition errors, ordinary documents are generally retrieved in preference to recognised ones. Means of correcting or eliminating the observed bias is the subject of this paper. Initial ideas and some preliminary results are presented.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
John S. Garofolo, Cedric G. P. Auzanne, Ellen M. Voorhees; “The TREC Spoken Document Retrieval Track: A Success Story”; Text Retrieval Conference (TREC) 8, E. Voorhees, Ed.; Gaithersburg, Maryland, USA; November 16–19, 1999
D. Abberley, S. Renals and G. Cook; “Retrieval of broadcast news documents with the THISL system”; In Proceeding IEEE ICASSP, pp 3781–3784; Seattle, 1998
Steve Renals and Dave Abberley; “The THISL SDR system at TREC-9”; Proceedings of TREC-9; http://www.dcs.shef.ac.uk/~sjr/pubs/2001/trec9.html, last accessed: September 2001
Jean-Manuel Van Thong, David Goddeau, Anna Litvinova, Beth Logan, Pedro Moreno and Michael Swain; “SpeechBot: A Speech Recognition based Audio Indexing System for the Web”; International Conference on Computer-Assisted Information Retrieval, Recherche d’Informations Assistee par Ordinateur (RIAO2000); Paris, April 2000; pp 106–115
Pedro Moreno, JM Van Thong, Beth Logan, Blair Fidler, Katrina Maffey,and Matthew Moores; “SpeechBot: A Content-based Search Index for Multimedia on the Web”; First IEEE Pacific-Rim Conference on Multimedia, (IEEE-PCM 2000), 2000
Amit Singhal, Fernando C. N. Pereira; “Document Expansion for Speech Retrieval”; SIGIR 1999; pp34–41
M.A. Siegler, M.J. Witbrock, S.T. Slattery, K. Seymore, R.E. Jones and A.G. Hauptmann; “Experiments in Spoken Document Retrieval at CMU”; Proceeding of the 6th Text REtrieval Conference (TREC 6); November 19–21, 1997; pp291–302
G.J.F. Jones and M. Han; “Retrieving Scanned Documents from a Mixed-Media Document Collection”; Proceedings of the BCS-IRSG European Colloquium on IR Research; Darmstadt, Germany, pp136–149, April 2001
Franz J.S. McCarley, S. Roukos; “Ad hoc and Multilingual Information Retrieval at IBM”; Proceeding of The Seventh Text REtrieval Conference (TREC 7); November 9–11, 1998; pp157–168
Mark Sanderson and Fabio Crestani; “Mixing and Merging for Spoken Document Retrieval”; Proceedings of the 2nd European Conference on Digital Libraries; Heraklion, Greece, September 1998, pp397–407. Lecture Notes in Computer Science N. 1513, Springer Verlag, Berlin, Germany.
J.S. Garofolo, E.M. Voorhees, C.G.P. Auzanne, M. Stanford, B.A. Lund; “1998 TREC-7 Spoken Document Retrieval Track Overview and Results”, in Proceedings of the DARPA Broadcast News Workshop, 1999
Yoshihiko Gotoh and Steve Renals; “Variable Word Rate N-Grams”; Proceedings of IEEE ICASSP 2000, pp1591–1594.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Sanderson, M., Shou, X.M. (2002). Speech and Hand Transcribed Retrieval. In: Coden, A.R., Brown, E.W., Srinivasan, S. (eds) Information Retrieval Techniques for Speech Applications. IRTSA 2001. Lecture Notes in Computer Science, vol 2273. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45637-6_7
Download citation
DOI: https://doi.org/10.1007/3-540-45637-6_7
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43156-5
Online ISBN: 978-3-540-45637-7
eBook Packages: Springer Book Archive