Speech and Hand Transcribed Retrieval

Sanderson, Mark; Shou, Xiao Mang

doi:10.1007/3-540-45637-6_7

Mark Sanderson⁶ &
Xiao Mang Shou⁶

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2273))

Included in the following conference series:

Workshop on Information Retrieval Techniques for Speech Applications

217 Accesses

Abstract

This paper describes the issues and preliminary work involved in the creation of an information retrieval system that will manage the retrieval from collections composed of both speech recognised and ordinary text documents. In previous work, it has been shown that because of recognition errors, ordinary documents are generally retrieved in preference to recognised ones. Means of correcting or eliminating the observed bias is the subject of this paper. Initial ideas and some preliminary results are presented.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Using Related Text Sources to Improve Classification of Transcribed Speech Data

Comparison of Retrieval Approaches and Blind Relevance Feedback Methods Within the Czech Speech Information Retrieval

Statistical language models for query-by-example spoken document retrieval

Article 03 January 2020

References

John S. Garofolo, Cedric G. P. Auzanne, Ellen M. Voorhees; “The TREC Spoken Document Retrieval Track: A Success Story”; Text Retrieval Conference (TREC) 8, E. Voorhees, Ed.; Gaithersburg, Maryland, USA; November 16–19, 1999
Google Scholar
D. Abberley, S. Renals and G. Cook; “Retrieval of broadcast news documents with the THISL system”; In Proceeding IEEE ICASSP, pp 3781–3784; Seattle, 1998
Google Scholar
Steve Renals and Dave Abberley; “The THISL SDR system at TREC-9”; Proceedings of TREC-9; http://www.dcs.shef.ac.uk/~sjr/pubs/2001/trec9.html, last accessed: September 2001
Jean-Manuel Van Thong, David Goddeau, Anna Litvinova, Beth Logan, Pedro Moreno and Michael Swain; “SpeechBot: A Speech Recognition based Audio Indexing System for the Web”; International Conference on Computer-Assisted Information Retrieval, Recherche d’Informations Assistee par Ordinateur (RIAO2000); Paris, April 2000; pp 106–115
Google Scholar
Pedro Moreno, JM Van Thong, Beth Logan, Blair Fidler, Katrina Maffey,and Matthew Moores; “SpeechBot: A Content-based Search Index for Multimedia on the Web”; First IEEE Pacific-Rim Conference on Multimedia, (IEEE-PCM 2000), 2000
Google Scholar
Amit Singhal, Fernando C. N. Pereira; “Document Expansion for Speech Retrieval”; SIGIR 1999; pp34–41
Google Scholar
M.A. Siegler, M.J. Witbrock, S.T. Slattery, K. Seymore, R.E. Jones and A.G. Hauptmann; “Experiments in Spoken Document Retrieval at CMU”; Proceeding of the 6^th Text REtrieval Conference (TREC 6); November 19–21, 1997; pp291–302
Google Scholar
G.J.F. Jones and M. Han; “Retrieving Scanned Documents from a Mixed-Media Document Collection”; Proceedings of the BCS-IRSG European Colloquium on IR Research; Darmstadt, Germany, pp136–149, April 2001
Google Scholar
Franz J.S. McCarley, S. Roukos; “Ad hoc and Multilingual Information Retrieval at IBM”; Proceeding of The Seventh Text REtrieval Conference (TREC 7); November 9–11, 1998; pp157–168
Google Scholar
Mark Sanderson and Fabio Crestani; “Mixing and Merging for Spoken Document Retrieval”; Proceedings of the 2^nd European Conference on Digital Libraries; Heraklion, Greece, September 1998, pp397–407. Lecture Notes in Computer Science N. 1513, Springer Verlag, Berlin, Germany.
Google Scholar
J.S. Garofolo, E.M. Voorhees, C.G.P. Auzanne, M. Stanford, B.A. Lund; “1998 TREC-7 Spoken Document Retrieval Track Overview and Results”, in Proceedings of the DARPA Broadcast News Workshop, 1999
Google Scholar
Yoshihiko Gotoh and Steve Renals; “Variable Word Rate N-Grams”; Proceedings of IEEE ICASSP 2000, pp1591–1594.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Information Studies, University of Sheffield, Western Bank, S10 2TN, Sheffield, UK
Mark Sanderson & Xiao Mang Shou

Authors

Mark Sanderson
View author publications
You can also search for this author in PubMed Google Scholar
Xiao Mang Shou
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

IBM T.J. Watson Research Center, P.O.Box 704, 10598, Yorktown Heights, NY, USA
Anni R. Coden & Eric W. Brown &
IBM Almaden Research Center, 650 Harry Road, 95120, San Jose, CA, USA
Savitha Srinivasan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sanderson, M., Shou, X.M. (2002). Speech and Hand Transcribed Retrieval. In: Coden, A.R., Brown, E.W., Srinivasan, S. (eds) Information Retrieval Techniques for Speech Applications. IRTSA 2001. Lecture Notes in Computer Science, vol 2273. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45637-6_7

Download citation

DOI: https://doi.org/10.1007/3-540-45637-6_7
Published: 22 January 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43156-5
Online ISBN: 978-3-540-45637-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics