skip to main content
10.1145/1008992.1009002acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
Article

Building an information retrieval test collection for spontaneous conversational speech

Published: 25 July 2004 Publication History

Abstract

Test collections model use cases in ways that facilitate evaluation of information retrieval systems. This paper describes the use of search-guided relevance assessment to create a test collection for retrieval of spontaneous conversational speech. Approximately 10,000 thematically coherent segments were manually identified in 625 hours of oral history interviews with 246 individuals. Automatic speech recognition results, manually prepared summaries, controlled vocabulary indexing, and name authority control are available for every segment. Those features were leveraged by a team of four relevance assessors to identify topically relevant segments for 28 topics developed from actual user requests. Search-guided assessment yielded sufficient inter-annotator agreement to support formative evaluation during system development. Baseline results for ranked retrieval are presented to illustrate use of the collection.

References

[1]
EU-US working group on spoken-word audio collections, 2003. http://www.dcs.shef.ac.uk/spandh/projects/swag/.
[2]
C. Buckley and E. Voorhees. Evaluating evaluation measure stability. In SIGIR 2000, pages 33--40, 2000.
[3]
William Byrne et al. Automated recognition of spontaneous speech for access to multilingual oral history archives. IEEE Transactions on Speech and Audio Processing, 12(4), 2004.
[4]
J. Carletta. Assessing agreement on classification tasks: The kappa statistic. Computational Linguistics, 22(2):249--254, 1996.
[5]
S. F. Chen and J. Goodman. An empirical study of smoothing techniques for language modeling. In Computer Speech and Language, 1999.
[6]
C. Cieri et al. Corpora for topic detection and tracking. In Topic Detection and Tracking: Event-Based Information Organization. Kluwer Academic, Boston, 2002.
[7]
C. Cleverdon. The Cranfield tests on index language devices. ASLIB Proceedings, 19(6):173--194, 1967.
[8]
G. V. Cormack et al. Efficient construction of large test collections. In SIGIR '98, pages 282--289, 1998.
[9]
Martin Franz et al. Ad hoc and multilingual information retrieval at IBM. In TREC-7, 1998.
[10]
J. S. Garofolo et al. The TREC spoken document retrieval track: A success story. In TREC-8, 1999.
[11]
J. Godfrey et al. SWITCHBOARD: telephone speech corpus for research and development. In IEEE International Conference on Acoustics, Speech and Signal Processing, pages 517--520, 1992.
[12]
Samuel Gustman et al. Supporting access to large digital oral history archives. In The Second Joint Conference on Digital Libraries, pages 18--27, 2002.
[13]
Xiaoli Huang and Dagobert Soergel. Relevance judges' understanding of topical relevance types: An explication of an enriched concept of topical relevance. In Annual Meeting of the American Society for Information Science and Technology, 2004. to appear.
[14]
R. V. Katter. The influence of scale form on relevance judgment. Information Storage and Retrieval, 4(1):1--11, 1968.
[15]
B. Ramabhadran et al. Towards automatic transcription of large spoken archives - English ASR for the MALACH project. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing, 2003.
[16]
S. E. Robertson et al. Okapi at TREC-3. In TREC-3, pages 109--126, 1994.
[17]
T. Rong et al. Towards the identification of the optimal number of relevance categories. Journal of the American Society for Information Science, 50(3):254--264, 1998.
[18]
L. Schamber. Variations in relevance and information behavior. In Annual Review of Information Science and Technology, volume 29, pages 3--48. 2000.
[19]
K. Sparck-Jones and C. J. van Rijsbergen. Information retrieval test collections. Journal of Documentation, 32(1):59--72, 1976.
[20]
E. M. Voorhees. Variations in relevance judgments and the measurement of retrieval effectiveness. Information Processing and Management, 36:697--716, 2000.
[21]
S. Whittaker et al. Play it again: A study of the factors underlying search browsing behavior. In CHI '98, pages 247--248, 1998.
[22]
Y. Yang and X. Liu. A re-examination of text categorization methods. In SIGIR '99, 1999.

Cited By

View all
  • (2017)The MIREX grand challengeJournal of the Association for Information Science and Technology10.1002/asi.2361868:1(97-112)Online publication date: 1-Jan-2017
  • (2017)The role of team cognition in collaborative information seekingJournal of the Association for Information Science and Technology10.1002/asi.2361468:1(129-140)Online publication date: 1-Jan-2017
  • (2016)Chatting through pictures? A classification of images tweeted in one week in the UK and USAJournal of the Association for Information Science and Technology10.1002/asi.2362067:11(2575-2586)Online publication date: 1-Nov-2016
  • Show More Cited By

Index Terms

  1. Building an information retrieval test collection for spontaneous conversational speech

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SIGIR '04: Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
    July 2004
    624 pages
    ISBN:1581138814
    DOI:10.1145/1008992
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 25 July 2004

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. assessment
    2. automatic speech recognition
    3. oral history
    4. search-guided relevance

    Qualifiers

    • Article

    Conference

    SIGIR04
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 792 of 3,983 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)4
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 05 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2017)The MIREX grand challengeJournal of the Association for Information Science and Technology10.1002/asi.2361868:1(97-112)Online publication date: 1-Jan-2017
    • (2017)The role of team cognition in collaborative information seekingJournal of the Association for Information Science and Technology10.1002/asi.2361468:1(129-140)Online publication date: 1-Jan-2017
    • (2016)Chatting through pictures? A classification of images tweeted in one week in the UK and USAJournal of the Association for Information Science and Technology10.1002/asi.2362067:11(2575-2586)Online publication date: 1-Nov-2016
    • (2016)A framework for evaluating automatic indexing or classification in the context of retrievalJournal of the Association for Information Science and Technology10.1002/asi.2360067:1(3-16)Online publication date: 1-Jan-2016
    • (2014)Automatic keyword selection for keyword search development and tuning2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP.2014.6855126(7839-7843)Online publication date: May-2014
    • (2012)Matching meaning for cross-language information retrievalInformation Processing and Management: an International Journal10.1016/j.ipm.2011.09.00348:4(631-653)Online publication date: 1-Jul-2012
    • (2011)A segment-level confidence measure for Spoken Document Retrieval2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP.2011.5947616(5548-5551)Online publication date: May-2011
    • (2011)Speech RetrievalSpoken Language Understanding10.1002/9781119992691.ch15(417-446)Online publication date: 23-Mar-2011
    • (2010)Spoken Proper Name Retrieval for Limited Resource Languages Using Multilingual Hybrid RepresentationsIEEE Transactions on Audio, Speech, and Language Processing10.1109/TASL.2009.203578518:6(1486-1495)Online publication date: 1-Aug-2010
    • (2009)Exploring fusion in a spontaneous speech retrieval taskProceedings of the third workshop on Searching spontaneous conversational speech10.1145/1631127.1631130(11-20)Online publication date: 23-Oct-2009
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media