Skip to main content

ATVS-CSLT-HCTLab System for NIST 2013 Open Keyword Search Evaluation

  • Conference paper
Advances in Speech and Language Technologies for Iberian Languages

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8854))

  • 808 Accesses

Abstract

This paper presents the ATVS-CSLT-HCTLab spoken term detection (STD) system submitted to the NIST 2013 Open Keyword Search evaluation. The evaluation consists of searching a list of query terms in Vietnamese conversational speech data. Our submission involves an automatic speech recognition (ASR) subsystem which converts speech signals into word/phone lattices, and an STD subsystem which indexes and searches for query terms. The submission is a hybrid approach which employs a word-based system to search for in-vocabulary (INV) terms and a phone-based system to search for out-of-vocabulary (OOV) terms. A term-dependent discriminative confidence estimation is employed to score confidence of detections. Although the ASR performance is not state-of-the-art, our submission achieves a moderate STD performance in the evaluation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abad, A., Rodríguez-Fuentes, L.J., Peñagarikano, M., Varona, A., Bordel, G.: On the calibration and fusion of heterogeneous spoken term detection systems. In: Proc. of Interspeech, pp. 20–24 (2013)

    Google Scholar 

  2. Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA data mining software: An update. SIGKDD Explorations 11(1) (2009)

    Google Scholar 

  3. Katsurada, K., Miura, S., Seng, K., Iribe, Y., Nitta, T.: Acceleration of spoken term detection using a suffix array by assigning optimal threshold values to subkeywords. In: Proc. of Interspeech, pp. 11–14 (2013)

    Google Scholar 

  4. Li, H., Han, J., Zheng, T., Zheng, G.: A novel confidence measure based on context consistency for spoken term detection. In: Proc. of Interspeech, pp. 2429–2430 (2012)

    Google Scholar 

  5. Liu, C., Wang, D., Tejedor, J.: N-gram FST indexing for spoken term detection. In: Proc. of Interspeech, pp. 2093–2096 (2012)

    Google Scholar 

  6. Martin, A., Doddington, G., Kamm, T., Ordowski, M., Przybocki, M.: The DET curve in assessment of detection task performance. In: Proc. of Eurospeech, pp. 1895–1898 (1997)

    Google Scholar 

  7. NIST: The spoken term detection (STD) 2006 evaluation plan, 10 edn. National Institute of Standards and Technology (NIST), Gaithersburg, MD, USA (September 2006), http://www.nist.gov/speech/tests/std

  8. Norouzian, A., Rose, R.: An approach for efficient open vocabulary spoken term detection. Speech Communication 57, 50–62 (2014)

    Article  Google Scholar 

  9. Povey, D., Ghoshal, A., Boulianne, G., Burget, L., Glembek, O., Goel, N., Hannemann, M., Motlicek, P., Qian, Y., Schwarz, P., Silovsky, J., Stemmer, G., Vesely, K.: The KALDI speech recognition toolkit. In: Proc. of ASRU (2011)

    Google Scholar 

  10. Stolcke, A.: SRILM - an extensible language modeling tool. In: Proc. of ICSLP, pp. 901–904 (2002)

    Google Scholar 

  11. Szoke, I.: Hybrid word-subword spoken term detection. Ph.D. thesis, Brno University of Technology (June 2010)

    Google Scholar 

  12. Wang, D.: Out-of-vocabulary Spoken Term Detection. Ph.D. thesis, University of Edinburgh (December 2009)

    Google Scholar 

  13. Wang, D., King, S.: Letter-to-sound pronunciation prediction using conditional random fields. IEEE Signal Processing Letters 18(2), 122–125 (2011)

    Article  Google Scholar 

  14. Wang, D., Tejedor, J., King, S., Frankel, J.: Term-dependent confidence normalization for out-of-vocabulary spoken term detection. Journal of Computer Science and Technology 27(2), 358–375 (2012)

    Article  Google Scholar 

  15. Wessel, F., Macherey, K., Schluter, R.: Using word probabilities as confidence measures. In: Proc. of ICASSP, pp. 225–228 (1998)

    Google Scholar 

  16. Young, S., Evermann, G., Gales, M., Hain, T., Kershaw, D., Liu, X., Moore, G., Odell, J., Ollason, D., Povey, D., Valtchev, V., Woodland, P.: The HTK v3.4 Book. Engineering Department, Cambridge University (March 2006)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Tejedor, J., Toledano, D.T., Wang, D. (2014). ATVS-CSLT-HCTLab System for NIST 2013 Open Keyword Search Evaluation. In: Navarro Mesa, J.L., et al. Advances in Speech and Language Technologies for Iberian Languages. Lecture Notes in Computer Science(), vol 8854. Springer, Cham. https://doi.org/10.1007/978-3-319-13623-3_26

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-13623-3_26

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-13622-6

  • Online ISBN: 978-3-319-13623-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics