Skip to main content

Detection of Publicity Mentions in Broadcast Radio: Preliminary Results

  • Conference paper
  • First Online:
Advances in Speech and Language Technologies for Iberian Languages (IberSPEECH 2016)

Abstract

The advertising mentions are publicity contents that are not prerecorded, usually are said by radio or TV broadcasters to publicize a product or a company. The main difficulty of detecting advertising mentions is that the audio is not exactly repeated every time, as happens with conventional prerecorded advertising where more efficient techniques such as audio fingerprinting can be used. This paper proposes the use of a keyword search system in Spanish for the detection of advertising mentions. For that, it has been necessary to train and evaluate a new speech recognizer in Spanish (LVCSR) using the Kaldi tool and databases Fisher Spanish and Callhome Spanish. The best word error rate we have obtained on conversational telephone speech is 41.10 %. For the evaluation of mentions detection a specific database in Spanish has been created, containing 300 h of audio, 25 of which have been tagged with different types of information, including mentions appearing in the audio. The recognizer has been applied to all advertising mentions in search for mention specific keywords, achieving a detection rate of about 74 %.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. IARPA: Babel Program. Intelligence Advanced Research Projects Activity (IARPA), Washington DC, USA (2011). http://www.iarpa.gov/images/files/programs/babel/Babel-Kickoff-Summary.pdf

  2. NIST Open KeyWord Search (OpenKWS). http://www.nist.gov/itl/iad/mig/openkws.cfm. Accessed 19 June 2016

  3. MEDIAEVAL Evaluations. http://www.multimediaeval.org/. Accessed 19 June 2016

  4. MEDIAEVAL Query by Example Search on Speech Task (QUESST) Evaluation. http://www.multimediaeval.org/mediaeval2015/quesst2015/. Accessed 19 June 2016

  5. ALBAYZIN Search-on-Speech Evaluation (2016). https://iberspeech2016.inesc-id.pt/index.php/albayzin-evaluation/#sos-identifier. Accessed 19 June 2016

  6. Tejedor, J., et al.: Spoken term detection ALBAYZIN 2014 evaluation: overview, systems, results, and discussion. EURASIP J. Audio Speech Music Process. 2015(1), 1–27 (2015)

    Article  Google Scholar 

  7. Chen, G., Yilmaz, O., Trmal, J., Povey, D., Khudanpur, S.: Using proxies for OOV keywords in the keyword search task. In: 2013 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp. 416–421. IEEE (2013)

    Google Scholar 

  8. Povey, D., Ghoshal, A., Boulianne, G., Burget, L., Glembek, O., Goel, N., Silovsky, J.: The Kaldi speech recognition toolkit. In: IEEE 2011 Workshop on Automatic Speech Recognition and Understanding. IEEE Signal Processing Society (2011)

    Google Scholar 

  9. Corpus Switchboard. It is available in Linguistic Data Consortium. Ref. LDC2010S01. https://catalog.ldc.upenn.edu/LDC2010S01. Accessed 16 May 2016)

  10. Corpus Switchboard. It is available in Linguistic Data Consortium .Ref. LDC96S35. https://catalog.ldc.upenn.edu/LDC96S35. Accessed 16 May 2016

  11. Linguistic Data Consortium. https://ldc.upenn.edu/. Accessed 18 May 2016

  12. Kaldi. http://kaldi-asr.org/doc/examples.html. Accessed 20 Sep 2016

  13. Post, M., Kumar, G., López, A., Karakos, D., Callison-Burch, C., Khudanpur, Y.S.: Improved speech-to-text translation with the fisher and Callhome Spanish–English speech translation corpus. In: International Workshop on Spoken Language Translation, p. 4, December 2013

    Google Scholar 

  14. Xu, J., Toledano, D.T., Tejedor, J.: The ATVS-GEINTRA STD system for ALBAYZIN 2014 search-on-speech evaluation. In: Proceedings of the IberSPEECH 2014, Las Palmas de Gran Canaria, Spain, pp. 290–298. http://iberspeech2014.ulpgc.es/images/Iberspeech2014_OnlineProceedings.pdf. Accessed 20 June 2016

Download references

Acknowledgements

This work has been partly funded by the Spanish Ministry of Economy and Competitiveness under project TEC2015-68172-C2-1-P (DSSL) and project TEC2012-37585-C02-01 (CMC-V2).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Doroteo T. Toledano .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing AG

About this paper

Cite this paper

Fernández-Gallego, M.P., Mesa-Castellanos, Á., Lozano-Díez, A., Toledano, D.T. (2016). Detection of Publicity Mentions in Broadcast Radio: Preliminary Results. In: Abad, A., et al. Advances in Speech and Language Technologies for Iberian Languages. IberSPEECH 2016. Lecture Notes in Computer Science(), vol 10077. Springer, Cham. https://doi.org/10.1007/978-3-319-49169-1_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-49169-1_11

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-49168-4

  • Online ISBN: 978-3-319-49169-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics