Abstract
The advertising mentions are publicity contents that are not prerecorded, usually are said by radio or TV broadcasters to publicize a product or a company. The main difficulty of detecting advertising mentions is that the audio is not exactly repeated every time, as happens with conventional prerecorded advertising where more efficient techniques such as audio fingerprinting can be used. This paper proposes the use of a keyword search system in Spanish for the detection of advertising mentions. For that, it has been necessary to train and evaluate a new speech recognizer in Spanish (LVCSR) using the Kaldi tool and databases Fisher Spanish and Callhome Spanish. The best word error rate we have obtained on conversational telephone speech is 41.10 %. For the evaluation of mentions detection a specific database in Spanish has been created, containing 300 h of audio, 25 of which have been tagged with different types of information, including mentions appearing in the audio. The recognizer has been applied to all advertising mentions in search for mention specific keywords, achieving a detection rate of about 74 %.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
IARPA: Babel Program. Intelligence Advanced Research Projects Activity (IARPA), Washington DC, USA (2011). http://www.iarpa.gov/images/files/programs/babel/Babel-Kickoff-Summary.pdf
NIST Open KeyWord Search (OpenKWS). http://www.nist.gov/itl/iad/mig/openkws.cfm. Accessed 19 June 2016
MEDIAEVAL Evaluations. http://www.multimediaeval.org/. Accessed 19 June 2016
MEDIAEVAL Query by Example Search on Speech Task (QUESST) Evaluation. http://www.multimediaeval.org/mediaeval2015/quesst2015/. Accessed 19 June 2016
ALBAYZIN Search-on-Speech Evaluation (2016). https://iberspeech2016.inesc-id.pt/index.php/albayzin-evaluation/#sos-identifier. Accessed 19 June 2016
Tejedor, J., et al.: Spoken term detection ALBAYZIN 2014 evaluation: overview, systems, results, and discussion. EURASIP J. Audio Speech Music Process. 2015(1), 1–27 (2015)
Chen, G., Yilmaz, O., Trmal, J., Povey, D., Khudanpur, S.: Using proxies for OOV keywords in the keyword search task. In: 2013 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp. 416–421. IEEE (2013)
Povey, D., Ghoshal, A., Boulianne, G., Burget, L., Glembek, O., Goel, N., Silovsky, J.: The Kaldi speech recognition toolkit. In: IEEE 2011 Workshop on Automatic Speech Recognition and Understanding. IEEE Signal Processing Society (2011)
Corpus Switchboard. It is available in Linguistic Data Consortium. Ref. LDC2010S01. https://catalog.ldc.upenn.edu/LDC2010S01. Accessed 16 May 2016)
Corpus Switchboard. It is available in Linguistic Data Consortium .Ref. LDC96S35. https://catalog.ldc.upenn.edu/LDC96S35. Accessed 16 May 2016
Linguistic Data Consortium. https://ldc.upenn.edu/. Accessed 18 May 2016
Kaldi. http://kaldi-asr.org/doc/examples.html. Accessed 20 Sep 2016
Post, M., Kumar, G., López, A., Karakos, D., Callison-Burch, C., Khudanpur, Y.S.: Improved speech-to-text translation with the fisher and Callhome Spanish–English speech translation corpus. In: International Workshop on Spoken Language Translation, p. 4, December 2013
Xu, J., Toledano, D.T., Tejedor, J.: The ATVS-GEINTRA STD system for ALBAYZIN 2014 search-on-speech evaluation. In: Proceedings of the IberSPEECH 2014, Las Palmas de Gran Canaria, Spain, pp. 290–298. http://iberspeech2014.ulpgc.es/images/Iberspeech2014_OnlineProceedings.pdf. Accessed 20 June 2016
Acknowledgements
This work has been partly funded by the Spanish Ministry of Economy and Competitiveness under project TEC2015-68172-C2-1-P (DSSL) and project TEC2012-37585-C02-01 (CMC-V2).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Fernández-Gallego, M.P., Mesa-Castellanos, Á., Lozano-Díez, A., Toledano, D.T. (2016). Detection of Publicity Mentions in Broadcast Radio: Preliminary Results. In: Abad, A., et al. Advances in Speech and Language Technologies for Iberian Languages. IberSPEECH 2016. Lecture Notes in Computer Science(), vol 10077. Springer, Cham. https://doi.org/10.1007/978-3-319-49169-1_11
Download citation
DOI: https://doi.org/10.1007/978-3-319-49169-1_11
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-49168-4
Online ISBN: 978-3-319-49169-1
eBook Packages: Computer ScienceComputer Science (R0)