Abstract
The information is one of the most valuable commodities nowadays. The information retrieval mechanisms from broadcast news recordings is then becoming the one of the most requested services from the end-users. The planned Slovak automatic broadcast news (BN) processing service provides automatic transcribing and metadata extracting abilities, enabling users to obtain information from the processed recordings using a web interface and the search engine. The resulted information is then provided trough multimodal interface, which allows users to see not only recorded audio-visual material, but also all automatically extracted metadata (verbal and nonverbal), and also to select incorrectly automatically identified data. The architecture of the present system is linear, which means every module starts after the previous has finished the data processing.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Nouza, J., Nejedlova, D., Zdansky, J., Kolorenc, J.: Very Large Vocabulary Speech Recognition System for Automatic Transcription of Czech Broadcast. In: Proceedings of ICSLP 2004, Jeju Island, Korea, October 2004, pp. 409–412 (2004) ISSN 1225-441x
Nouza, J., Zdansky, J., Cerva, P., Kolorenc, J.: Continual on-line monitoring of Czech spoken broadcast programs. In: INTERSPEECH-2006, paper 1478-Wed1CaP.13 (2006)
Seymore, K., Chen, S., Doh, S.J., Eskenazi, M., Gouvea, E., Raj, B., Ravishankar, M., Rosenfeld, R., Siegler, M., Stern, R., Thayer, E.: The 1997 CMU Sphinx-3 English Broadcast News transcription system. In: Proceedings of the DARPA Speech Recognition Workshop (1998)
Gauvain, J.-L.: The LIMSI 1999 Hub-4E Transcription System. In: Proceedings of DARPA Speech Transcription Workshop 2000 (2000)
Gauvain, J.L., Lamel, L., Adda, G.: The LIMSI Broadcast News Transcription System. In: Speech Communication (2002), http://citeseer.ist.psu.edu/gauvain02limsi.html
McTait, K., Adda-Decker, M.: The 300k LIMSI German Broadcast News Transcription System. In: Eurospeech 2003, Genova,
Huerta, J.M., Thayer, E., Ravishankar, M., Stern, R.M.: The Development of the 1997 CMU Spanish Broadcast News Transcription System. In: Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop, Landsdowne, Virginia (February 1998)
Meinedo, H., Caseiro, D., Neto, J., Trancoso, I.: AUDIMUS.media: a broadcast news speech recognition system for the European Portuguese language. In: Mamede, N.J., Baptista, J., Trancoso, I., Nunes, M.d.G.V. (eds.) PROPOR 2003. LNCS, vol. 2721, pp. 9–17. Springer, Heidelberg (2003)
Riedler, J., Katsikas, S.: Development of a Modern Greek Broadcast-News Corpus and Speech Recognition System. In: Nivre, J., Kaalep, H.-J., Muischnek, K., Koit, M. (eds.) Proceedings of the 16th Nordic Conference of Computational Linguistics NODALIDA 2007, pp. 380–383. University of Tartu, Tartu (2007)
Marcello, F.: A System for the Retrieval of Italian Broadcast News. Speech Communication 33(1-2) (2000)
Brugnara, F., Cettolo, M., Federico, M., Giuliani, D.: Advances in automatic transcription of Italian broadcast news. In: Proceedings of ICSLP, Beijing, China, vol. II, pp. 660–663 (2000)
Che, C., Yuk, D., Chennoukh, S., Flanagan, J.: Development of the RU Hub4 system. In: Proceedings of DARPA Speech Recognition Workshop (1997)
Zibert, J., Mihelic, F., Martens, J.-P., Meinedo, H., Neto, J., Docio, L., Garcia-Mateo, C., David, P., Zdansky, J., Pleva, M., Cizmar, A., Zgank, A., Kacic, Z., Teleki, C., Vicsi, K.: COST278 broadcast news segmentation and speaker clustering evaluation. In: Interspeech 2005 Proceedings of the 9th European Conference on Speech Communication and Technology, Lisboa, pp. 629–632. Universität Bonn, Bonn (2005)
Pitz, M., Molau, S., Schluter, R., Ney, H.: Automatic transcription verification of broadcast news and similar speech corpora. In: Proceedings of the DARPA Broadcast News Workshop (March 1999)
Manta, M., Antoine, F., Galliano, S., Barras, C., Geoffrois, E., Liberman, M., Wu, Z.: Transcriber tool website, http://trans.sourceforge.net/en/presentation.php
Pleva, M., Juhár, J., Čižmár, A.: Slovak broadcast news speech corpus for automatic speech recognition. In: RTT 2007: Research in Telecommunication Technology: 8th international conference: Žilina - Liptovský Ján, Slovak Republic, September 10-12, 2007, pp. 334–337 (2007) ISBN 978-80-8070-735-4
Young, S.: ATK: An application Toolkit for HTK, version 1.3. Cambridge University, Cambridge (2004)
Žibert, J., Mihelič, F.: Development, Evaluation and Automatic Segmentation of Slovenian Broadcast News Speech Database. In: Proceedings of the 7th International Multi-Conference Information Society IS 2004, Jozef Stefan Institute, Ljubljana, Slovenia, October 13th - 14th 2004, vol. B, pp. 72–78 (2004) ISBN 961-6303-64-3
Pollak, P., Černocký, J., Boudy, J., Choukri, K., Rusko, M., Trnka, M.: SpeechDat(E) „Eastern European Telephone Speech Databases. In: Proceedings of LREC 2000 Satellite workshop XLDB - Very large Telephone Speech Databases, Athens, Greece, pp. 20–25 (May 2000)
Juhár, J., Ondáš, S., Čižmár, A., Rusko, M., Rozinaj, G., Jarina, R.: Development of Slovak GALAXY/VoiceXML based spoken language dialogue system to retrieve information from the Internet. In: Interspeech 2006 - ICSLP, Pittsburgh, Pennsylvania, USA, September 17-21, pp. 485–488. Universität Bonn, Bonn (2006) ISSN 1990-9772
Rusko, M., Trnka, M., Darjaa, S.: MobilDat-SK - A Mobile Telephone Extension to the SpeechDat-E SK Telephone Speech Database in Slovak. In: SPEECOM 2006, Sankt Petersburg, Russia (July 2006) (accepted)
Simkova, M.: Slovak National Corpus – history and current situation. In: Insight into Slovak and Czech Corpus Linguistics, Bratislava: Veda, pp. 152–159 (2005)
Mirilovič, M., Lihan, S., Juhár, J., Čižmár, A.: Slovak speech recognition based on Sphinx-4 and SpeechDat-SK. In: Proceedings of DSP-MCOM 2005 international conference, Košice, Slovakia, pp. 76–79 (Septembert 2005)
Mirilovič, M., Juhár, J., Čižmár, A.: Steps towards the stochastic language modeling in Slovak. In: Proceedings of ECMS 2007: 8th International Workshop on Electronics, Control, Modeling, Measurement and Signals, May 21-23, 2007, p. 19. Technical University of Liberec, Liberec (2007) ISBN 978-80-7372-202-9
Mirilovič, M., Juhár, J., Čižmár, A.: Automatic segmentation of Slovak words into morphemes. In: Proceedings of RTT 2007: Research in Telecommunication Technology: 8th international conference, Žilina - Liptovský Ján, Slovak Republic, September 10-12, 2007, pp. 259–263 (2007) ISBN 978-80-8070-735-4
Zgank, A., Kacic, Z., Diehl, F., Juhar, J., Lihan, S., Vicsi, K., Szaszak, G.: Graphemes as basic units for crosslingual speech recognition. In: Proceedings of ASIDE 2005: ISCA Tutorial and Research Workshop (ITRW), 10th and 11th November 2005, pp. 23–27. Aalborg University, Aalborg (2005)
Hain, T., Woodland, P.C.: Segmentation and Classification of Broadcast News Audio. In: Proceedings of ICSLP 1998 - 5th International Conference on Spoken Language Processing, Sydney, Australia, November 30 - December 4 (1998)
Navas, E., Hernaéz, I., Luengo, I., Sainz, I., Saraxaga, I., Sanchez, J.: Meaningful Parameters in Emotion Characterisation. In: Esposito, A., Faundez-Zanuy, M., Keller, E., Marinaro, M. (eds.) COST Action 2102. LNCS (LNAI), vol. 4775, pp. 74–84. Springer, Heidelberg (2007)
Lihan, S., Juhár, J., Čižmár, A.: Comparison of Slovak and Czech speech recognition based on grapheme and phoneme acoustic models. In: Proceedings of Interspeech 2006 ICSLP: Proceedings of the Ninth International Conference on Spoken Language Processing, Pittsburgh, Pensylvania, USA, September 17-21, 2006, pp. 149–152. Universität Bonn, Bonn (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pleva, M., Čižmár, A., Juhár, J., Ondáš, S., Mirilovič, M. (2008). Towards Slovak Broadcast News Automatic Recording and Transcribing Service. In: Esposito, A., Bourbakis, N.G., Avouris, N., Hatzilygeroudis, I. (eds) Verbal and Nonverbal Features of Human-Human and Human-Machine Interaction. Lecture Notes in Computer Science(), vol 5042. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-70872-8_12
Download citation
DOI: https://doi.org/10.1007/978-3-540-70872-8_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-70871-1
Online ISBN: 978-3-540-70872-8
eBook Packages: Computer ScienceComputer Science (R0)