ABSTRACT
This paper describes a keyword spotting based audio searching engine that can through a lpng recordings of radio broadcast with a low missing rate. Bothth sub-sylllable (Initial- Final) and base-syllable based keyword spotting strategies are investigated. The system can achieves a 20% misssing rate with around 1.5 false alarm per hour (FA/KW/H) and the performance cam be further improved to zero missing rate with as low as 0.5 FA/KW/H by incorporating better keyword specifications.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Philippe Gelin, Chris J. Wellekens, “Keyword spotting for video soundtrack indexing” ICASSP’96
Alexander G. Haupmann and Howard D. Wactlar, “Indexing and search of multimodal information”, ICASSP’97
S. J. Young, M. G. Brown, J. T. Foote, G. J. F. Jones, K. Sparck JOnes, “AcoustiC indexing for multimedia retrieval and browsing,” COOPIS’97
Yasou Ariki, Yoshiaki Sugiyama, “A TV News Retrieval System with Interactive Query Function”, COOPIS’97
Dave Abberley, Steve Renals and Gary Cook, “Retrieval of broadcast news dOcuments with the THISL system”, ICASSP’98
Steven Wegmann, Punning Zhan, and Larry Gillick, “Progress in broadcast news transcription at Dragon Systems”, ICASSP’99
R. Rose; E. Chang and R. Lipmann “Techniques for information retrieval from voice message”, ICASSP’91
W.K. Lo et al., “Development of Cantonese spoken language corpora for speech applications,” ISCSLP’98
Tan Lee W. K. LO, P. C. Ching and Helen Meng, “Spoken language resources for Cantonese speech processing”, to appear in Speech Communications.
the linguistic Society of Hong Kong, 1997.
Tan Lee and P. C. Ching; “Cantonese Syllable Recognition Using Neural Networks”, IEEE Transactions on Speech and Audio Processing, Vol.7. No. 4, July 1999
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lam, H.S., Lee, T., Ching, P.C. (2001). A Low Missing Rate Audio Search Technique for Cantonese Radio Broadcast Recording. In: Shum, HY., Liao, M., Chang, SF. (eds) Advances in Multimedia Information Processing — PCM 2001. PCM 2001. Lecture Notes in Computer Science, vol 2195. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45453-5_70
Download citation
DOI: https://doi.org/10.1007/3-540-45453-5_70
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42680-6
Online ISBN: 978-3-540-45453-3
eBook Packages: Springer Book Archive