Abstract
Today we require sophisticated speech processing technologies that process massive speech data simultaneously. In this paper we describe the implementation and evaluation of a Julius-backended parallel and scalable speech recognition system on the data stream management system “System S” developed by IBM Research. Our experimental result on our parallel and distributed environment with 4 nodes and 16 cores shows that the throughput can be significantly increased by a factor of 13.8 when compared with that on a single core. We also demonstrate that the beam management module in our system can keep throughput and recognition accuracy with varying input data rate.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Abadi, D.J., et al.: The Design of the Borealis Stream Processing Engine. In: Proc. CIDR, pp. 277–289 (2005)
Wolf, J., Bansal, N., Hildrum, K., Parekh, S., Rajan, D., Wagle, R., Wu, K.-L., Fleischer, L.K.: SODA: An Optimizing Scheduler for Large-Scale Stream-Based Distributed Computer Systems. In: Issarny, V., Schantz, R. (eds.) Middleware 2008. LNCS, vol. 5346, pp. 306–325. Springer, Heidelberg (2008)
Gedik, B., et al.: A Code Generation Approach to Optimizing High-Performance Distributed Data Stream Processing. In: Proc. USENIX, pp. 847–856 (2009)
Arakawa, Y., et al.: A Study for a Scalability Evaluation Model of Spoken Dialogue System. Transactions of Information Processing Society of Japan 46(9), 2269–2278 (2005) (in Japanese)
Tatbul, N., et al.: Load Shedding in a Data Stream Manager. In: Proc. VLDB (2003)
Gedik, B., et al.: SPADE: The System S Declarative Stream Processing Engine. In: Proc. SIGMOD, pp. 1123–1134 (2008)
Amini, L., et al.: SPC: A Distributed, Scalable Platform for Data Mining. In: DM-SSP, pp. 27–37 (2006)
Jain, N., et al.: Design, implementation, and evaluation of the linear road benchmark on the stream processing core. In: International Conference on Management of Data, ACM SIGMOD, Chicago, IL (2006)
Young, S., et al.: The HTK book (for HTK Version 3.2) (2002)
Lee, A., et al.: Recent Development of Open-Source Speech Recognition Engine Julius. In: Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC (2009)
Lee, A.: Large Vocabulary Continuous Speech Recognition Engine Julius ver. 4. IEICE technical report. Speech 107(406), pp.307-312 (2007) (in Japanese)
Dixon, P.R., et al.: The Titech Large Vocabulary WFST Speech Recognition System. In: IEEE ASRU, pp. 443–448 (2007)
Lee, A., et al.: An Efficient Two-pass Search Algorithm using Word Trellis Index. In: Proc. ICSLP, pp. 1831–1834 (1998)
Itahashi, S., et al.: Development of ASJ Japanese newspaper article sentences corpus. Annual Meeting of Acoustic Society of Japan 1997(2), 187–188 (1997) (in Japanese)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Nishii, S., Suzumura, T. (2012). Highly Scalable Speech Processing on Data Stream Management System. In: Lee, Sg., Peng, Z., Zhou, X., Moon, YS., Unland, R., Yoo, J. (eds) Database Systems for Advanced Applications. DASFAA 2012. Lecture Notes in Computer Science, vol 7239. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-29035-0_14
Download citation
DOI: https://doi.org/10.1007/978-3-642-29035-0_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-29034-3
Online ISBN: 978-3-642-29035-0
eBook Packages: Computer ScienceComputer Science (R0)