Skip to main content

Highly Scalable Speech Processing on Data Stream Management System

  • Conference paper
Database Systems for Advanced Applications (DASFAA 2012)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7239))

Included in the following conference series:

Abstract

Today we require sophisticated speech processing technologies that process massive speech data simultaneously. In this paper we describe the implementation and evaluation of a Julius-backended parallel and scalable speech recognition system on the data stream management system “System S” developed by IBM Research. Our experimental result on our parallel and distributed environment with 4 nodes and 16 cores shows that the throughput can be significantly increased by a factor of 13.8 when compared with that on a single core. We also demonstrate that the beam management module in our system can keep throughput and recognition accuracy with varying input data rate.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Abadi, D.J., et al.: The Design of the Borealis Stream Processing Engine. In: Proc. CIDR, pp. 277–289 (2005)

    Google Scholar 

  2. Wolf, J., Bansal, N., Hildrum, K., Parekh, S., Rajan, D., Wagle, R., Wu, K.-L., Fleischer, L.K.: SODA: An Optimizing Scheduler for Large-Scale Stream-Based Distributed Computer Systems. In: Issarny, V., Schantz, R. (eds.) Middleware 2008. LNCS, vol. 5346, pp. 306–325. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  3. Gedik, B., et al.: A Code Generation Approach to Optimizing High-Performance Distributed Data Stream Processing. In: Proc. USENIX, pp. 847–856 (2009)

    Google Scholar 

  4. Arakawa, Y., et al.: A Study for a Scalability Evaluation Model of Spoken Dialogue System. Transactions of Information Processing Society of Japan 46(9), 2269–2278 (2005) (in Japanese)

    MathSciNet  Google Scholar 

  5. Tatbul, N., et al.: Load Shedding in a Data Stream Manager. In: Proc. VLDB (2003)

    Google Scholar 

  6. Gedik, B., et al.: SPADE: The System S Declarative Stream Processing Engine. In: Proc. SIGMOD, pp. 1123–1134 (2008)

    Google Scholar 

  7. Amini, L., et al.: SPC: A Distributed, Scalable Platform for Data Mining. In: DM-SSP, pp. 27–37 (2006)

    Google Scholar 

  8. Jain, N., et al.: Design, implementation, and evaluation of the linear road benchmark on the stream processing core. In: International Conference on Management of Data, ACM SIGMOD, Chicago, IL (2006)

    Google Scholar 

  9. Young, S., et al.: The HTK book (for HTK Version 3.2) (2002)

    Google Scholar 

  10. Lee, A., et al.: Recent Development of Open-Source Speech Recognition Engine Julius. In: Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC (2009)

    Google Scholar 

  11. Lee, A.: Large Vocabulary Continuous Speech Recognition Engine Julius ver. 4. IEICE technical report. Speech 107(406), pp.307-312 (2007) (in Japanese)

    Google Scholar 

  12. Dixon, P.R., et al.: The Titech Large Vocabulary WFST Speech Recognition System. In: IEEE ASRU, pp. 443–448 (2007)

    Google Scholar 

  13. Lee, A., et al.: An Efficient Two-pass Search Algorithm using Word Trellis Index. In: Proc. ICSLP, pp. 1831–1834 (1998)

    Google Scholar 

  14. Itahashi, S., et al.: Development of ASJ Japanese newspaper article sentences corpus. Annual Meeting of Acoustic Society of Japan 1997(2), 187–188 (1997) (in Japanese)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Nishii, S., Suzumura, T. (2012). Highly Scalable Speech Processing on Data Stream Management System. In: Lee, Sg., Peng, Z., Zhou, X., Moon, YS., Unland, R., Yoo, J. (eds) Database Systems for Advanced Applications. DASFAA 2012. Lecture Notes in Computer Science, vol 7239. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-29035-0_14

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-29035-0_14

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-29034-3

  • Online ISBN: 978-3-642-29035-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics