Efficient FPGA Implementation of a Knowledge-Based Automatic Speech Classifier

Siniscalchi, Sabato M.; Gennaro, Fulvio; Vitabile, Salvatore; Gentile, Antonio; Sorbello, Filippo

doi:10.1007/11599555_21

Sabato M. Siniscalchi^22,24,
Fulvio Gennaro²²,
Salvatore Vitabile^23,25,
Antonio Gentile^22,25 &
…
Filippo Sorbello^22,25

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3820))

Included in the following conference series:

International Conference on Embedded Software and Systems

949 Accesses
1 Citations

Abstract

Speech recognition has become common in many application domains, from dictation systems for professional practices to vocal user interfaces for people with disabilities or hands-free system control. However, so far the performance of Automatic Speech Recognition (ASR) systems are comparable to Human Speech Recognition (HSR) only under very strict working conditions, and in general far lower. Incorporating acoustic-phonetic knowledge into ASR design has been proven a viable approach to rise ASR accuracy. Manner of articulation attributes such as vowel, stop, fricative, approximant, nasal, and silence are examples of such knowledge. Neural networks have already been used successfully as detectors for manner of articulation attributes starting from representations of speech signal frames. In this paper an optimized digital Knowledge-based Automatic Speech Classifier for real-time applications is implemented on FPGA using six attribute scoring Multi-Layer Perceptrons (MLP). Digital MLP key features are a virtual neuron architecture and use of sinusoidal activation functions for the hidden layer. Implementation results on FPGA show that use of sinusoidal activation functions decrease hardware resource usage of more than 50% for slices, FFs, LUTs and more than 35% for FPGA RAM blocks when compared with the standard sigmoid-based neuron implementation. Furthermore, neuron virtualization allows for a significant decrease of concurrent memory access, resulting in improved performance for the entire attribute scoring module.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Vitabile, S., Gentile, A.L., Dammone, G.B., Sorbello, F.: MLP Neural Network Implementation on a SIMD Architecture. In: Marinaro, M., Tagliaferri, R. (eds.) WIRN 2002. LNCS, vol. 2486, pp. 99–108. Springer, Heidelberg (2002)
Chapter Google Scholar
Sorbello, F., Gioiello, G.A.M., Vitabile, S.: Handwritten Character Recognition using a MLP. In: Knowledge-Based Intelligent Techniques in Character Recognition, ch. 5, pp. 91–119. CRC Press Publishers, Boca Raton (1999)
Google Scholar
Vitabile, S., Gentile, A., Sorbello, F.: Real-Time Road Signs Recognition on a SIMD Architecture. WSEAS Transactions on Circuits and Systems 3(3), 664–669 (2004) ISSN: 1109-2734
Google Scholar
Huelsbergen, L.: A Representation for Dynamic Graphs in Reconfigurable Hardware and its Application to Fundamental Graph Algorithms. In: 8th International Symposium on Field Programmable Gate Arrays, ISBN 1-58113-193-3
Google Scholar
Porrmann, M., Witkowski, U., Kalte, H., Ruckert, U.: Implementation of Artificial Neural Hardware Accelerator. In: 10th Euromicro Workshop on Parallel, Distributed and Network-based Processing, Spain, January 9-11, pp. 243–250 (2002)
Google Scholar
RC203 Software Manual, http://www.celoxica.com/support/documentation
Ortigosa, E.M., Ortigosa, P.M., Canas, A., Ros, E., Agis, R., Ortega, J.: FPGA Implementation of Multi-layer Perceptrons for Speech Recognition. In: Cheung, P.Y.K., Constantinides, G.A. (eds.) FPL 2003. LNCS, vol. 2778, pp. 1048–1052. Springer, Heidelberg (2003)
Chapter Google Scholar
Kirchhoff, K.: Combining Articulatory and Acoustic Information for Speech Recognition in Noisy and Reverberant Environments. In: Proc. of the International Conference on Spoken Language Processing, Sydney, Australia, pp. 891–894
Google Scholar
Lee, K.F., Hon, H.W.: Speaker-independent phone recognition using hidden Markov models. IEEE Trans. On Acoust., Speech and Signal Process. 37(11), 1641–1648 (1989)
Article Google Scholar
Li, J., Tsao, Y., Lee, C.-H.: A Study on Knowledge source integration for candidate rescoring in automatic speech recognition. In: Proc. of ICASSP 2005 (2005)
Google Scholar
Lee, C.-H.: From knowledge-ignorant to knowledge-rich modeling: a new speech research paradigm for next generation automatic speech recognition. In: Proc. ICSLP (2004)
Google Scholar
Garofolo, J.S., Lamel, L.F., Fisher, W.M., Fiscus, J.G., Pallett, D.S., Dahlgren, N.L.: DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus. U.S. Dept. of Commerce, NIST, Gaithersburg, MD (February 1993)
Google Scholar
Wang, J.-C., et al.: Chipdesign of MFCC extraction for speech recognition. INTEGRATION, the VLSI journal 32, 111–131 (2002)
Article MATH Google Scholar
Siniscalchi, S.M., Li, J., Pilato, G., Vassallo, G., Clements, M.A., Gentile, A., Sorbello, F.: Application of E-aNets to Feature Recognition of Manner of Articulation in Knowledge-based Automatic Speech Recognition. In: Apolloni, B., Marinaro, M., Nicosia, G., Tagliaferri, R. (eds.) WIRN 2005 and NAIS 2005. LNCS, vol. 3931, pp. 140–146. Springer, Heidelberg (2006)
Chapter Google Scholar
Vitabile, S., Conti, V., Gennaro, F., Sorbello, F.: Efficient MLP Digital Implementation on FPGA. In: 8⁰ EUROMICRO Conference on Digital System Design (DSD 2005), pp. 218–222. IEEE Computer Society Press, Los Alamitos (2005)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Dipartimento di Ingegneria Informatica, Università di Palermo, V.le delle Scienze (Edif. 6), 90128, Palermo, Italy
Sabato M. Siniscalchi, Fulvio Gennaro, Antonio Gentile & Filippo Sorbello
Dipartimento di Biotecnologie Mediche e Medicina Legale, Università di Palermo, Via del Vespro, 90127, Palermo, Italy
Salvatore Vitabile
Center for Signal and Image Processing, School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, Georgia, 30332, USA
Sabato M. Siniscalchi
Istituto di CAlcolo e Reti ad alte prestazioni, Consiglio Nazionale delle Ricerche, V.le delle Scienze (Edif. 11), 90128, Palermo, Italy
Salvatore Vitabile, Antonio Gentile & Filippo Sorbello

Authors

Sabato M. Siniscalchi
View author publications
You can also search for this author in PubMed Google Scholar
Fulvio Gennaro
View author publications
You can also search for this author in PubMed Google Scholar
Salvatore Vitabile
View author publications
You can also search for this author in PubMed Google Scholar
Antonio Gentile
View author publications
You can also search for this author in PubMed Google Scholar
Filippo Sorbello
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, St. Francis Xavier University, Antigonish, Canada
Laurence T. Yang
School of Computer Science, Northwestern Polytechnical University, 710072, Xi’an, P.R. China
Xingshe Zhou
Department of Radiology, State University of New York at Stony Brook, L-4, 120 Health Sciences Center, 1793-8460, Stony Brook, New York
Wei Zhao
College of Computer Science, Zhejiang University, 310027, Hangzhou, Zhejiang, China
Zhaohui Wu
Northwestern Polytechnical University, No. 127 West Youyi Road, P.O. Box 404, 710072, Xi’an City, Shaanxi Province, China
Yian Zhu
Department of Mathematics, Statistics and Computer Science, St. Francis Xavier University,
Man Lin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Siniscalchi, S.M., Gennaro, F., Vitabile, S., Gentile, A., Sorbello, F. (2005). Efficient FPGA Implementation of a Knowledge-Based Automatic Speech Classifier. In: Yang, L.T., Zhou, X., Zhao, W., Wu, Z., Zhu, Y., Lin, M. (eds) Embedded Software and Systems. ICESS 2005. Lecture Notes in Computer Science, vol 3820. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11599555_21

Download citation

DOI: https://doi.org/10.1007/11599555_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-30881-2
Online ISBN: 978-3-540-32297-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics