Skip to main content

Advertisement

Log in

An efficient hybrid learning algorithm for neural network–based speech recognition systems on FPGA chip

  • Original Article
  • Published:
Neural Computing and Applications Aims and scope Submit manuscript

Abstract

This paper implemented an artificial neural network (ANN) on a field programmable gate array (FPGA) chip for Mandarin speech measurement and recognition of nonspecific speaker. A three-layer hybrid learning algorithm (HLA), which combines genetic algorithm (GA) and steepest descent method, was proposed to fulfill a faster global search of optimal weights in ANN. Some other popular evolutionary algorithms, such as differential evolution, particle swarm optimization and improve GA, were compared to the proposed HLA. It can be seen that the proposed HLA algorithm outperforms the other algorithms. Finally, the designed system was implemented on an FPGA chip with an SOC architecture to measure and recognize the speech signals.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

References

  1. Sivaram GSVS, Nemala SK, Mesgarani N, Hermansky H (2010) Data-driven and feedback based spectro-temporal features for speech recognition. IEEE Signal Process Lett 17(11):957–960

    Article  Google Scholar 

  2. Lauria S (2007) Talking to machines: introducing Robot perception to resolve speech recognition uncertainties. Circuits Syst Signal Process 26(4):513–526

    Article  MATH  Google Scholar 

  3. Wan CY, Lee LS (2008) Histogram-based quantization for robust and/or distributed speech recognition. IEEE Trans Audio Speech Lang Processing 16(4):859–873

    Article  Google Scholar 

  4. Hagon MT, Demuth HB, Beale M (1996) Neural network design. Thomson Learning, Stamford

    Google Scholar 

  5. Kwong S, Chau CW (1997) Analysis of parallel genetic algorithms on HMM based speech recognition system. IEEE Trans Consumer Electron 43(4):1229–1233

    Article  Google Scholar 

  6. Shi Y, Liu J, Liu R (2001) Single-chip speech recognition system based on 8051 microcontroller core. IEEE Trans Consumer Electron 47(1):149–153

    Article  Google Scholar 

  7. Lin FJ, Huang PK, Chou WD (2007) Recurrent-fuzzy-neural-network-controlled linear induction motor servo drive using genetic algorithms. IEEE Trans Ind Electron 54(3):1449–1461

    Article  Google Scholar 

  8. Karamalis PD, Kanatas AG, Constantinou P (2009) A genetic algorithm applied for optimization of antenna arrays used in mobile radio channel characterization devices. IEEE Trans Instrum Meas 58:2475–2487

    Article  Google Scholar 

  9. Chu WC (2003) Speech coding algorithms. Wiley, Wiley-IEEE, New Jersey

    Book  MATH  Google Scholar 

  10. Huang X, Acero A, Wuenon H (2005) Spoken language processing a guide to theory algorithm and system development. Pearson, London

    Google Scholar 

  11. Leung HF, Lam HK, Ling SH (2003) Tuning of the structure and parameters of a neural network using an improved genetic algorithm. IEEE Trans Neural Netw 14:79–88

    Article  Google Scholar 

  12. Kennedy J, Eberhart RC (1995) “Particle swarm optimization.” In: Proceedings IEEE International Conference on Neural Networks (Perth, Australia), IEEE Service Center, Piscataway, NJ, pp IV:1942–1948, 1995

  13. Storn R, Price K (1997) Differential evolution- A simple and efficient heurist for global optimization over continuous spaces. J. of Global Optimization 11:341–359

    Article  MathSciNet  MATH  Google Scholar 

  14. Runstein F, Violaro F (1995)”An isolated-word speech recognition system using neural networks”.In: Proceeding of the 38th midwest symposium on circuit and systems, Vol 1, pp 550–553, 1995

  15. Sadaoki F, Dekker M (2001) Digital speech processing, synthesis, and recognition. Marcel Dekker, New York

    Google Scholar 

Download references

Acknowledgments

This research work was supported by the National Science Council of the Republic of China under contract NSC 100-2221-E-390-025-MY2.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shing-Tai Pan.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Pan, ST., Lan, ML. An efficient hybrid learning algorithm for neural network–based speech recognition systems on FPGA chip. Neural Comput & Applic 24, 1879–1885 (2014). https://doi.org/10.1007/s00521-013-1428-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00521-013-1428-5

Keywords

Navigation