Skip to main content

A New Step in Arabic Speech Identification: Spoken Digit Recognition

  • Conference paper
Information Processing and Security Systems

Abstract

This work presents a new Algorithm to recognize separate voices of some Arabic words, the digits form zero to ten. Firstly we prepare our signal by pre-processing trial. Next the speech signal is processed as an image by Power Spectrum Estimation. For feature extraction, transformation and hence recognition, the algorithm of minimal eigenvalues of Toeplitz matrices together with other methods of speech processing and recognition are used. At the stage of classification many methods are tested from classical ones, which depend on the matrix theory, to different types of neuron networks, mainly radial basis functions neural networks. The success rate obtained in the presented experiments is almost ideal and exceeded 98% for many cases. The results have shown flexibility to extend the algorithm to speaker identification.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

6 References

  1. K. Saeed, M. Nammous, “Experimental Image-Based Algorithm for Spoken Arabic Digits Identification,” Computer Information Systems and Applications, Vol.1, pp.55–66, WSFiZ Press, Bialystok, Poland 2004.

    Google Scholar 

  2. K. Saeed, “Computer Graphics Analysis: A Criterion for Image Feature Extraction and Recognition,” Vol. 10, Issue 2, 2001, pp. 185–194, MGV-International Journal on Machine Graphics and Vision, Institute of Computer Science, Polish Academy of Sciences, Warsaw.

    Google Scholar 

  3. R. W. Schafer, L. R. Rabiner, “System for Automatic Formant Analysis of Voiced Speech,” /J. Acoust. Soc. Amer. Vol.47, Feb. 1970.

    Google Scholar 

  4. Andreas A., “Digital Filters: Analysis and Design,” McGraw-Hill, New York 1979.

    Google Scholar 

  5. Cz. Basztura, “Modele analizy i procedury w komputerowym rozpoznawaniu głosów,” (in Polish), prace naukowe ITiA Politechniki Wrocławskiej, no. 30, Wrocław 1989.

    Google Scholar 

  6. L. S. Marple, “Digital Spectral Analysis,” Englewood Cliffs, NJ: Prentice Hall, 1987.

    Google Scholar 

  7. Sadaoki Furui, “Digital Speech Processing, Synthesis, and Recognition,” Marcel Dekker, Inc. 2001.

    Google Scholar 

  8. R. Tadeusiewicz, “Sygnał mowy,” WKiŁ (in Polish), Warsaw 1988.

    Google Scholar 

  9. V. K. Ingle, J. G. Proakis, “Digital Signal Processing Using MATLAB,” Brooks Cole, July 1999.

    Google Scholar 

  10. K. Saeed, M. Kozłowski, A. Kaczanowski, “Metoda do rozpoznawania obrazów akustycznych izolowanych liter mowy”, Zeszyty Politechniki Białostockiej (in Polish), I-1/2002, pp. 181–207, Bialystok 2002.

    Google Scholar 

  11. K. Saeed, M. Kozłowski, “An Image-Based System for Spoken-Letter Recognition,” 10th Int. Conference CAIP'03 on Computer Analysis of Images and Patterns, August 2003, Groningen. Proceedings published in: Lecture Notes in Computer Science, Petkov and Westenberg (Eds.), pp. 494–502, LNCS 2756, Springer-Verlag Heidelberg: Berlin 2003.

    Google Scholar 

  12. K. Saeed, M. Tabedzki, “A New Hybrid System for Recognition of Handwritten-Script,” Invited for publication in International Scientific Journal of Computing, Institute of Computer Information Technologies, Volume 3, Issue 1, pp. 50–57, 2004, Ternopil, Ukraine 2004.

    Google Scholar 

  13. Shigeru Katagiri, “Handbook of Neural Networks for Speech Processing,” Artech House, Boston 2000.

    Google Scholar 

  14. M.W. Mak, W.G. Allen and G.G. Sexton, “Speaker identification using radial basis functions”, The Third International Conference on Artifical Neural networks, University of Northumbria at Newcastle, U.K 1998.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer Science+Business Media, Inc.

About this paper

Cite this paper

Saeed, K., Nammous, M.K. (2005). A New Step in Arabic Speech Identification: Spoken Digit Recognition. In: Saeed, K., Pejaś, J. (eds) Information Processing and Security Systems. Springer, Boston, MA. https://doi.org/10.1007/0-387-26325-X_6

Download citation

  • DOI: https://doi.org/10.1007/0-387-26325-X_6

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-0-387-25091-5

  • Online ISBN: 978-0-387-26325-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics