Skip to main content
Log in

Quaternion Fourier Descriptors for the Preprocessing and Recognition of Spoken Words Using Images of Spatiotemporal Representations

  • Published:
Journal of Mathematical Imaging and Vision Aims and scope Submit manuscript

Abstract

This paper presents an application of the quaternion Fourier transform for the preprocessing for neural-computing. In a new way the 1D acoustic signals of French spoken words are represented as 2D signals in the frequency and time domain. These kind of images are then convolved in the quaternion Fourier domain with a quaternion Gabor filter for the extraction of features. This approach allows to greatly reduce the dimension of the feature vector. Two methods of feature extraction are tested. The features vectors were used for the training of a simple MLP, a TDNN and a system of neural experts. The improvement in the classification rate of the neural network classifiers are very encouraging which amply justify the preprocessing in the quaternion frequency domain. This work also suggests the application of the quaternion Fourier transform for other image processing tasks.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  1. Bayro-Corrochano, E.: Geometric Computing for Perception Action Systems. Springer, New York (2001)

    MATH  Google Scholar 

  2. Bülow, T.: Hypercomplex Spectral Signal Representations for the Processing and Analysis of Images. PhD. thesis, Christian Albrechts University of Kiel (1999)

  3. Gabor, D.: Theory of communication. J. IEE 93, 429–457 (1946)

    Google Scholar 

  4. Felsberg, M.: Signal processing using frequency domain methods in Clifford algebra. Msc. Thesis, Computer Science Institute, Christian Albrechts Universität zu Kiel (1998)

  5. Hamilton, W.R.: Elements of Quaternions. Longmans Green, London (1866). Chelsea, New York (1969)

    Google Scholar 

  6. Mehrotra, K., Mohan, Ch.K., Ranka, S.: Elements of Artificial Neural Networks. MIT Press, Cambridge (1997)

    Google Scholar 

  7. Yaglom, M.: Complex Numbers in Geometry. Academic, Leicester (1968)

    Google Scholar 

  8. Zwicker, E.: Psychoacoustics: Facts and Models, 2nd edn. Springer, Berlin (1998)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Eduardo Bayro-Corrochano.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bayro-Corrochano, E., Trujillo, N. & Naranjo, M. Quaternion Fourier Descriptors for the Preprocessing and Recognition of Spoken Words Using Images of Spatiotemporal Representations. J Math Imaging Vis 28, 179–190 (2007). https://doi.org/10.1007/s10851-007-0004-y

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10851-007-0004-y

Keywords

Navigation