Classification of speech signal based on gender: a hybrid approach using neuro-fuzzy systems

Gomathy, M.; Meena, K.; Subramaniam, K. R.

doi:10.1007/s10772-011-9118-0

Classification of speech signal based on gender: a hybrid approach using neuro-fuzzy systems

Published: 12 October 2011

Volume 14, pages 377–391, (2011)
Cite this article

International Journal of Speech Technology Aims and scope Submit manuscript

M. Gomathy¹,
K. Meena^2,3 &
K. R. Subramaniam⁴

251 Accesses
2 Citations
Explore all metrics

Abstract

One of the most important processes in speech processing is gender classification. Generally gender classification is done by considering pitch as feature. In general the pitch value of female is higher than the male. In some cases, pitch value of male is higher and female is low, in that cases this classification will not obtain the exact result. By considering this drawback here proposed a gender classification method which considers three features and uses fuzzy logic and neural network to identify the given speech signal belongs to which gender. For training fuzzy logic and neural network, training dataset is generated by considering the above three features. After completion of training, a speech signal is given as input, fuzzy and neural network gives an output, for that output mean value is taken and this value gives the speech signal belongs to which gender. The result shows the performance of our method in gender classification.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Automatic speech recognition: a survey

Article 10 November 2020

A comprehensive survey on automatic speech recognition using neural networks

Article 15 August 2023

Comparative analysis of audio classification with MFCC and STFT features using machine learning techniques

Article Open access 03 January 2024

References

Devi, T. M., Kasthuri, N., & Natarajan, A. M. (2010). Performance comparison of noise classification using intelligent networks. International Journal of Electronics Engineering, 2(1), 49–54.
Google Scholar
Faúndez-Zanuy, M., McLaughlin, S., Esposito, A., Hussain, A., Schoentgen, J., Kubin, G., Kleijn, W. B., & Maragos, P. (2002). Non-linear speech processing: overview and applications. Control & Intelligent Systems, 30(1), 1–10.
Google Scholar
Gomathy, M., Meena, K., & Subramaniam, K. R. (2011, to be published). Gender grouping in speech recognition using statistical metrics of pitch strength. EJSR J.
Gudi, A. B., & Nagaraj, H. C. (2009). Optimal curve fitting of speech signal for disabled children. International Journal of Computer science & Information Technology (IJCSIT), 1(2), 99–107.
Google Scholar
Gudi, A. B., Shreedhar, H. K., & Nagaraj, H. C. (2010). Signal processing techniques to estimate the speech disability in children. IACSIT International Journal of Engineering and Technology, 2(2), 169–176.
Google Scholar
Haraty, R. A., & El Ariss, O. (2007). CASRA+: a colloquial Arabic speech recognition application. American Journal of Applied Sciences, 4(1), 23–32.
Article Google Scholar
Hasegawa, Y., & Hata, K. (1994). Non-physiological differences between male and female speech: Evidence from the delayed F0 fall phenomenon in Japanese. In Proceedings of international conference on spoken language processing (pp. 1179–1182).
Google Scholar
Hasegawa, Y., & Hata, K. (1995). The function of F0-peak delay in Japanese. In Proceedings of 21st annual meeting of the Berkeley linguistics society (pp. 141–151).
Google Scholar
Kotti, M., & Kotropoulos, C. (2008). Gender classification in two emotional speech databases. In Proceedings of 19th international conference on pattern recognition (pp. 1–4). Tampa.
Chapter Google Scholar
Mahdi, A. E., & Jafer, E. (2008). Two-feature voiced/unvoiced classifier using wavelet transform. The Open Electrical and Electronic Engineering Journal, 2, 8–13.
Article Google Scholar
McAulay, R. J., & Quatieri, T. F. (1988). Speech processing based on a sinusoidal model. The Lincoln Laboratory Journal, 1(2), 153–168.
Google Scholar
Othman, A. M., & Riadh, M. H. (2008). Speech recognition using scaly neural networks. World Academy of Science, Engineering and Technology, 38, 253–258.
Google Scholar
Patel, I., & Rao, Y. S. (2010). Speech recognition using HMM with MFCC—an analysis using frequency spectral decomposition technique. Signal & Image Processing: An International Journal (SIPIJ), 1(2), 101–110.
Article Google Scholar
Qi, Y., & Hunt, B. R. (1993). Voiced-unvoiced-silence classifications of speech using hybrid features and a network classifier. IEEE Transactions on Speech and Audio Processing, 1(2), 250–255.
Article Google Scholar
Rakesh, K., Dutta, S., & Shama, K. (2011). Gender recognition using speech processing techniques in LABVIEW. International Journal of Advances in Engineering & Technology, 1(2), 51–63.
Google Scholar
Rao, R. R., & Prasad, A. (2011). Glottal excitation feature based gender identification system using ergodic HMM. International Journal of Computers & Applications, 17(3), 31–36.
Article Google Scholar
Rodger, J. A., & Pendharkar, P. C. (2004). A field study of the impact of gender and user’s technical experience on the performance of voice-activated medical tracking application International Journal of Human-Computer Studies, 60, 529–544.
Article Google Scholar
Sedaaghi, M. H. (2009). A comparative study of gender and age classification in speech signals. Iranian Journal of Electrical & Electronic Engineering, 5(1), 1–12.
Google Scholar
Shue, Y.-L., & Iseli, M. (2008). The role of voice source measures on automatic gender classification. In Proceedings of IEEE international conference on acoustics, speech and signal processing (pp. 4493–4496). Las Vegas.
Chapter Google Scholar
Sigmund, M. (2008). Gender distinction using short segments of speech signal. International Journal of Computer Science and Network Security, 8(10), 159–162.
Google Scholar
Silovsky, J., & Nouza, J. (2006). Speech, speaker and speaker’s gender identification in automatically processed broadcast stream. Radio Engineering Journal, 15(3), 42–48.
Google Scholar
Singh, G., Junghare, A., & Chokhani, P. (2010). Multi utility E-controlled cum voice operated farm vehicle. International Journal of Computers & Applications, 1(13), 109–113.
Google Scholar
Zengi, Y.-M., Wu, Z.-Y., Falk, T., & Chan, W.-Y. (2006). Robust GMM based gender classification using pitch and rasta-PLP parameters of speech. In Proceedings of fifth international conference on machine learning and cybernetics (pp. 13–16). Dalian.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Shrimathi Indira Gandhi College, Trichy-2, India
M. Gomathy
Bharathidhasan University, Trichirapalli, India
K. Meena
Shrimathi Indira Gandhi College, Trichirapalli, India
K. Meena
Department of Computer Application, Shrimathi Indira Gandhi College, Trichy-2, India
K. R. Subramaniam

Authors

M. Gomathy
View author publications
You can also search for this author in PubMed Google Scholar
K. Meena
View author publications
You can also search for this author in PubMed Google Scholar
K. R. Subramaniam
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to M. Gomathy.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Gomathy, M., Meena, K. & Subramaniam, K.R. Classification of speech signal based on gender: a hybrid approach using neuro-fuzzy systems. Int J Speech Technol 14, 377–391 (2011). https://doi.org/10.1007/s10772-011-9118-0

Download citation

Received: 13 July 2011
Accepted: 22 September 2011
Published: 12 October 2011
Issue Date: December 2011
DOI: https://doi.org/10.1007/s10772-011-9118-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Classification of speech signal based on gender: a hybrid approach using neuro-fuzzy systems

Abstract

Access this article

Similar content being viewed by others

Automatic speech recognition: a survey

A comprehensive survey on automatic speech recognition using neural networks

Comparative analysis of audio classification with MFCC and STFT features using machine learning techniques

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Classification of speech signal based on gender: a hybrid approach using neuro-fuzzy systems

Abstract

Access this article

Similar content being viewed by others

Automatic speech recognition: a survey

A comprehensive survey on automatic speech recognition using neural networks

Comparative analysis of audio classification with MFCC and STFT features using machine learning techniques

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation