skip to main content
10.1145/2708463.2709051acmotherconferencesArticle/Chapter ViewAbstractPublication PagesperminConference Proceedingsconference-collections
research-article

Classification of Stop Consonants using Modulation Spectrogram-Based Features

Authors Info & Claims
Published:26 February 2015Publication History

ABSTRACT

In this paper, we propose use of modulation spectrogram-based features for stop consonants classification based on their place of articulation. Stop sounds are classified as bilabial, alveolar and velar according to their place of articulation. The modulation spectrogram which is a two- dimensional (i.e., 2-D) feature represents modulation of low frequency components with acoustic frequency. In this work, modulation spectrogram has been obtained for all stop consonants from TIMIT database and then a dimension reduction algorithm, viz., higher order singular value decomposition (HOSVD) is applied on the feature vectors. The reduced dimension feature set is then applied to a Support Vector Machine (SVM) classifier which gives an overall accuracy of 94.25% for stop classification and 95.29% for place of articulation classification.

References

  1. Ali, A. M. A., Spiegel, J. V. and Muller, P., "Robust classification of stop consonants using auditaory-based speech processing", Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), Salt Lake City, UT, vol.1, pp.81--84, 2001.Google ScholarGoogle Scholar
  2. Ali, A. M. A., Spiegel, J. V. and Muller, P., "Acoustic-phonetic features for the automatic classification of stop consonants", IEEE Trans. Speech and Audio Proc., vol. 9, no. 8, pp. 833--841, 2001.Google ScholarGoogle ScholarCross RefCross Ref
  3. Ali, A. M. A., Spiegel, J. V. and Muller, P., "Acoustic-phonetic features for the automatic recognition of stop consonants", J. Acoust. Soc. Am., vol. 103, no. 5, pp. 2777--2778, 1998.Google ScholarGoogle ScholarCross RefCross Ref
  4. Zheng, Y., Mark H. J., and Borys S., "Stop consonant classification by dynamic formant trajectory," In Proc. Int. Conf. on Speech and Language Proc. (ICSLP), Jeju Island, Korea, 2004.Google ScholarGoogle Scholar
  5. Gidas, B.; Murua, A., "Classification and clustering of stop consonants via nonparametric transformations and wavelets," in Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), Detroit, MI, vol.1, pp.872--875 May 1995.Google ScholarGoogle Scholar
  6. Bush, M.A., Kopec, G.E. and Zue V.W., "Selecting acoustic features for stop consonant identification", in Proc. of Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), vol.8, no. 5, pp.742, 74, 1983Google ScholarGoogle ScholarCross RefCross Ref
  7. Greenberg, S. and Kingsbury, B. E. D., "The Modulation Spectrogram: In pursuit of an invariant representation of speech", Proc. IEEE on Int. Conf. Acoustics, Speech, Signal Process.(ICASSP), Munich, Germany, 1647--1650, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Hermansky, H., "The modulation spectrum in the automatic recognition of speech,", in Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), Santa Barbara, CA, pp.140, 147, Dec 1997.Google ScholarGoogle ScholarCross RefCross Ref
  9. Ganapathy, S., Thomas, S., and Hermansky, H., "Modulation frequency features for phoneme recognition in noisy speech", J. Acoust. Soc. Am., vol. 125, no.1, pp. EL8-EL12 2009.Google ScholarGoogle ScholarCross RefCross Ref
  10. Thomas, S., Ganapathy, S., and Hermansky, H., "Phoneme recognition using spectral envelope and modulation frequency features", IEEE Int. Conf. on Acoustics, Speech and Signal Processing(ICASSP), pp. 4453--4456, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Ganapathy, S., Thomas, S., and Hermansky, H., "Temporal envelop compensation for robust phoneme recognition using modulation spectrum", J. Acoust. Soc. Am. Vol. 128, no. 6, pp. 3769--3780, 2000.Google ScholarGoogle ScholarCross RefCross Ref
  12. Atlas, L. and Shamma, A. S., "Joint Acoustic and Modulation frequency", EURASIP J. on Applied signal Proccessing, vol. 7, pp. 668--675, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Markaki, M. and Stylianou, Y., "Using modulation spectra for voice pathology detection and classification", IEEE Annual Int. Conf. on Engineering in Medicine and Biology Society (EMBC), Minneapolis, Minnesota, USA, pp. 2514--2517, 2009Google ScholarGoogle ScholarCross RefCross Ref
  14. Markaki, M. and Stylianou, Y., "Modulation spectral features for objective voice quality assessment", Proc. IEEE Int. Symp. On Comm., Control and Signal Proc., Limassol, Cyprus, pp 1--4, 2010Google ScholarGoogle ScholarCross RefCross Ref
  15. Markaki, M., and Stylianou, Y., "Voice Pathology Detection and Discrimination based on Modulation Spectral Features", IEEE Trans. on Audio, Speech, and Language Proc., vol. 19, no. 7, pp. 1938--1948, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Lathauwer, L. D., Moor, B. D. and Vandewalle, J., "A multilinear singular value decomposition", SIAM J. Matrix Anal. Appl., vol 21, no. 4, pp. 1253--1278, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Modulation Toolbox. Available online: http://www.ee.washington.edu/research/isdl/projects/modulationtoolbox (last accessed on 14th Sept 2014).Google ScholarGoogle Scholar

Index Terms

  1. Classification of Stop Consonants using Modulation Spectrogram-Based Features

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Other conferences
            PerMIn '15: Proceedings of the 2nd International Conference on Perception and Machine Intelligence
            February 2015
            269 pages
            ISBN:9781450320023
            DOI:10.1145/2708463

            Copyright © 2015 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 26 February 2015

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article
            • Research
            • Refereed limited

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader