skip to main content
10.1145/1352694.1352755acmconferencesArticle/Chapter ViewAbstractPublication Pageseatis-orgConference Proceedingsconference-collections
research-article

Use of Haar wavelet transform based multiple template matching for analyses of speech voice

Published: 14 May 2007 Publication History

Abstract

Technologies of wavelet transformation were used in JPEG2000 and those will be available for CODEC. Pivotal reminders for voice recognition were investigated by using multi-resolution of Haar wavelet representation (H-WR). Template of a phoneme differs from that of a syllable. Optimum accuracy of the feature depends on segmentation of template-matching (TM) analyses. 64 components of Haar wavelet coefficients (H-WC) for recognition of a phoneme are able to decrease to 15 components with lower frequency. Here, each set of data begins at peak value in each pitch. Sampling frequency is 10 kHz. The period of segment for a phoneme is 6.4msec. Segmentation of phoneme in speech can be checked by using the fact that ratio (r) between SWC (sum of absolute value of WC in a scale) becomes r=1, at a transition. SWC is available as a constituent in vector quantization for a syllable. Short syllables are decoded by means of 8 pieces of SWC, here the SWC was obtained from a set of data of 1024 pieces on a syllable (sampling frequency is 5 kHz, period of extraction for a syllable is 204.8msec).

References

[1]
S. Karasawa, "Brain Mechanism on Understanding of Information Explained by Concept of Activity", IEICE Technical Report, TL2007-2, ISSN 0913-5685. 2007.
[2]
S. Karasawa, "Attributes of Language Use Explained by Activities of Neuron", IEICE Technical Report, TL2006-11, ISSN 0913-5685, 2006, pp.31--36.
[3]
Y. C. Lee, S. S. Ahn, "Statistical Model-Based VAD Algorithm with Wavelet Transform", Proc. IEICE Transaction on Fundamentals of Electronics, Communications and Computer Sciences, E89-A (6) 2006, pp.1594--1600.
[4]
J. O. Kim, et al. "On the Extraction of the Valid Speech-Sound by the Merging Algorithm with the Discrete Wavelet Transform", Inter. Conference on Computational Science, 2003, pp.619--628.
[5]
B. Thipakom, B. Kaewkamnerdpong, "Thai Phoneme Segmentation using Discrete Wavelet Transform", International Journal of Smart Engineering System Design, Vol 5, No.4, 2003, 389--399.
[6]
C. J. Long, S. Datta, "Wavelet Based Feature Extraction for Phoneme Recognition", Inter. Conference on Spoken Language Processing, 1996.
[7]
B. T. Tan, M. Fu, A. Spray, F. Dermody, "The Use of Wavelet Transforms in Phoneme Recognition", Inter. Conference on Spoken Language Processing, 1996.

Cited By

View all
  • (2012)Adaptive threshold based video shot boundary detection framework2012 International Conference on Image Analysis and Signal Processing10.1109/IASP.2012.6425020(1-5)Online publication date: Nov-2012
  • (2012)Wavelet based shot boundary detection technique using between cluster distance2012 5th International Congress on Image and Signal Processing10.1109/CISP.2012.6469701(249-253)Online publication date: Oct-2012
  • (2009)Classification of Grasp Types through Wavelet Decomposition of EMG Signals2009 2nd International Conference on Biomedical Engineering and Informatics10.1109/BMEI.2009.5305493(1-5)Online publication date: Oct-2009
  1. Use of Haar wavelet transform based multiple template matching for analyses of speech voice

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    EATIS '07: Proceedings of the 2007 Euro American conference on Telematics and information systems
    May 2007
    498 pages
    ISBN:9781595935984
    DOI:10.1145/1352694
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 14 May 2007

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. CODEC
    2. Haar discrete wavelet transform
    3. data-compression
    4. template matching

    Qualifiers

    • Research-article

    Conference

    EATIS07

    Acceptance Rates

    Overall Acceptance Rate 17 of 64 submissions, 27%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)1
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 08 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2012)Adaptive threshold based video shot boundary detection framework2012 International Conference on Image Analysis and Signal Processing10.1109/IASP.2012.6425020(1-5)Online publication date: Nov-2012
    • (2012)Wavelet based shot boundary detection technique using between cluster distance2012 5th International Congress on Image and Signal Processing10.1109/CISP.2012.6469701(249-253)Online publication date: Oct-2012
    • (2009)Classification of Grasp Types through Wavelet Decomposition of EMG Signals2009 2nd International Conference on Biomedical Engineering and Informatics10.1109/BMEI.2009.5305493(1-5)Online publication date: Oct-2009

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media