research-article

Linear histogram equalization in the acoustic feature domain for speech recognition over Bluetooth™ channels

Authors:
Ke Peng

Shanghai Jiao Tong University, Shanghai, China

Shanghai Jiao Tong University, Shanghai, China
View Profile

,
Hongbin Cai

Motorola China Research Center, Shanghai, China

Motorola China Research Center, Shanghai, China
View Profile

,
Yaxin Zhang

Motorola China Research Center, Shanghai, China

Motorola China Research Center, Shanghai, China
View Profile

Mobility '07: Proceedings of the 4th international conference on mobile technology, applications, and systems and the 1st international symposium on Computer human interaction in mobile technologySeptember 2007Pages 427–430https://doi.org/10.1145/1378063.1378130

Published:10 September 2007Publication History

Mobility '07: Proceedings of the 4th international conference on mobile technology, applications, and systems and the 1st international symposium on Computer human interaction in mobile technology

Pages 427–430

ABSTRACT

This paper studies the improvement of speech recognition over Bluetooth™ wireless channels. Speech recognition over Bluetooth™ suffers from the low SNR due to the position of the Bluetooth™ microphone, Bluetooth™ codec distortion, packet loss over the wireless channel, and Bluetooth™ channel distortion. By transforming the MFCCs (Mel-Frequency Cepstral Coefficients) to make the cumulative density functions of the MFCC values in recognition match the ones that were estimated on the training data, the recognition can be improved. The cumulative density functions are approximated using a small number of quantiles. Recognition tests on a Bluetooth™ speech database showed significant increase of recognition accuracy in noisy environments.

References

Bawab, Z. A., et al. Speech recognition over Bluetooth wireless channels. In Proceedings of Eurospeech. Geneva, Switzerland, 2003, 1233--1236.Google Scholar
Bluetooth#8482; Specification Version 1.2, Nov. 2003.Google Scholar
Higler, F. Quantile Based Histogram Equalization for Noise Robust Speech Recognition. Ph. D. Dissertation, RWTH Aachen (University of Technology), Aachen, Germany, 2005.Google Scholar
Hilger, F., and Ney, H. Quantile Based Histogram Equalization for Noise Robust Large Vocabulary Speech Recognition. IEEE Transactions on Speech and Audio Processing, Vol. 14, No. 3 (May 2006), 845--854. Google ScholarDigital Library
Molau, S., Pitz, M., and Ney, H. Histogram based normalization in the acoustic feature space. In Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding. Madonna di Campiglio, Trento, Italy, Dec. 2001.Google ScholarCross Ref
Nour-Eldin, A. H., et al. Automatic recognition of Bluetooth speech in 802.11 interference and the effectiveness of insertion-based compensation techniques. In Proceedings of ICASSP. Montreal, Quebec, Canada, 2004, 1033--1036.Google Scholar

Index Terms

Linear histogram equalization in the acoustic feature domain for speech recognition over Bluetooth™ channels
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
      1. Speech recognition
2. Human-centered computing
  1. Human computer interaction (HCI)

Recommendations

Environmental robust speech and speaker recognition through multi-channel histogram equalization

Feature statistics normalization in the cepstral domain is one of the most performing approaches for robust automaticspeech and speaker recognition in noisy acoustic scenarios: feature coefficients are normalized by using suitable linear or nonlinear ...
Read More
Acoustic model adaptation based on pronunciation variability analysis for non-native speech recognition

In this paper, pronunciation variability between native and non-native speakers is investigated, and a novel acoustic model adaptation method is proposed based on pronunciation variability analysis in order to improve the performance of a speech ...
Read More
Slovenian spontaneous speech recognition and acoustic modeling of filled pauses and onomatopoeas

This paper is focused on acoustic modeling for spontaneous speech recognition. This topic is still a very challenging task for speech technology research community. The attributes of spontaneous speech can heavily degrade speech recognizer's accuracy ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
Mobility '07: Proceedings of the 4th international conference on mobile technology, applications, and systems and the 1st international symposium on Computer human interaction in mobile technology
September 2007
702 pages
ISBN:9781595938190
DOI:10.1145/1378063
General Chairs:
Peter H. J. Chong
Nanyang Technological University, Singapore
,
Adrian David Cheok
National University of Singapore, Singapore
Copyright © 2007 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 10 September 2007
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Bluetooth™ channel
linear histogram equalization
quantile
speech recognition
Qualifiers
- research-article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 149
  Total Downloads
- Downloads (Last 12 months)0
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Linear histogram equalization in the acoustic feature domain for speech recognition over Bluetooth™ channels

Mobility '07: Proceedings of the 4th international conference on mobile technology, applications, and systems and the 1st international symposium on Computer human interaction in mobile technology

ABSTRACT

References

Cited By

Index Terms

Recommendations

Environmental robust speech and speaker recognition through multi-channel histogram equalization

Acoustic model adaptation based on pronunciation variability analysis for non-native speech recognition

Slovenian spontaneous speech recognition and acoustic modeling of filled pauses and onomatopoeas

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Linear histogram equalization in the acoustic feature domain for speech recognition over Bluetooth™ channels

Mobility '07: Proceedings of the 4th international conference on mobile technology, applications, and systems and the 1st international symposium on Computer human interaction in mobile technology

ABSTRACT

References

Cited By

Index Terms

Recommendations

Environmental robust speech and speaker recognition through multi-channel histogram equalization

Acoustic model adaptation based on pronunciation variability analysis for non-native speech recognition

Slovenian spontaneous speech recognition and acoustic modeling of filled pauses and onomatopoeas

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media