research-article

NS-KWS: joint optimization of near-sensor processing architecture and low-precision GRU for always-on keyword spotting

Authors:

Qin Li,

Huazhong YangAuthors Info & Claims

ISLPED '20: Proceedings of the ACM/IEEE International Symposium on Low Power Electronics and Design

Pages 97 - 102

https://doi.org/10.1145/3370748.3407001

Published: 10 August 2020 Publication History

Get Access

Abstract

Keyword spotting (KWS) is a crucial front-end module in the whole speech interaction system. The always-on KWS module detects input words, then activates the energy-consuming complex backend system when keywords are detected. The performance of the KWS determines the standby performance of the whole system and the conventional KWS module encounters the power consumption bottleneck problem of the data conversion near the microphone sensor. In this paper, we propose an energy-efficient near-sensor processing architecture for always-on KWS, which could enhance continuous perception of the whole speech interaction system. By implementing the keyword detection in the analog domain after the microphone sensor, this architecture avoids energy-consuming data converter and achieves faster speed than conventional realizations. In addition, we propose a lightweight gated recurrent unit (GRU) with negligible accuracy loss to ensure the recognition performance. We also implement and fabricate the proposed KWS system with the CMOS 0.18μm process. In the system-view evaluation results, the hardware-software co-design architecture achieves 65.6% energy consumption saving and 71 times speed up than state of the art.

Supplementary Material

MP4 File (3370748.3407001.mp4)

This is the nearly 15min presentation of the paper 18 in ISLPED-2020. The title is "NS-KWS: Joint Optimization of Near-Sensor Processing Architecture and Low-Precision GRU for Always-On Keyword Spotting". We propose a near-sensor processing architecture for always-on keyword spotting application and solve the ADC bottleneck problem in the conventional system. The processing architecture, network compression, NN evaluation, and analog processing circuit will be introduced in this presentation. Welcome for question and discussion.

Download
129.30 MB

References

[1]

Stephen Boyd, N. Parikh, E. Chu, B. Peleato, J. Eckstein, et al. 2011. Distributed optimization and statistical learning via the alternating direction method of multipliers. Foundations and Trends® in Machine learning 3, 1 (2011), 1--122.

Abstract

Supplementary Material

References

Cited By

Index Terms

Recommendations

TE-KWS: Text-Informed Speech Enhancement for Noise-Robust Keyword Spotting

A Keyword-Aware Language Modeling Approach to Spoken Keyword Search

Spoken keyword search system using improved ASR engine and novel template-based keyword scoring

Comments

Information

Published In

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations