Poster Abstract: Towards Speaker Identification on Resource-Constrained Embedded Devices

Authors:
Markus Gallacher

Graz University of Technology, Graz, Austria

Graz University of Technology, Graz, Austria

https://orcid.org/0009-0000-9456-1092
View Profile

,
Carlo Alberto Boano

Graz University of Technology, Graz, Austria

Graz University of Technology, Graz, Austria

https://orcid.org/0000-0001-7647-3734
View Profile

,
M. S. Arun Sankar

University College Cork, Cork, Ireland

University College Cork, Cork, Ireland

https://orcid.org/0000-0002-2798-9846
View Profile

,
Utz Roedig

University College Cork, Cork, Ireland

University College Cork, Cork, Ireland

https://orcid.org/0000-0002-4020-0889
View Profile

,
Willian T. Lunardi

Technology Innovation Institute TII, Abu Dhabi, United Arab Emirates

Technology Innovation Institute TII, Abu Dhabi, United Arab Emirates

https://orcid.org/0000-0003-0718-0019
View Profile

,
Michael Baddeley

Technology Innovation Institute TII, Abu Dhabi, United Arab Emirates

Technology Innovation Institute TII, Abu Dhabi, United Arab Emirates

https://orcid.org/0000-0002-9202-8582
View Profile

SenSys '23: Proceedings of the 21st ACM Conference on Embedded Networked Sensor SystemsNovember 2023Pages 518–519https://doi.org/10.1145/3625687.3628387

Published:26 April 2024Publication History

SenSys '23: Proceedings of the 21st ACM Conference on Embedded Networked Sensor Systems

Pages 518–519

ABSTRACT

Voice is a convenient and popular way to interact with our digital world. Besides translating speech to text, it is also possible to identify speakers based on their voice profile. To date, speaker identification has predominantly been limited to high-performance computational platforms owing to the intricate nature of the underlying algorithms. In this work, we demonstrate that it is possible to reduce model complexity by the required factor of ~10, such that speaker identification can be made feasible for embedded devices with limited resources. We further describe and discuss novel use cases, such as voice-based presence detection and authentication, that become feasible on these class of devices.

References

A. Hajavi and A. Etemad. 2019. A Deep Neural Network for Short-Segment Speaker Recognition. In Proc. of Interspeech'19. Google ScholarCross Ref
M. Jakubec et al. 2021. Speaker Recognition with ResNet and VGG Networks. In Proc. of RADIOELEKTRONIKA'19. Google ScholarCross Ref
S. Koppula et al. 2018. Energy-Efficient Speaker Identification with Low-Precision Networks. In Proc. of ICASSP'18. Google ScholarDigital Library
C. Nunes et al. 2020. AM-MobileNet1D: A Portable Model for Speaker Recognition. In Proc. of IJCNN'20. Google ScholarCross Ref
S.S. Tirumala and S.R. Shahamiri. 2016. A Review on Deep Learning Approaches in Speaker Identification. In Proc. of ICSPS'16. Google ScholarDigital Library

Recommendations

Text-Independent Speaker Identification Using Vowel Formants

Automatic speaker identification has become a challenging research problem due to its wide variety of applications. Neural networks and audio-visual identification systems can be very powerful, but they have limitations related to the number of ...
Read More
Speaker Identification Using Whispered Speech
CSNT '13: Proceedings of the 2013 International Conference on Communication Systems and Network Technologies

The study of closed set text-independent speaker identification using whisper speech is presented in this paper. A new feature called temporal Teager energy based sub band cepstral coefficients (TTESBCC) is proposed. The work presented compares the ...
Read More
Text-Independent/Text-Prompted Speaker Recognition by Combining Speaker-Specific GMM with Speaker Adapted Syllable-Based HMM

We presented a new text-independent/text-prompted speaker recognition method by combining speaker-specific Gaussian Mixture Model (GMM) with syllable-based HMM adapted by MLLR or MAP. The robustness of this speaker recognition method for speaking style'...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
SenSys '23: Proceedings of the 21st ACM Conference on Embedded Networked Sensor Systems
November 2023
574 pages
ISBN:9798400704147
DOI:10.1145/3625687
General Chair:
Rasit Eskicioglu,
Program Chair:
Polly Huang,
Program Co-chair:
Neal Patwari
This work is licensed under a Creative Commons Attribution International 4.0 License.
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 26 April 2024
Check for updates
Author Tags
machine learning
speaker identification
embedded systems
Qualifiers
- short-paper
Conference

Acceptance Rates
Overall Acceptance Rate174of867submissions,20%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 17
  Total Downloads
- Downloads (Last 12 months)17
- Downloads (Last 6 weeks)17
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Poster Abstract: Towards Speaker Identification on Resource-Constrained Embedded Devices

SenSys '23: Proceedings of the 21st ACM Conference on Embedded Networked Sensor Systems

ABSTRACT

References

Cited By

Recommendations

Text-Independent Speaker Identification Using Vowel Formants

Speaker Identification Using Whispered Speech

Text-Independent/Text-Prompted Speaker Recognition by Combining Speaker-Specific GMM with Speaker Adapted Syllable-Based HMM