research-article

Public Access

Voice Fingerprinting for Indoor Localization with a Single Microphone Array and Deep Learning

Authors:
Shivenkumar Parmar

California State University, Sacramento, Sacramento, CA, USA

California State University, Sacramento, Sacramento, CA, USA
View Profile

,
Xuyu Wang

California State University, Sacramento, Sacramento, CA, USA

California State University, Sacramento, Sacramento, CA, USA
View Profile

,
Chao Yang

Auburn University, Auburn, AL, USA

Auburn University, Auburn, AL, USA
View Profile

,
Shiwen Mao

Auburn University, Auburn, AL, USA

Auburn University, Auburn, AL, USA
View Profile

WiseML '22: Proceedings of the 2022 ACM Workshop on Wireless Security and Machine LearningMay 2022Pages 21–26https://doi.org/10.1145/3522783.3529528

Published:16 May 2022Publication History

WiseML '22: Proceedings of the 2022 ACM Workshop on Wireless Security and Machine Learning

Pages 21–26

ABSTRACT

With the fast development of the Internet of Things (IoT), smart speakers for voice assistance have become increasingly important in smart homes, which offers a new type of human-machine interaction interface. Voice localization with microphone arrays can improve smart speaker's performance and enable many new IoT applications. To address the challenges of complex indoor environments, such as non-line-of-sight (NLOS) and multi-path propagation, we propose voice fingerprinting for indoor localization using a single microphone array. The proposed system consists of a ReSpeaker 6-mic circular array kit connected to a Raspberry Pi and a deep learning model, and operates in offline training and online test stages. In the offline stage, the models are trained with spectrogram images obtained from audio data using short-time Fourier transform (STFT). Transfer learning is used to speed up the training process. In the online stage, a top-K probabilistic method is used for location estimation. Our experimental results demonstrate that the Inception-ResNet-v2 model can achieve a satisfactory localization performance with small location errors in two typical home environments.

References

M. Wang, W. Sun, and L. Qiu, "MAVL: Multiresolution analysis of voice localization," in Proc. USENIX NSDI'21, Virtual Conference, Apr. 2021, pp. 845--858.Google Scholar
W. Wang, J. Li, Y. He, and Y. Liu, "Symphony: Localizing multiple acoustic sources with a single microphone array," in Proc. ACM SenSys'20, Virtual Conference, Nov. 2020, pp. 82--94.Google ScholarDigital Library
M. E. Epstein and L. Vasserman, "Generating language models," US Patent 9,437,189, Sept. 2016.Google Scholar
Q. Lin, Z. An, and L. Yang, "Rebooting ultrasonic positioning systems for ultrasound-incapable smart devices," in Proc. ACM MobiCom'19, Los Cabos, Mexico, Oct. 2019, pp. 1--16.Google Scholar
T. C. Collier, A. N. Kirschel, and C. E. Taylor, "Acoustic localization of antbirds in a Mexican rainforest using a wireless sensor network," J. Acoustical Soc. America, vol. 128, no. 1, pp. 182--189, July 2010.Google ScholarCross Ref
S. Shen, D. Chen, Y.-L. Wei, Z. Yang, and R. R. Choudhury, "Voice localization using nearby wall reflections," in Proc. ACM MobiCom'20, London, UK, Sept. 2020, pp. 1--14.Google Scholar
J. Purohit, X. Wang, S. Mao, X. Sun, and C. Yang, "Fingerprinting-based indoor and outdoor localization with LoRa and deep learning," in Proc. IEEE GLOBECOM'20, Taipei, Taiwan, Dec. 2020, pp. 1--6.Google Scholar
X. Wang, L. Gao, S. Mao, and S. Pandey, "CSI-based fingerprinting for indoor localization: A deep learning approach," IEEE Trans. Veh. Technol., vol. 66, no. 1, pp. 763--776, Jan. 2017.Google Scholar
X. Wang, L. Gao, and S. Mao, "BiLoc: Bi-modality deep learning for indoor localization with 5GHz commodity Wi-Fi," IEEE Access J., vol. 5, no. 1, pp. 4209--4220, Mar. 2017.Google ScholarCross Ref
X. Wang, X. Wang, and S. Mao, "Deep convolutional neural networks for indoor localization with CSI images," IEEE Trans. Netw. Sci. Eng., vol. 7, no. 1, pp. 316--327, Jan./Mar. 2020.Google ScholarCross Ref
C. Szegedy, V. Vanhoucke, S. Ioffe, J. Shlens, and Z. Wojna, "Rethinking the inception architecture for computer vision," in Proc. IEEE CVPR'16, Las Vegas, NV, June-July 2016, pp. 2818--2826.Google Scholar
K. He, X. Zhang, S. Ren, and J. Sun, "Identity mappings in deep residual networks," in Proc. 2016 European Conference on Computer Vision, Amsterdam, The Netherlands, Oct. 2016, pp. 630--645.Google ScholarCross Ref
C. Szegedy, S. Ioffe, V. Vanhoucke, and A. Alemi, "Inception-v4, inception-resnet and the impact of residual connections on learning," in Proc. AAAI'17, San Francisco, CA, Feb. 2017, pp. 4278--4284.Google Scholar
Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, "Gradient-based learning applied to document recognition," Proc. IEEE, vol. 86, no. 11, pp. 2278--2324, Nov. 1998.Google ScholarCross Ref
K. He, X. Zhang, S. Ren, and J. Sun, "Deep residual learning for image recognition," in Proc. IEEE CVPR'16, Las Vegas, NV, June-July 2016, pp. 770--778.Google Scholar
X. Wang, X. Wang, and S. Mao, "Indoor fingerprinting with bimodal CSI tensors: A deep residual sharing learning approach," IEEE Internet of Things Journal, vol. 8, no. 6, pp. 4498--4513, Mar. 2021.Google ScholarCross Ref
M. Youssef and A. Agrawala, "The Horus WLAN location determination system," in Proc. ACM MobiSys'05, Seattle, WA, June 2005, pp. 205--218.Google ScholarDigital Library
X. Wang, Z. Yu, and S. Mao, "Indoor localization using magnetic and light sensors with smartphones: A deep LS™ approach," Springer Mobile Networks and Applications (MONET) J., vol. 25, no. 2, pp. 819--832, Apr. 2020.Google ScholarDigital Library
Seed Wiki, "ReSpeaker 6-Mic circular array kit for Raspberry Pi," Jan. 2019. [Online]. Available: https://wiki.seeedstudio.com/BIBentrySTDinterwordspacingGoogle Scholar

Index Terms

Voice Fingerprinting for Indoor Localization with a Single Microphone Array and Deep Learning

Index terms have been assigned to the content through auto-classification.

Recommendations

Speaker Tracking and Identifying Based on Indoor Localization System and Microphone Array
AINAW '07: Proceedings of the 21st International Conference on Advanced Information Networking and Applications Workshops - Volume 02

This paper presents a novel multimodal system to track the participants and identify the active speaker in the smart meeting room. Indoor localization system, Cicada, is used to obtain the location and identity information of the participants. Cicada, ...
Read More
Indoor human localization with orientation using WiFi fingerprinting
ICUIMC '14: Proceedings of the 8th International Conference on Ubiquitous Information Management and Communication

Localization in indoor environment poses a fundamental challenge in ubiquitous computing compared to its well-established GPS-based outdoor environment counterpart. This study investigated the feasibility of a WiFi-based indoor positioning system to ...
Read More
Deep belief networks for fingerprinting indoor localization using ultrawideband technology

With the increasing requirement of localization services in indoor environment, indoor localization techniques have drawn a lot of attention. In recent years, fingerprinting localization techniques have been proved to be effective in indoor localization ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
WiseML '22: Proceedings of the 2022 ACM Workshop on Wireless Security and Machine Learning
May 2022
93 pages
ISBN:9781450392778
DOI:10.1145/3522783
General Chair:
Murtuza Jadliwala
University of Texas at San Antonio, San Antonio, Texas, USA
Copyright © 2022 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 16 May 2022
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
deep learning
internet of things (iot)
microphone array.
transfer learning
voice localization
Qualifiers
- research-article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 182
  Total Downloads
- Downloads (Last 12 months)85
- Downloads (Last 6 weeks)13
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Voice Fingerprinting for Indoor Localization with a Single Microphone Array and Deep Learning

WiseML '22: Proceedings of the 2022 ACM Workshop on Wireless Security and Machine Learning

ABSTRACT

References

Cited By

Index Terms

Recommendations

Speaker Tracking and Identifying Based on Indoor Localization System and Microphone Array

Indoor human localization with orientation using WiFi fingerprinting

Deep belief networks for fingerprinting indoor localization using ultrawideband technology

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Voice Fingerprinting for Indoor Localization with a Single Microphone Array and Deep Learning

WiseML '22: Proceedings of the 2022 ACM Workshop on Wireless Security and Machine Learning

ABSTRACT

References

Cited By

Index Terms

Recommendations

Speaker Tracking and Identifying Based on Indoor Localization System and Microphone Array

Indoor human localization with orientation using WiFi fingerprinting

Deep belief networks for fingerprinting indoor localization using ultrawideband technology

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media