Abstract
This paper presents a combined speech recognition/speaker identification system that can be efficiently used for personalized domotic control. The proposed system works as a distributed framework and it is designed to identify a speaker in home environments in order to provide user access to customized options. Human speech signals contain both language and speaker dependent information. Using this information the system realizes a personalized control in home environments and this approach can also be applied in more generic scenarios such as car customization settings. The system was optimized with the aim to allow an immediate use only with the addition of small and cheap audio front-ends that will capture commands spoken by the user. Meanwhile a remote server performs the speech recognition as well as user identification and combines these informations to provides user specific settings which are sent back to the desired actuator at home.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Alessandrini M, Biagetti G, Curzi A, Turchetti C (2011) Semi-automatic acoustic model generation from large unsynchronized audio and text chunks. In: 12th Annual conference of the international speech communication association (Interspeech 2011). Florence, Italy, pp 1681–1684
Alessandrini M, Biagetti G, Curzi A, Turchetti C (2013) A garbage model generation technique for embedded speech recognisers. In: Signal processing: algorithms, architectures, arrangements, and applications (SPA 2013). Poznan, Poland, pp 318–322
Bazzi I, Glass JR (2000) Modeling out-of-vocabulary words for robust speech recognition. In: 6th International conference on spoken language processing (ICSLP 2000/INTERSPEECH 2000). Beijing, China, pp 401–404
Bhardwaj S, Srivastava S, Hanmandlu M, Gupta J (2013) GFM-based methods for speaker identification. IEEE Trans Cybern 43(3):1047–1058
Biagetti G, Crippa P, Curzi A, Falaschetti L, Orcioni S, Turchetti C (2015) Distributed speech recognition for lighting system control. In: Intelligent decision technologies. Springer, Berlin, pp 101–111
Biagetti G, Crippa P, Curzi A, Orcioni S, Turchetti C (2015) Speaker identification with short sequences of speech frames. In: International conference on pattern recognition applications and methods (ICPRAM 2015). Lisbon, Portugal, pp 178–185
Bimbot F, Bonastre JF, Fredouille C, Gravier G, Magrin-Chagnolleau I, Meignier S, Merlin T, Ortega-Garca J, Petrovska-Delacrétaz D, Reynolds DA (2004) A tutorial on text-independent speaker verification. EURASIP J Appl Sig Process 2004:430–451
Campbell JPJ (1997) Speaker recognition: a tutorial. Proc IEEE 85(9):1437–1462
Chen K, Wang L, Chi H (1997) Methods of combining multiple classifiers with different features and their applications to text-independent speaker identification. Int J Pattern Recognit Artif Intell 11(03):417–445
Dobrowolski A, Majda E (2011) Cepstral analysis in the speakers recognition systems. In: Signal processing algorithms, architectures, arrangements, and applications conference proceedings (SPA 2011). Poznan, Poland, pp 1–6
ETSI ES 202 050 V1.1.5 (2007) Speech processing, transmission and quality aspects (STQ); distributed speech recognition; advanced front-end feature extraction algorithm; compression algorithms
Figueiredo MAT, Jain A (2002) Unsupervised learning of finite mixture models. IEEE Trans Pattern Anal Mach Intell 24(3):381–396
Gish H, Schmidt M (1994) Text-independent speaker identification. IEEE Signal Process Mag 11(4):18–32
Hirota S, Hayasaka N, Iiguni Y (2012) Experimental evaluation of structure of garbage model generated from in-vocabulary words. In: 2012 International symposium on communications and information technologies (ISCIT 2012). Gold Coast, Australia, pp 87–92
Huang X, Acero A, Hon HW (2001) Spoken language processing: a guide to theory, algorithm, and system development. Prentice Hall PTR, Upper Saddle River
Jain A, Duin RPW, Mao J (2000) Statistical pattern recognition: a review. IEEE Trans Pattern Anal Mach Intell 22(1):4–37
Jain A, Ross A, Prabhakar S (2004) An introduction to biometric recognition. IEEE Trans Circuits Syst Video Technol 14(1):4–20
Kinnunen T, Li H (2010) An overview of text-independent speaker recognition: from features to supervectors. Speech Commun 52(1):12–40
Levit M, Chang S, Buntschuh B (2009) Garbage modeling with decoys for a sequential recognition scenario. In: IEEE workshop on automatic speech recognition understanding (ASRU 2009). Merano, Italy, pp 468–473
Reynolds D (1994) Experimental evaluation of features for robust speaker identification. IEEE Trans Speech Audio Process 2(4):639–643
Reynolds D (2002) An overview of automatic speaker recognition technology. In: 2002 IEEE international conference on acoustics, speech, and signal processing (ICASSP 2002). Orlando, Florida, vol 4, pp IV–4072–IV–4075
Reynolds D, Rose R (1995) Robust text-independent speaker identification using Gaussian mixture speaker models. IEEE Trans Speech Audio Process 3(1):72–83
Togneri R, Pullella D (2011) An overview of speaker identification: accuracy and robustness issues. IEEE Circuits Syst Mag 11(2):23–61
Walker W, Lamere P, Kwok P, Raj B, Singh R, Gouvea E, Wolf P, Woelfel J (2007) Sphinx-4: a flexible open source framework for speech recognition
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Biagetti, G., Crippa, P., Falaschetti, L., Orcioni, S., Turchetti, C. (2016). Distributed Speech and Speaker Identification System for Personalized Domotic Control. In: Conti, M., Martínez Madrid, N., Seepold, R., Orcioni, S. (eds) Mobile Networks for Biometric Data Analysis. Lecture Notes in Electrical Engineering, vol 392. Springer, Cham. https://doi.org/10.1007/978-3-319-39700-9_13
Download citation
DOI: https://doi.org/10.1007/978-3-319-39700-9_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-39698-9
Online ISBN: 978-3-319-39700-9
eBook Packages: EngineeringEngineering (R0)