A Cancelable Biometric Approach for Efficient Identification of Speakers from Encrypted Speech

Abd El-Wahab, Basant S.; El-Khobby, Heba A.; Elnaby, Mustafa M. Abd; El-Samie, Fathi E. Abd

doi:10.1007/s11277-021-08384-5

A Cancelable Biometric Approach for Efficient Identification of Speakers from Encrypted Speech

Published: 25 April 2022

Volume 124, pages 1899–1921, (2022)
Cite this article

Wireless Personal Communications Aims and scope Submit manuscript

Basant S. Abd El-Wahab¹,
Heba A. El-Khobby¹,
Mustafa M. Abd Elnaby¹ &
…
Fathi E. Abd El-Samie^2,3

255 Accesses
2 Citations
Explore all metrics

Abstract

Biometric systems based on feature extraction from speech signals are extensively deployed in different security systems. This work presents a secure speaker identification system, namely cancelable speaker identification system, which can be further used for remote access applications. Two formulations for efficient cryptosystems based on a chaotic map and the empirical mode decomposition (EMD) with a single key are presented for this purpose. The encryption achieves a large degree of security in the speaker identification systems. In the proposed cancelable speaker identification scenario, cepstral features are extracted from encrypted speech signals. The matching in this proposed scenario is performed with artificial neural networks (ANNs). Furthermore, various enhancement methods are implemented at the receiver side to remove the noise effect prior to speaker identification. Consequently, the features of the speakers are protected from the attackers. The simulation results prove that the proposed cancelable biometric system has an outstanding performance level compared to the traditional biometric systems. It achieves a recognition rate up to 100% at a − 20 dB signal-to-noise ratio (SNR) and above, using the two proposed cryptosystems with a high level of security.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 6

Performance enhancement of speaker identification systems using speech encryption and cancelable features

Article 10 October 2017

Novel secured speech communication for person authentication

Article 06 December 2022

Efficient cancelable speaker identification system based on a hybrid structure of DWT and SVD

Article 28 January 2022

References

Reynolds, D. A. (2002). An overview of automatic speaker recognition technology. In 2002 IEEE international conference on in Acoustics, speech, and signal processing (ICASSP) (pp. IV-4072-IV-4075).
de Lara, J. R. C. (2005). A method of automatic speaker recognition using cepstral features and vectorial quantization. In Iberoamerican congress on pattern recognition (pp. 146–153).
Saksamudre, S.K., Shrishrimal, P.P., Deshmukh, R.R. A review on different approaches for speech recognition system. International Journal of Computer Applications, 115(22.)‏
Sotelo, E. E., Nakamura, T., Nagai, T., & Hernandez, E. E. (2012) Who said that? The crossmodal matching identity for inferring unfamiliar faces from voices. In 2012 Eighth international conference on signal image technology and internet based systems (SITIS) (pp. 97–104).
Chauhan, N., Isshiki, T., & Li, D. (2019) Speaker recognition using LPC, MFCC, ZCR features with ANN and SVM classifier for large input database. In IEEE 4th international conference on computer and communication systems (ICCCS). IEEE (pp. 130–133).
Abd El-Wahab, B. S., El-khobby, H. A., Abd Elnaby, M. M., & Abd El-Samie, F. E. (2011) Simultaneous speaker identification and watermarking. International Journal of Speech Technology, pp. 1–14.‏
Kekre, H., Athawale, A., Desai, M. (2011). Speaker identification using row mean vector of spectrogram. In Proceedings of the international conference & workshop on emerging trends in technology (pp. 171–174).
Verma, G. K. (2011) Multi-feature fusion for closed set text independent speaker identification. In International conference on information intelligence, systems, technology and management (pp. 170–179).
Dutta, M., Patgiri, C., Sarma, M., & Sarma, K. K. (2015). Closed-set text-independent speaker identification system using multiple ANN classifiers. In Proceedings of the 3_rd international conference on frontiers of intelligent computing: Theory and applications (FICTA) 2014 (pp. 377–385).
Kinnunen, T. (2003). Spectral features for automatic text-independent speaker recognition. Licentiate’s Thesis.
Kurzekar, P. K., Deshmukh, R. R., Waghmare, V. B., & Shrishrimal, P. P. (2014). A comparative study of feature extraction techniques for speech recognition system. International Journal of Innovative Research in Science, Engineering and Technology, 3, 18006–18016.
Article Google Scholar
Ratha, N. K., Connell, J. H., & Bolle, R. M. (2001). Enhancing security and privacy in biometrics-based authentication systems. IBM Systems Journal, 40, 614–634.
Article Google Scholar
Ratha, N. K., Chikkerur, S., Connell, J. H., & Bolle, R. M. (2007). Generating cancelable fingerprint templates. IEEE Transactions on Pattern Analysis and Machine Intelligence, 29, 561–572.
Article Google Scholar
Ying, C. L., & Jin, A. T. B. (2007). Probabilistic random projections and speaker verification. In International conference on biometrics (pp. 445–454).
Wang, Y., & Plataniotis, K. N. (2010). An analysis of random projection for changeable and privacy-preserving biometric verification. IEEE Transactions on Systems, Man, and Cybernetics Part B (Cybernetics), 40, 1280–1293.
Article Google Scholar
Teoh, A. B. J., & Chong, L.-Y. (2010). Secure speech template protection in speaker verification system. Speech Communication, 52, 150–163.
Article Google Scholar
Jin, A. T. B., Ling, D. N. C., & Goh, A. (2004). Biohashing: Two factor authentication featuring fingerprint data and tokenised random number. Pattern Recognition, 37(2004), 2245–2255.
Article Google Scholar
Teoh, A. B., Goh, A., & Ngo, D. C. (2006). Random multispace quantization as an analytic mechanism for biohashing of biometric and random identity inputs. IEEE Transactions on Pattern Analysis and Machine Intelligence, 28, 1892–1901.
Article Google Scholar
Hu, H. T., Lin, S. J., & Hsu, L. Y. (2017). Effective blind speech watermarking via adaptive mean modulation and package synchronization in DWT domain. EURASIP Journal on Audio, Speech, and Music Processing, 1, 10.
Article Google Scholar
Hammad, M., Luo, G., & Wang, K. (2019). Cancelable biometric authentication system based on ECG. Multimedia Tools and Applications, 78, 1857–1887.
Article Google Scholar
Tulyakov, S., Farooq, F., & Govindaraju, V. (2005) Symmetric hash functions for fingerprint minutiae. In International conference on pattern recognition and image analysis (pp. 30–38).
Juels, A., & Sudan, M. (2006). A fuzzy vault scheme. Designs, Codes and Cryptography, 38, 237–257.
Article MathSciNet MATH Google Scholar
Xu, W., & Cheng, M. (2008). Cancelable voiceprint template based on chaff-points-mixture method. In CIS'08. International conference on computational intelligence and security, 2008 (pp. 263–266).
Sandyarani, K., & Kumar, P. N. (2018). Efficient substructure sharing methods for optimizing the composite s-box, mixcolumn and inverse mixcolumn in rijndael advanced encryption standard. Journal of Computational and Theoretical Nanoscience, 15(3), 798–810.
Article Google Scholar
Ds, A., & Minu, K. (2013). Frequency speech scrambler based on hartley transform and OFDM algorithm. International Journal of Computer Applications, 61(8), 36–40.
Article Google Scholar
Brasser, F., et al. (2018). VoiceGuard: Secure and private speech processing. In Interspeech (pp. 1303–1307)
Alroubaie, Z. M., Hashem, M. A., & Hasan, F. S. (2019). FPGA design of encryption speech system using synchronized fixed-point chaotic maps based stream ciphers. International Journal of Engineering and Advanced Technology, 8(6), 1534–1541.
Article Google Scholar
Milton, R. (1989). A time and frequency-domain speech scrambler. In COMSIG 1989. Proceedings Communications and Signal Processing, 1989, Southern African Conference on, 1989 (pp. 125–130).
Sadkhan, S., & Abbas, N. (2012). Speech scrambling based on wavelet transform. In Advances in wavelet theory and their applications in engineering physics and technology (pp. 41–58).‏
Manjunath, G., & Anand, G. (2002). Speech encryption using circulant transformations. In ICME'02. Proceedings. 2002 IEEE international conference on multimedia and expo, 2002 (pp. 553–556).
Wu, Y., Ng, B. P. (2002). Speech scrambling with Hadamard transform in frequency domain. In 2002 6_th International conference on signal processing (pp. 1560–1563).
Jiao, Ge., Li, L., & Zou, Yi. (2019). Improved security for android system based on multi-chaotic maps using a novel image encryption algorithm. International Journal of Performability Engineering, 15(6), 1692.
Google Scholar
Al Saad, S. N., & Hato, E. (2014). A speech encryption based on chaotic maps. International Journal of Computer Applications, 93, 19–28.
Google Scholar
Zhai, Y., Lin, S., & Zhang, Q. (2008) Improving image encryption using multi-chaotic map. In Workshop on power electronics and intelligent transportation system, 2008. PEITS'08 (pp. 143–148).
Ratnavelu, K., et al. (2017). Image encryption method based on chaotic fuzzy cellular neural networks. Signal Processing, 140, 87–96.
Article Google Scholar
Wu, X., Zhu, B., Hu, Y., & Ran, Y. (2017). A novel color image encryption scheme using rectangular transform-enhanced chaotic tent maps. IEEE Access, 5, 6429–6436.
Google Scholar
Hassan, E. S., Zhu, X., El-Khamy, S. E., Dessouky, M. I., El-Dolil, S. A., & El-Samie, F. E. A. (2012). A chaotic interleaving scheme for the continuous phase modulation based single-carrier frequency-domain equalization system. Wireless Personal Communications, 62, 183–199.
Article MATH Google Scholar
Khaldi, K., & Boudraa, A.-O. (2012). On signals compression by EMD. Electronics Letters, 48(21), 1329–1331.
Article Google Scholar
Khaldi, K., Turki-Hadj Alouane, M. O. N. I. A., & Boudraa, A. O. (2010). Voiced speech enhancement based on adaptive filtering of selected intrinsic mode functions. Advances in Adaptive Data Analysis, 2(01), 65–80.
Article MathSciNet Google Scholar
Tirumala, S. S., Shahamiri, S. R., Garhwal, A. S., & Wang, R. (2017). Speaker identification features extraction methods: A systematic review. Expert Systems With Applications, 90, 250–271.
Article Google Scholar
Neville, K. L., & Hussain, Z. M. (2009). Effects of wavelet compression of speech on its Mel-Cepstral coefficients. In International conference on communication, computer and power (ICCCP’09), Muscat (pp. 387–390).
Gupta, S., Jaafar, J., Ahmad, W. F. W., & Bansal, A. (2013). Feature extraction using MFCC. Signal & Image Processing, 4, 101–108.
Google Scholar
Childers, D. G., Skinner, D. P., & Kemerait, R. C. (1977). The cepstrum: A guide to processing. Proceedings of the IEEE, 65, 1428–1443.
Article Google Scholar
Muda, L., Begam, M., & Elamvazuthi, I. (2010). Voice recognition algorithms using mel frequency cepstral coefficient (MFCC) and dynamic time warping (DTW) techniques. arXiv preprint https://arxiv.org/abs/1003.4083.
Yu, H., Tan, Z. H., Ma, Z., Martin, R., & Guo, J. (2017). Spoofing detection in automatic speaker verification systems using DNN classifiers and dynamic acoustic features. IEEE Transactions on Neural Networks and Learning Systems, 29(10), 4633–4644.
Article Google Scholar
Galushkin, A. I. (2007). Neural networks theory. . Springer.
MATH Google Scholar
Suzuki, K. (Ed.). (2013). Artificial neural networks: Architectures and applications. BoD–Books on Demand
Evans, N. W., Mason, J. S., Liu, W. M., & Fauve, B. (2006). An assessment on the fundamental limitations of spectral subtraction. In 2006 IEEE international conference on acoustics speech and signal processing proceedings (Vol. 1, pp. 1–1). IEEE.
Purushotham, U., & Suresh, K. (2018). Implementation of spectral subtraction using sub-band filtering in DSP C6748 processor for enhancing speech signal. In Advances in machine learning and data science (pp. 259–267). Springer, Singapore.
Abd El-Fattah, M., Dessouky, M. I., Diab, S. M., & AbdEl-Samie, F.E.-S. (2008). Speech enhancement using an adaptive wiener filtering approach. Progress in Electromagnetics Research, 4, 167–184.
Article Google Scholar
Macq, B., Dittmann, J., & Delp, E. J. (2004). Benchmarking of image watermarking algorithms for digital rights management. Proceedings of the IEEE, 92, 971–984.
Article Google Scholar
Et-Khamy, S. E., Hadhoud, M. M., Dessouky, M. L., Salam, B. M., & Abd El-Sarnie, F. E. (2003). Sectioned implementation of regularized image interpolation. In 2003 46th midwest symposium on circuits and systems (Vol. 2, pp. 656–659). IEEE.
Rahimizadeh, N., Hasanzadeh, R. P., & Janabi-Sharifi, F. (2020). An optimized non-local LMMSE approach for speckle noise reduction of medical ultrasound images. Multimedia Tools and Applications, 1–23.

Download references

Author information

Authors and Affiliations

Department of Electronics and Electrical Communications Engineering, Faculty of Engineering, Tanta University, Tanta, Egypt
Basant S. Abd El-Wahab, Heba A. El-Khobby & Mustafa M. Abd Elnaby
Department of Electronics and Electrical Communications Engineering, Faculty of Electronic Engineering, Menoufia University, Menoufia, Egypt
Fathi E. Abd El-Samie
Department of Information Technology, College of Computer and Information Sciences, Princess Nourah Bint Abdulrahman University, P.O. Box 84428, Riyadh, 11671, Saudi Arabia
Fathi E. Abd El-Samie

Authors

Basant S. Abd El-Wahab
View author publications
You can also search for this author in PubMed Google Scholar
Heba A. El-Khobby
View author publications
You can also search for this author in PubMed Google Scholar
Mustafa M. Abd Elnaby
View author publications
You can also search for this author in PubMed Google Scholar
Fathi E. Abd El-Samie
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Heba A. El-Khobby.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Abd El-Wahab, B.S., El-Khobby, H.A., Elnaby, M.M.A. et al. A Cancelable Biometric Approach for Efficient Identification of Speakers from Encrypted Speech. Wireless Pers Commun 124, 1899–1921 (2022). https://doi.org/10.1007/s11277-021-08384-5

Download citation

Accepted: 15 March 2021
Published: 25 April 2022
Issue Date: June 2022
DOI: https://doi.org/10.1007/s11277-021-08384-5

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Cancelable Biometric Approach for Efficient Identification of Speakers from Encrypted Speech

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Performance enhancement of speaker identification systems using speech encryption and cancelable features

Novel secured speech communication for person authentication

Efficient cancelable speaker identification system based on a hybrid structure of DWT and SVD

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

A Cancelable Biometric Approach for Efficient Identification of Speakers from Encrypted Speech

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Performance enhancement of speaker identification systems using speech encryption and cancelable features

Novel secured speech communication for person authentication

Efficient cancelable speaker identification system based on a hybrid structure of DWT and SVD

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation