Abstract
The popularity and portability of high-fidelity recording devices and playback devices pose severe challenges for speaker recognition systems against replay voice attacks. In this paper, the signal of audio is transformed into the frequency domain through the Fourier trans-form and constant Q transform. Compared with genuine voice, the mean and standard deviation of the replay voice at each frequency bin has changed slightly. And through the coefficient of variation to further analyze the difference between genuine voice and replay voice. A detection algorithm based on fusion feature is proposed. The algorithm uses two kinds of time-frequency transform coefficients and their cepstrum characteristics to train the GMM model and calculate the likelihood ratio score. Finally, the replay voice is detected by the fusion of scores. The experimental results show that the algorithm is about 13% lower than the baseline EER provided by The ASV Spoof 2017.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Zhu, D., Ma, B., Li, H.: Speaker verification with feature-space MAPLR parameters. IEEE Trans. Audio Speech Lang. Process. 19(3), 505–515 (2010)
Wu, Z., Kinnunen, T., Evans, N., et al.: ASV spoof 2015: The first automatic speaker verification spoofing and countermeasures challenge. In: 16th Annual Conference of the International Speech Communication Association, INTERSPEECH 2015, vol. 11, pp. 588–604. INTERSPEECH, Dresden (2015)
Alegre, F., Janicki, A., Evans, N.: Re-assessing the threat of replay spoofing attacks against automatic speaker verification. biometrics special interest group. In: Proceedings of the 2014 Biometrics Special Interest Group, pp. 1–6. IEEE, Piscataway (2014)
Shang, W., Stevensin, M.: A playback attack detector for speaker verification systems. In: International Symposium on Communications Control and Signal Processing, ISCCSP 2008, pp. 1144–1149. IEEE, Piscataway (2008)
Jakub, G., Marcin, G., Rafal, S.: Playback attack detection for text-dependent speaker verification over telephone channels. Speech Commun. 67, 143–153 (2015)
Todisco, M., Delgado, H., Evans, N.: A new feature for automatic speaker verification anti-spoofing: constant Q cepstral coefficients. In: Odyssey 2016-The Speaker and Language Recognition Workshop, pp. 283–290. IEEE, Piscataway (2016)
Wu, Z., Yamagishi, J., Kinnunen, T., et al.: ASV spoof: the automatic speaker verification spoofing and countermeasures challenge. IEEE J. Sel. Top. Signal Process. 11, 588–604 (2017)
Ji, Z., Li, Z.Y., Li, P., et al.: Ensemble learning for countermeasure of audio replay spoofing attack in ASVspoof2017. In: INTERSPEECH 2017, pp. 87–91. INTERSPEECH, Stockholm (2017)
Lantian, L., Yixiang, C., Dong, W.: A study on replay attack and anti-spoofing for automatic speaker verification. In: INTERSPEECH 2017, pp. 92–96. INTERSPEECH, Stockholm (2017)
Evans, N.W.D., Kinnunen, T., Yamagishi, J.: Spoofing and countermeasures for automatic speaker verification. In: Proceedings of the 2013 Conference of the International Speech Communication Association, INTERSPEECH 2013, pp. 925–929. INTERSPEECH, Lyon (2013)
Lee, K.A., Larcher, A., Wang, G., et al.: The reddots data collection for speaker recognition. In: 16th Annual Conference of the International Speech Communication Association, INTERSPEECH 2015, pp. 2996–3000. INTERSPEECH, Dresden (2015)
Kinnunen, T., Sahidullah, M., Delgado, H., et al.: The ASV spoof 2017 challenge: assessing the limits of replay spoofing attack detection. In: INTERSPEECH 2017, pp. 1–6. INTERSPEECH, Stockholm (2017)
Acknowledgments
This work is supported by the National Natural Science Foundation of China (Grant No. U1736215, 61672302), Zhejiang Natural Science Foundation (Grant No. LZ15F020002, LY17F020010), Ningbo Natural Science Foundation (Grant No. 2017A610123), Ningbo University Fund (Grant No. XKXL1509, XKXL1503).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Lin, L., Wang, R., Yan, D., Li, C. (2018). A Replay Voice Detection Algorithm Based on Multi-feature Fusion. In: Sun, X., Pan, Z., Bertino, E. (eds) Cloud Computing and Security. ICCCS 2018. Lecture Notes in Computer Science(), vol 11068. Springer, Cham. https://doi.org/10.1007/978-3-030-00021-9_27
Download citation
DOI: https://doi.org/10.1007/978-3-030-00021-9_27
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-00020-2
Online ISBN: 978-3-030-00021-9
eBook Packages: Computer ScienceComputer Science (R0)