A Replay Voice Detection Algorithm Based on Multi-feature Fusion

Lin, Lang; Wang, Rangding; Yan, Diqun; Li, Can

doi:10.1007/978-3-030-00021-9_27

Lang Lin¹⁶,
Rangding Wang¹⁶,
Diqun Yan¹⁶ &
…
Can Li¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11068))

Included in the following conference series:

International Conference on Cloud Computing and Security

1486 Accesses

Abstract

The popularity and portability of high-fidelity recording devices and playback devices pose severe challenges for speaker recognition systems against replay voice attacks. In this paper, the signal of audio is transformed into the frequency domain through the Fourier trans-form and constant Q transform. Compared with genuine voice, the mean and standard deviation of the replay voice at each frequency bin has changed slightly. And through the coefficient of variation to further analyze the difference between genuine voice and replay voice. A detection algorithm based on fusion feature is proposed. The algorithm uses two kinds of time-frequency transform coefficients and their cepstrum characteristics to train the GMM model and calculate the likelihood ratio score. Finally, the replay voice is detected by the fusion of scores. The experimental results show that the algorithm is about 13% lower than the baseline EER provided by The ASV Spoof 2017.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Zhu, D., Ma, B., Li, H.: Speaker verification with feature-space MAPLR parameters. IEEE Trans. Audio Speech Lang. Process. 19(3), 505–515 (2010)
Article Google Scholar
Wu, Z., Kinnunen, T., Evans, N., et al.: ASV spoof 2015: The first automatic speaker verification spoofing and countermeasures challenge. In: 16th Annual Conference of the International Speech Communication Association, INTERSPEECH 2015, vol. 11, pp. 588–604. INTERSPEECH, Dresden (2015)
Google Scholar
Alegre, F., Janicki, A., Evans, N.: Re-assessing the threat of replay spoofing attacks against automatic speaker verification. biometrics special interest group. In: Proceedings of the 2014 Biometrics Special Interest Group, pp. 1–6. IEEE, Piscataway (2014)
Google Scholar
Shang, W., Stevensin, M.: A playback attack detector for speaker verification systems. In: International Symposium on Communications Control and Signal Processing, ISCCSP 2008, pp. 1144–1149. IEEE, Piscataway (2008)
Google Scholar
Jakub, G., Marcin, G., Rafal, S.: Playback attack detection for text-dependent speaker verification over telephone channels. Speech Commun. 67, 143–153 (2015)
Article Google Scholar
Todisco, M., Delgado, H., Evans, N.: A new feature for automatic speaker verification anti-spoofing: constant Q cepstral coefficients. In: Odyssey 2016-The Speaker and Language Recognition Workshop, pp. 283–290. IEEE, Piscataway (2016)
Google Scholar
Wu, Z., Yamagishi, J., Kinnunen, T., et al.: ASV spoof: the automatic speaker verification spoofing and countermeasures challenge. IEEE J. Sel. Top. Signal Process. 11, 588–604 (2017)
Article Google Scholar
Ji, Z., Li, Z.Y., Li, P., et al.: Ensemble learning for countermeasure of audio replay spoofing attack in ASVspoof2017. In: INTERSPEECH 2017, pp. 87–91. INTERSPEECH, Stockholm (2017)
Google Scholar
Lantian, L., Yixiang, C., Dong, W.: A study on replay attack and anti-spoofing for automatic speaker verification. In: INTERSPEECH 2017, pp. 92–96. INTERSPEECH, Stockholm (2017)
Google Scholar
Evans, N.W.D., Kinnunen, T., Yamagishi, J.: Spoofing and countermeasures for automatic speaker verification. In: Proceedings of the 2013 Conference of the International Speech Communication Association, INTERSPEECH 2013, pp. 925–929. INTERSPEECH, Lyon (2013)
Google Scholar
Lee, K.A., Larcher, A., Wang, G., et al.: The reddots data collection for speaker recognition. In: 16th Annual Conference of the International Speech Communication Association, INTERSPEECH 2015, pp. 2996–3000. INTERSPEECH, Dresden (2015)
Google Scholar
Kinnunen, T., Sahidullah, M., Delgado, H., et al.: The ASV spoof 2017 challenge: assessing the limits of replay spoofing attack detection. In: INTERSPEECH 2017, pp. 1–6. INTERSPEECH, Stockholm (2017)
Google Scholar

Download references

Acknowledgments

This work is supported by the National Natural Science Foundation of China (Grant No. U1736215, 61672302), Zhejiang Natural Science Foundation (Grant No. LZ15F020002, LY17F020010), Ningbo Natural Science Foundation (Grant No. 2017A610123), Ningbo University Fund (Grant No. XKXL1509, XKXL1503).

Author information

Authors and Affiliations

College of Information Science and Engineering of Ningbo University, Ningbo, 315211, China
Lang Lin, Rangding Wang, Diqun Yan & Can Li

Authors

Lang Lin
View author publications
You can also search for this author in PubMed Google Scholar
Rangding Wang
View author publications
You can also search for this author in PubMed Google Scholar
Diqun Yan
View author publications
You can also search for this author in PubMed Google Scholar
Can Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lang Lin .

Editor information

Editors and Affiliations

Nanjing University of Information Science and Technology, Nanjing, China
Xingming Sun
Nanjing University of Information Science and Technology, Nanjing, China
Zhaoqing Pan
Department of Computer Science, Purdue University, West Lafayette, IN, USA
Elisa Bertino

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lin, L., Wang, R., Yan, D., Li, C. (2018). A Replay Voice Detection Algorithm Based on Multi-feature Fusion. In: Sun, X., Pan, Z., Bertino, E. (eds) Cloud Computing and Security. ICCCS 2018. Lecture Notes in Computer Science(), vol 11068. Springer, Cham. https://doi.org/10.1007/978-3-030-00021-9_27

Download citation

DOI: https://doi.org/10.1007/978-3-030-00021-9_27
Published: 26 September 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-00020-2
Online ISBN: 978-3-030-00021-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics