Skip to main content

A Replay Voice Detection Algorithm Based on Multi-feature Fusion

  • Conference paper
  • First Online:
Cloud Computing and Security (ICCCS 2018)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11068))

Included in the following conference series:

  • 1486 Accesses

Abstract

The popularity and portability of high-fidelity recording devices and playback devices pose severe challenges for speaker recognition systems against replay voice attacks. In this paper, the signal of audio is transformed into the frequency domain through the Fourier trans-form and constant Q transform. Compared with genuine voice, the mean and standard deviation of the replay voice at each frequency bin has changed slightly. And through the coefficient of variation to further analyze the difference between genuine voice and replay voice. A detection algorithm based on fusion feature is proposed. The algorithm uses two kinds of time-frequency transform coefficients and their cepstrum characteristics to train the GMM model and calculate the likelihood ratio score. Finally, the replay voice is detected by the fusion of scores. The experimental results show that the algorithm is about 13% lower than the baseline EER provided by The ASV Spoof 2017.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Zhu, D., Ma, B., Li, H.: Speaker verification with feature-space MAPLR parameters. IEEE Trans. Audio Speech Lang. Process. 19(3), 505–515 (2010)

    Article  Google Scholar 

  2. Wu, Z., Kinnunen, T., Evans, N., et al.: ASV spoof 2015: The first automatic speaker verification spoofing and countermeasures challenge. In: 16th Annual Conference of the International Speech Communication Association, INTERSPEECH 2015, vol. 11, pp. 588–604. INTERSPEECH, Dresden (2015)

    Google Scholar 

  3. Alegre, F., Janicki, A., Evans, N.: Re-assessing the threat of replay spoofing attacks against automatic speaker verification. biometrics special interest group. In: Proceedings of the 2014 Biometrics Special Interest Group, pp. 1–6. IEEE, Piscataway (2014)

    Google Scholar 

  4. Shang, W., Stevensin, M.: A playback attack detector for speaker verification systems. In: International Symposium on Communications Control and Signal Processing, ISCCSP 2008, pp. 1144–1149. IEEE, Piscataway (2008)

    Google Scholar 

  5. Jakub, G., Marcin, G., Rafal, S.: Playback attack detection for text-dependent speaker verification over telephone channels. Speech Commun. 67, 143–153 (2015)

    Article  Google Scholar 

  6. Todisco, M., Delgado, H., Evans, N.: A new feature for automatic speaker verification anti-spoofing: constant Q cepstral coefficients. In: Odyssey 2016-The Speaker and Language Recognition Workshop, pp. 283–290. IEEE, Piscataway (2016)

    Google Scholar 

  7. Wu, Z., Yamagishi, J., Kinnunen, T., et al.: ASV spoof: the automatic speaker verification spoofing and countermeasures challenge. IEEE J. Sel. Top. Signal Process. 11, 588–604 (2017)

    Article  Google Scholar 

  8. Ji, Z., Li, Z.Y., Li, P., et al.: Ensemble learning for countermeasure of audio replay spoofing attack in ASVspoof2017. In: INTERSPEECH 2017, pp. 87–91. INTERSPEECH, Stockholm (2017)

    Google Scholar 

  9. Lantian, L., Yixiang, C., Dong, W.: A study on replay attack and anti-spoofing for automatic speaker verification. In: INTERSPEECH 2017, pp. 92–96. INTERSPEECH, Stockholm (2017)

    Google Scholar 

  10. Evans, N.W.D., Kinnunen, T., Yamagishi, J.: Spoofing and countermeasures for automatic speaker verification. In: Proceedings of the 2013 Conference of the International Speech Communication Association, INTERSPEECH 2013, pp. 925–929. INTERSPEECH, Lyon (2013)

    Google Scholar 

  11. Lee, K.A., Larcher, A., Wang, G., et al.: The reddots data collection for speaker recognition. In: 16th Annual Conference of the International Speech Communication Association, INTERSPEECH 2015, pp. 2996–3000. INTERSPEECH, Dresden (2015)

    Google Scholar 

  12. Kinnunen, T., Sahidullah, M., Delgado, H., et al.: The ASV spoof 2017 challenge: assessing the limits of replay spoofing attack detection. In: INTERSPEECH 2017, pp. 1–6. INTERSPEECH, Stockholm (2017)

    Google Scholar 

Download references

Acknowledgments

This work is supported by the National Natural Science Foundation of China (Grant No. U1736215, 61672302), Zhejiang Natural Science Foundation (Grant No. LZ15F020002, LY17F020010), Ningbo Natural Science Foundation (Grant No. 2017A610123), Ningbo University Fund (Grant No. XKXL1509, XKXL1503).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lang Lin .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Lin, L., Wang, R., Yan, D., Li, C. (2018). A Replay Voice Detection Algorithm Based on Multi-feature Fusion. In: Sun, X., Pan, Z., Bertino, E. (eds) Cloud Computing and Security. ICCCS 2018. Lecture Notes in Computer Science(), vol 11068. Springer, Cham. https://doi.org/10.1007/978-3-030-00021-9_27

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-00021-9_27

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-00020-2

  • Online ISBN: 978-3-030-00021-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics