ISCA Archive Odyssey 2022
ISCA Archive Odyssey 2022

A Novel Feature Based on Graph Signal Processing for Detection of Physical Access Attacks

Longting Xu, Mianxin Tian, Xing Guo, Zhiyong Shan, Jie Jia, Yiyuan Peng, Jichen Yang, Rohan Kumar Das

Speaker verification systems face threat from various spoofing attacks and particularly, the physical access attacks or replay attacks that are most common show an imminent threat. Literature shows that graph signal processing (GSP) shows a better correlation between speech samples and explore more hidden information from speech than the traditional digital signal processing methods. With this motivation, we propose a novel feature based on GSP, namely, graph frequency cepstral coefficient (GFCC). We use the combined shift operator to construct the graph signal, and then carry out the graph Fourier analysis to extract GFCC features. It is observed that compared to fast Fourier transform, the GFT can more accurately represent the structural relationship of speech samples, which makes the real and replay speech very distinguishable in the frequency domain. We use the GFCC features with a light convolutional neural network system in our studies. The results on ASVspoof 2019 physical access corpus show that the proposed GFCC feature based system outperforms the challenge baselines by a large margin and emerge as one of the best performing state-of-the-art single systems.


doi: 10.21437/Odyssey.2022-15

Cite as: Xu, L., Tian, M., Guo, X., Shan, Z., Jia, J., Peng, Y., Yang, J., Das, R.K. (2022) A Novel Feature Based on Graph Signal Processing for Detection of Physical Access Attacks. Proc. The Speaker and Language Recognition Workshop (Odyssey 2022), 107-111, doi: 10.21437/Odyssey.2022-15

@inproceedings{xu22_odyssey,
  author={Longting Xu and Mianxin Tian and Xing Guo and Zhiyong Shan and Jie Jia and Yiyuan Peng and Jichen Yang and Rohan Kumar Das},
  title={{A Novel Feature Based on Graph Signal Processing for Detection of Physical Access Attacks}},
  year=2022,
  booktitle={Proc. The Speaker and Language Recognition Workshop (Odyssey 2022)},
  pages={107--111},
  doi={10.21437/Odyssey.2022-15}
}