Abstract:
This study deals with the problem of voice liveness detection (VLD) in order to increase biometric security. We propose Linear Frequency Residual Cepstral Coefficients (L...Show MoreMetadata
Abstract:
This study deals with the problem of voice liveness detection (VLD) in order to increase biometric security. We propose Linear Frequency Residual Cepstral Coefficients (LFRCC) features, and Bi-directional Long Short-Term Memory (BiLSTM) classifier to ascertain whether the speech is live or not. We employed the POp COrpus (POCO) dataset, which is a standard dataset for the VLD problem. We compare our work with various other well known spectral features, such as Mel Frequency Cepstral Coefficients (MFCC), and Linear Frequency Cepstral Coefficients (LFCC) using BiLSTM as pattern classifier. Our optimal results indicate 83.14 % accuracy; which is 20.85 % greater than the existing STFT-based approach. To investigate practical suitability of VLD system using the proposed LFRCC feature set, analysis of latency period, feature-fusion, and comparison with pre-trained models is also presented in this study.
Published in: 2024 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)
Date of Conference: 03-06 December 2024
Date Added to IEEE Xplore: 27 January 2025
ISBN Information: