Loading [a11y]/accessibility-menu.js
Unusable Spoken Response Detection with BLSTM Neural Networks | IEEE Conference Publication | IEEE Xplore

Unusable Spoken Response Detection with BLSTM Neural Networks


Abstract:

Voice biometrics has been applied to enhance the security of spoken language proficiency tests and ensure valid test scores by detecting fraudulent activity. These method...Show More

Abstract:

Voice biometrics has been applied to enhance the security of spoken language proficiency tests and ensure valid test scores by detecting fraudulent activity. These methods can, however, be triggered by certain distortions, including background noise and adjacent test-takers, resulting in false positive alarms. In this paper, a two-layer bi-directional LSTM RNN model is employed to detect these distorted (unusable) responses and a sub-sampling method is applied to reduce the difficulties of model training caused by very long input sequence and imbalanced training data. The system is evaluated on a corpus that was collected from an assessment of English language proficiency around the world. Results show that our approach significantly outperforms two baselines: a Gaussian mixture model (GMM) classifying frame-level features and an AdaBoost classifier operating on i-vectors. Our system's F-score in unusable response detection is 0.60 compared to 0.43 and 0.49 for the two baseline systems.
Date of Conference: 26-29 November 2018
Date Added to IEEE Xplore: 06 May 2019
ISBN Information:
Conference Location: Taipei, Taiwan

Contact IEEE to Subscribe

References

References is not available for this document.