Chunking Defense for Adversarial Attacks on ASR

Shao, Yiwen; Villalba, Jesus; Joshi, Sonal; Kataria, Saurabh; Khudanpur, Sanjeev; Dehak, Najim

doi:10.21437/Interspeech.2022-11096

Chunking Defense for Adversarial Attacks on ASR

Yiwen Shao, Jesus Villalba, Sonal Joshi, Saurabh Kataria, Sanjeev Khudanpur, Najim Dehak

While deep learning has lead to dramatic improvements in automatic speech recognition (ASR) systems in the past few years, it has also made them vulnerable to adversarial attacks. These attacks may be designed to either make ASR fail in producing the correct transcription or worse, output an adversary-chosen sentence. In this work, we propose a defense based on independently processing random or fixed size chunks of the speech input in the hope of "containing” the cumulative effect of the adversarial perturbations. This approach does not require any additional training of the ASR system, or any defensive preprocessing of the input. It can be easily applied to any ASR systems with little loss in performance under benign conditions, while improving adversarial robustness. We perform experiments on the Librispeech data set with different adversarial attack budgets, and show that the proposed defense achieves consistent improvement on two different ASR systems/models

doi: 10.21437/Interspeech.2022-11096

Cite as: Shao, Y., Villalba, J., Joshi, S., Kataria, S., Khudanpur, S., Dehak, N. (2022) Chunking Defense for Adversarial Attacks on ASR. Proc. Interspeech 2022, 5045-5049, doi: 10.21437/Interspeech.2022-11096

@inproceedings{shao22c_interspeech,
  author={Yiwen Shao and Jesus Villalba and Sonal Joshi and Saurabh Kataria and Sanjeev Khudanpur and Najim Dehak},
  title={{Chunking Defense for Adversarial Attacks on ASR}},
  year=2022,
  booktitle={Proc. Interspeech 2022},
  pages={5045--5049},
  doi={10.21437/Interspeech.2022-11096}
}