Abstract:
In this work we adapt and evaluate different solutions for automatic speech recognition (ASR) to be used as an HMI for the assistant robot. Two on-device solutions: Kaldi...Show MoreMetadata
Abstract:
In this work we adapt and evaluate different solutions for automatic speech recognition (ASR) to be used as an HMI for the assistant robot. Two on-device solutions: Kaldi (DNN-HMM) and Mozilla's DeepSpeech (end-to-end), and three internet service APIs: IBM Watson, Microsoft Azure and Google Speech to Text are evaluated. The systems are adapted to the domain of robot commands and evaluated on a set of expected inputs. As the goal is to retain the ability to recognise general language, the systems are also evaluated on out of domain data.
Published in: 2020 16th International Conference on Control, Automation, Robotics and Vision (ICARCV)
Date of Conference: 13-15 December 2020
Date Added to IEEE Xplore: 08 January 2021
ISBN Information: