ISCA Archive Interspeech 2016
ISCA Archive Interspeech 2016

Real-Time Presentation Tracking Using Semantic Keyword Spotting

Reza Asadi, Harriet J. Fell, Timothy Bickmore, Ha Trinh

Given presentation slides with detailed written speaking notes, automatic tracking of oral presentations can help speakers ensure they cover their planned content, and can reduce their anxiety during the speech. Tracking is a more complex problem than speech-to-text alignment, since presenters rarely follow their exact presentation notes, and it must be performed in real-time. In this paper, we propose a novel system that can track the current degree of coverage of each slide’s contents. To do this, the presentation notes for each slide are segmented into sentences, and the words are filtered into keyword candidates. These candidates are then scored based on word specificity and semantic similarity measures to find the most useful keywords for the tracking task. Real-time automatic speech recognition results are matched against the keywords and their synonyms. Sentences are scored based on detected keywords, and the ones with scores higher than a threshold are tagged as covered. We manually and automatically annotated 150 slide presentation recordings to evaluate the system. A simple tracking method, matching speech recognition results against the notes, was used as the baseline. The results show that our approach led to higher accuracy measures compared to the baseline method.


doi: 10.21437/Interspeech.2016-617

Cite as: Asadi, R., Fell, H.J., Bickmore, T., Trinh, H. (2016) Real-Time Presentation Tracking Using Semantic Keyword Spotting. Proc. Interspeech 2016, 3081-3085, doi: 10.21437/Interspeech.2016-617

@inproceedings{asadi16_interspeech,
  author={Reza Asadi and Harriet J. Fell and Timothy Bickmore and Ha Trinh},
  title={{Real-Time Presentation Tracking Using Semantic Keyword Spotting}},
  year=2016,
  booktitle={Proc. Interspeech 2016},
  pages={3081--3085},
  doi={10.21437/Interspeech.2016-617}
}