Editorial for the special issue on spoken content retrieval

doi:10.1016/j.csl.2014.05.002

Computer Speech & Language

Volume 28, Issue 5, September 2014, Pages 1019-1020

https://doi.org/10.1016/j.csl.2014.05.002 Get rights and content

Abstract

A typical spoken content retrieval solution integrates multiple technologies that belong to the areas of automatic speech recognition and information retrieval. Due to the rich set of challenges – many of them language specific – as well as widespread impact, numerous research sites in the world are actively engaged in this research area. This special issue highlights some of the recent advances in spoken content retrieval.

Introduction

Increasing computing power, storage space and connection bandwidth make large amounts of multimedia data widely available. In addition to text and metadata-based search, content-based retrieval is emerging as a means to provide easy access to multimedia, especially for data containing speech. However, so far retrieval of spoken content has been restricted mostly to the accompanying text meta-data. As research in speech recognition has matured, there has been a shift from development of core technology to applications such as retrieval of spoken content. Unlocking the spoken content holds great promise for improving such applications. This special issue brings together some of the current approaches to spoken content retrieval with the goal of offering a glimpse into the future of the area.

Over the last decade, significant advances have resulted from a combination of advances in automatic speech recognition (ASR), and information retrieval (IR). Innovative methods for tighter coupling of the component technologies together with novel work in query-by-example have been crucial for robustness to errors in speech retrieval. Moving towards open vocabulary search is necessitated by the increased presence of many heterogenous sources. This requires detection and recovery of out-of-vocabulary (OOV) terms as a first step in the discovery of relations to known terms and topics in information extraction. OOV query search is a critical component of the keyword search systems used in DARPA RATS and IARPA Babel programs, as well as the NIST OpenKWS evaluations. Summary of the previous work in the field can be found in Chelba et al., 2008, Chelba et al., 2011.

Section snippets

Summary of the articles in this special issue

This special issue consists of four articles that explore various aspects of spoken content retrieval.

In Eskevich and Jones (2014), the authors explore retrieval of informal speech from meetings assuming a recall-focused search scenario where a user is potentially looking for all relevant topical segments. Search of meetings is an interesting task for speech retrieval since it incorporates many challenges: (i) the content may be spoken in a wide range of often informal spontaneous styles (ii)

References (6)

M. Eskevich et al.
Exploring speech retrieval from meetings using the AMI corpus
Comput. Speech Lang.
(2014)
H.-Y. Lee et al.
Improved open-vocabulary spoken content retrieval with word and subword lattices using acoustic feature similarity
Comput. Speech Lang.
(2014)
F. Metze et al.
Language independent search in MediaEval's spoken web search task
Comput. Speech Lang.
(2014)

There are more references available in the full text version of this article.

Cited by (0)

View full text

EditorialEditorial for the special issue on spoken content retrieval

Abstract

Introduction

Section snippets

Summary of the articles in this special issue

Comput. Speech Lang.

Comput. Speech Lang.

Comput. Speech Lang.

Editorial
Editorial for the special issue on spoken content retrieval