Abstract:
In the paper we present a complex platform for automatic processing of Czech TV news programmes. Its audio processing module provides text transcription in form of metada...Show MoreMetadata
Abstract:
In the paper we present a complex platform for automatic processing of Czech TV news programmes. Its audio processing module provides text transcription in form of metadata that contain information about spoken content, speaker identities, used pronunciation, word positions and intonation. The video processing module provides pictures representing individual video scenes and information about detected and possibly recognized human faces. The audio and video data are merged into single XML files that are indexed and stored in a searchable database. A simple Web-based search engine can be used to retrieve information from the database that recently contain more than 1800 hours of transcribed programmes from Czech CT24 station.
Published in: 2008 IEEE 10th Workshop on Multimedia Signal Processing
Date of Conference: 08-10 October 2008
Date Added to IEEE Xplore: 05 November 2008
ISBN Information: