To read this content please select one of the options below:

Fine‐granularity semantic video annotation: An approach based on automatic shot level concept detection and object recognition

Vanessa El‐Khoury (Chair of Distributed Information Systems, University of Passau, Passau, Germany)

Martin Jergler (Chair of Distributed Information Systems, University of Passau, Passau, Germany)

Getnet Abebe Bayou (Chair of Distributed Information Systems, University of Passau, Passau, Germany)

David Coquil (Chair of Distributed Information Systems, University of Passau, Passau, Germany)

Harald Kosch (Chair of Distributed Information Systems, University of Passau, Passau, Germany)

International Journal of Pervasive Computing and Communications

ISSN: 1742-7371

Article publication date: 30 August 2013

Downloads

185

Abstract

Purpose

–

A fine‐grained video content indexing, retrieval, and adaptation requires accurate metadata describing the video structure and semantics to the lowest granularity, i.e. to the object level. The authors address these requirements by proposing semantic video content annotation tool (SVCAT) for structural and high‐level semantic video annotation. SVCAT is a semi‐automatic MPEG‐7 standard compliant annotation tool, which produces metadata according to a new object‐based video content model introduced in this work. Videos are temporally segmented into shots and shots level concepts are detected automatically using ImageNet as background knowledge. These concepts are used as a guide to easily locate and select objects of interest which are then tracked automatically to generate an object level metadata. The integration of shot based concept detection with object localization and tracking drastically alleviates the task of an annotator. The paper aims to discuss these issues.

Design/methodology/approach

–

A systematic keyframes classification into ImageNet categories is used as the basis for automatic concept detection in temporal units. This is then followed by an object tracking algorithm to get exact spatial information about objects.

Findings

–

Experimental results showed that SVCAT is able to provide accurate object level video metadata.

Originality/value

–

The new contribution in this paper introduces an approach of using ImageNet to get shot level annotations automatically. This approach assists video annotators significantly by minimizing the effort required to locate salient objects in the video.

Keywords

Citation

El‐Khoury, V., Jergler, M., Abebe Bayou, G., Coquil, D. and Kosch, H. (2013), "Fine‐granularity semantic video annotation: An approach based on automatic shot level concept detection and object recognition", International Journal of Pervasive Computing and Communications, Vol. 9 No. 3, pp. 243-269. https://doi.org/10.1108/IJPCC-07-2013-0019

Publisher

:

Emerald Group Publishing Limited

To read this content please select one of the options below:

Please note you do not have access to teaching notes

Fine‐granularity semantic video annotation: An approach based on automatic shot level concept detection and object recognition

Abstract

Purpose

Design/methodology/approach

Findings

Originality/value

Keywords

Citation

Publisher

Related articles

Something didn’t work…

All feedback is valuable

Platform update page

Questions & More Information

To read this content please select one of the options below:

Please note you do not have access to teaching notes

Abstract

Purpose

Design/methodology/approach

Findings

Originality/value

Keywords

Citation

Publisher

Related articles

We’re listening — tell us what you think

Something didn’t work…

All feedback is valuable

Join us on our journey

Platform update page

Questions & More Information