Abstract.
The study presented in this paper analyses the visual MPEG-7 descriptors from a statistical point of view. A statistical analysis is able to reveal the properties and qualities of the used descriptors: redundancies, sensitivity to media content, etc. These aspects were not considered in the MPEG-7 design process where the major goal was optimising the retrieval rate. For the statistical analysis eight basic visual descriptors were applied to three media collections: the Brodatz dataset, a selection of the Corel photo dataset and a set of coats-of-arms images. The resulting feature vectors were analysed with four statistical methods: mean and variance of description elements, distribution of elements, cluster analysis (hierarchical and topological) and factor analysis. The analysis revealed that, for example, most MPEG-7 descriptions are highly redundant and sensitive to the presence of colour shades.
Article PDF
Similar content being viewed by others
References
Bober M (2001) MPEG-7 visual shape descriptors. IEEE Trans Circuits Sys Video Technol 11/6:716-719
Breiteneder C, Eidenberger H (1999) Content-based image retrieval of coats of arms. In: Proc IEEE international workshop on multimedia signal processing, Helsingör, Denmark, pp 91-96
Chang SF, Sikora T, Puri A (2001) Overview of the MPEG-7 standard. IEEE Trans Circuits Sys Video Technol 11/6:688-695
Del Bimbo A (1999) Visual information retrieval. Morgan Kaufmann, San Francisco
Eidenberger H (2003) How good are the visual MPEG-7 features? In: Proc SPIE conference on visual communications and image processing, Lugano SPIE vol 5150, pp 476-488. http://www.ims.tuwien.ac.at/\(\sim \)hme/papers/vcip2003-mpeg7.pdf, last visited 2004-04-05)
Eidenberger H, Breiteneder C (2003) VizIR - a framework for visual information retrieval. J Vis Lang Comput 14:443-469
Eidenberger H, Breiteneder C, Hitz M (2002) A framework for visual information retrieval. In: Proc conference on visual information systems, HsinChu, Taiwan. Lecture notes in computer science, vol 2314. Springer, Berlin Heidelberg New York, pp 105-116
Fuhr N (2001) Information retrieval methods for multimedia objects. In: Veltkamp RC, Burkhardt H, Kriegel HP (eds) State-of-the-art in content-based image and video retrieval. Kluwer, Boston, pp 191-212
Guo J, Kuo JCC (2001) Semantic video object segmentation for content-based multimedia applications. Kluwer, Boston
Jain AK, Murty MN, Flynn PJ (1999) Data clustering: a review. ACM Comput Surv 31/3:264-323
Kohonen T (1990) The Self-Organizing Map. Proc IEEE 78/9:1464-1480
Kohonen T, Hynninen J, Kangas J, Laaksonen J (1995) SOM-PAK: The Self-Organizing Map program package. Technical report, Helsinki University of Technology
Kohonen T, Oja E, Simula O, Visa A (1996) Engineering applications of the Self-Organizing Map. Proc IEEE 84/10:1358-1384
Loehlin JC (1998) Latent variable models: An introduction to factor, path, and structural analysis, 3rd edn. Erlbaum, Mahwah, NJ
Manjunath BS, Ohm JR, Vasudevan VV, Yamada A (2001) Color and texture descriptors. IEEE Trans Circuits Sys Video Technol 11/6:703-715
Manjunath BS, Salembier P, Sikora T (2002) Introduction to MPEG-7. Wiley, San Francisco
MPEG-7 experimentation model website. http://www.lis.e-technik.tu-muenchen.de/research/bv/topics/ mmdb/e\_mpeg7.html (hosted by TU Munich, last visited 2004-04-05)
Smeulders AWM, Worring M, Santini S, Gupta A, Jain R (2000) Content-based image retrieval at the end of the early years. IEEE Trans Pattern Anal Mach Intell 22/12:1349-1380
Author information
Authors and Affiliations
Corresponding author
Additional information
Published online: 6 October 2004
Rights and permissions
About this article
Cite this article
Eidenberger, H. Statistical analysis of content-based MPEG-7 descriptors for image retrieval. Multimedia Systems 10, 84–97 (2004). https://doi.org/10.1007/s00530-004-0141-8
Issue Date:
DOI: https://doi.org/10.1007/s00530-004-0141-8