Abstract
Metadata on multimedia documents may help to describe their content and make their processing easier, for example by identifying events in temporal media, as well as carrying descriptive information for the overall resource. Metadata is essentially static and may be associated with, or embedded in, the multimedia contents. The aim of this paper is to present a proposal for multimedia documents annotation, based on modeling and unifying features elicited from content and structure mining. Our approach relies on the availability of annotated metadata representing segment content and structure as well as segment transcripts. Temporal and spatial operators are also taken into account when annotating documents. Any feature is identified into a descriptor called “meta-document”. These meta-documents are the basis of querying by adapted query languages.
Similar content being viewed by others
References
A. Albano, D. Colazzo, G. Ghelli, and P. Manghi “A type system for querying XML documents,” in Proc. ACM SIGIR Workshop on XML and Information Retrieval, Athens, Greece, July 28, 2000.
J.F. Allen “Maintaining knowledge about temporal intervals,” Communication of the ACM, Vol 26, No. 11, pp. 837–843, 1983.
J.F. Allen “Time and Time Again: The Many Ways to Represent Time,” International Journal of Intelligent Systems, Vol. 6, No. 4, pp. 391–355, 1991.
I. Amous, A. Jedidi, and F. Sèdes “A contribution to multimedia document modeling and organizing,” in Proc. 8th International Conference on Object Oriented Information Systems, Montpelier, France, 02–05 Springer LNCS No. 2425, 2002, pp. 439–444.
R. André-Obrecht, J. Pinquier, and C. Sénac “Speech and music classification in audio documents,” in Proc. International Conference on Acoustics, Speech and Signal Processing, Orlando, United States, May 2002, Vol. 4, pp. 4164.
R. Baeza-Yates and G. Navarro “XQL and proximal nodes,” Journal of the American Society for Information Science and Technology, Vol 53, No. 6, pp. 504–514, 2002.
C. Barras, E. Geoffrois, Z. Wu, and M. Liberman “Transcriber: A free tool for segmenting, labeling and transcribing speech,” in Proc. 1st International Conference on Language Resources and Evaluation, Granada, Spain, 28–30 May 1998, pp. 1373–1376.
D. Chamberlin, D. Florescu, J. Robie, J. Siméon, and M. Stefanescu “XQuery1.0: An XML query language,” W3C Working Draft, Jun. 2001, http://www.w3.org/TR/2001/WD-xquery-20010607/
D. Chamberlin “XQuery: A query language for XML,” in Proc. SIGMOD Conference, San Diego, United States, 9–12 June 2003, p. 682.
The “Corpus Encoding Standard,” website: http://www.cs.vassar.edu/CES/, Document CES 1. Version 1.5, Last modified 20 March 2000.
A. Deutsch, M. Fernandez, D. Florescu, A. Levy and D Suciu “Querying XML data,” IEEE Data Engineering Bulletin, Vol. 22, No. 3, pp. 10–18, 1999.
A. Dorado and E. Izquierdo “Semi-automatic image annotation using frequent keyword mining,” in Proc. 7th International Conference on Information Visualization, London England, 16–18 July 2003, p. 537.
Ch. Djeraba, Multimedia Mining, a Highway to Intelligent Multimedia Documents, Kluwer Publishers, 2003.
M. Fernandez, J. Marsh, and M. Nagy “XQuery1.0 & XPath2.0 data model,” W3C, Retrieved November 2002, http://www.w3c.org/TR/query-datamodel/
E.M.J. Genhofer and J.H. Erring “High-Level Spatial Data Structures for GIS,” Geographical Information Systems: Longman, London, 1991.
Y. Gong “Advancing content-based image retrieval by exploiting image colours and regions features,” Multimedia Systems, Vol 7, No. 6, pp. 449–457, 1999.
K. Hasida. “Global Document Annotation website,” 2002, http://i-content.org/GDA/
A. Karmouch and N. Hirzalla “A data model and query language for multimedia documents databases,” Multimedia System, Vol 7, No. 6 pp. 388–398, 1999.
E.C. Lementini, P. Felice, D. Oosterom, P. Van, “A Small set of formal topological relationships suitable for end-user interaction,” in Proc. of the 3rd International Symposium on Advances in Spatial Databases, Berlin Heidelberg New York, June 1993, Springer Verlag, Lecture Notes in Computer Science 692, pp. 277–295
D. Lee, “Query relaxation for XML model,” A dissertation submitted in partial satisfaction of the requirements for the degree Doctor of Philosophy in Computer Science University of California Los Angeles, Unites States, July 2002.
R. Lienhart and W. Effelsberg “Automatic text segmentation and text recognition for video indexing,” Multimedia Systems, Vol. 8, No. 1, pp. 69–81, 2000.
W-Y. Ma and B.S. Majunath “NeTra: A toolbox for navigating large image databases,” Multimedia Systems, Vol. 7, No. 3, pp. 184–198, 1999.
B. Oliboni and L. Tanca “A visual language should be easy to use: A step forward for XML-GL,” Information Systems, Vol. 27, No. 7, pp. 459–486, 2002.
J.R. Smith and S-F. Chang “Integrated spatial and feature image query,” Multimedia Systems Vol. 7, No. 2, pp. 129–140, 1999.
Y. Ohta “Pattern recognition and understanding for visual information media,” in Proc of the 16th International Conference on Pattern Recognition, Vol. 1, pp. 536–546, Québec, Canada, August 2002.
F. Sèdes “Base documentaire—Hyperbases Proposition d’un modèle générique et contribution à la spécification d’un langage pour l’interrogation et la manipulation d’informations semi-structurées,” Memory for obtaining enabling to direct research, Paul Sabatier University Toulouse, France, Dec. 1998.
M. Tamer Özsu, P. Iglinski, D. Szafron, S. El-Medani, and M. Schöne “An object-oriented SGML/HYTIME compliant multimedia database management system,” in Proc. the 5th ACM International Multimedia Conference, Seattle, United States, Nov. 8–14, 1997.
The “Text Encoding Initiative” (TEI), website: http://www.tei-c.org, revised 6 January 2003.
D. Tjondronegoro, and Ch. Yi-Ping Phoebe “Content-based indexing and retrieval using MPEG-7 and X-query in video data management systems,” World Wide Web Journal 02’, Vol. 5, No. 3, pp. 207–227, 2002 (Kluwer Publishers).
W3C “working draft specification on SMIL”: http://www. w3c.org/TR/smil20/
X. Zhu, E. Bertino, J. Fan, E. Ferrari, M.-S. Hacid, and A.K. Elmagarmid “Hierarchical video content description and summarization using unified semantic and visual similarity,” Multimedia Systems, Vol 9, No. 1, pp. 31–53, 2003.
Author information
Authors and Affiliations
Corresponding author
Additional information
Ikram Amous received her Bachelor degree in business data processing of Sfax University, Tunisia, in 1998. She received her Master Degree in data processing and telecommunication from the Paul Sabatier University of Toulouse III, French in June 1999. She received her doctorate in Informatics from the Paul Sabatier University of Toulouse III in December 2003. She is currently a member at LARIM laboratory of Sfax University, Tunisia. Her research interests include semi-structured document modeling, multimedia document personalization, multimedia document annotation and querying.
Anis jedidi received his Bachelor degree in business data processing from Sfax University, Tunisia, in 1999. Received his Master degree in data processing and telecommunication from Paul Sabatier University, Toulouse III, French, 2000. He is currently a Ph.D. candidate in Information System Generalized team at IRIT laboratory at Paul Sabatier University. His research interests include semi-structured document modeling, multimedia document annotation and querying.
Florence M. Sèdes is a professor of computing in the Department of Computing at Paul Sabatier University (Toulouse 3). She received a Ph.D. in Computer Science from Paul sabatier University. She has more than ten years of research experience in database and information systems. Her research interests include multimedia document, indexing and retrieving, semi-structured data and flexible querying.
Rights and permissions
About this article
Cite this article
Amous, I., Jedidi, A. & Sèdes, F. A Contribution to Multimedia Document Modeling and Querying. Multimed Tools Appl 25, 391–404 (2005). https://doi.org/10.1007/s11042-005-6542-7
Issue Date:
DOI: https://doi.org/10.1007/s11042-005-6542-7