FiM’s DE - the communication package for the creative pipeline

Castro, H.; Andrade, M. T.; Viana, P.

doi:10.1007/s11042-020-10282-0

FiM’s DE - the communication package for the creative pipeline

Published: 16 February 2021

Volume 80, pages 18151–18180, (2021)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

137 Accesses
Explore all metrics

Abstract

The FotoInMotion (FiM) project is building a novel media creation platform, leveraging the use of semi-automated analysis and editing tools to empower creators to easily transform static visual acquisitions of real-world events into rich, animated and engaging objects, distributable through common channels. FiM transforms the content creative chain into an integrated pipeline across which media and metadata seamlessly flow and are exploited to produce more complex media objects. One of the addressed challenges consists the need for a seamless and efficient communication across such pipeline and on how to preserve, in a structured manner, all of the involved media and metadata. Existing standardized metadata tools and content wrappers are limited in expressivity and scope and incapable of fully supporting the needs of the content creative pipeline. This paper describes FiM’s new structured data object, i.e. the Digital Event (DE), which acts as a universal vehicle for media and metadata. It builds on well-established and emergent MPEG standards (MPEG-21, MPEG-V, MPEG-7 and MPEG HEIF), to support data diversity, interoperability, packaging and sharing, within complex, Machine Learning enhanced, creative pipelines. Our solution has been validated by creative professionals (photojournalism, fashion marketing and festivals), who have conducted experiments within the context of different creative workflows in real world scenarios. DE’s employment revealed to be advantageous, particularly in the homogenization of the media and metadata representation and packaging and in the normalization of the interaction between different pipeline components.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Picturemarks: Changes in Mining Media and Digital Storytelling

Creative Data Ontology: ‘Russian Doll’ Metadata Versioning in Film and TV Post-Production Workflows

Latent Spaces: A Creative Approach

Data availability

Not applicable.

Notes

The metadata content for each DE mentioned in this use case scenario, along with their validating schemas is available at [46]

References

Adobe (n.d.) XMP Specification, https://www.adobe.com/products/xmp.html. Accessed 02 Feb 2021
Advanced Authoring Format page at the Advanced Media Workflow Association Official Website (n.d.) https://www.amwa.tv/aaf. Accessed 02 Feb 2021
Arndt R, Troncy R, Staab S, Hardman L (2009) COMM: a core ontology for multimedia annotation. In: Staab S, Studer R (eds) Handbook on ontologies. Springer, Heidelberg, pp 403–421. https://doi.org/10.1007/978-3-540-92673-3_18
Chapter Google Scholar
Athanasiadis T, Tzouvaras V, Petridis K, Precioso F, Avrithis Y, Kompatsiaris Y (2005) Using a multimedia ontology infrastructure for semantic annotation of multimedia content, 5th International Workshop on knowledge markup and Semantic Annotation, located at the 4rd International Semantic Web Conference ISWC 2005, Galway Ireland, November 7 2005
Baca M (2016)Introduction to metadata: Third Edition, Getty Publications
Baca M, Harpring P (2014) Categories for the Description of Works of Art: Describe and catalogue works of art, architecture, and cultural heritage (CDWA), http://www.getty.edu/research/publications/electronic_publications/cdwa/. Accessed 02 Feb 2021
Blöhdorn S, Petridis K, Saathoff C, Simou N, Tzouvaras V, Avrithis Y, Handschuh S, Kompatsiaris Y, Staab S and Strintzis MG (2005) Semantic annotation of images and videos for multimedia analysis. In: European semantic web conference, Heraklion, May–June 2005. Springer, Berlin, Heidelberg, pp 592–607
Burnett I, Davis S, Drury GM (2005) MPEG-21 digital item declaration and identification – principles and compression. IEEE Trans Multimed 7(3):400–407
Camera & Imaging Products Association (n.d.) Exchangeable image file format for digital still cameras v2.3, http://www.cipa.jp/std/documents/e/DC-008-2012_E.pdf. Accessed 02 Feb 2021
Castro H, Monteiro J, Pereira A, Silva D, Coelho G, Carvalho P (2016) Cognition inspired format for the expression of computer vision metadata. Multimed Tools Appl 75(24):17035–170572015. https://doi.org/10.1007/s11042-015-2974-x
Article Google Scholar
Everingham M, Van Gool L, Williams CK, Winn J, Zisserman A (2010) The pascal visual object classes (voc) challenge. Int J Comput Vis 88(2):303–338
Article Google Scholar
FotoInMotion Website (n.d.) https://fotoinmotion.eu/. Accessed 02 Feb 2021
Hare JS, Lewis PH, Enser PG, Sandom CJ (2006) Mind the gap: another look at the problem of the semantic gap in image retrieval, in multimedia content analysis, management, and retrieval 2006. Int Soc Optics Photon 6073:607309
HDF5 Group (n.d.) HDF5 Support Page, http://portal.hdfgroup.org/display/HDF5/HDF5. Accessed 02 Feb 2021
Hunter J (2001) Adding multimedia to the semantic web—building an MPEG-7 ontology, Proceedings of the First International Conference on Semantic Web Working, July 2001, pp 261–283
Hunter J, Lagoze C (2001) Combining RDF and XML schemas to enhance interoperability between metadata application profiles. World Wide Web 1:457–466
Google Scholar
IPTC ORG Site (n.d.) NewsML-G2 Guidelines, https://www.iptc.org/std/NewsML-G2/guidelines/#quick-start-guide-to-newsml-g2-basics. Accessed 02 Feb 2021
IPTC ORG Site (n.d.) PTC Photo Metadata Standard 2017.1, http://www.iptc.org/std/photometadata/specification/IPTC-PhotoMetadata. Accessed 02 Feb 2021
Isaac A, Troncy R (2004) Designing and using an audio-visual description core ontology, Workshop on Core Ontologies in Ontology Engineering, Northamptonshire UK, 8 October 2004, vol. 118
MPEG Group (2003) Information technology — Multimedia content description interface — Part 5: Multimedia description schemes, International Standard ISO/IEC 15938-5:2003, Geneva Switzerland
Lin TY, Maire M, Belongie S, Hays J, Perona P, Ramanan D, Dollár P, Zitnick CL (2014) Microsoft coco: Common objects in context. In: European conference on computer vision, Zurich Switzerland, 6-12 September 2004. Springer, Cham, pp 740–755
List T, Fisher RB (2004) CVML – An XML-based Computer Vision Markup Language, In Proceedings of the 17th International Conference on Pattern Recognition (ICPR 2004), Cambridge UK, August 23-26 2004, IEEE Computer Society
Makadia A, Pavlovic V, Kumar S (2010) Baselines for Image Annotation. Int J Comput Vis 90(1):88–105
Article Google Scholar
Miller E (1998) An introduction to the resource description framework. Bull Am Soc Inf Sci Technol 25(1):15–19
Article Google Scholar
MPEG Group (2017) Information technology — High efficiency coding and media delivery in heterogeneous environments — Part 12: Image File Format, International Standard ISO/IEC 23008-12:2017, Geneva Switzerland
MPEG Group (n.d.) MPEG-21, https://mpeg.chiariglione.org/standards/mpeg-21. Accessed 02 Feb 2021
MPEG Group (n.d.) MPEG-7, https://mpeg.chiariglione.org/standards/mpeg-7. Accessed 02 Feb 2021
MPEG Group (2002) Information technology — Multimedia content description interface — Part 3: Visual, International Standard ISO/IEC 15938-3:2002, Geneva Switzerland
MPEG Group (n.d.) MPEG-V, https://mpeg.chiariglione.org/standards/mpeg-v. Accessed 02 Feb 2021
MPEG Group (n.d.) MPEG-V Sensory Information, https://mpeg.chiariglione.org/standards/mpeg-v/sensory-information. Accessed 02 Feb 2021
MPEG Group (n.d.) MPEG-V Data formats for interaction devices, https://mpeg.chiariglione.org/standards/mpeg-v/data-formats-interaction. Accessed 02 Feb 2021
MPEG Group (n.d.) MPEG-H, https://mpeg.chiariglione.org/standards/mpeg-h. Accessed 02 Feb 2021
MPEG Group (n.d.) MPEG-H Part 12 Image File Format, https://mpeg.chiariglione.org/standards/mpeg-h/image-file-format. Accessed 02 Feb 2021
MPEG Group (n.d.) MPEG-H Home Site, https://mpeg.chiariglione.org/standards/mpeg-h. Accessed 02 Feb 2021
MPEG Group (2003) Information technology — Multimedia framework (MPEG-21) — Part 3: Digital Item Identification, International Standard ISO/IEC 21000-3:2003, Geneva Switzerland
MPEG Group (2004) Information technology — Multimedia framework (MPEG-21) — Part 5: Rights Expression Language, International Standard ISO/IEC 21000-5:2004, Geneva Switzerland
MPEG Group (2004), Information technology — Multimedia framework (MPEG-21) — Part 7: Digital Item Adaptation, International Standard ISO/IEC 21000-7:2004, Geneva Switzerland
MPEG Group (2005) Information technology — Multimedia framework (MPEG-21) — Part 2: Digital Item Declaration, International Standard ISO/IEC 21000-2:2005, Geneva Switzerland
MPEG Group (2006) Information technology — Multimedia framework (MPEG-21) — Part 4: Intellectual Property Management and Protection Components, International Standard ISO/IEC 21000-4:2006, Geneva Switzerland
MPEG Group (2007) Information technology — Multimedia framework (MPEG-21) — Part 3: Digital Item Identification — Amendment 1: Related identifier types, International Standard Amendment ISO/IEC 21000-3:2003/Amd 1:2007, Geneva Switzerland
MPEG Group (2013) Information technology — Multimedia framework (MPEG-21) — Part 3: Digital Item Identification — Amendment 2: Digital item semantic relationships, International Standard Amendment ISO/IEC 21000-3:2003/Amd 2:2013, Geneva Switzerland
Oberle D, Ankolekar A, Hitzler P et al (2007) DOLCE ergo SUMO: on foundational and domain models in the SmartWeb integrated ontology (SWIntO). J Web Semant Sci Serv Agents World Wide Web 5(3):156–174
Article Google Scholar
Pereira F, Koenen R (2001) MPEG-7: a standard for multimedia content description. Int J Image Graph 1(3):527–547
Article Google Scholar
Project CAVIAR (n.d.) website, http://homepages.inf.ed.ac.uk/rbf/CAVIAR. Accessed 02 Feb 2021
Project ViPER (n.d.) website, http://viper-toolkit.sourceforge.net. Accessed 02 Feb 2021
Repository of FiM DE metadata schemas and samples (n.d.) https://drive.google.com/drive/folders/1QUxCzKk62Z9H-b6X8UXB9MXx9y0f_fE9?usp=sharing. Accessed 02 Feb 2021
SMPTE (2011) Material Exchange Format (MXF) – File Format Specification, SMPTE ST 377-1, 2011 Edition
Thomee B, Shamma DA, Friedland G, Elizalde B, Ni K, Poland D, Borth D, Li LJ (2015) The new data and new challenges in multimedia research, arXiv preprint arXiv:1503.01817, no. 8, March 2015
Tsinaraki C, Polydoros P, Moumoutzis N, Christodoulakis S (2004) Integration of OWL ontologies in MPEG-7 and TV-anytime compliant semantic indexing. 16th International conference on advanced information systems engineering, Riga, June 2004. Lecture notes in computer science, vol. 3084, pp. 398–413. Springer, Heidelberg. https://doi.org/10.1007/978-3-540-25975-6_29
Vellido A, Martín-Guerrero JD, Lisboa PJ (2012) Making machine learning models interpretable. ESANN 12:163–172
Google Scholar
Vembu S, Kiesel M, Sintek M and Baumann S (2006) Towards bridging the semantic gap in multimedia annotation and retrieval. In 1st International Workshop on Semantic Web Annotations for Multimedia (SWAMM), as part of the 15th World Wide Web Conference, Edinburgh Scotland, 22 May 2006
Viana P, Alves AP (2010) A semantic management model to enable the integrated management of media and devices. Multimed Tools Appl 49(1):37–62
Article Google Scholar
Weibel S, Kunze J, Lagoze C, Wolf M Dublin core metadata for resource discovery. Internet Eng Task Force RFC 2413(222):132

Download references

Funding

This work was developed with the financial support of the Fundação para a Ciência e Tecnologia (FCT), Portugal, within the scope of the post-Doctoral grant with the reference number SFRH/BPD/108329/2015.

It was also partially funded by the European Union’s Horizon 2020 research and innovation programme under the grant H2020-ICT-20-2017-1-RIA-780612.

Author information

Authors and Affiliations

INESC TEC, Campus da FEUP, Rua Dr. Roberto Frias, 4200 - 465, Porto, Portugal
H. Castro, M. T. Andrade & P. Viana
FEUP, Rua Dr. Roberto Frias, s/n, 4200-465, Porto, Portugal
M. T. Andrade
ISEP, R. Dr. António Bernardino de Almeida 431, 4249-015, Porto, Portugal
P. Viana

Authors

H. Castro
View author publications
You can also search for this author in PubMed Google Scholar
M. T. Andrade
View author publications
You can also search for this author in PubMed Google Scholar
P. Viana
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to H. Castro.

Ethics declarations

Conflict of interest

Not applicable.

Code availability

Not applicable.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Castro, H., Andrade, M.T. & Viana, P. FiM’s DE - the communication package for the creative pipeline. Multimed Tools Appl 80, 18151–18180 (2021). https://doi.org/10.1007/s11042-020-10282-0

Download citation

Received: 08 November 2019
Revised: 25 September 2020
Accepted: 09 December 2020
Published: 16 February 2021
Issue Date: May 2021
DOI: https://doi.org/10.1007/s11042-020-10282-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

FiM’s DE - the communication package for the creative pipeline

Abstract

Access this article

Similar content being viewed by others

Picturemarks: Changes in Mining Media and Digital Storytelling

Creative Data Ontology: ‘Russian Doll’ Metadata Versioning in Film and TV Post-Production Workflows

Latent Spaces: A Creative Approach

Data availability

Notes

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Code availability

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

FiM’s DE - the communication package for the creative pipeline

Abstract

Access this article

Similar content being viewed by others

Picturemarks: Changes in Mining Media and Digital Storytelling

Creative Data Ontology: ‘Russian Doll’ Metadata Versioning in Film and TV Post-Production Workflows

Latent Spaces: A Creative Approach

Data availability

Notes

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Code availability

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation