Abstract
The ever increasing need for extracting knowledge from heterogeneous data has become a major concern. This is particularly observed in many application domains where several actors, with different expertise, exchange a great amount of information at any stage of a large-scale project. In this paper, we propose LinkedMDR: a novel ontology for Linked Multimedia Document Representation that describes the knowledge of a heterogeneous document corpus in a semantic data network. LinkedMDR combines existing standards and introduces new components that handle the connections between these standards and augment their capabilities. It is generic and offers a pluggable layer that makes it adaptable to different domain-specific knowledge. Experiments conducted on construction projects show that LinkedMDR is applicable in real-world scenarios.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
Inter-document links are relations between documents. Intra-document links are relations between elements of the same document.
- 2.
- 3.
For the sake of simplicity, we only present 6 documents. However, other documents could be also involved such as videos, audios and 3D drawings.
- 4.
LinkedMDR is an OWL ontology created on Protégé. Details on the LinkedMDR ontology, the overall concepts and relations are available at http://spider.sigappfr.org/linkedmdr/.
- 5.
Available at http://ifcowl.openbimstandards.org/IFC4_ADD2.owl.
- 6.
HIT2GAP (Highly Innovative Building Control Tools) is a large-scale project that involves 21 partners and provides an energy management platform for managing building energy behavior. Further details are available at: http://www.hit2gap.eu/.
- 7.
- 8.
The number of XML tags in the XML annotation files that we generated based on the existing standards and the number of RDF triples that we generated in the LinkedMDR ontology.
- 9.
\(F_2\)-measure: (5 \(\times \) P \(\times \) R) /(4 \(\times \) P+ R)
Recall: No. of covered relevant criteria/Total No. of expected criteria.
Precision: No. of covered relevant criteria/Total No. of annotated criteria.
References
Arndt, R., Troncy, R., Staab, S., Hardman, L., Vacura, M.: COMM: designing a well-founded multimedia ontology for the web. In: Aberer, K., Choi, K.-S., Noy, N., Allemang, D., Lee, K.-I., Nixon, L., Golbeck, J., Mika, P., Maynard, D., Mizoguchi, R., Schreiber, G., Cudré-Mauroux, P. (eds.) ASWC/ISWC -2007. LNCS, vol. 4825, pp. 30–43. Springer, Heidelberg (2007). doi:10.1007/978-3-540-76298-0_3
Bloechle, J.-L., Rigamonti, M., Hadjar, K., Lalanne, D., Ingold, R.: XCDF: a canonical and structured document format. In: Bunke, H., Spitz, A.L. (eds.) DAS 2006. LNCS, vol. 3872, pp. 141–152. Springer, Heidelberg (2006). doi:10.1007/11669487_13
Brut, M., Laborie, S., Manzat, A.M., Sedes, F.: Integrating heterogeneous metadata into a distributed multimedia information system. In: COGnitive systems with Interactive Sensors (2009)
buildingSMART: IFC-Industry Foundation Classes, IFC4 Add2 Release (2016). http://www.buildingsmart-tech.org/specifications/ifc-releases/ifc4-add2-release
Charbel, N., Tekli, J., Chbeir, R., Tekli, G.: Resolving XML semantic ambiguity. In: EDBT, pp. 277–288 (2015)
Dublin Core Metadata Initiative: DCMI Metadata Terms (2012). http://dublincore.org/documents/dcmi-terms/
EXIF: Exchangeable Image File Format for digital still cameras (2002). http://www.exif.org/Exif2-2.PDF
Garcia, R., Celma, O.: Semantic integration and retrieval of multimedia metadata. In: 5th International Workshop on Knowledge Markup and Semantic Annotation, pp. 69–80 (2005)
Guo, K., Liang, Z., Tang, Y., Chi, T.: SOR: an optimized semantic ontology retrieval algorithm for heterogeneous multimedia big data. J. Comput. Sci. (2017)
Hunter, J.: An overview of the MPEG-7 description definition language (DDL). IEEE Trans. Circuits Syst. Video Technol. 11(6), 765–772 (2001)
Huovila, P.: Linking IFCs and BIM to sustainability assessment of buildings. In: Proceedings of the CIB W78 2012: 29th International Conference (2012)
ITEA: LINDO-Large scale distributed INDexation of multimedia Objects (2010). https://itea3.org/project/lindo.html
Klinger, M., Susong, M.: Chapter, phases of the contruction project. In: The Construction Project: Phases, People, Terms, Paperwork, Processes. American Bar Association (2006)
OpenCV: Open Source Computer Vision Library (2011). http://opencv.org
Pankowski, T., Brzykcy, G.: Data access based on faceted queries over ontologies. In: Hartmann, S., Ma, H. (eds.) DEXA 2016 Part II. LNCS, vol. 9828, pp. 275–286. Springer, Cham (2016). doi:10.1007/978-3-319-44406-2_21
Saathoff, C., Scherp, A.: Unlocking the semantics of multimedia presentations in the web with the multimedia metadata ontology. In: Proceedings of the 19th International Conference on World Wide Web, pp. 831–840. ACM (2010)
Salembier, P., Smith, J.R.: MPEG-7 multimedia description schemes. IEEE Trans. Circuits Syst. Video Technol. 11(6), 748–759 (2001)
Scherp, A., Eissing, D., Saathoff, C.: A method for integrating multimedia metadata standards and metadata formats with the multimedia metadata ontology. Int. J. Semant. Comput. 6(01), 25–49 (2012)
Suarez-Figueroa, M.C., Atemezing, G.A., Corcho, O.: The landscape of multimedia ontologies in the last decade. Multimed. Tools Appl. 62(2), 377–399 (2013)
Tekli, J., Charbel, N., Chbeir, R.: Building semantic trees from XML documents. Web Semant.: Sci. Serv. Agents World Wide Web 37, 1–24 (2016)
The Moving Picture Experts Group: MPEG7-Multimedia Content Description Interface (2001). http://mpeg.chiariglione.org/standards/mpeg-7
The Text Encoding Initiative Consortium: TEI-Text Encoding Initiative (1994). http://www.tei-c.org/release/doc/tei-p5-doc/en/Guidelines.pdf
W3C: Resource Description Framework (2004). https://www.w3.org/RDF/
W3C: Ontology for Media Resources 1.0 (2012). http://www.w3.org/TR/mediaont-10/
Weibel, S., Kunze, J., Lagoze, C., Wolf, M.: Dublin Core metadata for resource discovery. Technical report 2070-1721 (1998)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Charbel, N., Sallaberry, C., Laborie, S., Tekli, G., Chbeir, R. (2017). LinkedMDR: A Collective Knowledge Representation of a Heterogeneous Document Corpus. In: Benslimane, D., Damiani, E., Grosky, W., Hameurlain, A., Sheth, A., Wagner, R. (eds) Database and Expert Systems Applications. DEXA 2017. Lecture Notes in Computer Science(), vol 10438. Springer, Cham. https://doi.org/10.1007/978-3-319-64468-4_28
Download citation
DOI: https://doi.org/10.1007/978-3-319-64468-4_28
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-64467-7
Online ISBN: 978-3-319-64468-4
eBook Packages: Computer ScienceComputer Science (R0)