Abstract
Collections of topics composed with the Darwin Information Typing Architecture (DITA) depend on annotations for improved search and retrieval. Through a set of metadata capabilities embedded in the DITA markup authors can define controlled values to identify and classify the subject matter of the content as well as to express the hierarchy and relationships between DITA elements and non-DITA resources. However, while these mechanisms typically provide both structural and semantic markup of DITA topics, it is difficult to manage, extend, and integrate a growing volume of DITA-based content and make them available for more intelligent Semantic Web services. Rather, the search and retrieval of DITA topics can benefit if combined with annotations captured in the Resource Description Framework (RDF). The paper addresses the issue of making the semantics of DITA XML documents explicit by using RDF for annotating existing documents. It reviews options for lifting DITA XML data into RDF for ease of processing. The paper shows that enriching DITA topics with semantic annotations helps to make DITA content more comprehensible and accessible, and improves the semantic interoperability among DITA topic instances. It concludes with general observations and an outlook on future work on exploiting the mapping and linking of DITA topics with RDF for improved sharing of data across collections as well as Linked Data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Baker, T., Keizer, J.: Linked Data for fighting global hunger: Experiences in setting standards for agricultural information management. In: Wood, D. (ed.) Linking Enterprise Data, pp. 177–201. Springer, Heidelberg (2010)
Breitling, F.: A standard transformation from XML to RDF via XSLT. Astron Nachr. 330, 755–760 (2009)
Closs, S.: Single Source Publishing. Modularer Content für EPUB & Co (Single source publishing. Modular content for EPUB & co). entwickler.press, Frankfurt/M (2011)
Decker, S., Meinik, S., Van Hermelen, F., Fensel, D., Klein, M., Broekstra, J., Erdmann, M., Horrocks, I.: Semantic Web: The Roles of XML and RDF. IEEE Internet Comput. Mag. 4, 63–74 (2000)
Dix, A., Lepouras, G., Katifori, A., Vassilakisc, C., Catarci, T., Poggie, A., Ioannidis, Y., Mora, M., Daradimos, I., Md. Akima, N., Humayoun, S.K., Terella, F.: From the web of data to a world of action. Web Semant Sci. Serv. Agents World Wide Web 8, 394–408 (2010)
Duval, E., Hodgins, W., Sutton, S., Weibel, S.L.: Metadata Principles and Practicalities. D-Lib Mag. 8, http://dlib.org/dlib/april02/weibel/04weibel.html (retrieved September 16, 2012)
Eberlein, K.J., Anderson, R.D., Joseph, G. (eds.): Darwin Information Typing Architecture (DITA) Version 1.2. Organization for the Advancement of Structured Information Standards (OASIS), Burlington, MA (December 2010), http://docs.oasis-open.org/dita/v1.2/spec/DITA1.2-spec.html (retrieved September 16, 2012)
Ferdinand, M., Zirpins, C., Trastour, D.: Lifting XML Schema to OWL. In: Koch, N., Fraternali, P., Wirsing, M. (eds.) ICWE 2004. LNCS, vol. 3140, pp. 354–358. Springer, Heidelberg (2004)
Gibbins, N., Shadbolt, N.: Resource Description Framework (RDF). In: Bates, M.J., Maack, M.N. (eds.) Encyclopedia of Library and Information Science, 3rd edn., vol. 5, pp. 4539–4547. CRC Press, Boca Raton (2010)
Greenberg, J.: Metadata and Digital Information. In: Bates, M.J., Maack, M.N. (eds.) Encyclopedia of Library and Information Science, 3rd edn., vol. 6, pp. 3610–3623. CRC Press, Boca Raton (2010)
IEEE Std 1484.12.1TM-2002:IEEE Standard for Learning Object Metadata. Institute of Electrical and Electronics Engineers (IEEE), New York, NY (June 2002)
ISO 15836:2009: Information and Documentation—The Dublin Core Metadata Element Set (2nd ed.). International Organization for Standardization (ISO), Geneva (February 15, 2009)
Kiryakov, A., Popov, B., Terziev, I., Manov, D., Ognyanoff, D.: Semantic annotation, indexing, and retrieval. Web Semant. Sci. Serv. Agents World Wide Web 2, 49–79 (2004)
Klein, M.: Using RDF Schema to interpret XML documents meaningfully. In: Handschuh, S., Staab, S. (eds.) Annotation for the Semantic Web, pp. 79–89. IOS Press, Amsterdam (2003)
Levine, D.B.: Cost-benefit analysis of a bridge to integrate the management of technical information for producing technical manuals and training courses. Institute for Defense Analyses, Alexandria (2010), http://oai.dtic.mil/oai/oai?verb=getRecord&metadataPrefix=html&identifier=ADA542429 (retrieved September 16, 2012 )
Morshed, A., Caracciolo, C., Johannsen, G., Keizer, J.: Thesaurus alignment for Linked Data publishing. In: Proceedings of the International Conference on Dublin Core and Metadata Applications (DC 2011), pp. 37–46. Dublin Core Metadata Initiative (DCMI), Dublin (2011), http://dcevents.dublincore.org/index.php/IntConf/dc-2011/paper/view/59 (retrieved September 16, 2012)
Meena, E., Kumar, A., Romary, L.: An extensible framework for efficient document management using RDF and OWL. In: Ide, N., Romary, L. (eds.) Workshop on NLP and XML (NLPXML-2004): RDF/RDFS and OWL in Language Technology (NLPXML 2004), pp. 51–58. Association for Computational Linguistic, Stroudsburg (2004), http://acl.ldc.upenn.edu/acl2004/nlpxml/pdf/meena-etal.pdf (retrieved September 16, 2012 )
Nešić, S., Jazayeri, M., Crestani, F., Gaševic: Concept-based semantic annotation, indexing and retrieval of office-like document units. Technical report, USI-INF-TR-2010-1, Faculty of Informatics, Università della Svizzeria italiana (2010), http://www.inf.usi.ch/research_publication.htm?id=56 (retrieved September 16, 2012 )
NISO: Understanding Metadata. National Information Standards Organization (NISO) Press, Bethesda, MD (2004), http://www.niso.org/ (retrieved September 16, 2012)
Patel-Schneider, P.F., Simeon, A.: The Yin/Yang Web: a unified model for XML syntax and RDF semantics. IEEE Trans. Knowl. Data Eng. 15, 797–812 (2003)
Powell, J.E., Collins, L.M., Martinez, M.L.B.: Semantically enhancing collections of library and non-library content. D-Lib Magazine 16 (2010), http://www.dlib.org/dlib/july10/powell/07powell.html (retrieved September 16, 2012)
Priestley, M., Hargis, G., Carpenter, S.: DITA: An XML-based Technical Documentation Authoring and Publishing Architecture. Tech. Comm. 48, 352–367 (2001)
Salminen, A., Tompa, F.: Communicating with XML. Springer, Heidelberg (2011)
Sánchez-Alonso, S., Sicilia, M.-Á.: Using an AGROVOC-based ontology for the description of learning resources on organic agriculture. In: Sicilia, M.-Á., Lytras, M.D. (eds.) Metadata and Semantics, pp. 481–492. Springer, Heidelberg (2009)
Schönberg, C., Freitag, B.: Evaluating RDF querying frameworks for document metadata. Technical report, MIP-0903, Fakultät für Informatik und Mathematik, Universität Passau (2003), http://www.fim.uni-passau.de/wissenschaftler/forschungsberichte/mip-0903.html (retrieved September 16, 2012 )
Schönberg, C., Weitl, F., Freitag, B.: Verifying the consistency of web-based technical documentation. J. Symbolic Comput. 46, 183–206 (2011)
Schreiber, G., Amin, A., Aroyo, L., van Assem, M., de Boer, V., Hardman, L., Hildebrand, M., Omelayenko, B., van Osenbruggen, J., Tordai, A., Wielemaker, J., Wielinga, B.: Semantic annotation and search of cultural-heritage collections: The MultimediaN E-Culture demonstrator. Web Semant. Sci. Serv. Agents World Wide Web 6, 243–249 (2008)
Shadboldt, N., Berners-Lee, T., Hall, W.: The Semantic Web revisited. IEEE Intell. Syst. 21, 96–101 (2006)
Shreve, G.M., Zeng, M.L.: Integrating resource metadata and domain markup in an NSDL collection. In: Proceedings of the International Conference on Dublin Core and Metadata Applications (DC 2003), pp. 223–229. Dublin Core Metadata Initiative (DCMI), Dublin (2003), http://dcpapers.dublincore.org/ojs/pubs/article/viewArticle/750 (retrieved September 16, 2012)
Sperberg-McQueen, C.M., Miller E.: On mapping from colloquial XML to RDF using XSLT. In: Extreme Markup Languages 2004 (2004), http://conferences.idealliance.org/extreme/html/2004/Sperberg-McQueen01/EML2004Sperberg-McQueen01.html (retrieved September 16, 2012)
Streich, R.: Techniques for managing collections of interrelated text modules. Markup Languages: Theory and Practice 1, 77–94 (1999)
Uren, V., Cimiano, P., Iria, J., Handschuh, S., Vargas-Vera, M., Motta, E., Ciravegna, F.: Semantic annotation for knowledge management: Requirements and a survey of the state of the art. Web Semant. Sci. Serv. Agents World Wide Web 4, 14–28 (2006)
Van Deursen, D., Poppe, C., Martens, G., Mannens, E., Walle, R.: XML to RDF conversion: A generic approach. In: Nesi, P., Ng, K., Delgado, J. (eds.) International Conference on Automated Solutions for Cross Media Content and Multi-Channel Distribution (AXMEDIS 2008), pp. 138–144. University of Florence, Florence (2008)
Wild, P.J., Giess, M.D., McMahon, C.A.: Describing engineering documents with faceted approaches. J. Doc. 66, 420–445 (2009)
Zeng, M.L.: Domain-specific markup languages and descriptive metadata: Their functions in scientific resource discovery. Revista Eletrônica de Biblioteconomia e Ciência da Informação 15, 164–176 (2010), http://www.periodicos.ufsc.br/index.php/eb/article/view/16890 (retrieved September 16, 2012)
Zeng, M.L., Chan, L.M.: Semantic Interoperability. In: Bates, M.J., Maack, N. (eds.) Encyclopedia of Library and Information Sciences, 3rd edn., vol. 6, pp. 4645–4662. CRC Press, Boca Raton (2010)
Zschocke, T.: Resolving controlled vocabulary in DITA markup: A case example in agroforestry. Program. 46, 321–340 (2012)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zschocke, T., Closs, S. (2012). Preliminary Exploration of Using RDF for Annotating DITA Topics on Agroforestry. In: Dodero, J.M., Palomo-Duarte, M., Karampiperis, P. (eds) Metadata and Semantics Research. MTSR 2012. Communications in Computer and Information Science, vol 343. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35233-1_27
Download citation
DOI: https://doi.org/10.1007/978-3-642-35233-1_27
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35232-4
Online ISBN: 978-3-642-35233-1
eBook Packages: Computer ScienceComputer Science (R0)