Skip to main content

Unification of XML DTD for XML Documents with Similar Structure

  • Conference paper
Computational Science and Its Applications – ICCSA 2005 (ICCSA 2005)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3482))

Included in the following conference series:

Abstract

There are many cases that XML documents have different DTDs in spite of having a similar structure and being logically the same kind of document. For this reason, a problem may occur in which these XML documents will have different database schema and are stored in different databases, and we have to access all database that is concerned to process the queries of users. Consequently, it decreases seriously the efficiency of retrieval. To solve this problem, we propose an algorithm that unifies DTDs of these XML documents using the finite automata and the tree structure. The finite automata are suitable for representing repetition operators and connectors of DTD, and are simple representation method for DTD. By using the finite automata, we are able to reduce the complexity of algorithm. And we apply a proposed algorithm to unify DTDs of science journals.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Ahonen, H.: Generating Grammars for Structured Documents Using Grammatical Inference Methods. University of Helsinki. Ph. D Thesis (1996)

    Google Scholar 

  2. Chidlovskii, B.: Using Regular Automata as XML schemas. In: 4’th IEEE Advances in Digital Libraries Conference(ADL 2000), Washington, USA, pp. 1–10 (2000)

    Google Scholar 

  3. Jeong, E., Hsu, C.-N.: Veiw Inference for Heterogeneous XML Information Integration. Journal of Intelligent Information Systems 20(1), 81–99 (2003)

    Article  Google Scholar 

  4. Mello, R.d.S., Castano, S., Heuser, C.A.: A method for unification of XML schemata. Information and Software Technology 44(4), 241–249 (2002)

    Article  Google Scholar 

  5. OmniMark, OmniMark: Content Model Algebra, http://www.exoterica.com/white/cma/cma.htm

  6. Reynaud, C., Sirot, J.-P., Vodislav, D.: Semantic Integration of XML Heterogeneous Data Sources. In: International Database Engineering & Application Symposium (IDEAS 2001), Grenoble, France, pp. 199–208 (2001)

    Google Scholar 

  7. Rodriguez-Gianolli, P., Mylopoulos, J.: A Semantic Approach to XML-based Data Integration. In: Kunii, H.S., Jajodia, S., Sølvberg, A. (eds.) ER 2001. LNCS, vol. 2224, pp. 117–132. Springer, Heidelberg (2001)

    Chapter  Google Scholar 

  8. Yoo, C.-S., Woo, S.-M., Kim, Y.-S.: Automatic Generation Algorithm of Uniform DTD for Structured Documents. In: Proc. of IEEE Region 10 Conf. TENCON 1999, vol. II, pp. 1095–1098 (1999)

    Google Scholar 

  9. XML 1.0(Third Edition), W3C Recommendation (2004), http://www.w3.org/TR/2004/REC-xml-20040204

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Yoo, CS., Woo, SM., Kim, YS. (2005). Unification of XML DTD for XML Documents with Similar Structure. In: Gervasi, O., et al. Computational Science and Its Applications – ICCSA 2005. ICCSA 2005. Lecture Notes in Computer Science, vol 3482. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11424857_103

Download citation

  • DOI: https://doi.org/10.1007/11424857_103

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-25862-9

  • Online ISBN: 978-3-540-32045-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics