Abstract
There are many cases that XML documents have different DTDs in spite of having a similar structure and being logically the same kind of document. For this reason, a problem may occur in which these XML documents will have different database schema and are stored in different databases, and we have to access all database that is concerned to process the queries of users. Consequently, it decreases seriously the efficiency of retrieval. To solve this problem, we propose an algorithm that unifies DTDs of these XML documents using the finite automata and the tree structure. The finite automata are suitable for representing repetition operators and connectors of DTD, and are simple representation method for DTD. By using the finite automata, we are able to reduce the complexity of algorithm. And we apply a proposed algorithm to unify DTDs of science journals.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Ahonen, H.: Generating Grammars for Structured Documents Using Grammatical Inference Methods. University of Helsinki. Ph. D Thesis (1996)
Chidlovskii, B.: Using Regular Automata as XML schemas. In: 4’th IEEE Advances in Digital Libraries Conference(ADL 2000), Washington, USA, pp. 1–10 (2000)
Jeong, E., Hsu, C.-N.: Veiw Inference for Heterogeneous XML Information Integration. Journal of Intelligent Information Systems 20(1), 81–99 (2003)
Mello, R.d.S., Castano, S., Heuser, C.A.: A method for unification of XML schemata. Information and Software Technology 44(4), 241–249 (2002)
OmniMark, OmniMark: Content Model Algebra, http://www.exoterica.com/white/cma/cma.htm
Reynaud, C., Sirot, J.-P., Vodislav, D.: Semantic Integration of XML Heterogeneous Data Sources. In: International Database Engineering & Application Symposium (IDEAS 2001), Grenoble, France, pp. 199–208 (2001)
Rodriguez-Gianolli, P., Mylopoulos, J.: A Semantic Approach to XML-based Data Integration. In: Kunii, H.S., Jajodia, S., Sølvberg, A. (eds.) ER 2001. LNCS, vol. 2224, pp. 117–132. Springer, Heidelberg (2001)
Yoo, C.-S., Woo, S.-M., Kim, Y.-S.: Automatic Generation Algorithm of Uniform DTD for Structured Documents. In: Proc. of IEEE Region 10 Conf. TENCON 1999, vol. II, pp. 1095–1098 (1999)
XML 1.0(Third Edition), W3C Recommendation (2004), http://www.w3.org/TR/2004/REC-xml-20040204
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yoo, CS., Woo, SM., Kim, YS. (2005). Unification of XML DTD for XML Documents with Similar Structure. In: Gervasi, O., et al. Computational Science and Its Applications – ICCSA 2005. ICCSA 2005. Lecture Notes in Computer Science, vol 3482. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11424857_103
Download citation
DOI: https://doi.org/10.1007/11424857_103
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-25862-9
Online ISBN: 978-3-540-32045-6
eBook Packages: Computer ScienceComputer Science (R0)