Skip to main content

Designing Functional Dependencies for XML

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2287))

Abstract

Functional dependencies are an integral part of database theory and they form the basis for normalizing relational tables up to BCNF. With the increasing relevance of the data-centric aspects of XML, it is pertinent to study functional dependencies in the context of XML, which will form the basis for further studies into XML keys and normalization. In this work, we investigate the design of functional dependencies in XML databases. We propose FD XML , a notation and DTD for representing functional dependencies in XML. We observe that many databases are hierarchical in nature and the corresponding nested XML data1 may inevitably contain redundancy. We develop a model based on FD XML to estimate the amount of data replication in XML data. We show how functional dependencies in XML can be verified with a single pass through the XML data, and present supporting experimental results. A platformindependent framework is also drawn up to demonstrate how the techniques proposed in this work can enrich the semantics of XML.

This work was done while the author was on a research scholarship from the National University of Singapore.

For this paper, XML data refers to data represented in XML. It is not to be confused with the W3C Note XML-Data.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. W. W. Armstrong. Dependency Structures of Database Relationships. In Proceedings of the tri-annual IFIP Conf 74, N-H (Amsterdam), 1974.

    Google Scholar 

  2. Peter Bunemana, Susan Davidson, Wenfei Fan, Carmem Hara, and Wang-Chiew Tan. Keys for XML. In Proceedings of the WWW’10, Hong Kong, China, 2001.

    Google Scholar 

  3. P. A. Bernstein. Synthesizing Third Normal Form Relations from Functional Dependencies. ACM Transactions on Database Systems, 1(4):277–298, Dec 1976.

    Article  Google Scholar 

  4. Tim Bray, Jean Paoli, C. M. Sperberg-McQueen, and Eve Maler. Extensible arkup Language (XML) 1.0 (Second Edition). http://www.w3.org/TR/2000/REC-xml-20001006, 2000.

  5. James Clark and Steve DeRose. XML Path Language (XPath) Version 1.0. Available at http://www.w3.org/TR/xpath, 1999.

  6. E. F. Codd. A Relational Model of Data for Large Shared Data Banks. j-CACM, 13(6):377–387, June 1970.

    MATH  Google Scholar 

  7. E. F. Codd. Further Normalization of the Database Relational Model. R. Rustin, Ed. Prentice-Hall, Englewood Cliffs, NJ, 1972.

    Google Scholar 

  8. Dongwon Lee and Wesley W. Chu. Comparative Analysis of Six XML Schema Languages. SIGMOD Record, 29(3):76–87, 2000.

    Article  Google Scholar 

  9. D. Fallside. XML Schema Part 0: Primer. Available at http://www.w3.org/TR/xmlschema-0/, 2000.

  10. Mary Fernandez and Jonathan Robie. XML Query Data Model. W3C Working Draft. Available at http://www.w3.org/TR/query-datamodel/, 2001.

  11. W Fan and J Siméon. Integrity Constraints for XML. In Proceedings of the Nineteenth ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, Dallas, Texas, USA, pages 23–34. ACM, 2000.

    Google Scholar 

  12. Dongwon Lee and Wesley W. Chu. Constraints-Preserving Transformation from XML Document Type Definition to Relational Schema. In Proceedings of the 19th International Conference on Conceptual Modeling, pages 323–338, 2000.

    Google Scholar 

  13. Michael Ley. DBLP Bibliography. Available at http://www.informatik.uni-trier.de/ley/db/, 2001.

  14. Tok Wang Ling, Cheng Hian Goh, and Mong Li Lee. Extending classical functional dependencies for physical database design. Information and Software Technology, 9(38):601–608, 1996.

    Article  Google Scholar 

  15. J. McHugh, S. Abiteboul, R. Goldman, D. Quass, and J. Widom. Lore: A Database Management System for Semistructured Data. SIGMOD Record, 26(3), 1997.

    Google Scholar 

  16. David Megginson. SAX: The Simple API for XML. Available at http://www.megginson.com/SAX/, 2001.

  17. Tsou and Fischer. Decomposition of a Relation Scheme into Boyce-Codd Normal Form. SIGACTN: SIGACT News, 14, 1982.

    Google Scholar 

  18. Kevin Williams et al. Professional XML Databases. Wrox Press Inc, 2000.

    Google Scholar 

  19. W3C DOM Working Group. Document Object Model (DOM). Available at http://www.w3.org/DOM/, 2001.

  20. Grant E. Weddell. Reasoning About Functional Dependencies Generalized for Semantic Data Models. ACM Transactions on Database Systems, 17(1):32–64, Mar 1992.

    Article  MathSciNet  Google Scholar 

  21. Jennifer Widom. Data Management for XML: Research Directions. IEEE Data Engineering Bulletin, 22(3):44–52, 1999.

    Google Scholar 

  22. Xiaoying Wu, Tok Wang Ling, Mong Li Lee, and Gillian Dobbie. Designing Semistructured Databases Using the ORA-SS Model. In Proceedings of the 2nd International Conference on Web Information Systems Engineering (WISE). IEEE Computer Society, 2001.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Lee, M.L., Ling, T.W., Low, W.L. (2002). Designing Functional Dependencies for XML. In: Jensen, C.S., et al. Advances in Database Technology — EDBT 2002. EDBT 2002. Lecture Notes in Computer Science, vol 2287. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45876-X_10

Download citation

  • DOI: https://doi.org/10.1007/3-540-45876-X_10

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-43324-8

  • Online ISBN: 978-3-540-45876-0

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics