skip to main content
10.1145/375551.375569acmconferencesArticle/Chapter ViewAbstractPublication PagespodsConference Proceedingsconference-collections
Article

Extended path expressions of XML

Authors Info & Claims
Published:01 May 2001Publication History

ABSTRACT

Query languages for XML often use path expressions to locate elements in XML documents. Path expressions are regular expressions such that underlying alphabets represent conditions on nodes. Path expressions represent conditions on paths from the root, but do not represent conditions on siblings, siblings of ancestors, and descendants of such siblings. In order to capture such conditions, we propose to extend underlying alphabets. Each symbol in an extended alphabet is a triplet (e1, a, e2), where a is a condition on nodes, and e1 (e2) is a condition on elder (resp. younger) siblings and their descendants; e1 and e2 are represented by hedge regular expressions, which are as expressive as hedge automata (hedges are ordered sequences of trees). Nodes matching such an extended path expression can be located by traversing the XML document twice. Furthermore, given an input schema and a query operation controlled by an extended path expression, it is possible to construct an output schema. This is done by identifying, where in the input schema the given extended path expression is satisfied.

References

  1. 1.S. Abiteboul, P. Buneman, and D. Suciu. Data on the Web: From Relations to Semistructured Dataand XML. Morgan Kaufmann, 1999.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. 2.S. Abiteboul and V. Vianu. Regular path queries with constraints. In PODS 97, 1997.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. 3.T. Arnold-Moore, M. Fuller, and R. Sacks-Davis. System architecture of a content management server for XML document applications. Markup Languages, 2(1), 2000.]]Google ScholarGoogle Scholar
  4. 4.R. Baeza-Yates and G. Navarro. Integrating contents and structure in text retrieval. SIGMOD Record, 25(1):67-79, Mar. 1996.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. 5.T. Bray, J. Paoli, and C. M. Sperberg-McQueen. Extensible Markup Language (XML) 1.0. W3C Recommendation. http://www.w3.org/TR/REC-xml, Feburary 1998.]]Google ScholarGoogle Scholar
  6. 6.A. Br. uggemann-Klein and D. Wood. Caterpillars: A context specification technique. Markup Languages: Theory and Practice, 2(1):81-106, Winter 2000.]]Google ScholarGoogle ScholarCross RefCross Ref
  7. 7.J. Buchi. Weak second-order arithmetic and finite automata, 1960.]]Google ScholarGoogle ScholarCross RefCross Ref
  8. 8.P. Buneman,W. Fan, and S. Weinsten. Path constraints on semistructured and structured data. In PODS 98, 1998.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. 9.J. Clark. TREX - tree regular expressions for XML, 2001. http://www.thaiopensource.com/trex/.]]Google ScholarGoogle Scholar
  10. 10.J. Clark and S. DeRose. XML Path Language (XPath) version 1.0. W3C Recommendation. http://www.w3.org/TR/xpath, November 1999.]]Google ScholarGoogle Scholar
  11. 11.H. Comon, M. Dauchet, R. Gilleron, F. Jacquemard, D. Lugiez, S. Tison, and M. Tommasi. "Tree Automata Techniques and Applications", 1997. http://www.grappa.univ-lille3.fr/tata.]]Google ScholarGoogle Scholar
  12. 12.B. Courcelle. On recognizable sets and tree automata. In M. Nivat and H. A . it-Kaci, editors, Resolution of Equations in Algebraic Structures. Academic Press, 1989.]]Google ScholarGoogle ScholarCross RefCross Ref
  13. 13.H.-D. Ebbinghaus and J. Flum. Finite Model Theory. Springer-Verlag, Berlin, 1995.]]Google ScholarGoogle ScholarCross RefCross Ref
  14. 14.M. Fernandez, J. Simeon, and P. Wadler. XML query languages: Experiences and exemplars. http://wwwdb.research.bell-labs.com/user/simeon/xquery.html.]]Google ScholarGoogle Scholar
  15. 15.M. Fernandez and D. Suciu. Optimizing regular path expressions using graph schemas. In ICDE 98, 1998.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. 16.F. Gecseg and M. Steinby. Tree languages. In A. Salomaa and G. Rozenberg, editors, Handbook of Formal Languages, Volume 3, Beyond Words, volume 3, pages 1-68. Springer-Verlag, 1997.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. 17.H. Hosoya and B. C. Pierce. Regular expression pattern matching for XML. In POPL 01, 2001.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. 18.ISO/IEC. Information Technology - Document Description and Processing Languages - Regular Language Description for XML (RELAX) - Part 1: RELAX Core, 2000. DTR 22250-1.]]Google ScholarGoogle Scholar
  19. 19.A. Mendelzon and P. Wood. Finding regular simple paths in graph databases. SIAM Journal on Computing, 24(6), 1995.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. 20.A. R. Meyer. Weak monadic second order theory of successor is Not elementary-recursive. In LOGCOLLOQ: Logic Colloquium, volume 453 of LNM. Springer-Verlag, 1975.]]Google ScholarGoogle ScholarCross RefCross Ref
  21. 21.T. Milo, D. Suciu, and V. Vianu. Typechecking for XML transformers. In PODS 00, 2000.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. 22.M. Murata. Transformation of documents and schemas by patterns and contextual conditions. In PODP 96, volume 1293 of LNCS. Springer-Verlag, 1997.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. 23.A. Neumann and H. Seidl. Locating matches of tree patterns in forests. In Foundations of Software Technology and Theoretical Computer Science, volume 1530 of Lecture Notes in Computer Science, pages 134-145. Springer-Verlag, 1998.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. 24.F. Neven. Extension of attribute grammars for structured document queries. In DBPL 99, 1999.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. 25.F. Neven and J. V. den Bussche. Expressiveness of structured document query languages based on attribute grammars. In PODS 98, pages 11-17, 1998.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. 26.F. Neven and T. Schwentick. Query automata. In PODS 99, 1999.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. 27.F. Neven and T. Schwentick. Expressive and efficient pattern languages for tree-structured data. In PODS 00, 2000.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. 28.M. Nivat and A. Podelski. Another variation on the common subexpression problem. Discrete Mathematics, 114:379-401, 1993.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. 29.C. Pair and A. Quere. Definition et etude des bilangages reguliers. Information and Control, 13(6):565-593, Dec. 1968.]]Google ScholarGoogle ScholarCross RefCross Ref
  30. 30.Y. Papakonstantinou and V. Vianu. DTD inference for views of XML data. In PODS 00, 2000.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. 31.A. Podelski. A monoid approach to tree automata. In M. Nivat and A. Podelski, editors, Tree Automata and Languages, pages 41-56. North-Holland, 1992.]]Google ScholarGoogle Scholar
  32. 32.M. Takahashi. Generalizations of regular sets and their application to a study of context-free languages. Information and Control, 27:1-36, 1975.]]Google ScholarGoogle ScholarCross RefCross Ref
  33. 33.W. Thomas. Languages, automata and logic. In A. Salomaa and G. Rozenberg, editors, Handbook of Formal Languages, Volume 3, Beyond Words, volume 3, pages 389-449. Springer-Verlag, 1997.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. 34.H. S. Thompson, D. Beech, M. Maloney, and N. Mendelsohn. "XML Schema Part 1: Structures", 2001. http://www.w3.org/TR/xmlschema-1/.]]Google ScholarGoogle Scholar
  35. 35.P. Wadler. A formal semantics of patterns in XSLT. In Markup Technologies 99, 1999.]]Google ScholarGoogle Scholar

Index Terms

  1. Extended path expressions of XML

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Conferences
            PODS '01: Proceedings of the twentieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
            May 2001
            301 pages
            ISBN:1581133618
            DOI:10.1145/375551

            Copyright © 2001 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 1 May 2001

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • Article

            Acceptance Rates

            PODS '01 Paper Acceptance Rate26of99submissions,26%Overall Acceptance Rate642of2,707submissions,24%

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader