skip to main content
10.1145/1166160.1166199acmconferencesArticle/Chapter ViewAbstractPublication PagesdocengConference Proceedingsconference-collections
Article

Describing and querying hierarchical XML structures defined over the same textual data

Published: 10 October 2006 Publication History

Abstract

Our work aims at representing and querying hierarchical XML structures defined over the same textual data. We call such data "multistructured textual documents".Our objectives are twofold. First, we shall define a suitable - XML compatible - data model enabling (1) to describe several independent hierarchical structures over the same textual data (represented by several XML structured documents) (2) to consider user annotations added in each structured document. Our proposal is based on the use of hedges (the foundation of the grammar language RelaxNG). Secondly, we shall propose an extension of XQuery in order to query structures and content in a concurrent way. We shall apply our proposals using a literary text written in old French.

References

[1]
A. Le Hors et al. Document object model (dom) level 3 core specification. Rec., W3C, 2004.
[2]
J. Allen. Time and time again: The many ways to represent time. International Journal of Intelligent Systems, 6(4):341--355, july 1991.
[3]
P.-V. Biron and A. Malhotra. XML Schema Part 2: Datatypes. Rec., W3C, 2001.
[4]
S. Boag. XQuery 1.0: An XML Query Language. Draft, W3C, 2003.
[5]
T. Bray. Namespaces in XML 1.1. Rec., W3C, 2004.
[6]
T. Bray, J. Paoli, and C.-M. Sperberg-McQueen. Extensible Markup Language (XML) 1.0. Rec., W3C, 1998.
[7]
E. Bruno and E. Murisasco. MSXD: a formal model for concurrent structures defined over the same textual data. In Proceedings of DEXA 2006, pages 172--181. LNCS, 2006.
[8]
J. Clark and M. Murata. RELAX NG Specification. Technical report, OASIS, 2001.
[9]
D. Draper et al. XQuery 1.0 and XPath 2.0 Formal Semantics. Candidate rec., W3C, 2005.
[10]
A. Dekhtyar and I.-E. Iacob. A framework for management of concurrent xml markup. Data and Knowledge Engineering, 52(2):185--208, 2005.
[11]
S. DeRose. Markup overlap: a review and a horse. In Extreme markup language 2004 Conference Proceedings, 2004.
[12]
P. Durusau and M. Brook O'Donnell. Concurrent markup for xml documents. In Proceedings of XML Europe Atlanta 2002., 2002.
[13]
M. Fernandez, A. Malhotra, J. Marsh, M. Nagy, and N. Walsh. XQuery 1.0 and XPath 2.0 Data Model. Draft, W3C, 2003.
[14]
C.-F. Goldfarb and Y. Rubinsky. The SGML handbook. Clarendon Press, Oxford, 1990.
[15]
M. Hilbert, O. Schonefeld, and A. Witt. Making concur work. In Extreme Markup Languages 2005, August 2005.
[16]
I.-E. Iacob and A. Dekhtyar. Towards a query language for multihierarchical xml: Revisiting xpath,. In Proceedings, Eighth International Workshop on the Web and Databases, WebDB'05, pages 43--48, 2005.
[17]
H.-V. Jagadish, L.-V.-S. Lakshmanan, M. Scannapieco, D. Srivastava, and N. Wiwatwattana. Colorful XML: One Hierarchy Isn't Enough. In SIGMOD Conference, pages 251--262, 2004.
[18]
M. Murata. Hedge automata: a formal model for XML schemata. Web page, 2000.
[19]
C.-M. Sperberg-McQueen and L. Burnard. Tei p4 guidelines for electronic text encoding and interchange, 2001.
[20]
C.-M. Sperberg-McQueen and C. Huitfeldt. Goddag: A data structure for overlapping hierarchies. In DDEP/PODDP, pages 139--160, 2000.
[21]
Jeni Tennison and Wendell Piez. Layered markup and annotation language (lmnl). In The Late breaking paper presented at Extreme Markup, 2002.
[22]
A. Witt. multiple hierarchies : news aspects of an old solution. In Extreme markup language 2004 Conference Proceedings, 2004.

Cited By

View all
  • (2016)Schema-aware Extended Annotation GraphsProceedings of the 2016 ACM Symposium on Document Engineering10.1145/2960811.2960816(45-54)Online publication date: 13-Sep-2016
  • (2010)Multimodal annotation of conversational dataProceedings of the Fourth Linguistic Annotation Workshop10.5555/1868720.1868749(186-191)Online publication date: 15-Jul-2010
  • (2007)An XML environment for multistructured textual documents2007 2nd International Conference on Digital Information Management10.1109/ICDIM.2007.4444228(230-235)Online publication date: Oct-2007

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
DocEng '06: Proceedings of the 2006 ACM symposium on Document engineering
October 2006
232 pages
ISBN:1595935150
DOI:10.1145/1166160
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 10 October 2006

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. XML
  2. XQuery
  3. multistructure
  4. textual documents
  5. tree-like structure

Qualifiers

  • Article

Conference

DocEng06
Sponsor:
DocEng06: ACM Symposium on Document Engineering
October 10 - 13, 2006
Amsterdam, The Netherlands

Acceptance Rates

Overall Acceptance Rate 194 of 564 submissions, 34%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 23 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2016)Schema-aware Extended Annotation GraphsProceedings of the 2016 ACM Symposium on Document Engineering10.1145/2960811.2960816(45-54)Online publication date: 13-Sep-2016
  • (2010)Multimodal annotation of conversational dataProceedings of the Fourth Linguistic Annotation Workshop10.5555/1868720.1868749(186-191)Online publication date: 15-Jul-2010
  • (2007)An XML environment for multistructured textual documents2007 2nd International Conference on Digital Information Management10.1109/ICDIM.2007.4444228(230-235)Online publication date: Oct-2007

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media