Abstract:
XML is the de facto standard to describe structured data. Several applications in the context of information systems are based on its use: electronic publishing, technica...Show MoreMetadata
Abstract:
XML is the de facto standard to describe structured data. Several applications in the context of information systems are based on its use: electronic publishing, technical documentation, digital libraries, web, etc. An XML document is mainly hierarchical. But, in some applications, several concurrent hierarchical structures could be associated to the same textual data. This paper presents an XML environment dedicated to the representation and the querying of such documents that we call multistructured textual documents. Our work aims at proposing a method for a compact representation of multiple trees over a single text based on segmentation. Segmentation encoding allows querying overlap/containment relations of markups belonging to different structures. This paper particularly focuses on the architecture of the XML environment implementing our proposals.
Date of Conference: 28-31 October 2007
Date Added to IEEE Xplore: 31 January 2008
ISBN Information: