skip to main content
10.1145/1367497.1367735acmconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
poster

Extracting XML schema from multiple implicit xml documents based on inductive reasoning

Published: 21 April 2008 Publication History

Abstract

We propose a method of classifying XML documents and extracting XML schema from XML by inductive inference based on constraint logic programming. The goal of this work is to type a large collection of XML approximately but efficiently. This can also process XML code written in a different schema or even code which is schema-less. Our approach is intended to achieve identification based on the syntax and semantics of the XML documents by information extraction using ontology, and to support retrieval and data management. Our approach has three steps. The first step is XML to predicates, the second step is to compare predicates and classifies structures which represent similar meanings in different structures, and the last step is predicates to rules by using ontology and to maintain XML Schema. We evaluate similarity of data type and data range by using an ontology dictionary, and XML Schema is made from results of second and last step.

References

[1]
Masaya Eki, Tadachika Ozono, Toramatsu Shintani, 'On an XML Database System Based on Constraint Logic Programming', WorldComp ICAI'07, pages 859-865, 2007.
[2]
Wen-Syan Li, Chris Clifton, 'SEMINT: a tool for identifying attribute correspondences in heterogeneous databases using neural networks', Data & Knowledge Engineering, Volume 33, Issue 1, Pages 49-84, Apr 2000.
[3]
Fumio Mizoguchi, Hayato Ohwada, 'Constraint relative least general generalization for inducing constraint logic programs', New Generation Computing, pages 335-368, 1995.
[4]
Svetlozar Nestorov, Serge Abiteboul, Rajeev Motwani, 'Extracting Schema from Semistructured Data', SIGMOD'98, pages 295-306, 1998.

Cited By

View all
  • (2012)User profile integration made easyProceedings of the 21st International Conference on World Wide Web10.1145/2187980.2188227(939-948)Online publication date: 16-Apr-2012

Index Terms

  1. Extracting XML schema from multiple implicit xml documents based on inductive reasoning

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      WWW '08: Proceedings of the 17th international conference on World Wide Web
      April 2008
      1326 pages
      ISBN:9781605580852
      DOI:10.1145/1367497
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      In-Cooperation

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 21 April 2008

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. inductive reasoning
      2. predicate logic
      3. xml

      Qualifiers

      • Poster

      Conference

      WWW '08
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)1
      • Downloads (Last 6 weeks)1
      Reflects downloads up to 20 Feb 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2012)User profile integration made easyProceedings of the 21st International Conference on World Wide Web10.1145/2187980.2188227(939-948)Online publication date: 16-Apr-2012

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media