Abstract
This paper put forward the concept of OEM model with weight on its edges, developes a new approach to extracting schema from semistructured data with weight on its edges, and gives two theorems related to computing taget set of label path and suporting degree of label path. Using wideth-first and top-down traversing strategy ,the algorithm computes target set and supporting degree of every label in a label path, and decides whether the label is retained in schema model according to its magnitude of supporting degree and weight of the label .In the last, we test the validity and efficiency of the algorithm. The schema scale of the semistructured data obtained from the same OEM database in this paper is smaller than that in other paper.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Wang, J., Meng, X.F.: Schema of Semistructured Data. A Survey. Computer Science 2, 6–10 (2001)
Meng, X.F.: An Overview of WEB Data Management. Journal of Computer Research and Development 4, 385–395 (2001)
Lu, C., Wei, C.Y., Zhang, H.T.: Schema Discovery of Semi-Structured Data Based on OEM Model. Computer Engineering and Applications 34, 162–165 (2006)
Meng, D.L., Ye, F.Y., Li, X.H.: Extracting Schema from Semistructured Data. Computer Engineering and Applications 27, 162–165 (2006)
Liu, F., Hu, H.P., Lu, S.F.: Schema Discovery for Semistructured Hierarchical Data. Mini-micro Systems 1, 84–88 (2004)
Goldman, R., Idom, W., DataGuide, J.: Enabling Query Formulation and Optimization in Semistructured Databases. In: Proc of the international Conf of the Very Large Data Bases(VlDB), Athens, Greece (1997)
McHugh, J., Widom, J., Loor: A Database Management System for Semistructured data. SIGMOD Record 3, 54–66 (1997)
Wu, G.Q., Chen, E.H.: An Approach of Storing Semi-structured Data with XML. Computer Engineering 10, 57–59 (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Li, J., Shi, S. (2009). Extracting Schema from Semistructured Data with Weight Tag. In: Yu, W., He, H., Zhang, N. (eds) Advances in Neural Networks – ISNN 2009. ISNN 2009. Lecture Notes in Computer Science, vol 5553. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-01513-7_126
Download citation
DOI: https://doi.org/10.1007/978-3-642-01513-7_126
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-01512-0
Online ISBN: 978-3-642-01513-7
eBook Packages: Computer ScienceComputer Science (R0)