Abstract
The recently proposed notion of a Semantic Web requires XML/RDF processing techniques ablt to locate, extract and organise heterogeneous information contained in XML documents coming from different sites, dealing flexibly with differences in structure and tag vocabulary. Such techniques shouls operate even when tagging is done in accordance with non-informative schemata, and even when no schema is available at all. In this paper, we review the main problems related to the processing and restructuring of large amounts of XML-based data, and propose some soultions in the framework of a flexible query and processing model for well-formed CML documents.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
G. Bordogna, D. Lucarella, G. Pasi, “A Fuzzy Object Oriented Data Model”, IEEE International Conference on Fuzzy Systems, Vol.1, 1994, pp.313–317.
B. Bouchon-Meunier, M. Rifqi, S. Bothorel, “Towards General Measures of Comparison of Objects”, Fuzzy Sets and Systems, vol.84, 1996.
P. Burchard, “Cut-Paste Metrics: String Matching”, http://www/pobox.com/burchard/bio/fuzzymatch/, 2000
S. Ceri, A. Bonifati, “Comparison of XML Query Languages”, SIGMOD Record 21(1) (2000).
R. Cohem, G. Di Battista, A. Kanevsky, R. Tamassia. “Reinventing the Wheel: An Optimal Data Structure for Connectivity Queries”. Proc. of ACM-TOC Symp. on the Theory of Computing. S.Diego, CA, (US) (1993).
S. Comai, E. Damiani, R. Posenato, L. Tanca. “A Schema-Based Approach to Modelling and Querying WWW Data”. In H. Christiansen, ed., Proceedings of Flexible Query Answering Systems (FQAS’ 98), Roskilde (Denmark), Lecture Notes in Artificial Intelligence 1495, Springer (1998).
S. Comai, E. Damiani, R. Posenato, L. Tanca. “XML=-GL: a Graphical Language for Querying and Restructuring XML Documents”, Computer Networks, Vol. 31, (1999), pp. 1171–1187.
S. Cohen, Y. Kogan, W. Nutt, Y. Sagiv, A. Serebrenik. “EquiX: Easy Querying in XML Databases in XML Databases” WebDB (Informal Proceedings), 2000.
E. Damiani, M.G. Fugini, C. Bellettini, “A Hierarchy Aware Approach to Faceted Classification of Object-Oriented Components”. ACM Transactions on Software Engineering Methodologies, vol.8 n.3 pp.215–262, 1999.
E. Damiani, L. Tanca. “Blin Queries to XML Data”. Proceedings of DEXA 2000, London, UK, September 4–8, 2000. Lecture Notes in Computer Science, Vol. 1873, Springer, 2000, Pages, 345–356.
E. Damiani, L. Tanca, F. Arcelli Fontana. “Fuzzy XML Queries via Context-based Choice of Aggregations”, Kybernetika n.16 vol.4, 2000.
D. Dubois, F. Esteva, P. Garcia, L. Godo, R. Lopez de Mantaras, H. Prade. “Fuzzy Set Modelling in Case-Based Reasoning”. Int. Jour. of Intelligent Systems 13(1) (1998).
D. Dubois, H. Prade, F. Sedes. “Fuzzy Logic Techniques in Multimedia Database Querying: A Preliminary Investigations of the Potentials”. In D. Meersman, Z. Tari, and S. Stevens, eds., Database Semantics: Semantic Issues in Multimedia Systems, Kluwer Academic Publisher (1999).
W3C. Extensible Stylesheet Language (XSL) Version 1.0. October 2000. http://www.w3C.org/TR/xsl/
W3C. Extensible Markup Language (XML) 1.0. Feb. 1998. http://www.w3C.org/TR/REC-xml/
W3C. Resource Description Framework (RDF) Model and Syntax Specification. W3C Recommendation 22 February 1999. http://www.w3.org/TR/REC-rdf-syntax/
W3C. Namespaces in XML W3C Recommendation 14 January 1999. http://www.w3.org/TR/1999/REC-xml-names-19990114/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Damiani, E., Olibini, B., Tanca, L. (2001). Fuzzy Techniques for XML Data Smushing. In: Reusch, B. (eds) Computational Intelligence. Theory and Applications. Fuzzy Days 2001. Lecture Notes in Computer Science, vol 2206. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45493-4_65
Download citation
DOI: https://doi.org/10.1007/3-540-45493-4_65
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42732-2
Online ISBN: 978-3-540-45493-9
eBook Packages: Springer Book Archive