Abstract
With the recent popularity of the World Wide Web, an enormous amount of heterogeneous information is now available online. As a result, information about real world objects may spread over different data sources and may be partial and inconsistent. How to manipulate such semistructured data is thus a challenge. Previous work on semistructured data mainly focuses on developing query languages and systems to retrieve semistructured data. Relatively less attention has been paid to the manipulation of such data. In order to manipulate such semistructured data, we need a data model that is more expressive than the existing graph-based and tree-based ones to account for the existence of partial and inconsistent information from different data sources. In this paper, we propose such a data model for semistructured data that allows partial and inconsistent information and discuss how to manipulate such semistructured data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
S. Abiteboul. Querying Semistructured Data. In Proceedings of the International Conference on Data Base Theory, pages 1–18. Springer-Verlag LNCS 1186, 1997.
S. Abiteboul, D. Quass, J. McHugh, J. Widom, and J. L. Wiener. The Lorel Query Language for Semistructured Data. Intl. Journal of Digital Libraries, 1(1):68–88, 1997.
J. L. Ambite, N. Ashish, G. Barish, G.A. Knoblock, S. Minton, P.J. Modi, I. Muslea, A. Philpot, and S. Tejada. ARIADNE: A system for constructing mediators for internet sources. In Proceedings of the ACM SIGMOD International Conference on Management of Data, 1998.
G. Arocena and A. Mendelzon. WebOQL: Restructuring Documents, Databases and Webs. In Proceedings of the International Conference on Data Engineering, pages 24–33. IEEE Computer Society, 1998.
F. Bancilhon and S. Khoshafian. A Calculus for Complex Objects. J. Computer and System Sciences, 38(2):326–340, 1989.
C. Beeri, G. Elber, T. Milo, Y. Sagiv, O. Shmueli, N. Tishby, Y. Kogan, D. Konopnicki, P. Mogilevski, and N. Slonim. Websuite — A tool suite for harnessing web data. In Proceedings of the International Workshop on the Web and Databases, 1998.
O. P. Buneman, S. B. Davidson, and A. Watters. A Semantics for Complex Objects and Approximate Answers. J. Computer and System Sciences, 43(1):170–218, 1991.
P. Buneman, S. Davidson, M. Fernandez, and D. Suciu. Adding Structure to Unstructured Data. In Proceedings of the International Conference on Data Base Theory, pages 336–350. Springer-Verlag LNCS 1186, 1997.
P. Buneman, S. Davidson, G. Hilebrand, and D. Suciu. A Query Language and Optimization Techniques for Unstructured Data. In Proceedings of the ACM SIGMOD International Conference on Management of Data, pages 505–516, 1996.
S. S. Chawathe, H. Garcia-Molina, J. Hammer, K. Ireland, Y. Papakonstantinou, J. D. Ullman, and J. Widom. The TSIMMIS Project: Integration of Heterogeneous Information Sources. In Proceedings of the 10th Meeting of the Information Processing Society of Japan, pages 7–18, 1994.
W. W. Cohen. Integration of Heterogeneous Databases without Common Domains Using Queries Based on Textual Similarity. In Proceedings of the ACM SIGMOD International Conference on Management of Data, pages 201–212, 1998.
O. Shmueli D. Konopnicki. W3QS: A Query System for the World-Wide Web. In Proceedings of the International Conference on Very Large Data Bases, pages 54–65, Zurich, Switzerland, 1995. Morgan Kaufmann Publishers, Inc.
L.G. Demichiel. Resolving Database Incompatibility: An Approach to Performing Relational Operations over Mismatched Domains. IEEE Transactions on Knowledge and Data Engineering, 1(4):485–493, 1989.
D. Florescu, A. Levy, and A. Mendelzon. Database Techniques for the World-Wide Web: A Survey. SIGMOD Record, 26(3), 1997.
R. Himmeroder, G. Lausen, B. Ludascher, and C. Schlepphorst. On a declarative semantics for web queries. In Proceedings of the International Conference on Deductive and Object-Oriented Databases, pages 386–398, Switzerland, 1997. Springer-Verlag LNCS.
R. Hull and G. Zhou. A Framework for Supporting Data Integration Using the Materialized and Virtual Approaches. In Proceedings of the ACM SIGMOD International Conference on Management of Data, pages 481–492, 1996.
T. Imielinski and W. L. Jr. Incomplete Information in Relational Databases. Journal of ACM, 31(4):761–791, 1984.
W. L. Jr. On Databases with Incomplete Information. Journal of ACM, 28(1):41–70, 1981.
L. V. S. Lakshmanan, F. Sadri, and I. N. Subramanian. A Declarative Language for Querying and Restructuring the Web. In Proceedings of the 6th International Workshop on Research Issues in Data Engineering, 1996.
L. Lamport. Latex User Guide and Reference Manual. Addison Wesley, 2 edition, 1994.
A. Y. Levy, A. Rajaraman, and J. J. Ordille. Querying heterogeneous information sources using source descriptions. In Proceedings of the International Conference on Very Large Data Bases, pages 251–262. Morgan Kaufmann Publishers, Inc., 1996.
L. Libkin. A Relational Algebra for Complex Objects based on Partial Information. In Proceedings of the Conference on Mathematical Foundations of Programming Semantics, pages 26–41, Rostock, Germany, 1991. Springer-Verlag LNCS 495.
L. Libkin. Normalizing Incomplete Databases. In Proceedings of the ACM Symposium on Principles of Database Systems, pages 219–230, San Jose, California, 1995.
M. Liu. ROL: A Deductive Object Base Language. Information Systems, 21(5):431–457, 1996.
M. Liu. Relationlog: A Typed Extension to Datalog with Sets and Tuples. Journal of Logic Programming, 36(3):271–299, 1998.
A. Mendelzon, G. Mihaila, and T. Milo. Querying the World Wide Web. In Proceedings of the First International Conference on Parellel and Distributed Information System, pages 80–91, 1996.
A. Motro and I. Rakov. Estimating the Quality of Data in Relational Databases. In Proceedings of the 1996 Conference on Information Quality, pages 94–106, 1996.
K. Munakata. Integration of Semistructured Data Using Outer Joins. In Proceedings of the Workshop on Management of Semistructured Data, 1997.
Y. Papakonstantinou, H. Garcia-Molina, and J. Widom. Object Exchange across Heterogeneous Information. In Proceedings of the International Conference on Data Engineering, pages 251–260. IEEE Computer Society, 1995.
F. S. C. Tseng, A. L. P. Chen, and W. P. Yang. Answering Heterogeneous Databases Queries with Degrees of Uncertainty. Distributed and Parallel Databases, 1(3):281–302, 1993.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Liu, M., Ling, T.W. (2000). A Data Model for Semistructured Data with Partial and Inconsistent Information. In: Zaniolo, C., Lockemann, P.C., Scholl, M.H., Grust, T. (eds) Advances in Database Technology — EDBT 2000. EDBT 2000. Lecture Notes in Computer Science, vol 1777. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46439-5_22
Download citation
DOI: https://doi.org/10.1007/3-540-46439-5_22
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-67227-2
Online ISBN: 978-3-540-46439-6
eBook Packages: Springer Book Archive