Abstract
In this paper, we look at Web data that comes from multiple sources, as in the Web 2.0. We argue that Web data is more than just its content. Rather, a piece of Web data carries along different facets, such the transformations that data underwent, the different perspectives that users have on the content, and the context in which a statement is made. We put forward the idea that provenance, i.e. the tracing of where data comes from, can help us model these phenomena. We study how far existing approaches address the issue of provenance for Web data, and identify gaps and open problems.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Green, T.J., Karvounarakis, G., Tannen, V.: Provenance semirings. In: PODS (2007)
Cui, Y., Widom, J., Wiener, J.L.: Tracing the lineage of view data in a warehousing environment. ACM Transactions on Database Systems 25(2) (2000)
Buneman, P., Cheney, J., Vansummeren, S.: On the expressiveness of implicit provenance in query and update languages. ACM Trans. Database Syst. 33(4) (2008)
Buneman, P., Khanna, S., Tan, W.-C.: Why and Where: A Characterization of Data Provenance. In: Van den Bussche, J., Vianu, V. (eds.) ICDT 2001. LNCS, vol. 1973, pp. 316–330. Springer, Heidelberg (2000)
Benjelloun, O., Sarma, A., Halevy, A., Theobald, M., Widom, J.: Databases with uncertainty and lineage. VLDB J. 17, 243–264 (2008)
Abiteboul, S., Duschka, O.M.: Complexity of answering queries using materialized views. In: Mendelzon, A.O., Paredaens, J. (eds.) PODS, pp. 254–263 (1998)
Green, T.: Containment of conjunctive queries on annotated relations. In: ICDT (2009)
Foster, J., Green, T., Tannen, V.: Annotated XML: queries and provenance. In: PODS (2008)
Amsterdamer, Y., Deutch, D., Tannen, V.: Provenance for aggregate queries. In: PODS (2011)
Carroll, J.J., Bizer, C., Hayes, P., Stickler, P.: Named graphs, provenance and trust. In: WWW 2005: Proceedings of the 14th International Conference on World Wide Web, pp. 613–622. ACM, New York (2005)
Theoharis, Y., Fundulaki, I., Karvounarakis, G., Christophides, V.: On provenance of queries on semantic web data. IEEE Internet Computing 15(1), 31–39 (2011)
Cheney, J., Chong, S., Foster, N., Seltzer, M.I., Vansummeren, S.: Provenance: a future history. In: Proc. of OOPSLA (2009)
Theoharis, Y., Fundulaki, I., Karvounarakis, G., Christophides, V.: On provenance of queries on semantic web data. IEEE Internet Computing 99(preprints) (2010)
Suchanek, F.M., Gross-Amblard, D.: Adding fake facts to ontologies. In: Demo at the International World Wide Web Conference. ACM (2010)
Suchanek, F.M., Gross-Amblard, D., Abiteboul, S.: Watermarking for Ontologies. In: Aroyo, L., Welty, C., Alani, H., Taylor, J., Bernstein, A., Kagal, L., Noy, N., Blomqvist, E. (eds.) ISWC 2011, Part I. LNCS, vol. 7031, pp. 697–713. Springer, Heidelberg (2011)
Etzioni, O., Cafarella, M., Downey, D., Kok, S., Popescu, A.M., Shaked, T., Soderland, S., Weld, D.S., Yates, A.: Web-scale information extraction in knowitall (preliminary results). In: World Wide Web Conference (2004)
Suchanek, F.M., Sozio, M., Weikum, G.: SOFIE: A Self-Organizing Framework for Information Extraction. In: International World Wide Web conference (WWW 2009). ACM Press, New York (2009)
Nakashole, N., Theobald, M., Weikum, G.: Scalable knowledge harvesting with high precision and high recall. In: WSDM (2011)
Carlson, A., Betteridge, J., Wang, R.C., Hruschka Jr., E.R., Mitchell, T.M.: Coupled semi-supervised learning for information extraction. In: Proceedings of the Third ACM International Conference on Web Search and Data Mining, WSDM 2010 (2010)
Suchanek, F.M., Kasneci, G., Weikum, G.: YAGO: A core of semantic knowledge - unifying WordNet and Wikipedia. In: Williamson, C.L., Zurko, M.E., Patel-Schneider, P.F., Shenoy, P.J. (eds.) World Wide Web Conference, Banff, Canada, pp. 697–706. ACM (2007)
Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.G.: DBpedia: A Nucleus for a Web of Open Data. In: Aberer, K., Choi, K.-S., Noy, N., Allemang, D., Lee, K.-I., Nixon, L.J.B., Golbeck, J., Mika, P., Maynard, D., Mizoguchi, R., Schreiber, G., Cudré-Mauroux, P. (eds.) ASWC 2007 and ISWC 2007. LNCS, vol. 4825, pp. 722–735. Springer, Heidelberg (2007)
Hoffart, J., Suchanek, F.M., Berberich, K., Weikum, G.: Yago2: a spatially and temporally enhanced knowledge base from wikipedia. Artificial Intelligence Journal (2012)
McCarthy, J.: Generality in artificial intelligence. Communications of the ACM 30(12), 1029–1035 (1987)
McCarthy, J.: Notes on formalizing context. In: IJCAI, pp. 555–562 (1993)
Buvac, S., Mason, I.A.: Propositional logic of context. In: Proc. of AAAI, pp. 412–419 (1993)
Buvac, S.: Quantificational logic of context. In: Proc. of AAAI, pp. 600–606 (1996)
Nossum, R.: A decidable multi-modal logic of context. J. Applied Logic 1(1-2), 119–133 (2003)
Klarman, S., Gutiérrez-Basulto, V.: Two-dimensional description logics for context-based semantic interoperability. In: AAAI (2011)
Giunchiglia, F., Serafini, L.: Multilanguage hierarchical logics, or: How we can do without modal logics. Artificial Intelligence 65(1), 29–70 (1994)
Serafini, L., Bouquet, P.: Comparing formal theories of context in ai. Artificial Intelligence 155(1-2), 41–67 (2004)
Hintikka, J.: Knowledge and Belief. Cornell University Press (1962)
Fagin, R., Halpern, J.Y., Moses, Y., Vardi, M.Y.: Reasoning About Knowledge. MIT Press (1995)
van Ditmarsch, H., van der Hoek, W., Kooi, B.: Dynamic Epistemic Logic. Springer (2007)
Halpern, J.Y., Moses, Y.: A guide to completeness and complexity for modal logics of knowledge and belief. Artif. Intell. 54(2), 319–379 (1992)
Foster, J.N., Green, T.J., Tannen, V.: Annotated xml: queries and provenance. In: PODS, pp. 271–280 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bienvenu, M., Deutch, D., Suchanek, F.M. (2012). Provenance for Web 2.0 Data. In: Jonker, W., Petković, M. (eds) Secure Data Management. SDM 2012. Lecture Notes in Computer Science, vol 7482. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32873-2_10
Download citation
DOI: https://doi.org/10.1007/978-3-642-32873-2_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32872-5
Online ISBN: 978-3-642-32873-2
eBook Packages: Computer ScienceComputer Science (R0)