Skip to main content

Provenance for Web 2.0 Data

  • Conference paper
Book cover Secure Data Management (SDM 2012)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7482))

Included in the following conference series:

Abstract

In this paper, we look at Web data that comes from multiple sources, as in the Web 2.0. We argue that Web data is more than just its content. Rather, a piece of Web data carries along different facets, such the transformations that data underwent, the different perspectives that users have on the content, and the context in which a statement is made. We put forward the idea that provenance, i.e. the tracing of where data comes from, can help us model these phenomena. We study how far existing approaches address the issue of provenance for Web data, and identify gaps and open problems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 54.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 72.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Green, T.J., Karvounarakis, G., Tannen, V.: Provenance semirings. In: PODS (2007)

    Google Scholar 

  2. Cui, Y., Widom, J., Wiener, J.L.: Tracing the lineage of view data in a warehousing environment. ACM Transactions on Database Systems 25(2) (2000)

    Google Scholar 

  3. Buneman, P., Cheney, J., Vansummeren, S.: On the expressiveness of implicit provenance in query and update languages. ACM Trans. Database Syst. 33(4) (2008)

    Google Scholar 

  4. Buneman, P., Khanna, S., Tan, W.-C.: Why and Where: A Characterization of Data Provenance. In: Van den Bussche, J., Vianu, V. (eds.) ICDT 2001. LNCS, vol. 1973, pp. 316–330. Springer, Heidelberg (2000)

    Chapter  Google Scholar 

  5. Benjelloun, O., Sarma, A., Halevy, A., Theobald, M., Widom, J.: Databases with uncertainty and lineage. VLDB J. 17, 243–264 (2008)

    Article  Google Scholar 

  6. Abiteboul, S., Duschka, O.M.: Complexity of answering queries using materialized views. In: Mendelzon, A.O., Paredaens, J. (eds.) PODS, pp. 254–263 (1998)

    Google Scholar 

  7. Green, T.: Containment of conjunctive queries on annotated relations. In: ICDT (2009)

    Google Scholar 

  8. Foster, J., Green, T., Tannen, V.: Annotated XML: queries and provenance. In: PODS (2008)

    Google Scholar 

  9. Amsterdamer, Y., Deutch, D., Tannen, V.: Provenance for aggregate queries. In: PODS (2011)

    Google Scholar 

  10. Carroll, J.J., Bizer, C., Hayes, P., Stickler, P.: Named graphs, provenance and trust. In: WWW 2005: Proceedings of the 14th International Conference on World Wide Web, pp. 613–622. ACM, New York (2005)

    Chapter  Google Scholar 

  11. Theoharis, Y., Fundulaki, I., Karvounarakis, G., Christophides, V.: On provenance of queries on semantic web data. IEEE Internet Computing 15(1), 31–39 (2011)

    Article  Google Scholar 

  12. Cheney, J., Chong, S., Foster, N., Seltzer, M.I., Vansummeren, S.: Provenance: a future history. In: Proc. of OOPSLA (2009)

    Google Scholar 

  13. Theoharis, Y., Fundulaki, I., Karvounarakis, G., Christophides, V.: On provenance of queries on semantic web data. IEEE Internet Computing 99(preprints) (2010)

    Google Scholar 

  14. Suchanek, F.M., Gross-Amblard, D.: Adding fake facts to ontologies. In: Demo at the International World Wide Web Conference. ACM (2010)

    Google Scholar 

  15. Suchanek, F.M., Gross-Amblard, D., Abiteboul, S.: Watermarking for Ontologies. In: Aroyo, L., Welty, C., Alani, H., Taylor, J., Bernstein, A., Kagal, L., Noy, N., Blomqvist, E. (eds.) ISWC 2011, Part I. LNCS, vol. 7031, pp. 697–713. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  16. Etzioni, O., Cafarella, M., Downey, D., Kok, S., Popescu, A.M., Shaked, T., Soderland, S., Weld, D.S., Yates, A.: Web-scale information extraction in knowitall (preliminary results). In: World Wide Web Conference (2004)

    Google Scholar 

  17. Suchanek, F.M., Sozio, M., Weikum, G.: SOFIE: A Self-Organizing Framework for Information Extraction. In: International World Wide Web conference (WWW 2009). ACM Press, New York (2009)

    Google Scholar 

  18. Nakashole, N., Theobald, M., Weikum, G.: Scalable knowledge harvesting with high precision and high recall. In: WSDM (2011)

    Google Scholar 

  19. Carlson, A., Betteridge, J., Wang, R.C., Hruschka Jr., E.R., Mitchell, T.M.: Coupled semi-supervised learning for information extraction. In: Proceedings of the Third ACM International Conference on Web Search and Data Mining, WSDM 2010 (2010)

    Google Scholar 

  20. Suchanek, F.M., Kasneci, G., Weikum, G.: YAGO: A core of semantic knowledge - unifying WordNet and Wikipedia. In: Williamson, C.L., Zurko, M.E., Patel-Schneider, P.F., Shenoy, P.J. (eds.) World Wide Web Conference, Banff, Canada, pp. 697–706. ACM (2007)

    Google Scholar 

  21. Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.G.: DBpedia: A Nucleus for a Web of Open Data. In: Aberer, K., Choi, K.-S., Noy, N., Allemang, D., Lee, K.-I., Nixon, L.J.B., Golbeck, J., Mika, P., Maynard, D., Mizoguchi, R., Schreiber, G., Cudré-Mauroux, P. (eds.) ASWC 2007 and ISWC 2007. LNCS, vol. 4825, pp. 722–735. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  22. Hoffart, J., Suchanek, F.M., Berberich, K., Weikum, G.: Yago2: a spatially and temporally enhanced knowledge base from wikipedia. Artificial Intelligence Journal (2012)

    Google Scholar 

  23. McCarthy, J.: Generality in artificial intelligence. Communications of the ACM 30(12), 1029–1035 (1987)

    Article  MathSciNet  Google Scholar 

  24. McCarthy, J.: Notes on formalizing context. In: IJCAI, pp. 555–562 (1993)

    Google Scholar 

  25. Buvac, S., Mason, I.A.: Propositional logic of context. In: Proc. of AAAI, pp. 412–419 (1993)

    Google Scholar 

  26. Buvac, S.: Quantificational logic of context. In: Proc. of AAAI, pp. 600–606 (1996)

    Google Scholar 

  27. Nossum, R.: A decidable multi-modal logic of context. J. Applied Logic 1(1-2), 119–133 (2003)

    Article  MathSciNet  MATH  Google Scholar 

  28. Klarman, S., Gutiérrez-Basulto, V.: Two-dimensional description logics for context-based semantic interoperability. In: AAAI (2011)

    Google Scholar 

  29. Giunchiglia, F., Serafini, L.: Multilanguage hierarchical logics, or: How we can do without modal logics. Artificial Intelligence 65(1), 29–70 (1994)

    Article  MathSciNet  MATH  Google Scholar 

  30. Serafini, L., Bouquet, P.: Comparing formal theories of context in ai. Artificial Intelligence 155(1-2), 41–67 (2004)

    Article  MathSciNet  MATH  Google Scholar 

  31. Hintikka, J.: Knowledge and Belief. Cornell University Press (1962)

    Google Scholar 

  32. Fagin, R., Halpern, J.Y., Moses, Y., Vardi, M.Y.: Reasoning About Knowledge. MIT Press (1995)

    Google Scholar 

  33. van Ditmarsch, H., van der Hoek, W., Kooi, B.: Dynamic Epistemic Logic. Springer (2007)

    Google Scholar 

  34. Halpern, J.Y., Moses, Y.: A guide to completeness and complexity for modal logics of knowledge and belief. Artif. Intell. 54(2), 319–379 (1992)

    Article  MathSciNet  MATH  Google Scholar 

  35. Foster, J.N., Green, T.J., Tannen, V.: Annotated xml: queries and provenance. In: PODS, pp. 271–280 (2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Bienvenu, M., Deutch, D., Suchanek, F.M. (2012). Provenance for Web 2.0 Data. In: Jonker, W., Petković, M. (eds) Secure Data Management. SDM 2012. Lecture Notes in Computer Science, vol 7482. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32873-2_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-32873-2_10

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-32872-5

  • Online ISBN: 978-3-642-32873-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics