Abstract
Data origin or processing information and the metadata that is useful in understanding data can be associated with data by using annotation. Provenance knowledge preserved by annotation is managed by continuously propagating the annotations through the workflow. Models for explicitly associating annotations are generally used for annotation-based provenance management, and techniques for propagating annotations have been proposed. There is also a model for implicitly associating annotations – the annotations are associated with data with arbitrary granularity by using queries. We call the implicit model “multi-granularity annotation” model. Multi-granularity annotation enables flexible association of information. However, no provenance management methods using multi-granularity annotations have been reported. We have developed a method for propagating multi-granularity annotations. We define rules for annotation propagation for each relational algebra operation, and they are used to recalculate the scopes of annotations associated with data. We also addressed the loss of information needed to preserve annotation associations during data derivation and the lack of static data annotations by extending the operations and the association method. Experiments showed that our method requires less space usage and execution time than conventional annotation management methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Tpc-h benchmark, http://www.tpc.org/tpch/
Bhagwat, D., Chiticariu, L., Tan, W.C., Vijayvargiya, G.: An annotation management system for relational databases. In: VLDB, pp. 900–911 (2004)
Buneman, P., Khanna, S., Tan, W.-C.: Why and where: A characterization of data provenance. In: Van den Bussche, J., Vianu, V. (eds.) ICDT 2001. LNCS, vol. 1973, pp. 316–330. Springer, Heidelberg (2000)
Buneman, P., Khanna, S., Tan, W.C.: On propagation of deletions and annotations through views. In: PODS, pp. 150–158 (2002)
Buneman, P., Tan, W.C.: Provenance in databases. In: SIGMOD, pp. 1171–1173 (2007)
Cheney, J., Chiticariu, L., Tan, W.C.: Provenance in databases: Why, how, and where. Foundations and Trends in Databases 1(4), 379–474 (2009)
Cui, Y., Widom, J., Wiener, J.L.: Tracing the lineage of view data in a warehousing environment. ACM TODS 25(2), 179–227 (2000)
Davidson, S.B., Boulakia, S.C., Eyal, A., Ludäscher, B., McPhillips, T.M., Bowers, S., Anand, M.K., Freire, J.: Provenance in scientific workflow systems. IEEE Data Eng. Bull. 30(4), 44–50 (2007)
Eltabakh, M.Y., Aref, W.G., Elmagarmid, A.K., Ouzzani, M., Silva, Y.N.: Supporting annotations on relations. In: EDBT, pp. 379–390 (2009)
Geerts, F., Kementsietsidis, A., Milano, D.: Mondrian: Annotating and querying databases through colors and blocks. In: ICDE, p. 82 (2006)
Green, T.J., Karvounarakis, G., Tannen, V.: Provenance semirings. In: PODS, pp. 31–40 (2007)
Srivastava, D., Velegrakis, Y.: Intensional associations between data and metadata. In: SIGMOD, pp. 401–412 (2007)
Tan, W.C.: Provenance in databases: Past, current, and future. IEEE Data Eng. Bull. 30(4), 3–12 (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Aoto, R., Shimizu, T., Yoshikawa, M. (2011). Propagation of Multi-granularity Annotations. In: Hameurlain, A., Liddle, S.W., Schewe, KD., Zhou, X. (eds) Database and Expert Systems Applications. DEXA 2011. Lecture Notes in Computer Science, vol 6861. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23091-2_51
Download citation
DOI: https://doi.org/10.1007/978-3-642-23091-2_51
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23090-5
Online ISBN: 978-3-642-23091-2
eBook Packages: Computer ScienceComputer Science (R0)