Abstract
This article describes SciProv, an architecture that aims to interact with Scientific Workflow Management Systems in order to capture and manipulate provenance metadata. For this purpose, SciProv adopts an approach based on an abstract model for representing the lineage. This model, called Open Provenance Model (OPM), allows that SciProv can set up a homogeneous and interoperable infrastructure for handling provenance metadata. As a result, SciProv is able to provide a framework for query metadata provenance generated in an e-Science scenario. Moreover, the architecture uses semantic web technology in order to process provenance queries. In this context, using ontologies and inference engines, SciProv can make inferences about lineage and, based on these inferences, obtain important results based on extraction of information beyond those that are registered explicitly from the data managed.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Barga, R.S., Digiampietri, L.A.: Automatic capture and efficient storage of e-science experiment provenance. Concur. Comput.: Pract. Exper. 20(5), 419–429 (2008)
Biton, O., Cohen-Boulakia, S., Davidson, S.B., Hara, C.S.: Querying and managing provenance through user views in scientific workflows. In: ICDE 2008: Proceedings of the 2008 IEEE 24th International Conference on Data Engineering, pp. 1072–1081. IEEE Computer Society, Washington, DC, USA (2008)
Buneman, P., Khanna, S., Tan, W.C.: Why and where: A characterization of data provenance. In: Van den Bussche, J., Vianu, V. (eds.) ICDT 2001. LNCS, vol. 1973, pp. 316–330. Springer, Heidelberg (2000)
Cavalier-Smith, T.: Only six kingdoms of life. Proceedings of the Royal Society B Biological Sciences 271(1545), 1251–1262 (2004)
Cruz, S.M.S.d., Campos, M.L.M., Mattoso, M.: Towards a taxonomy of provenance in scientific workflow management systems. In: SERVICES 2009: Proceedings of the 2009 Congress on Services - I. IEEE Computer Society, Washington, DC, USA (2009)
Davidson, S.B., Freire, J.: Provenance and scientific workflows: challenges and opportunities. In: SIGMOD 2008: Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data, pp. 1345–1350. ACM, New York (2008)
Digiampietri, L., Medeiros, C., Setúbal, J.: A framework based in web services orchestration bioinformatics workflow management. Genetics and Molecular Research 4(3), 535–542 (2005)
Factor, M., Henis, E., Naor, D., Rabinovici-Cohen, S., Reshef, P., Ronen, S., Michetti, G., Guercio, M.: Authenticity and provenance in long term digital preservation: modeling and implementation in preservation aware storage. In: First Workshop on Theory and Practice of Provenance, pp. 6:1–6:10. USENIX Association, Berkeley (2009)
Freire, J., Koop, D., Santos, E., Silva, C.T.: Provenance for computational tasks: A survey. Computing in Science and Enginnering 10(3), 11–21 (2008)
Freire, J., Silva, C.T.: Towards enabling social analysis of scientific data. In: CHI Social Data Analysis Workshop, Florence, Italy (2008)
Freire, J., Silva, C.T., Callahan, S.P., Santos, E., Scheidegger, C.E., Vo, H.T.: Managing rapidly-evolving scientific workflows. In: Moreau, L., Foster, I. (eds.) IPAW 2006. LNCS, vol. 4145, pp. 10–18. Springer, Heidelberg (2006)
Golbeck, J., Hendler, J.: A semantic web approach to the provenance challenge. Concurr. Comput.: Pract. Exper. 20(5), 431–439 (2008)
Groth, P., Deelman, E., Juve, G., Mehta, G., Berriman, B.: Pipeline-centric provenance model. In: WORKS 2009: Proceedings of the 4th Workshop on Workflows in Support of Large-Scale Science, pp. 1–8. ACM, New York (2009)
Groth, P., Jiang, S., Miles, S., Munroe, S., Tan, V., Tsasakou, S., Moreau, L.: An architecture for provenance systems. Tech. Rep. D3.1.1 Final Architecture v.0.6, EU Provenance Project, Southampton, UK (2006)
Ludäscher, B., Altintas, I., Berkley, C., Higgins, D., Jaeger, E., Jones, M., Lee, E.A., Tao, J., Zhao, Y.: Scientific workflow management and the kepler system: Research articles. Concurr. Comput.: Pract. Exper. 18(10), 1039–1065 (2006)
Marinho, A., Murta, L., Werner, C., Braganholo, V., da Cruz, S.M.S., Ogasawara, E., Mattoso, M.: Managing provenance in scientific workflows with provmanager. In: 22nd International Symposium on Computer Architecture and High Performance Computing. LNCC (2010)
Moreau, L., Clifford, B., Freire, J., Gil, Y., Groth, P., Futrelle, J., Kwasnikowska, N., Miles, S., Missier, P., Myers, J., Simmhan, Y., Stephan, E., Bussche, J.: The open provenance model core specification (v1.1). Future Generation Computer System (2010) (in press, corrected proof)
Moreau, L., Freire, J., Futrelle, J., Mcgrath, R.E., Myers, J., Paulson, P.: The open provenance model: An overview, pp. 323–326 (2008)
Munroe, S., Miles, S., Moreau, L., Vázquez-Salceda, J.: Prime: a software engineering methodology for developing provenance-aware applications. In: SEM 2006: Proceedings of the 6th International Workshop on Software Engineering and Middleware, pp. 39–46. ACM, New York (2006)
Oinn, T., Li, P., Kell, D.B., Goble, C., Goderis, A., Greenwood, M., Hull, D., Stevens, R., Turi, D., Zhao, J.: Taverna/mygrid: Aligning a workflow system with the life sciences community. In: Workflows for e-Science: Scientific Workflows for Grids, ch. Part III, pp. 300–319. Springer, London (2007)
Olson, G.M.: The next generation of science collaboratories. In: CTS 2009: Proceedings of the 2009 International Symposium on Collaborative Technologies and Systems, pp. xv–xvi.. IEEE Computer Society, Washington, DC, USA (2009)
Simmhan, Y.L., Plale, B., Gannon, D.: A framework for collecting provenance in data-centric scientific workflows. In: ICWS 2006: Proceedings of the IEEE International Conference on Web Services, pp. 427–436. IEEE Computer Society, Washington, DC, USA (2006)
Tateno, Y., Imanishi, T., Miyazaki, S., Fukami-Kobayashi, K., Saitou, N., Sugawara, H., Gojobori, T.: Dna data bank of japan (ddbj) for genome scale research in life science. Nucleic Acids Research 30(1), 27–30 (2002)
Zhao, J., Goble, C., Stevens, R., Turi, D.: Mining taverna’s semantic web of provenance. Concurr. Comput.: Pract. Exper. 20(5), 463–472 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Gaspar, W., Braga, R., Campos, F. (2011). SciProv: An Architecture for Semantic Query in Provenance Metadata on e-Science Context. In: Böhm, C., Khuri, S., Lhotská, L., Pisanti, N. (eds) Information Technology in Bio- and Medical Informatics. ITBAM 2011. Lecture Notes in Computer Science, vol 6865. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23208-4_7
Download citation
DOI: https://doi.org/10.1007/978-3-642-23208-4_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23207-7
Online ISBN: 978-3-642-23208-4
eBook Packages: Computer ScienceComputer Science (R0)