Skip to main content

SciProv: An Architecture for Semantic Query in Provenance Metadata on e-Science Context

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6865))

Abstract

This article describes SciProv, an architecture that aims to interact with Scientific Workflow Management Systems in order to capture and manipulate provenance metadata. For this purpose, SciProv adopts an approach based on an abstract model for representing the lineage. This model, called Open Provenance Model (OPM), allows that SciProv can set up a homogeneous and interoperable infrastructure for handling provenance metadata. As a result, SciProv is able to provide a framework for query metadata provenance generated in an e-Science scenario. Moreover, the architecture uses semantic web technology in order to process provenance queries. In this context, using ontologies and inference engines, SciProv can make inferences about lineage and, based on these inferences, obtain important results based on extraction of information beyond those that are registered explicitly from the data managed.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   54.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   69.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Barga, R.S., Digiampietri, L.A.: Automatic capture and efficient storage of e-science experiment provenance. Concur. Comput.: Pract. Exper. 20(5), 419–429 (2008)

    Article  Google Scholar 

  2. Biton, O., Cohen-Boulakia, S., Davidson, S.B., Hara, C.S.: Querying and managing provenance through user views in scientific workflows. In: ICDE 2008: Proceedings of the 2008 IEEE 24th International Conference on Data Engineering, pp. 1072–1081. IEEE Computer Society, Washington, DC, USA (2008)

    Google Scholar 

  3. Buneman, P., Khanna, S., Tan, W.C.: Why and where: A characterization of data provenance. In: Van den Bussche, J., Vianu, V. (eds.) ICDT 2001. LNCS, vol. 1973, pp. 316–330. Springer, Heidelberg (2000)

    Chapter  Google Scholar 

  4. Cavalier-Smith, T.: Only six kingdoms of life. Proceedings of the Royal Society B Biological Sciences 271(1545), 1251–1262 (2004)

    Article  Google Scholar 

  5. Cruz, S.M.S.d., Campos, M.L.M., Mattoso, M.: Towards a taxonomy of provenance in scientific workflow management systems. In: SERVICES 2009: Proceedings of the 2009 Congress on Services - I. IEEE Computer Society, Washington, DC, USA (2009)

    Google Scholar 

  6. Davidson, S.B., Freire, J.: Provenance and scientific workflows: challenges and opportunities. In: SIGMOD 2008: Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data, pp. 1345–1350. ACM, New York (2008)

    Chapter  Google Scholar 

  7. Digiampietri, L., Medeiros, C., Setúbal, J.: A framework based in web services orchestration bioinformatics workflow management. Genetics and Molecular Research 4(3), 535–542 (2005)

    Google Scholar 

  8. Factor, M., Henis, E., Naor, D., Rabinovici-Cohen, S., Reshef, P., Ronen, S., Michetti, G., Guercio, M.: Authenticity and provenance in long term digital preservation: modeling and implementation in preservation aware storage. In: First Workshop on Theory and Practice of Provenance, pp. 6:1–6:10. USENIX Association, Berkeley (2009)

    Google Scholar 

  9. Freire, J., Koop, D., Santos, E., Silva, C.T.: Provenance for computational tasks: A survey. Computing in Science and Enginnering 10(3), 11–21 (2008)

    Article  Google Scholar 

  10. Freire, J., Silva, C.T.: Towards enabling social analysis of scientific data. In: CHI Social Data Analysis Workshop, Florence, Italy (2008)

    Google Scholar 

  11. Freire, J., Silva, C.T., Callahan, S.P., Santos, E., Scheidegger, C.E., Vo, H.T.: Managing rapidly-evolving scientific workflows. In: Moreau, L., Foster, I. (eds.) IPAW 2006. LNCS, vol. 4145, pp. 10–18. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  12. Golbeck, J., Hendler, J.: A semantic web approach to the provenance challenge. Concurr. Comput.: Pract. Exper. 20(5), 431–439 (2008)

    Article  Google Scholar 

  13. Groth, P., Deelman, E., Juve, G., Mehta, G., Berriman, B.: Pipeline-centric provenance model. In: WORKS 2009: Proceedings of the 4th Workshop on Workflows in Support of Large-Scale Science, pp. 1–8. ACM, New York (2009)

    Google Scholar 

  14. Groth, P., Jiang, S., Miles, S., Munroe, S., Tan, V., Tsasakou, S., Moreau, L.: An architecture for provenance systems. Tech. Rep. D3.1.1 Final Architecture v.0.6, EU Provenance Project, Southampton, UK (2006)

    Google Scholar 

  15. Ludäscher, B., Altintas, I., Berkley, C., Higgins, D., Jaeger, E., Jones, M., Lee, E.A., Tao, J., Zhao, Y.: Scientific workflow management and the kepler system: Research articles. Concurr. Comput.: Pract. Exper. 18(10), 1039–1065 (2006)

    Article  Google Scholar 

  16. Marinho, A., Murta, L., Werner, C., Braganholo, V., da Cruz, S.M.S., Ogasawara, E., Mattoso, M.: Managing provenance in scientific workflows with provmanager. In: 22nd International Symposium on Computer Architecture and High Performance Computing. LNCC (2010)

    Google Scholar 

  17. Moreau, L., Clifford, B., Freire, J., Gil, Y., Groth, P., Futrelle, J., Kwasnikowska, N., Miles, S., Missier, P., Myers, J., Simmhan, Y., Stephan, E., Bussche, J.: The open provenance model core specification (v1.1). Future Generation Computer System (2010) (in press, corrected proof)

    Google Scholar 

  18. Moreau, L., Freire, J., Futrelle, J., Mcgrath, R.E., Myers, J., Paulson, P.: The open provenance model: An overview, pp. 323–326 (2008)

    Google Scholar 

  19. Munroe, S., Miles, S., Moreau, L., Vázquez-Salceda, J.: Prime: a software engineering methodology for developing provenance-aware applications. In: SEM 2006: Proceedings of the 6th International Workshop on Software Engineering and Middleware, pp. 39–46. ACM, New York (2006)

    Google Scholar 

  20. Oinn, T., Li, P., Kell, D.B., Goble, C., Goderis, A., Greenwood, M., Hull, D., Stevens, R., Turi, D., Zhao, J.: Taverna/mygrid: Aligning a workflow system with the life sciences community. In: Workflows for e-Science: Scientific Workflows for Grids, ch. Part III, pp. 300–319. Springer, London (2007)

    Chapter  Google Scholar 

  21. Olson, G.M.: The next generation of science collaboratories. In: CTS 2009: Proceedings of the 2009 International Symposium on Collaborative Technologies and Systems, pp. xv–xvi.. IEEE Computer Society, Washington, DC, USA (2009)

    Google Scholar 

  22. Simmhan, Y.L., Plale, B., Gannon, D.: A framework for collecting provenance in data-centric scientific workflows. In: ICWS 2006: Proceedings of the IEEE International Conference on Web Services, pp. 427–436. IEEE Computer Society, Washington, DC, USA (2006)

    Chapter  Google Scholar 

  23. Tateno, Y., Imanishi, T., Miyazaki, S., Fukami-Kobayashi, K., Saitou, N., Sugawara, H., Gojobori, T.: Dna data bank of japan (ddbj) for genome scale research in life science. Nucleic Acids Research 30(1), 27–30 (2002)

    Article  Google Scholar 

  24. Zhao, J., Goble, C., Stevens, R., Turi, D.: Mining taverna’s semantic web of provenance. Concurr. Comput.: Pract. Exper. 20(5), 463–472 (2008)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Gaspar, W., Braga, R., Campos, F. (2011). SciProv: An Architecture for Semantic Query in Provenance Metadata on e-Science Context. In: Böhm, C., Khuri, S., Lhotská, L., Pisanti, N. (eds) Information Technology in Bio- and Medical Informatics. ITBAM 2011. Lecture Notes in Computer Science, vol 6865. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23208-4_7

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-23208-4_7

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-23207-7

  • Online ISBN: 978-3-642-23208-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics