Skip to main content
Log in

The Back-End of a Two-Layer Model for a Federated National Datastore for Academic Research VOs that Integrates EGEE Data Management

  • Published:
Journal of Grid Computing Aims and scope Submit manuscript

Abstract

This paper proposes an architecture for the back-end of a federated national datastore for use by academic research communities, developed by the e-INIS (Irish National e-InfraStructure) project, and describes in detail one member of the federation, the regional datastore at Trinity College Dublin. It builds upon existing infrastructure and services, including Grid-Ireland, the National Grid Initiative and EGEE, Europe’s leading Grid infrastructure. It assumes users are in distinct research communities and that their data access patterns can be described via two properties, denoted as mutability and frequency-of-access. The architecture is for a back-end—individual academic communities are best qualified to define their own front-end services and user interfaces. The proposal is designed to facilitate front-end development by placing minimal restrictions on how the front-end is implemented and on the internal community security policies. The proposal also seeks to ensure that the communities are insulated from the back-end and from each other in order to ensure quality of service and to decouple their front-end implementation from site-specific back-end implementations.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. CMIP5 - Coupled Model Intercomparison Project Phase 5. http://cmip-pcmdi.llnl.gov/cmip5/index.html

  2. Digital Humanities Observatory. http://dho.ie

  3. Heliophysics Integrated Observatory (HELIO). http://www.helio-vo.eu

  4. Irish Centre for High End Computing (ICHEC). http://www.ichec.ie/services

  5. ESFRI Working Group About Digital Repositories, ESFRI Position Paper. ESFRI (2007). Also available as ftp://ftp.cordis.europa.eu/pub/esfri/docs/digital_repositories_working_group.pdf (2007)

  6. NSF07-28 Cyberinfrastructure Vision for 21st Century Discovery. “National Science Foundation” (2007). Also available as http://www.nsf.gov/pubs/2007/nsf0728/index.jsp (2007)

  7. OECD Principles and Guidelines for Access to Research Data from Public Funding. OECD (2007). Also available as http://www.oecd.org/dataoecd/9/61/38500813.pdf (2007)

  8. Enabling Inter-Repository Access Management between iRODS and Fedora. In: 4th International Conference on Open Repositories, OR09. Conference Presentations. Georgia Institute of Technology. http://hdl.handle.net/1853/28494 (2009)

  9. Blanke, T., Hedges, M., Dunn, S.: Arts and humanities e-science—current practices and future challenges. Future Gener. Comput. Syst. 25(4), 474–480. doi:10.1016/j.future.2008.10.004 (2009)

    Article  Google Scholar 

  10. Cassidy, K., McCandless, J., Childs, S., Walsh, J., Coghlan, B., Dagger, D.: Combining a virtual Grid testbed and Grid elearning courseware. In: Proc. Cracow Grid Workshop 2006 (CGW06). Academic Computer Centre CYFRONET AGH, Cracow, Poland (2006)

  11. Childs, S., Coghlan, B., O’Callaghan, D., Quigley, G., Walsh, J.: Centralised fabric management for a national Grid infrastructure. In: Cracow Grid Workshop (CGW’05). Cracow, Poland (2005)

  12. Colarelli, D., Grunwald, D., Neufeld, M.: The case for massive arrays of idle disks (maid). In: In the 2002 Conference on File and Storage Technologies, p. 2002. On (2002)

  13. Cornillon, P., Gallagher, J., Sgouros, T.: Opendap: accessing data in a distributed, heterogeneous environment. Data Sci. J. 2, 164–174. doi:10.2481/dsj.2.164 (2003)

    Article  Google Scholar 

  14. Data Management Task Force: e-IRG Report on Data Management. e-Infrastructure Reflection Group (2009)

  15. Foster, I., Kesselman, C., Tsudik, G., Tuecke, S.: A security architecture for computational Grids. In: Proc. 5th ACM Conference on Computer and Communications Security Conference, pp. 83–92 (1998)

  16. Hedges, M., Blanke, T., Hasan, A.: Rule-based curation and preservation of data: a data Grid approach using iRODS. In: Future Generation Computer Systems-the International Journal of Grid Computing-Theory Methods and Applications, vol. 25(4), pp. 446–452. 3rd IEEE International Conference on e-Science and Grid Computing, Bangalore, India, 10–13 Dec 2007. doi:10.1016/j.future.2008.10.003 (2009)

  17. Hertel, C.R.: Implementing CIFS: the Common Internet File System. Prentice Hall PTR. http://www.ubiqx.org/cifs/index.html (2003)

  18. Klous, S., Frey, J., Son, S.C., Thain, D., Roy, A., Livny, M., van den Brand, J.: Transparent access to Grid resources for user software. Concurr. Comput.: Practice and Experience 18(7), 787–801. doi:10.1002/cpe.961 (2006)

    Article  Google Scholar 

  19. Knopper, S., Koning, R., Roodhart, J., Grosso, P., de Laat, C.: Amsterdam cinegrid exchange—a distributed high-quality digital media solution. SNE Technical Report SNE-UVA-2009-01. Available at http://www.science.uva.nl/research/sne/reports/ (2009)

  20. Koblitz, B., Santos, N., Pose, V.: The AMGA metadata service. Journal of Grid Computing 6(1), 61–76. doi:10.1007/s10723-007-9084-6 (2008)

    Article  Google Scholar 

  21. Laure, E., Gr, C., Fisher, S., Frohner, A., Kunszt, P., et al.: Programming the Grid with glite. In: Computational Methods in Science and Technology, vol. 12, pp. 33–45. http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.93.1312 (2006)

  22. Mandrichenko, I., Allcock, W., Perelmutov, T.: GridFTP v2 Protocol Description. GGF Document Series GFD.47. Also available as http://www.ogf.org/documents/GFD.47.pdf (2005)

  23. Oinn, T., Addis, M., Ferris, J., Marvin, D., Senger, M., Greenwood, M., Carver, T., Glover, K., Pocock, M.R., Wipat, A., Li, P.: Taverna: a tool for the composition and enactment of bioinformatics workflows. Bioinformatics 20(17), 3045–3054. doi:10.1093/bioinformatics/bth361 (2004)

    Article  Google Scholar 

  24. Saklikar, S., Saha, S.: Next steps for security assertion markup language (SAML). In: SWS’07: Proceedings of the 2007 ACM Workshop on Secure Web Services, pp. 52–65. Assoc Computing Machinery, 1515 Broadway, New York, NY 10036-9998 USA. ACM Workshop on Secure Web Services, Fairfax, VA, 02 NOV 2007 (2007)

  25. Satran, J., Meth, K., Sapuntzakis, C., Chadalapaka, M., Zeidner, E.: Internet Small Computer Systems Interface (iSCSI). RFC 3720 (Proposed Standard). http://www.ietf.org/rfc/rfc3720.txt. Updated by RFCs 3980, 4850, 5048 (2004)

  26. Schulz, G.: MAID 2.0: Energy Savings without Performance Compromises. http://www.storageio.com/Reports/StorageIO_WP_Jan02_2008.pdf

  27. Sim, A., Shoshani, A., Badino, P., Barring, O., Baud, J., Corso, E., Witt, S.D., Donno, F., Gu, J., Haddox-Schatz, M., Hess, B., Jensen, J., Kowalski, A., Litmaath, M., Magnoni, L., Perelmutov, T., Petravick, D., Watson, C.: The storage resource manager interface specification version 2.2. Also available as http://www.ogf.org/documents/GFD.129.pdf. GGF Document Series GFD.129 (2008)

  28. Thain, D., Livny, M.: Parrot: Transparent user-level middleware for data-intensive computing. In: In Workshop on Adaptive Grid Middleware. http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.9.8435 (2003)

  29. Whitehead, J.E., Wiggins, M.: Webdav: IETF standard for collaborative authoring on the web. IEEE Internet Computing 2(5), 34–40. doi:10.1109/4236.722228 (1998)

    Article  Google Scholar 

  30. Witzig, C.: Shibboleth Interoperability Through a Short Lived Credential Service. EGEE-II. Report EGEE-II-MJRA1.4-770102-v0.96.doc (2006)

  31. Zhang, S., Coddington, P., Wendelborn, A.: Davis: a generic interface for SRB and iRODS. DHPC Technical Report DHPC-197 (2009)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Geoff Quigley.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Coghlan, B., Walsh, J., Childs, S. et al. The Back-End of a Two-Layer Model for a Federated National Datastore for Academic Research VOs that Integrates EGEE Data Management. J Grid Computing 8, 341–364 (2010). https://doi.org/10.1007/s10723-010-9149-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10723-010-9149-9

Keywords

Navigation