Abstract
This paper proposes an architecture for the back-end of a federated national datastore for use by academic research communities, developed by the e-INIS (Irish National e-InfraStructure) project, and describes in detail one member of the federation, the regional datastore at Trinity College Dublin. It builds upon existing infrastructure and services, including Grid-Ireland, the National Grid Initiative and EGEE, Europe’s leading Grid infrastructure. It assumes users are in distinct research communities and that their data access patterns can be described via two properties, denoted as mutability and frequency-of-access. The architecture is for a back-end—individual academic communities are best qualified to define their own front-end services and user interfaces. The proposal is designed to facilitate front-end development by placing minimal restrictions on how the front-end is implemented and on the internal community security policies. The proposal also seeks to ensure that the communities are insulated from the back-end and from each other in order to ensure quality of service and to decouple their front-end implementation from site-specific back-end implementations.
Similar content being viewed by others
References
CMIP5 - Coupled Model Intercomparison Project Phase 5. http://cmip-pcmdi.llnl.gov/cmip5/index.html
Digital Humanities Observatory. http://dho.ie
Heliophysics Integrated Observatory (HELIO). http://www.helio-vo.eu
Irish Centre for High End Computing (ICHEC). http://www.ichec.ie/services
ESFRI Working Group About Digital Repositories, ESFRI Position Paper. ESFRI (2007). Also available as ftp://ftp.cordis.europa.eu/pub/esfri/docs/digital_repositories_working_group.pdf (2007)
NSF07-28 Cyberinfrastructure Vision for 21st Century Discovery. “National Science Foundation” (2007). Also available as http://www.nsf.gov/pubs/2007/nsf0728/index.jsp (2007)
OECD Principles and Guidelines for Access to Research Data from Public Funding. OECD (2007). Also available as http://www.oecd.org/dataoecd/9/61/38500813.pdf (2007)
Enabling Inter-Repository Access Management between iRODS and Fedora. In: 4th International Conference on Open Repositories, OR09. Conference Presentations. Georgia Institute of Technology. http://hdl.handle.net/1853/28494 (2009)
Blanke, T., Hedges, M., Dunn, S.: Arts and humanities e-science—current practices and future challenges. Future Gener. Comput. Syst. 25(4), 474–480. doi:10.1016/j.future.2008.10.004 (2009)
Cassidy, K., McCandless, J., Childs, S., Walsh, J., Coghlan, B., Dagger, D.: Combining a virtual Grid testbed and Grid elearning courseware. In: Proc. Cracow Grid Workshop 2006 (CGW06). Academic Computer Centre CYFRONET AGH, Cracow, Poland (2006)
Childs, S., Coghlan, B., O’Callaghan, D., Quigley, G., Walsh, J.: Centralised fabric management for a national Grid infrastructure. In: Cracow Grid Workshop (CGW’05). Cracow, Poland (2005)
Colarelli, D., Grunwald, D., Neufeld, M.: The case for massive arrays of idle disks (maid). In: In the 2002 Conference on File and Storage Technologies, p. 2002. On (2002)
Cornillon, P., Gallagher, J., Sgouros, T.: Opendap: accessing data in a distributed, heterogeneous environment. Data Sci. J. 2, 164–174. doi:10.2481/dsj.2.164 (2003)
Data Management Task Force: e-IRG Report on Data Management. e-Infrastructure Reflection Group (2009)
Foster, I., Kesselman, C., Tsudik, G., Tuecke, S.: A security architecture for computational Grids. In: Proc. 5th ACM Conference on Computer and Communications Security Conference, pp. 83–92 (1998)
Hedges, M., Blanke, T., Hasan, A.: Rule-based curation and preservation of data: a data Grid approach using iRODS. In: Future Generation Computer Systems-the International Journal of Grid Computing-Theory Methods and Applications, vol. 25(4), pp. 446–452. 3rd IEEE International Conference on e-Science and Grid Computing, Bangalore, India, 10–13 Dec 2007. doi:10.1016/j.future.2008.10.003 (2009)
Hertel, C.R.: Implementing CIFS: the Common Internet File System. Prentice Hall PTR. http://www.ubiqx.org/cifs/index.html (2003)
Klous, S., Frey, J., Son, S.C., Thain, D., Roy, A., Livny, M., van den Brand, J.: Transparent access to Grid resources for user software. Concurr. Comput.: Practice and Experience 18(7), 787–801. doi:10.1002/cpe.961 (2006)
Knopper, S., Koning, R., Roodhart, J., Grosso, P., de Laat, C.: Amsterdam cinegrid exchange—a distributed high-quality digital media solution. SNE Technical Report SNE-UVA-2009-01. Available at http://www.science.uva.nl/research/sne/reports/ (2009)
Koblitz, B., Santos, N., Pose, V.: The AMGA metadata service. Journal of Grid Computing 6(1), 61–76. doi:10.1007/s10723-007-9084-6 (2008)
Laure, E., Gr, C., Fisher, S., Frohner, A., Kunszt, P., et al.: Programming the Grid with glite. In: Computational Methods in Science and Technology, vol. 12, pp. 33–45. http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.93.1312 (2006)
Mandrichenko, I., Allcock, W., Perelmutov, T.: GridFTP v2 Protocol Description. GGF Document Series GFD.47. Also available as http://www.ogf.org/documents/GFD.47.pdf (2005)
Oinn, T., Addis, M., Ferris, J., Marvin, D., Senger, M., Greenwood, M., Carver, T., Glover, K., Pocock, M.R., Wipat, A., Li, P.: Taverna: a tool for the composition and enactment of bioinformatics workflows. Bioinformatics 20(17), 3045–3054. doi:10.1093/bioinformatics/bth361 (2004)
Saklikar, S., Saha, S.: Next steps for security assertion markup language (SAML). In: SWS’07: Proceedings of the 2007 ACM Workshop on Secure Web Services, pp. 52–65. Assoc Computing Machinery, 1515 Broadway, New York, NY 10036-9998 USA. ACM Workshop on Secure Web Services, Fairfax, VA, 02 NOV 2007 (2007)
Satran, J., Meth, K., Sapuntzakis, C., Chadalapaka, M., Zeidner, E.: Internet Small Computer Systems Interface (iSCSI). RFC 3720 (Proposed Standard). http://www.ietf.org/rfc/rfc3720.txt. Updated by RFCs 3980, 4850, 5048 (2004)
Schulz, G.: MAID 2.0: Energy Savings without Performance Compromises. http://www.storageio.com/Reports/StorageIO_WP_Jan02_2008.pdf
Sim, A., Shoshani, A., Badino, P., Barring, O., Baud, J., Corso, E., Witt, S.D., Donno, F., Gu, J., Haddox-Schatz, M., Hess, B., Jensen, J., Kowalski, A., Litmaath, M., Magnoni, L., Perelmutov, T., Petravick, D., Watson, C.: The storage resource manager interface specification version 2.2. Also available as http://www.ogf.org/documents/GFD.129.pdf. GGF Document Series GFD.129 (2008)
Thain, D., Livny, M.: Parrot: Transparent user-level middleware for data-intensive computing. In: In Workshop on Adaptive Grid Middleware. http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.9.8435 (2003)
Whitehead, J.E., Wiggins, M.: Webdav: IETF standard for collaborative authoring on the web. IEEE Internet Computing 2(5), 34–40. doi:10.1109/4236.722228 (1998)
Witzig, C.: Shibboleth Interoperability Through a Short Lived Credential Service. EGEE-II. Report EGEE-II-MJRA1.4-770102-v0.96.doc (2006)
Zhang, S., Coddington, P., Wendelborn, A.: Davis: a generic interface for SRB and iRODS. DHPC Technical Report DHPC-197 (2009)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Coghlan, B., Walsh, J., Childs, S. et al. The Back-End of a Two-Layer Model for a Federated National Datastore for Academic Research VOs that Integrates EGEE Data Management. J Grid Computing 8, 341–364 (2010). https://doi.org/10.1007/s10723-010-9149-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10723-010-9149-9