ABSTRACT
In this paper, we introduce Apache Airavata, a software framework to compose, manage, execute, and monitor distributed applications and workflows on computational resources ranging from local resources to computational grids and clouds. Airavata builds on general concepts of service-oriented computing, distributed messaging, and workflow composition and orchestration. This paper discusses the architecture of Airavata and its modules, and illustrates how the software can be used as individual components or as an integrated solution to build science gateways or general-purpose distributed application and workflow management systems.
- Apache. Tika. http://tika.apache.org/.Google Scholar
- M. Armbrust, A. Fox, R. Griffith, A. Joseph, R. Katz, A. Konwinski, G. Lee, D. Patterson, A. Rabkin, I. Stoica, et al. Above the clouds: A berkeley view of cloud computing. EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2009--28, 2009.Google Scholar
- ASF. Apache software foundation. http://incubator.apache.org/airavata/.Google Scholar
- D. Bernholdt, S. Bharathi, D. Brown, K. Chanchio, M. Chen, A. Chervenak, L. Cinquini, B. Drach, I. Foster, P. Fox, et al. The earth system grid: Supporting the next generation of climate modeling research. Proceedings of the IEEE, 93(3):485--495, 2005.Google ScholarCross Ref
- D. Borthakur. The hadoop distributed file system: Architecture and design. Hadoop Project Website, 2007.Google Scholar
- K. Brewster, D. Weber, K. Thomas, K. Droegemeier, Y. Wang, M. Xue, S. Marru, D. Gannon, J. Alameda, B. Jewett, et al. Use of the lead portal for on-demand severe weather prediction. In Sixth Conference on Artificial Intelligence Applications to Environmental Science, 88th Annual Meeting of the American Meteorological Society, New Orleans, 2008.Google Scholar
- M. Christie and S. Marru. The lead portal: a teragrid gateway and application service architecture: Research articles. Concurr. Comput. : Pract. Exper., April 2007. Google ScholarDigital Library
- R. Clark, S. Marru, M. Christie, D. Gannon, B. Illston, K. Droegemeier, and T. Baltzer. The lead-wxchallenge pilot project: enabling the community. In 24th Conference on IIPS, 2008.Google Scholar
- F. Curbera, F. Leymann, T. Storey, D. Ferguson, and S. Weerawarana. Web services platform architecture: SOAP, WSDL, WS-policy, WS-addressing, WS-BPEL, WS-reliable messaging and more. Prentice Hall PTR, 2005. Google ScholarDigital Library
- E. Deelman, J. Blythe, Y. Gil, C. Kesselman, G. Mehta, S. Patil, M. Su, K. Vahi, and M. Livny. Pegasus: Mapping scientific workflows onto the grid. In Grid Computing, pages 131--140. Springer, 2004.Google Scholar
- B. Demeler. UltraScan: a comprehensive data analysis software package for analytical ultracentrifugation experiments. Analytical Ultracentrifugation: Techniques And Methods, pages 210--229, 2005.Google Scholar
- R. Dooley, K. Milfeld, C. Guiang, S. Pamidighantam, and G. Allen. From proposal to production: Lessons learned developing the computational chemistry grid cyberinfrastructure. Journal of Grid Computing, 2006.Google ScholarCross Ref
- Extreme. Computing lab. http://extreme.indiana.edu.Google Scholar
- I. Foster, C. Kesselman, and S. Tuecke. The anatomy of the grid: Enabling scalable virtual organizations. International Journal of High Performance Computing Applications, 2001. Google ScholarDigital Library
- D. Gannon, B. Plale, S. Marru, G. Kandaswamy, Y. Simmhan, and S. Shirasuna. Dynamic, adaptive workflows for mesoscale meteorology. Workflows for e-Science, pages 126--142, 2007.Google Scholar
- J. Ghosh, N. Singh, Y. Fan, S. Marru, K. Vanomesslaeghe, and S. Pamidighantam. Molecular Parameter Optimization Gateway (ParamChem). In Proceedings of the 2011 TeraGrid Conference. ACM, 2011. Google ScholarDigital Library
- C. Goble and D. De Roure. myexperiment: social networking for workflow-using e-scientists. In Proceedings of the 2nd workshop on Workflows in support of large-scale science, pages 1--2. ACM, 2007. Google ScholarDigital Library
- T. Gunarathne, C. Herath, E. Chinthaka, and S. Marru. Experience with adapting a ws-bpel runtime for escience workflows. In Proceedings of the 5th Grid Computing Environments Workshop, pages 7:1--7:10. ACM, 2009. Google ScholarDigital Library
- Y. Huang, A. Slominski, C. Herath, and D. Gannon. Ws-messenger: A web services-based messaging system for service-oriented grid computing. In Cluster Computing and the Grid, 2006. CCGRID 06. Sixth IEEE International Symposium on, volume 1, pages 8--pp. IEEE, 2006. Google ScholarDigital Library
- Java. Content repository. http://jcp.org/aboutJava/communityprocess/review/jsr170/.Google Scholar
- R. Jayasinghe, D. Gamage, and S. Perera. Towards improved data dissemination of publish-subscribe systems. In 2010 IEEE International Conference on Web Services, pages 520--525. IEEE, 2010. Google ScholarDigital Library
- G. Klimeck, M. McLennan, S. Brophy, G. Adams III, and M. Lundstrom. nanohub. org: Advancing education and research in nanotechnology. | Computing in Science & Engineering, 2008. Google ScholarDigital Library
- B. Lud scher, I. Altintas, C. Berkley, D. Higgins, E. Jaeger, M. Jones, E. Lee, J. Tao, and Y. Zhao. Scientific workflow management and the kepler system. Concurrency and Computation: Practice and Experience, 2006.Google Scholar
- C. Mattmann, D. Crichton, N. Medvidovic, and S. Hughes. A software architecture-based framework for highly distributed and data intensive scientific applications. In Proceedings of the 28th international conference on Software engineering, pages 721--730. ACM, 2006. Google ScholarDigital Library
- M. Miller, W. Pfeiffer, and T. Schwartz. Creating the cipres science gateway for inference of large phylogenetic trees. In Gateway Computing Environments Workshop (GCE), 2010, pages 1--8. IEEE, 2010.Google ScholarCross Ref
- T. Oinn, M. Addis, J. Ferris, D. Marvin, M. Senger, M. Greenwood, T. Carver, K. Glover, M. Pocock, A. Wipat, et al. Taverna: a tool for the composition and enactment of bioinformatics workflows. Bioinformatics, 20(17):3045, 2004. Google ScholarDigital Library
- S. Perera, S. Marru, and C. Herath. Workflow Infrastructure for Multi-scale Science Gateways. In TeraGrid Conference, June, 2008.Google Scholar
- M. Pierce, S. Marru, R. Singh, A. Kulshrestha, and K. Muthuraman. Open grid computing environments: advanced gateway support activities. In Proceedings of the 2010 TeraGrid Conference. ACM, 2010. Google ScholarDigital Library
- M. Pierce, S. Marru, W. Wu, G. Kandaswami, G. von Laszewski, R. Dooley, M. Dahan, N. Wilkins-Diehr, and M. Thomas. Open grid computing environments. In Proceedings of the Fourth Annual TeraGrid Conference. Citeseer, 2009. Google ScholarDigital Library
- J. Saltz, S. Oster, S. Hastings, S. Langella, T. Kurc, W. Sanchez, M. Kher, A. Manisundaram, K. Shanbhag, and P. Covitz. cagrid: design and implementation of the core architecture of the cancer biomedical informatics grid. Bioinformatics, 2006. Google ScholarDigital Library
- A. Slominski. Adapting bpel to scientific workflows. Workflows for e-Science, pages 208--226, 2007.Google Scholar
- P. Srinath, H. Chathura, E. Jaliya, C. Eran, R. Ajith, J. Deepal, W. Sanjiva, and D. Glen. Axis2, middleware for next generation web services. In Web Services, 2006. ICWS'06. International Conference on, pages 833--840, 2006. Google ScholarDigital Library
- I. Taylor, M. Shields, I. Wang, and A. Harrison. The triana workflow environment: Architecture and applications. Workflows for e-Science, pages 320--339, 2007.Google Scholar
- S. Vinoski. Chain of responsibility. Internet Computing, IEEE, 6(6):80--83, 2002. Google ScholarDigital Library
- N. Wilkins-Diehr, D. Gannon, G. Klimeck, S. Oster, and S. Pamidighantam. Teragrid science gateways and their impact on science. Computer, 41(11):32--41, 2008. Google ScholarDigital Library
- Y. Yang, J. Choi, C. Herath, S. Marru, and S. Kim. Biovlab: Bioinformatics data analysis using cloud computing and graphical workflow composers. Cloud Computing and Software Services, page 309, 2010.Google Scholar
Index Terms
- Apache airavata: a framework for distributed applications and computational workflows
Recommendations
Apache Airavata as a Laboratory: Architecture and Case Study for Component-Based Gateway Middleware
SCREAM '15: Proceedings of the 1st Workshop on The Science of Cyberinfrastructure: Research, Experience, Applications and ModelsScience gateways are more than user interfaces to computational grids and clouds. Gateways are middleware in their own right, providing flexible, lightweight federations of heterogenous collections of computing resources (such as campus clusters, ...
Supporting Science Gateways Using Apache Airavata and SciGaP Services
PEARC '18: Proceedings of the Practice and Experience on Advanced Research ComputingThe Science Gateways Platform as a service (SciGaP.org) project provides a rapid development and stable hosting platform for a wide range of science gateways that focus on software as a service. Based on the open source Apache Airavata project, SciGaP ...
Managing Faults for Distributed Workflows over Grids
Grid applications composed of multiple, distributed jobs are common areas for applying Web-scale workflows. Workflows over grid infrastructures are inherently complicated due to the need to both functionally assure the entire process and coordinate the ...
Comments