Abstract
We present Pyrus, a domain-specific online modeling environment for building graphical processes for data analysis, machine learning and artificial intelligence. Pyrus aims at bridging the gap between de facto (often Python-based) standards as established by the Jupyter platform, and the tradition to model data analysis workflows in a dataflow-driven fashion. Technically, Pyrus integrates established online IDEs like Jupyter and allows users to graphically combine available functional components to dataflow-oriented workflows in a collaborative fashion without writing a single line of code. Following a controlflow/dataflow conversion and compilation, the execution is then delegated to the underlying platforms. Both the inputs to a modeled workflow and the results of its execution can be specified and viewed without leaving Pyrus which supports a seamless cooperation between data science experts and programmers. The paper illustrates the fundamental concepts, the employed domain-specific language, and, in particular, the role of the integrated IDE’s in an example-driven fashion which can be reproduced in the available online modeling environment.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
Project: https://gitlab.com/scce/ml-process, Demo: https://ls5vs023.cs.tu-dortmund.de, (For reviewing only) Login:isola21 Password:isola21.
- 2.
- 3.
- 4.
Demo: https://ls5vs026.cs.tu-dortmund.de/. (For reviewing only) Login:isola21 Password:isola21.
- 5.
References
Eclipse Che. https://www.eclipse.org/che
Gitpod. https://www.gitpod.io
Jupyter. https://jupyter.org
rpy2 - R in Python. https://rpy2.github.io/
Baldini, I., et al.: Serverless computing: current trends and open problems. In: Chaudhary, S., Somani, G., Buyya, R. (eds.) Research Advances in Cloud Computing, pp. 1–20. Springer, Singapore (2017). https://doi.org/10.1007/978-981-10-5026-8_1
Bholowalia, P., Kumar, A.: EBK-means: a clustering technique based on elbow method and k-means in WSN. Int. J. Comput. Appl. 105(9) (2014)
Boßelmann, S., et al.: DIME: a programming-less modeling environment for web applications. In: Margaria, T., Steffen, B. (eds.) ISoLA 2016. LNCS, vol. 9953, pp. 809–832. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-47169-3_60
Chisholm, A.: Exploring Data with Rapidminer. Packt Publishing Ltd, Birmingham (2013)
Damas, L., Milner, R.: Principal type-schemes for functional programs. In: Proceedings of the 9th ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages, pp. 207–212 (1982)
Eker, J., et al.: Taming heterogeneity-the ptolemy approach. Proc. IEEE 91(1), 127–144 (2003)
Fowler, M., Parsons, R.: Domain-Specific Languages. Addison-Wesley/ACM Press (2011)
Khalajzadeh, H., Abdelrazek, M., Grundy, J., Hosking, J.G., He, Q.: Survey and analysis of current end-user data analytics tool support. IEEE Trans. Big Data (2019)
Khalajzadeh, H., Simmons, A.J., Abdelrazek, M., Grundy, J., Hosking, J.G., He, Q.: Visual languages for supporting big data analytics development. In: ENASE, pp. 15–26 (2020)
Kim, C.H., Grundy, J., Hosking, J.: A suite of visual languages for model-driven development of statistical surveys and services. J. Vis. Lang. Comput. 26, 99–125 (2015)
Kopetzki, D.: Generation of domain-specific language-to-language transformation languages (2019)
Lamprecht, A.L.: User-Level Workflow Design. LNCS, vol. 8311. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-45389-2
Lamprecht, A.L., Margaria, T., Neubauer, J.: On the use of XMDD in software development education. In: 2015 IEEE 39th Annual Computer Software and Applications Conference, vol. 2, pp. 835–844. IEEE (2015)
Lamprecht, A.-L., Margaria, T., Steffen, B.: Seven variations of an alignment workflow - an illustration of agile process design and management in Bio-jETI. In: Măndoiu, I., Sunderraman, R., Zelikovsky, A. (eds.) ISBRA 2008. LNCS, vol. 4983, pp. 445–456. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-79450-9_42
Lamprecht, A.L., Margaria, T., Steffen, B.: Bio-jETI: a framework for semantics-based service composition. BMC Bioinformatics 10(10), 1–19 (2009)
Li, L., Hosking, J., Grundy, J.: EML: a tree overlay-based visual language for business process modelling, pp. 131–137 (2007)
Li, L., Hosking, J., Grundy, J.: MaramaEML: an integrated multi-view business process modelling environment with tree-overlays, zoomable interfaces and code generation. In: 2008 23rd IEEE/ACM International Conference on Automated Software Engineering, pp. 477–478 (2008)
Lloyd, S.: Least squares quantization in PCM. IEEE Trans. Inf. Theory 28(2), 129–137 (1982)
Ludäscher, B., et al.: Scientific workflow management and the Kepler system. Concurr. Comput. Pract. Exp. 18(10), 1039–1065 (2006)
Lybecait, M., Kopetzki, D., Zweihoff, P., Fuhge, A., Naujokat, S., Steffen, B.: A tutorial introduction to graphical modeling and metamodeling with CINCO. In: Margaria, T., Steffen, B. (eds.) ISoLA 2018. LNCS, vol. 11244, pp. 519–538. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-03418-4_31
Margaria, T., Kubczak, C., Steffen, B.: The XMDD approach to the semantic web services challenge. In: Blake, B., Cabral, L., König-Ries, B., Küster, U., Martin, D. (eds.) Semantic Web Services, pp. 233–248. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-28735-0_15
Margaria, T., Nagel, R., Steffen, B.: jETI: a tool for remote tool integration. In: Halbwachs, N., Zuck, L.D. (eds.) TACAS 2005. LNCS, vol. 3440, pp. 557–562. Springer, Heidelberg (2005). https://doi.org/10.1007/978-3-540-31980-1_38
Margaria, T., Steffen, B.: Business process modeling in the jABC: the one-thing approach. In: Handbook of Research on Business Process Modeling, pp. 1–26. IGI Global (2009)
Margaria, T., Steffen, B.: Continuous model-driven engineering. Computer 42(10), 106–109 (2009)
Margaria, T., Steffen, B.: Service-orientation: conquering complexity with XMDD. In: Hinchey, M., Coyle, L. (eds.) Conquering Complexity, pp. 217–236. Springer, London (2012). https://doi.org/10.1007/978-1-4471-2297-5_10
Margaria, T., Steffen, B.: eXtreme model-driven development (XMDD) technologies as a hands-on approach to software development without coding. In: Encyclopedia of Education and Information Technologies, pp. 732–750 (2020)
Margaria, T., Steffen, B., Reitenspieß, M.: Service-oriented design: the roots. In: Benatallah, B., Casati, F., Traverso, P. (eds.) ICSOC 2005. LNCS, vol. 3826, pp. 450–464. Springer, Heidelberg (2005). https://doi.org/10.1007/11596141_34
Mernik, M., Heering, J., Sloane, A.M.: When and how to develop domain-specific languages. ACM Comput. Surv. 37(4), 316–344 (2005)
Missier, P., et al.: Taverna, reloaded. In: Gertz, M., Ludäscher, B. (eds.) SSDBM 2010. LNCS, vol. 6187, pp. 471–481. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-13818-8_33
Naujokat, S., Lybecait, M., Kopetzki, D., Steffen, B.: Cinco: a simplicity-driven approach to full generation of domain-specific graphical modeling tools. Int. J. Softw. Tools Technol. Transfer 20(3), 327–354 (2018)
Naur, P., Randell, B. (eds.): Software Engineering: Report of a Conference Sponsored by the NATO Science Committee, Garmisch, Germany, 7–11 October 1968. Scientific Affairs Division, NATO, Brussels 39 Belgium (1969)
Nelson, B.J.: Remote procedure call (1982)
O’Donovan, P., Leahy, K., Bruton, K., O’Sullivan, D.T.J.: An industrial big data pipeline for data-driven analytics maintenance applications in large-scale smart manufacturing facilities. J. Big Data 2(1), 1–26 (2015). https://doi.org/10.1186/s40537-015-0034-z
Pierce, B.C., Benjamin, C.: Types and Programming Languages. MIT Press, Cambridge (2002)
Plotkin, G.D.: A structural approach to operational semantics. Aarhus university (1981)
Reisig, W.: Petri Nets: An Introduction, vol. 4. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-69968-9
Robinson, J.A.: A machine-oriented logic based on the resolution principle. J. ACM (JACM) 12(1), 23–41 (1965)
Rumbaugh, J., Jacobson, I., Booch, G.: The unified modeling language. Reference manual (1999)
Saay, S., Margaria, T.: XMDD as key enabling technology for integration of large scale elearning based on NRENs. In: 2020 IEEE 20th International Conference on Advanced Learning Technologies (ICALT), pp. 45–46. IEEE (2020)
Spinellis, D.: Version control systems. IEEE Softw. 22(5), 108–109 (2005)
Standard, O.: Web services business process execution language version 2.0 (2007)
Steffen, B., Gossen, F., Naujokat, S., Margaria, T.: Language-driven engineering: from general-purpose to purpose-specific languages. In: Steffen, B., Woeginger, G. (eds.) Computing and Software Science. LNCS, vol. 10000, pp. 311–344. Springer, Cham (2019). https://doi.org/10.1007/978-3-319-91908-9_17
Steffen, B., Margaria, T., Braun, V.: The electronic tool integration platform: concepts and design. Int. J. Softw. Tools Technol. Transfer 1(1–2), 9–30 (1997)
Steffen, B., Margaria, T., Nagel, R., Jörges, S., Kubczak, C.: Model-driven development with the jABC. In: Bin, E., Ziv, A., Ur, S. (eds.) HVC 2006. LNCS, vol. 4383, pp. 92–108. Springer, Heidelberg (2007). https://doi.org/10.1007/978-3-540-70889-6_7
Steinberg, D., Budinsky, F., Merks, E., Paternostro, M.: EMF: Eclipse Modeling Framework. Pearson Education, London (2008)
Tarr, P., Ossher, H., Harrison, W., Sutton, S.M.: N degrees of separation: multi-dimensional separation of concerns. In: Proceedings of the 1999 International Conference on Software Engineering (IEEE Cat. No. 99CB37002), pp. 107–119. IEEE (1999)
White, S.A.: Introduction to BPMN. IBM Cooperation 2 (2004)
Zweihoff, P., Naujokat, S., Steffen, B.: Pyro: generating domain-specific collaborative online modeling environments. In: Hähnle, R., van der Aalst, W. (eds.) FASE 2019. LNCS, vol. 11424, pp. 101–115. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-16722-6_6
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Zweihoff, P., Steffen, B. (2021). Pyrus: An Online Modeling Environment for No-Code Data-Analytics Service Composition. In: Margaria, T., Steffen, B. (eds) Leveraging Applications of Formal Methods, Verification and Validation. ISoLA 2021. Lecture Notes in Computer Science(), vol 13036. Springer, Cham. https://doi.org/10.1007/978-3-030-89159-6_2
Download citation
DOI: https://doi.org/10.1007/978-3-030-89159-6_2
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-89158-9
Online ISBN: 978-3-030-89159-6
eBook Packages: Computer ScienceComputer Science (R0)