Abstract
ETL systems continue to suffer from a lack of a simple and rigorous approach for modelling and validation of populating processes for data warehouses. In spite of the efforts that researchers have been done, there is not yet a convinced and simply approach for modelling (conceptual and logical views), validating and testing ETL processes before conduct them to implementation and roll out. In this paper we explored the use of BPMN for ETL conceptual modelling and validation. Basically, we intended to provide a set of BPMN meta-models (patterns) especially designed to map standard data warehousing ETL processes and testing them before constructing the final system. We think this is a practical approach to reduce significantly the inadequacy of an ETL system in its real world scenario. A surrogate key pipelining process was selected as a case study to demonstrate the use and utility of the ETL modelling approach presented here.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Akkaoui, Z., Zimanyi, E.: Defining ETL worfklows using BPMN and BPEL. In: Proceedings of the ACM Twelfth International Workshop on Data Warehousing and OLAP, Hong Kong, China (2009)
Akkaoui, Z., Zimà nyi, E., Mazón, J., Trujillo, J.: A model-driven framework for ETL process development. In: Proceedings of the ACM 14th International Workshop on Data Warehousing and OLAP, pp. 45–52. ACM, Glasgow (2011)
Bizagi: Bizagi BPM Suite workflow patterns, Bizagi (2012), http://www.ibm.com/developerworks/library/specification/ws-bpel/ (accessed on June 17, 2012)
BPEL, Business Process Execution Language for Web Services (2012), http://www.bizagi.com/docs/Workflow%20Patterns%20using%20BizAgi%20Process%20Modeler.pdf (accessed on June 15, 2012)
BPMN, Object Management Group Business Process Model and Notation (2012), http://www.bpmn.org/ (accessed on June 15, 2012)
Dayal, U., Wilkinson, K., Simitsis, A., Castellanos, M.: Business Processes Meet Operational Business Intelligence (2010)
Kimball, R., Caserta, J.: The Data Warehouse ETL Toolkit - Pratical Techniques for Extracting, Cleaning, Conforming, and Delivering Data. Wiley Publishing, Inc. (2004)
Leymann, F.: BPEL vs. BPMN 2.0: Should You Care? In: Mendling, J., Weidlich, M., Weske, M. (eds.) BPMN 2010. LNBIP, vol. 67, pp. 8–13. Springer, Heidelberg (2010)
Muñoz, L., Mazón, J.-N., Pardillo, J., Trujillo, J.: Modelling ETL Processes of Data Warehouses with UML Activity Diagrams. In: Meersman, R., Tari, Z., Herrero, P. (eds.) OTM 2008 Workshops. LNCS, vol. 5333, pp. 44–53. Springer, Heidelberg (2008)
Scacchi, W.: Process Models in Software Engineering. In: Marciniak, J.J. (ed.) Encyclopedia of Software Engineering (2001)
Silver, B.: BPMN Method and Style: A levels-based methodology for BPM process modeling and improvement using BPMN 2.0. Cody-Cassidy Press (2009)
Simitsis, A., Vassiliadis, P.: A Methodology for the Conceptual Modeling of ETL Processes. In: Eder, J., Missikoff, M. (eds.) CAiSE 2003. LNCS, vol. 2681, pp. 305–316. Springer, Heidelberg (2003)
Sommerville, I.: Software Engineering, 8th edn. Pearson Education, Addison Wesley (2006)
Trujillo, J., Luján-Mora, S.: A UML Based Approach for Modeling ETL Processes in Data Warehouses. In: Song, I.-Y., Liddle, S.W., Ling, T.-W., Scheuermann, P. (eds.) ER 2003. LNCS, vol. 2813, pp. 307–320. Springer, Heidelberg (2003)
Weske, M., van der Aalst, W., Verbeek, H.: Advances in Business Process Management. Data & Knowledge Engineering 50(1) (2004)
Wilkinson, K., Simitsis, A., Castellanos, M., Dayal, U.: Leveraging Business Process Models for ETL Design. In: Parsons, J., Saeki, M., Shoval, P., Woo, C., Wand, Y. (eds.) ER 2010. LNCS, vol. 6412, pp. 15–30. Springer, Heidelberg (2010)
Vassiliadis, P., Simitsis, A., Skiadopoulos, S.: On the Logical Modeling of ETL Processes. In: Pidduck, A.B., Mylopoulos, J., Woo, C.C., Ozsu, M.T. (eds.) CAiSE 2002. LNCS, vol. 2348, pp. 782–786. Springer, Heidelberg (2002)
Vassiliadis, P., Simitsis, A., Skiadopoulos, S.: Modeling ETL activities as graphs. In: Proceedings of the 4th Intl. Workshop on Design and Management of Data Warehouses 2002, DMDW 2002, Toronto, Canada, pp. 52–61 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Oliveira, B., Belo, O. (2012). BPMN Patterns for ETL Conceptual Modelling and Validation. In: Chen, L., Felfernig, A., Liu, J., RaÅ›, Z.W. (eds) Foundations of Intelligent Systems. ISMIS 2012. Lecture Notes in Computer Science(), vol 7661. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34624-8_50
Download citation
DOI: https://doi.org/10.1007/978-3-642-34624-8_50
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-34623-1
Online ISBN: 978-3-642-34624-8
eBook Packages: Computer ScienceComputer Science (R0)