Standardizing Process-Data Exploitation by Means of a Process-Instance Metamodel

Cancela, Antonio; Quintero, Antonia M. Reina; Gómez-López, María Teresa; García-García, Alejandro

doi:10.1007/978-3-030-46633-6_3

Part of the book series: Lecture Notes in Business Information Processing ((LNBIP,volume 379))

Included in the following conference series:

494 Accesses

Abstract

The analysis of data produced by enterprises during business-process executions is crucial in ascertaining how these processes work and how they can be optimized, despite heterogeneous nature of these data structures. This data may also be used for various types of analysis, such as reasoning, process querying and process mining, which consume different data formats. However, all these structures and formats share a common ground: the business-process model and its instantiation are in each of their kernels. In this paper, we propose the use of a Business-Process Instance Metamodel, which serves as a common interface to perform an independent exploitation of data from the applications that produce the data and those which consume the data. A tool has been implemented as a proof of concept to illustrate the ease of matching the data with the proposed metamodel.

You have full access to this open access chapter, Download conference paper PDF

Enhancing Process Models to Improve Business Performance: A Methodology and Case Studies

Purpose: Identifying the Right Use Cases

Process Mining in a Nutshell

Keywords

1 Introduction

Companies today produce a great amount of data in a daily basis that accurately reflects their business processes. This data holds special interest for the study and optimization of these business processes. However, increasingly, the data comes from heterogeneous sources with different formats and structures (relational databases, NoSQL, APIs, data warehouses...). This causes the analysis and exploitation of data to become highly time-consuming, since many solutions are ad-hoc solutions, and, as a consequence, they have to be adapted depending on the techniques to be applied. This data-preparation process constitutes the most significant barrier to be improved and one of the highest time-consuming tasks in data-analysis projects [38].

In this paper, we propose the use of a Business-Process Instance Metamodel as an intermediate layer to specify the relation between the domain-specific data produced and its meaning in a business process, thereby facilitating how it can be exploited by business analysis techniques. Our research goal is the simplification of the data analysis by making independent the structures of data production from data consumption. The approach is based on the definition of mappings between data sources and the business process concepts specified in the Business-Process Instance Metamodel. The benefits obtained by using an intermediate metamodel include the reduction of the analysis time and the exploitation of data in a more appropriate way [24]. In fact, the use of the intermediate metamodel is a benefit itself, since it provides a standard way to access business-process data and also improves the interoperability among organizations.

The paper is organized as follows: Firstly, Sect. 2 gives a general overview of the approach and introduces how the proposed Business Instance Metamodel may be employed in different contexts. Secondly, Sect. 3 describes a case study that has been used to test the approach. Thirdly, the metamodel is detailed in Sect. 4. Section 5 presents the tool implemented as a proof of concept. The tool helps to define the matching between an Oracle\(^{TM}\) database and the Process Instance Metamodel. Section 6 then surveys other existing approaches that exploit data in different contexts. Finally, conclusions are drawn and further work is outlined in Sect. 7.

2 Approach Overview and Contributions

Business process data exploitation depends highly on the technology that supports the business data storage as well as how the data is structured. As a consequence, no standard approach exists that can exploit any type of source, and it is therefore necessary to develop ad-hoc data analysis mechanisms adapted to both data technology and model. Thus, for example, the necessary data preparation to generate an event log to be employed by a process-mining tool is totally different depending on whether data is stored in a relational database or whether it comes from a cloud data source or a data warehouse. Moreover, this generation process also depends on the specific data model.

In order to render data exploitation as technology-agnostic regarding its data structure [31], our approach is inspired by the guidelines provided by Model-Driven Architecture in such a way that we propose a Business Process Instance Metamodel that allows us to separate produced data structure from data analysis solutions. In other words, the Business-Process Instance Metamodel can be seen as an intermediate artifact that allows applications that produce business process data to become independent from those applications that consume such data [15]. Figure 1 depicts how the process instance metamodel acts as an interface between data producers and consumers.

The following subsections describe how the metamodel proposed in the paper could be used in different contexts under the previous viewpoints.

2.1 Data Production Viewpoint

This viewpoint represents those contexts in which business-process data is produced. This data is mapped into the metamodel in order to be analysed. The left-hand side of Fig. 1 depicts three different contexts of use related to formatting the produced data: APIs located in cloud systems; data warehouses; and relational databases.

Regarding the context of cloud systems and APIs, it should be borne in mind that companies use more and more cloud data sources which rely on complex structures such as JSON, whose objects might have different properties. This data usually complements the specific company data with data from payments, geolocation, etc. Furthermore, APIs can be used to cross-reference information (such as weather and macroeconomics) and cloud systems usually perform some computation over data, which results in new data sources.

Data warehouse based applications are an especially important context of use, above all when the business process data produced is used as input for process-mining and process-discovery techniques, since data warehouses commonly store historical information of the companies as well as many details regarding the timing of that information. Note that process-mining techniques need historical information in order to rebuild a consistent process model.

Finally, a common context of use from the data production viewpoint is related to applications that use relational databases. In fact, the case study introduced in this paper is based on a relational database. Relational databases also provide one of the most widely used scenarios for process querying, as detailed in Sect. 6.

2.2 Data Consumption Viewpoint

This viewpoint represents those contexts in which data obtained from business processes is exploited. The right-hand side of Fig. 1 depicts three different contexts of use related to data exploitation: reasoning by using defined ontologies; process queries for the creation of dashboards that improve decision-making; and event log generation for process mining.

In the reasoning context of use, applications use semantization techniques, which are based on an ontology as a formal specification. Many business process semantization approaches link concepts from domain ontologies with business process elements that are grounded in a business process ontology [19]. Thus, reasoning is used to derive facts from the ontology, which are not expressed explicitly. The elements of this business process ontology can be mapped to the concepts defined in our metamodel, since ontologies and metamodels are closely related [29].

Process queries improve decision-making, for example, by creating dashboards to exploit the business process data [35]. As far as our metamodel covers information related to process definitions, process instances, activities and activity instances and their attributes, we can ask for durations, sequences of activities, frequencies of executions, and can identify bottleneck activities, study deviated instances of activities/processes, etc. As a consequence, this information can be used to infer Key Performance Indicators which facilitate the monitoring of the process [32].

Finally, in the context of event logs for process-mining techniques, applications may not be able to produce event logs or may fail to produce them in the correct format [2]. Obtaining logs from the instances of our metamodel implies listing the activity instances ordered in terms of execution time, grouping them by process instance, and producing files with XES-formatted data. Note that the process to perform this transformation must be adapted to the data source in order to obtain the correct data output for processing. Moreover, it must be considered that the company systems work with various data sources at the same time. From the consumer viewpoint, all these details must be transparent by means of an appropriate transformation.

3 Case Study

This section presents the case study carried out to test the validity of the proposal. Data has been obtained from the execution of a business process within a prominent aerospace company. Although the company has no Business Process Management System, it does have a proprietary system that is supported by a relational database. The core business of the company consists of the assembly of aircraft and their modules. An aircraft undergoes a huge process of engineering, components designing, components construction that must be followed to assemble the final product to be ready to fly. When a new aircraft is about to be released, it must be tested several times. Figure 2 depicts the testing process of the aircraft modules.

When the aircraft testing process starts, the New aircraft order arrives activity begins, and, as a consequence, its data is introduced in the AIRCRAFT table (Fig. 3). Bear in mind that an aircraft passes through different stations, and that in each station, the aircraft modules must pass a set of tests that are composed of different sections. Thereby the execution of each test brings about the execution of every section in that test.

The Configuration of the test sections that the aircraft must pass activity consists of scheduling the different test sections that must be executed on the modules of each aircraft. This information is stored in the TEST_SECTIONS table. After the test configuration, the aircraft is driven to the first station to start the test execution (Move to the next station activity) and the set of tests are launched (Launch next test activity).

Every time a test is launched, a row is inserted in the TEST_EXECUTION table and another row is inserted in TEST_SECTION_EXECUTIONS (one for each test section executed). Furthermore, if the test fails, usually due to some kind of incidence, the Incidence Registration activity starts and the Troubleshooting subprocess is triggered. As a consequence: first, the original row is modified in order to register both the moment when the test failed and the status of the test after being executed; and second, a new row is inserted in the TEST_INCIDENCES table. Note that if an incidence appears during the execution of a section, it must be solved successfully before the airplane is released. As a consequence, the full test needs to be repeated, regardless of which section the error appeared, since the success of some parts of a test may depend on other parts of the test. Thus, when the Troubleshooting subprocess finishes, then the whole test is relaunched (Relaunch test activity).

Due to the lack of a Business Process Management System, every test execution is stored in detail in the database, whereby information related to aircraft, tests, stations, etc. is held. Thus, each time a new test is launched, the data involved is stored, such as timestamps related to every action, the status of the test, when the test has finished, and which sections were executed. The data model which supports this process is composed of the following tables (Fig. 3):

AIRCRAFT: This stores information about the tested aircraft. As a consequence, a row is inserted in this table each time an airplane is going to be tested. The table stores: the type of the airplane, the model of the airplane, the name, the start date, and the end date scheduled.
TESTS: This stores information about the collection of tests defined in the system: the name of the test, the creation date, and its type.
TEST_SECTIONS: This stores the sections of each test that each airplane should pass and the order in which the sections should be executed.
TEST_EXECUTIONS: This table stores information about a test execution. Thus, a row is inserted in this table each time a test is launched. The table stores: the station in which the test was executed, the time when the test execution started, the time when the test execution ended, and the test status after finishing.
TEST_SECTION_EXECUTIONS: This stores information about the execution of a test section. Note that each test is split into different sections that are in charge of preparing the execution or checking certain variables. The table stores the timestamp when the test section execution started, the timestamp when the test section execution ended, and the final status.
TEST_INCIDENCES: This stores information about the incidences produced during test executions. As a consequence, a row is inserted in this table when an error appears while running a test. The table stores the time when the incidence appeared, the incidence type, the status of the incidence, and the error that caused the incidence.

4 Process Instance Metamodel

The Business Process Instance Metamodel is detailed in Fig. 4. The metamodel has been specified with EMF [37]. Note that it is a very simple model which is mainly centred on the most basic entities related to business process instances together with their attributes. A previous extension of this metamodel was published in [15].

The root of the metamodel is the ProcessEngine metaclass and represents the BPMS or software application that is in charge of process execution. The process engine can be in charge of several processes. The ProcessDefinition metaclass represents the formal definition of the process, that is, what we call the Business Process Model. The attributes are:

id. Key identifier of the process.
name. Name of the process model.
description. Description of the process model.
suspended. This attribute represents whether a process is suspended (temporarily disabled). While it is suspended, the process is not instantiated.

A business process is composed of different activities and the Activity metaclass models these activities. The attributes are:

id. Key identifier of the activity.
name. Name of the activity.
description. Description of the activity.

One business process can be executed many times and the ProcessInstance metaclass models these executions or instances. The attributes are:

id. Key identifier of the process instance.
ended. A flag (Boolean) indicating whether the instance is still running.
suspended. A flag (Boolean) indicating whether the instance is suspended.
startUser. The user who started the instance process.
duration. Time spent on process execution. This information is recovered when the process has ended.
startTime. This represents when the instance process started.
endTime. This represents when the instance process ended.

Finally, the ActivityInstance metaclass represents the execution of an activity and is related to the Activity metaclass (note that an activity may be executed many times) and to the ProcessInstance metaclass (an activity may be executed in the context of different business processes). The attributes are:

id. Key identifier of the activity instance.
startTime. This represents when the instance activity started.
endTime. This represents when the instance activity ended.
duration. Time spent on activity execution. This information is recovered when the activity ends.
cancelled. A flag (Boolean) indicating whether the instance is cancelled.
assignee. The user assigned to the execution of the activity.

Note that this metamodel allows us to exploit business data in different contexts, independently of the storage technology and how the information is structured. We only need to define mappings from the concrete technology to the Process Instance Metamodel. Therefore, the information stored as instances of the Business Process Instance Metamodel may be used to generate event log traces (both in XES or MXML format), to be queried for decision-making or to be semantized thereby enabling the application of reasoning techniques.

4.1 Mapping the Metamodel and the Case Study Models

This section explains how the Process Instance Metamodel is used, from the data production viewpoint, in the context of the case study introduced in Sect. 3.

The Process Definition metaclass is related to the Testing Aircraft Process shown in Fig. 2. Each instance of that process is mapped into the Process Instance metaclass (see Fig. 5). As a consequence, when a new row is inserted into the TEST_EXECUTIONS table, then an instance of the Process Instance metaclass is created. It should be also noted that the startTime and endTime attributes are mapped to the startTime and endTime fields of that table.

Since there are different activities, such as the Launch next test or the Incidence registration activities, there are mappings between the Activity Instance metaclass and different tables (see Fig. 5). Thus, an instance of the Activity Instance metaclass is created each time a new row is inserted into the TEST_SECTION_EXECUTIONS, TEST_INCIDENCES, or TEST_EXECUTIONS tables. The expected startDate of an assembly process of an airplane is stored in the AIRCRAFT table. However, the real start time is represented by the oldest startTime of the TEST_EXECUTION related to a specific idAircraft that indicates the true beginning of the process.

Finally note that, although in this case study every mapping is related to the insertion of a row into a table, this is not the only possible scenario. The mappings of the Activity Instance metaclass could also be related to editions of rows. Thus, for example, the incidenceType field could be mapped to different activities if the various types of incidences lead to different subprocess executions.

5 Proof of Concept Implementation

In order to support our proposal, a proof-of-concept has been implemented to illustrate the mapping process between the business data repository and our Process Instance Metamodel. One of the main benefits of our proposal is that the mapping itself is that which remains after performing the matching between data repositories and the metamodel, instead of the mapped data, as in other approaches [7]. Thus, every new item of data registered in the repository is automatically available in the Process Instance Metamodel, and it is able to perform business process analysis not just after the process execution, but also whilst the execution is happening, which is key in some cases. This provides agility and the opportunity of making decisions during the business process instance. Another considerable benefit of this approach, since it is not tied to any specific data consumption context (process mining, process querying, reasoning over processes...), is the ability to exploit the business process data simultaneously in different contexts. We could use it to generate event logs while visualizing statistic data on a dashboard, as is showed in the demo video recorded using the proof-of-concept tool. This provides versatility to the way business process data can be used by the companies without the necessity of performing specific ad-hoc applications or data transformations for each context or goal. The proof of concept has been developed as a web application and implements a simple dashboard where we can compare visually different instances, cross-check statistics information related to our instances, and watch the evolution of our process data over time. Furthermore, a video demo shows how the tool is able to automatically analyse the structure of the data repository and how the mapping process can be executed in an easy way. Figure 6 shows a screenshot that captures the mapping definition process. The proof-of-concept has been developed as a result of collaboration with a company whose data could never be publicly available. However, the software is available for application to other cases. Moreover, a video demo using the tool has been recorded to facilitate the use of the tool. For any further details regarding the tool, check the website http://www.idea.us.es/portfolio-item/process-data-matching-tool/.

6 Related Work

We will limit the scope of this section to the approaches related to business processes whose focus is on the exploitation of data generated during process execution. The approaches can be classified into: approaches whose goal is the semantization of process data in order to use ontology-based reasoning; approaches whose goal involves the querying of process data to aid in decision-making in business process scenarios; and approaches whose goal is the creation of execution traces that are used as input for process discovery algorithms. Bear in mind that these different scenarios consume data in different formats, and certain conversion and formatting tasks can be tedious and complex since data can be stored in heterogeneous repositories [7]. The following subsections give a general overview of the state-of-the-art of the aforementioned contexts.

6.1 Approaches that Consume Data for Reasoning

This group introduces the incorporation of data ontology in order to support functionalities of a more intelligent nature, such as process reasoning. In general, these approaches augment existing processes with semantic annotations, so that formal reasoning techniques can be applied. There are several techniques of semantization of Business Processes [20]:

The SUPER project [40] formally represents business process concepts by means of a stack of five ontologies and provides a modelling environment for the enrichment of existing processes with semantic annotations.
The SAP AG system [6] integrates semantic descriptions and business process artifacts by linking concepts from an ontology and elements of business process models.
The Prosecco project [28] provides a unified dictionary of business concepts to help with the systems integration and takes into account semantic dependencies between business process models and rule models.
Finally, there is also a group of techniques that could be used for semantization of business processes that are not process specific, for example, since many business process execution environments use REST interfaces, certain techniques for semantization of REST interfaces could be used. However, these kinds of techniques remain out of the scope of this study.

The approach that is most closely related to ours is the SAP AG system in the sense that the domain ontology and the business process model are integrated by means of links; however, that system is focused on semantic sources.

6.2 Approaches that Consume Data for Querying

The approaches in this group query process data to help in decision-making in business process scenarios [35]. There are many different approaches to query process data. According to [33], these approaches can be classified depending on the type of behaviour models they can take as input:

Methods that operate over event logs. This group includes approaches such as CRG [23], eCRG [21], DAPOQ-Lang [27], FPSPARQL [5], and PIQL [30]. The approach most closely related to ours is DAPOQ-Lang because it is built on top of the metamodel proposed in [26]. The main difference is that their metamodel subsumes two different viewpoints, (process, and data), while in our approach the viewpoints are defined with different metamodels, in such a way that we have applied the principle of the separation of concerns.
Methods that operate over process model specifications. This group includes a set of approaches that were originally conceived for querying conceptual models, and, as a consequence, they are also useful for querying process models, and another set of approaches that were originally conceived for querying process model collections. The first subgroup includes approaches such as DMQL [11], GMQL [10], and VMQL [39]. The second subgroup includes approaches such as BPMN-Q [3], BPMN VQL [12], BPSL [22], CRL [13], Descriptive PQL [18], IPM-PQL [9], and PPSL [14]. The approaches most closely related to ours are DMQL and GMQL in the sense that they define a generic metamodel to cover all types of modelling languages. This metamodel can be seen as a way to decouple query languages from modelling languages.
Methods that operate over behaviours encoded in process models. This group includes approaches such as APQL [16], BQL [17], QuBPAL [36], and PQL [34]. All of these approaches are based on the definition of semantic relations between tasks. The most closely related to our approach is that of APQL, in the sense that the proposed language is independent of the notation used to specify process models.
Methods that operate over collections that may include process models and/or event logs. This group includes approaches such as BPQL [25] and NP-QL [4]. This group is the least related to our approach.

6.3 Approaches that Consume Data for the Creation of Execution Traces

There are several approaches that consume data to create execution traces. Thus, in [7], a conversion from a data source in table format to an event log is proposed. The approach is tested by means of two case studies: an SAP system, and a set of CSV files that are the result of exporting a database.

In [8], a framework to extract XES event log information from legacy relational databases is proposed. The extraction is made by defining two ontologies, one that represents the domain of interest and another one that represents event logs. The domain ontology is linked to the legacy data by using the ontology-based data access paradigm (OBDA), and the concepts defined in the event log ontology are mapped into the concepts defined in the domain ontology by means of annotations.

In [1], a framework to unify existing approaches of process discovery from event logs is introduced. The framework is based on event log and process model abstractions, and, as a consequence, it only includes concepts from event log and process viewpoints.

In [26], a metamodel is proposed to query the data from different sources in a standardized way. Thus, the metamodel allows the decoupling of the application of the data analysis techniques. The proposed metamodel includes concepts related to two different viewpoints: process and data. Furthermore, in order to be compatible with the XES metamodel, the proposed metamodel also includes events and cases. Mappings from data sources of three different scenarios (database redo logs, in-table version storage, and SAP-style change tables) to the proposed metamodel are formalized.

Note that all these approaches share at least one of the following two weak points covered by our approach: (1) different data sources are considered, but relational databases and/or tabular formats are taken for granted [7, 26]; and (2) the focus is on the results (event logs) instead of on the means (relations between stored data and events data), which forces the process of mapping to be repeated each time a new log needs to be generated [1, 7, 8].

7 Conclusions and Further Work

Due to the existence of multiple techniques based on Business Process Analysis, this paper introduces the necessity of the utilization of a Business Process Instance Metamodel as a bridge between data sources and data exploitation techniques.

As we have seen, this metamodel provides the first step towards isolating the process data produced and the objective of its analysis. This is especially relevant in scenarios where different types of business process data exploitation are going to be applied and/or scenarios where different data sources with various formats are working together. Thereby, this paper shows how the use of an intermediate metamodel can help to standardize the exploitation of business process data by defining a common infrastructure that may be used in various contexts of business process analytics.

In terms of further work, how to query the metamodel in order to extract the required information in the correct format constitutes the next challenge to tackle. This challenge brings about the extension of the metamodel to encapsulate other existing proposals of consumers and producers, while it maintains the abstraction level to ensure adaptability to any business regardless of its sector or domain knowledge.

Finally, we consider it interesting to enrich the way of defining the matching, by making the tool more flexible and by allowing the building of processes of a more complex nature and the exploitation of more complex data sources.

References

van der Aalst, W.M.P.: Process discovery from event data: relating models and logs through abstractions. Wiley Interdisc. Rew.: Data Min. Knowl. Discov. 8(3) (2018). https://doi.org/10.1002/widm.1244
van der Aalst, W.M.P.: Process mining: the missing link. Process Mining, pp. 25–52. Springer, Heidelberg (2016). https://doi.org/10.1007/978-3-662-49851-4_2
Chapter Google Scholar
Awad, A.: BPMN-Q: a language to query business processes. In: Enterprise Modelling and Information Systems Architectures, pp. 115–128 (2007)
Google Scholar
Beeri, C., Eyal, A., Kamenkovich, S., Milo, T.: Querying business processes. In: Proceedings of the 32nd International Conference on Very Large Data Bases. VLDB 2006, pp. 343–354. VLDB Endowment (2006). http://dl.acm.org/citation.cfm?id=1182635.1164158
Beheshti, S.-M.-R., Benatallah, B., Motahari-Nezhad, H.R., Sakr, S.: A query language for analyzing business processes execution. In: Rinderle-Ma, S., Toumani, F., Wolf, K. (eds.) BPM 2011. LNCS, vol. 6896, pp. 281–297. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-23059-2_22
Chapter Google Scholar
Born, M., Dörr, F., Weber, I.: User-friendly semantic annotation in business process modeling. In: Weske, M., Hacid, M.-S., Godart, C. (eds.) WISE 2007. LNCS, vol. 4832, pp. 260–271. Springer, Heidelberg (2007). https://doi.org/10.1007/978-3-540-77010-7_25
Chapter Google Scholar
Buijs, J.: Mapping data sources to XES in a generic way. Department of Mathematics and Computer Science, Eindhoven University of Technology (2010)
Google Scholar
Calvanese, D., Montali, M., Syamsiyah, A., van der Aalst, W.M.P.: Ontology-driven extraction of event logs from relational databases. In: Reichert, M., Reijers, H.A. (eds.) BPM 2015. LNBIP, vol. 256, pp. 140–153. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-42887-1_12
Chapter Google Scholar
Choi, I., Kim, K., Jang, M.: An XML-based process repository and process query language for integrated process management. Knowl. Process Manag. 14(4), 303–316. https://onlinelibrary.wiley.com/doi/abs/10.1002/kpm.290
Delfmann, P., Breuker, D., Matzner, M., Becker, J.: Supporting information systems analysis through conceptual model query. The diagramed model query language (DMQL). Commun. Assoc. Inf. Syst. 37, 24 (2015)
Google Scholar
Delfmann, P., Steinhorst, M., Dietrich, H.A., Becker, J.: The generic model query language GMQL. Conceptual specification, implementation, and runtime evaluation. Inf. Syst. 47, 129–177 (2015)
Article Google Scholar
Di Francescomarino, C., Tonella, P.: Crosscutting concern documentation by visual query of business processes. In: Ardagna, D., Mecella, M., Yang, J. (eds.) BPM 2008. LNBIP, vol. 17, pp. 18–31. Springer, Heidelberg (2009). https://doi.org/10.1007/978-3-642-00328-8_3
Chapter Google Scholar
Elgammal, A., Turetken, O., van den Heuvel, W.-J., Papazoglou, M.: Formalizing and appling compliance patterns for business process compliance. Softw. Syst. Model. 15(1), 119–146 (2014). https://doi.org/10.1007/s10270-014-0395-3
Article Google Scholar
Foerster, A., Engels, G., Schattkowsky, T.: Activity diagram patterns for modeling quality constraints in business processes. In: Briand, L., Williams, C. (eds.) MODELS 2005. LNCS, vol. 3713, pp. 2–16. Springer, Heidelberg (2005). https://doi.org/10.1007/11557432_2
Chapter Google Scholar
Gómez-López, M.T., Reina Quintero, A.M., Parody, L., Pérez Álvarez, J.M., Reichert, M.: An architecture for querying business process, business process instances, and business data models. In: Teniente, E., Weidlich, M. (eds.) BPM 2017. LNBIP, vol. 308, pp. 757–769. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-74030-0_60
Chapter Google Scholar
ter Hofstede, A.H.M., Ouyang, C., La Rosa, M., Song, L., Wang, J., Polyvyanyy, A.: APQL: a process-model query language. In: Song, M., Wynn, M.T., Liu, J. (eds.) AP-BPM 2013. LNBIP, vol. 159, pp. 23–38. Springer, Cham (2013). https://doi.org/10.1007/978-3-319-02922-1_2
Chapter Google Scholar
Jin, T., Wang, J., Wen, L.: Querying business process models based on semantics. In: Yu, J.X., Kim, M.H., Unland, R. (eds.) DASFAA 2011. LNCS, vol. 6588, pp. 164–178. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-20152-3_13
Chapter Google Scholar
Kammerer, K., Kolb, J., Reichert, M.: PQL - a descriptive language for querying, abstracting and changing process models. In: Gaaloul, K., Schmidt, R., Nurcan, S., Guerreiro, S., Ma, Q. (eds.) CAISE 2015. LNBIP, vol. 214, pp. 135–150. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-19237-6_9
Chapter Google Scholar
Kluza, K., Kaczor, K., Nalepa, G.J., Ślażyński, M.: Opportunities for business process semantization in open-source process execution environments. In: 2015 Federated Conference on Computer Science and Information Systems (FedCSIS), vol. 5, pp. 1307–1314, September 2015
Google Scholar
Kluza, K., et al.: Overview of selected business process semantization techniques. In: Pełech-Pilichowski, T., Mach-Król, M., Olszak, C.M. (eds.) Advances in Business ICT: New Ideas from Ongoing Research. SCI, vol. 658, pp. 45–64. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-47208-9_4
Chapter Google Scholar
Knuplesch, D., Reichert, M.: A visual language for modeling multiple perspectives of business process compliance rules. Softw. Syst. Model. 16(3), 715–736 (2016). https://doi.org/10.1007/s10270-016-0526-0
Article Google Scholar
Liu, Y., Muller, S., Xu, K.: A static compliance-checking framework for business process models. IBM Syst. J. 46(2), 335–361 (2007)
Article Google Scholar
Ly, L.T., Rinderle-Ma, S., Dadam, P.: Design and verification of instantiable compliance rule graphs in process-aware information systems. In: Pernici, B. (ed.) CAiSE 2010. LNCS, vol. 6051, pp. 9–23. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-13094-6_3
Chapter Google Scholar
Mannhardt, F., de Leoni, M., Reijers, H.A., van der Aalst, W.M.P., Toussaint, P.J.: Guided process discovery - a pattern-based approach. Inf. Syst. 76, 1–18 (2018). https://doi.org/10.1016/j.is.2018.01.009
Article Google Scholar
Momotko, M., Subieta, K.: Process query language: a way to make workflow processes more flexible. In: Benczúr, A., Demetrovics, J., Gottlob, G. (eds.) ADBIS 2004. LNCS, vol. 3255, pp. 306–321. Springer, Heidelberg (2004). https://doi.org/10.1007/978-3-540-30204-9_21
Chapter Google Scholar
González López de Murillas, E., Reijers, H.A., van der Aalst, W.M.P.: Connecting databases with process mining: a meta model and toolset. Softw. Syst. Model. 18(2), 1209–1247 (2018). https://doi.org/10.1007/s10270-018-0664-7
Article Google Scholar
González López de Murillas, E., Reijers, H.A., van der Aalst, W.M.P.: Everything you always wanted to know about your process, but did not know how to ask. In: Dumas, M., Fantinato, M. (eds.) BPM 2016. LNBIP, vol. 281, pp. 296–309. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-58457-7_22
Chapter Google Scholar
Nalepa, G.J., Ślażyński, M., Kutt, K., Kucharska, E., Łuszpaj, A.: Unifying business concepts for SMEs with prosecco ontology. In: 2015 Federated Conference on Computer Science and Information Systems (FedCSIS), pp. 1321–1326, September 2015
Google Scholar
Parreiras, F.S.: Marrying ontology and model-driven engineering (chap.). Wiley and Sons (2012)
Google Scholar
Pérez-Álvarez, J.M., Gómez-López, M.T., Parody, L., Gasca, R.M.: Process instance query language to include process performance indicators in DMN. In: 2016 IEEE 20th International Enterprise Distributed Object Computing Workshop (EDOCW), pp. 1–8, September 2016
Google Scholar
Pérez-Álvarez, J.M., Gómez-López, M.T., Eshuis, R., Montali, M., Gasca, R.M.: Verifying the manipulation of data objects according to business process and data models. Knowl. Inf. Syst., 1–31 (2020). https://doi.org/10.1007/s10115-019-01431-5
Pérez-Álvarez, J.M., Maté, A., Gómez-López, M.T., Trujillo, J.: Tactical business-process-decision support based on KPIs monitoring and validation. Comput. Ind. 102, 23–39 (2018). https://doi.org/10.1016/j.compind.2018.08.001
Article Google Scholar
Polyvyanyy, A.: Business process querying. In: Sakr, S., Zomaya, A. (eds.) Encyclopedia of Big Data Technologies, pp. 1–9. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-63962-8_108-1
Chapter Google Scholar
Polyvyanyy, A., Corno, L., Conforti, R., Raboczi, S., Rosa, M.L., Fortino, G.: Process querying in Apromore. In: BPM 2015 Demo Track (2015)
Google Scholar
Polyvyanyy, A., Ouyang, C., Barros, A., van der Aalst, W.M.P.: Process querying: enabling business intelligence through query-based process analytics. Decis. Support Syst. 100, 41–56 (2017). https://doi.org/10.1016/j.dss.2017.04.011
Article Google Scholar
Smith, F., Missikoff, M., Proietti, M.: Ontology-based querying of composite services. In: Ardagna, C.A., Damiani, E., Maciaszek, L.A., Missikoff, M., Parkin, M. (eds.) Business System Management and Engineering. LNCS, vol. 7350, pp. 159–180. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-32439-0_10
Chapter Google Scholar
Steinberg, D., Budinsky, F., Paternostro, M., Merks, E.: EMF: Eclipse Modeling Framework 2.0, 2nd edn. Addison-Wesley Professional, Boston (2009)
Google Scholar
Stodder, D.: Improving data preparation for business analytics. Applying technologies and methods for establishing trusted data assets for more productive users. Best Practices Report Q3 2016, pp. 19–21 (2016)
Google Scholar
Störrle, H.: VMQL: a visual language for ad-hoc model querying. J. Vis. Lang. Comput. 22(1), 3–29 (2011). Special Issue on Visual Languages and Logic
Article Google Scholar
Wetzstein, B., et al.: Semantic business process management: a lifecycle based requirements analysis. In: Proceedings of the Workshop on Semantic Business Process and Product Lifecycle Management, vol. 251. CEUR Workshop Proceedings (2007)
Google Scholar

Download references

Acknowledgements

This work has been partially funded by the Ministry of Science and Technology of Spain by the Project ECLIPSE (RTI2018-094283-B-C33 and TIN2016-75394-R) and the European Regional Development Fund (ERDF/FEDER).

Author information

Authors and Affiliations

Universidad de Sevilla, Sevilla, Spain
Antonio Cancela, Antonia M. Reina Quintero, María Teresa Gómez-López & Alejandro García-García

Authors

Antonio Cancela
View author publications
You can also search for this author in PubMed Google Scholar
Antonia M. Reina Quintero
View author publications
You can also search for this author in PubMed Google Scholar
María Teresa Gómez-López
View author publications
You can also search for this author in PubMed Google Scholar
Alejandro García-García
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Antonio Cancela .

Editor information

Editors and Affiliations

Università degli Studi di Milano, Milan, Italy
Paolo Ceravolo
University of Twente, Enschede, The Netherlands
Maurice van Keulen
University of Seville, Seville, Spain
María Teresa Gómez-López

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cancela, A., Quintero, A.M.R., Gómez-López, M.T., García-García, A. (2020). Standardizing Process-Data Exploitation by Means of a Process-Instance Metamodel. In: Ceravolo, P., van Keulen, M., Gómez-López, M. (eds) Data-Driven Process Discovery and Analysis. SIMPDA SIMPDA 2018 2019. Lecture Notes in Business Information Processing, vol 379. Springer, Cham. https://doi.org/10.1007/978-3-030-46633-6_3

Download citation

DOI: https://doi.org/10.1007/978-3-030-46633-6_3
Published: 25 April 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-46632-9
Online ISBN: 978-3-030-46633-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Federation for Information Processing (opens in a new tab)