Conformance Testing and Inference of Embedded Components

Petrenko, Alexandre; Avellaneda, Florent

doi:10.1007/978-3-319-99927-2_10

Alexandre Petrenko¹⁸ &
Florent Avellaneda¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 11146))

Included in the following conference series:

IFIP International Conference on Testing Software and Systems

752 Accesses
1 Citations

Abstract

The problems of active inference (learning) and conformance testing of a system modelled by an automaton have actively been studied for decades, however, much less attention has been paid to modular systems, modelled by communicating automata. In this paper, we consider a system of two communicating FSMs, one machine represents an embedded component and another the remaining part of the system, the context. Assuming that the context FSM is known, we want to learn the embedded FSM without directly interacting with it. This problem can be viewed as a generalization of the classical automata inference in isolation, i.e., it is the grey box learning problem. The proposed approach to solve this problem relies on a SAT-solving method for FSM inference from traces. It does not depend on the composition topology and allows at the same time to solve a related problem of conformance testing in context. The latter is to test whether an embedded implementation FSM composed with the given context is equivalent to the embedded specification FSM also composed with the context. The novelty of the conformance testing method is that it directly generates a complete test suite for the embedded machine and avoids using nondeterministic approximations with their tests, eliminating thus several sources of test redundancy inherent in the existing methods.

You have full access to this open access chapter, Download conference paper PDF

Applying Automata Learning to Embedded Control Software

Learning Communicating State Machines

Complete Property-Oriented Module Testing

Keywords

1 Introduction

Componentization is an important engineering principle. Top-down design approaches brake down large systems into smaller parts, components, and bottom-up approaches compose existing components into larger systems. While practitioners are mostly using ad hoc techniques, model-based software engineering is investigating formal approaches which can offer automation to various phases of modular system development, including testing and using legacy components and components of the shelf, COTS.

The existing model-based testing approaches focus mostly on a holistic view of a modular system, based on a single state-oriented model, see, e.g., [7]. Conformance tests are then generated from a state machine, which models either a component in isolation or a whole system as observed on external interfaces [4, 5, 7]. On the other hand, when a system is built using existing components, testing efforts should be focused only on new components [17]. This motivates research in conformance testing in context aka embedded testing, which aims to check whether an embedded implementation FSM composed with the given context is equivalent to the embedded specification FSM also composed with the context.

All the known methods for complete tests generation for testing in context first construct from the context and embedded machine an embedded equivalent or the largest solution to the appropriate FSM equation, which represents the behavior of the embedded machine as can be controlled and observed via context [2, 11, 12, 14]. The resulting partial machine is a nondeterministic approximation of the embedded deterministic machine, it is then used to derive complete internal tests, which are finally translated into external ones executed on external interfaces of a modular system.

Conformance testing is closely related to active inference, aka query learning, as already been understood, see, e.g., [10], for the case when a system is considered “as a whole”, i.e., modelled as an FSM.

Model inference helps in dealing with legacy components and COTS. Once a model is reengineered it can be used to perform verification with model checkers, regression and integration testing or redesign. Automata inference is an important topic addressed in many works, see, e.g., [1, 3, 8, 9, 13, 15], which treat a system as a single black box, even if it contains components with known models and only some need to be learned.

We propose to generalize the FSM inference problem to the case when an FSM to learn is a part of a modular system. Indeed, the traditional automaton/FSM inference problem statement is a particular case of this general situation, namely, when the rest of the system is a single state machine performing just a bijection of external and internal inputs. We know the only work [18] addressing the grey box learning problem, where the goal is to learn a tail FSM in the serial composition with the context FSM. We propose an approach for solving the grey box learning problem that does not depend on the composition topology, as opposed to the previous work [18].

To simplify the discussions, we model a system with two communicating FSMs, one machine represents an embedded component and another the remaining part of the system, the context. Assuming that the context FSM is known, we elaborate an approach to learn the embedded FSM without directly interacting with it. The proposed approach relies on a SAT-solving method for FSM inference from sample traces. The approach also allows to solve the problem of conformance testing in context, which is to check whether an embedded implementation FSM composed with the given context is equivalent to the embedded specification FSM also composed with the context. The novelty of the conformance testing method is that it directly generates a complete test suite for the embedded machine and avoids using nondeterministic approximations with their tests, eliminating thus several sources of test redundancy inherent in the existing methods.

The paper is organized as follows. Section 2 provides definitions related to state machines and automata needed to formalize the approach. Communicating FSMs are formally defined and illustrated on a working example in Sect. 3. A SAT-solving method for FSM inference from traces, which allows to obtain different conjectures [10] as required by the proposed methods, is recalled in Sect. 4. Section 5 presents our method for complete test generation for embedded components. In Sect. 6 we present some experimental results concerning test generation. Section 7 describes the method for embedded FSM inference and Sect. 8 concludes.

2 Definitions

A Finite State Machine or simple machine M is a 5-tuple (S, s₀, I, O, T), where S is a finite set of states with an initial state s₀; I and O are finite non-empty disjoint sets of inputs and outputs, respectively; T is a transition relation $ T\, \subseteq \,S \times I \times O \times S,(s,a,o,s^{{\prime }} )\, \in \,T $ is a transition. When we need to refer to the machine being in state s ∈ S, we write M/s.

M is complete (completely specified) if for each tuple (s, a) ∈ S × I there exists transition (s, a, o, s′) ∈ T, otherwise it is partial. It is deterministic if for each (s, a) ∈ S × I there exists at most one transition (s, a, o, s′) ∈ T, otherwise it is nondeterministic. FSM M is a submachine of M′ = (S′, s₀, I, O, T′) if $ S\, \subseteq \,S^{{\prime }} $ and $ T\, \subseteq \,T^{{\prime }} $.

An execution of M/s is a finite sequence of transitions forming a path from s in the state transition diagram of M. The machine M is initially connected, if for any state s ∈ S there exists an execution from s₀ to s. Henceforth, we consider only deterministic initially connected machines.

A trace of M/s is a string in (IO)* which labels an execution from s. Let Tr(s) denote the set of all traces of M/s and Tr_M denote the set of traces of M/s₀. For trace $ \upomega \in Tr\left( s \right) $, we use s-after-ω to denote the state M reached after the execution of ω, for an empty trace ε, s-after-ε = s. When s is the initial state we write M-after-ω instead of s₀-after-ω.

Given a string ω ∈ (IO)*, the I-restriction of ω is a string obtained by deleting from ω all symbols that are not in I, denoted $ \upomega_{ \downarrow I} $.

The I-restriction of a trace ω ∈ Tr(s) is said to be a transfer sequence from state s to state s-after-ω. The length of a trace is defined as the length of its I-restriction. A prefix of trace $ \upomega \in Tr\left( s \right) $ is a trace $ \upomega^{{\prime }} \in Tr\left( s \right) $ such that the I-restriction of the latter is a prefix of the former.

Given an input sequence α, we let out(s, α) denote the O-restriction of the trace that has α as its I-restriction. States s, s′ ∈ S are equivalent w.r.t. α, if out(s, α) = out(s′, α), denoted s ≅ _α s′; they are distinguishable by α, if out(s, α) ≠ out(s′, α), denoted s ≇ _α s′ or simply s ≇ s′. States s and s′ are equivalent if they are equivalent w.r.t. all input sequences, i.e., Tr(s) = Tr(s′), denoted s ≅ s′. The equivalence and distinguishability relations between FSMs is similarly defined. Two FSMs are equivalent if their initial states are equivalent.

Given two FSMs M = (S, s₀, I, O, T) and $ M^{{\prime }} = \left( {S^{{\prime }} ,\,s_{0}^{{\prime }} ,\,I,\,O,\,T^{{\prime }} } \right) $, their product M × M′ is the FSM (P, p₀, I, O, H), where $ p_{0} = \left( {s_{0} ,\,s_{0}^{{\prime }} } \right) $ such that P and H are the smallest sets satisfying the following rule: If $ \left( {s,\,s^{{\prime }} } \right) \in P,\,\left( {s,x,o,t} \right) \in T,\,\left( {s^{{\prime }} ,x,o^{{\prime }} ,t^{{\prime }} } \right) \in T^{{\prime }} $, and o = o′, then (t, t′) ∈ P and ((s, s′), x, o, (t, t′)) ∈ H. It is known that if M and M′ are complete machines then they are equivalent if and only if the product M × M′ is complete.

Two complete FSMs M = (S, s₀, I, O, T) and $ M^{{\prime }} = \left( {S^{{\prime }} ,s_{0}^{{\prime }} ,I,O,T^{{\prime }} } \right) $ are called isomorphic if there exists a bijection f: S → S′ such that $ f\left( {s_{0} } \right) = s_{0}^{{\prime }} $₀ and for all a ∈ I, o ∈ O, and s ∈ S, f(s-after-ao) = f(s)-after-ao. Isomorphic FSMs are equivalent, but the converse does not necessarily hold.

Given a string ω ∈ (IO)* of length |ω|, let Pref(ω) be the set of all prefixes of ω. We define a (linear) FSM W(ω) = (X, x₀, I, O, D_ω), where D_ω is a transition relation, such that |X| = |ω| + 1, and there exists a bijection f: X → Pref(ω), such that f(x₀) = ε, (x_i, a, o, x_i+1) ∈ D_ω if f(x_i)ao = f(x_i+1) for all i = 0, …, |ω|−1, in other words, W(ω) has the set of traces Pref(ω). We call it the ω-machine. Similarly, given a finite prefix-closed set of traces Ω ⊂ (IO)* of some deterministic FSM, let W(Ω) = (X, x₀, I, O, D_Ω) be the acyclic deterministic FSM such that Ω is the set of its traces, called an Ω-machine. The bijection f relates states of this machine to traces in Ω.

While the set of traces of the Ω-machine is Ω, there are many FSMs which contain the set Ω among their traces. We restrict our attention to the set of all FSMs with at most n states and alphabets I and O, denoted $ {\mathfrak{J}} \left( {n,\,I,\,O} \right) $. An $ {\text{FSM}}\,C = \left( {S,\,s_{0} ,\,I,\,O,\,T} \right),\,C \in {\mathfrak{J}} \left( {n, I, O} \right) $ is called an Ω-conjecture, if Ω ⊆ Tr_C.

The states of the Ω-machine W(Ω) = (X, x₀, I, O, D_Ω) and an Ω-conjecture C = (S, s₀, I, O, T) are closely related to each other. Formally, there exists a mapping μ: X → S, such that μ(x) = s₀-after-f(x), the state reached by C with the trace f(x) ∈ Ω. The mapping μ is unique and induces a partition π_C on the set X such that x and x′ belong to the same block of the partition π_C, denoted $ x =_{{\uppi_{C} }} x^{{\prime }} $, if μ(x) = μ(x′).

Given an Ω-conjecture C with the partition π_C, let D be an Ω′-conjecture with the partition π_D, such that Ω′ ⊆ Ω, we say that the partition π_C is an expansion of the partition π_D, if its projection onto states of Ω′ coincides with the partition π_D.

A finite set of input sequences L ⊂ I* is a checking experiment for a complete FSM M with n states if for each $ {\text{FSM}}\,N \in {\mathfrak{J}}\,\left( {n,\,I,\,O} \right) $, such that N ≅ _L M, it holds that N ≅ M. Checking experiments are also called complete (i.e., sound and exhaustive) tests.

We also use the classical automaton model. A Finite Automaton A is a 5-tuple (P, p₀, X, T, F), where P is a finite set of states with the initial state p₀; X is a finite alphabet; T is a transition relation $ T \subseteq S \times X \cup \left\{ \varepsilon \right\} \times S $, where ε represents an internal action, and F is a set of final or accepting states. We shall use several operations over automata, namely, expansion, restriction, and intersection, following [16].

Given an automaton A and a finite alphabet $ U,\,U \cap X = \varnothing $, the U-expansion of automaton A is the automaton denoted A_↑U obtained by adding at each state a self-looping transition labeled with each action in U.

For an automaton A and an alphabet U ⊆ X, the U-restriction of automaton A is the automaton denoted A_↓U obtained by replacing each transition with the symbol in X\U by an ε-transition between the same states.

Given automata A = (P, p₀, X, T, F_A) and B = (R, r₀, Y, Z, F_B), such that $ X \cap Y \ne \varnothing $, the intersection A ∩ B is the largest initially connected submachine of the automaton $ (P \times R,\left( {p_{0} ,r_{0} } \right),X \cap Y,Q,F_{A} \times F_{B} ) $, where for each symbol a ∈ X ∩ Y and each state (p, r) ∈ P × R, ((p, r), a, (p′, r′)) ∈ Q, if (p, a, p′) ∈ T and (r, a, r′) ∈ Z.

We also define an automaton corresponding to a given FSM M. The automaton, denoted by A(M), is obtained by splitting each transition of M labeled by input/output into two transitions labeled by input and output, respectively, and connecting them with an auxiliary non-final state. The original states of M are only final states of A(M), hence the language of A(M) coincides with the set of traces of M.

3 Communicating FSMs

The behavior of a modular system composed of FSM components depends on its environment. The two together constitute a closed system. Communications in it can either be via messages or by method calls, using no queues. We restrict our attention to the case when queues are not used, which is possible with a so-called slow environment assuming that the closed system operates with a single message in transit [12]. This is a sufficient condition for the existence of a deterministic FSM modelling the external behavior of a modular system [12, 16]. Such an environment can be modelled by a (chaos) automaton Env with two states, out of which the initial state is the final state, shown in Fig. 1. After issuing an external input in X to the system it waits until an external output in O is produced before executing a next input. Its language is the set (XO)*.

We further consider only two deterministic FSMs, one of them representing an embedded component E and another the remaining part of the modular system, aka context K, as shown in Fig. 1.

We assume that the sets X, O, U, and V are pairwise disjoint. The context FSM K assumed to be a complete machine interacts with the environment and can process an external input after it produces an internal output even before an external output is emitted. Since this violates the restriction of having a single message in transit, we constrain its behavior by composing it with the slow environment. Let A(K) be the automaton of the context FSM K. Then the intersection of automata $ A\left( K \right) \cap Env_{ \uparrow U \cup V} = A(K)_{\text{slow}} $ represents the behavior of the context constrained by the slow environment. Then the intersection $ A\left( K \right)_{\text{slow}} \cap A\left( E \right)_{ \uparrow X \cup O} $ denoted by A(K) ◊ A(E) describes the behavior of the closed system, called the composite automaton of the modular system.

The language of A(K) ◊ A(E) is the set of all strings labelling all the executions of the system L(A(K) ◊ A(E)). Restricting a string to the alphabets of a component FSM we obtain a trace of the context or embedded FSM. The external behavior of the system is expressed in terms of external inputs X and outputs O, so it is the set of (X ∪ O)-restrictions of A(K) ◊ A(E), i.e., external traces of the system. They are traces of an FSM, provided that A(K) ◊ A(E) has no livelocks, i.e., cycles labelled by symbols in U ∪ V [16], the machine can be obtained by removing ε-transitions in A(K) ◊ A(E)_↓X∪O and pairing each input with a subsequent output, if it exists, to an FSM transition’s label. If some external input is not followed by an external output it is deleted from the corresponding state of (A(K) ◊ A(E))_↓X∪O, as a result, the FSM becomes partial. If all inputs are deleted from the initial state then the machine has a single state and no transition. We let K ◊ E denote the resulting FSM, called the composite FSM of the modular system.

Given two FSMs E and E′ over the alphabets U and V, such that the composite machines K ◊ E and K ◊ E′ are complete FSMs, we say that E and E′ are externally equivalent (or equivalent in context) if K ◊ E ≅ K ◊ E. Clearly, E ≅ E′ implies K ◊ E ≅ K, but the converse does not hold. Testing in context uses external equivalence as a conformance relation between implementations of a component embedded in a modular system and its specification.

A finite set of input sequences L ⊂ X* is an external checking experiment (complete test suite) for the embedded FSM E w.r.t. $ {\mathfrak{J}} \left( {n,\,U,\,V} \right), $ if for each $ {\text{FSM}}\,N \in {\mathfrak{J}} \left( {n,\,U,\,V} \right), $ where n is the number of states in E such that K ◊ N ≅_L K ◊ E, it holds that K ◊ N ≅ K ◊ E.

Example. Consider the context FSM K and embedded FSM E shown in Fig. 2 together with the composite FSM K ◊ E. The composite automaton A(K) ◊ A(E) is shown in Fig. 3.

4 Passive Inference with SAT-Solving

Henceforth, we first provide a brief overview of the SAT-solving based method for conjecture generation from a given set of traces avoiding regeneration of already considered conjectures which is the basic step of testing and learning an FSM in isolation [10] and an embedded FSM, as we show in Sect. 5. For a detailed presentation, the reader is referred to [10].

The basic step of conjecture inference from a given set of traces Ω is state merging of the Ω-machine. SAT-solving approaches [6, 9] encode the problem into Boolean constraints, a solution if it exists is a conjecture with a given number of states. We use an existing encoding of a set of traces Ω into a Boolean formula formula [6]. Let W(Ω) = (X, x₀, I, O, D_Ω) be the Ω-machine. Each state of the Ω-machine is represented by a variable x, so x_i ∈ {0, …, n − 1}. Since the Ω-machine is deterministic, the state variables satisfy the constraint [1]:

$$ \begin{aligned} \begin{array}{*{20}c} {\forall_{xi} ,\,x_{j} \in :\,if\,x_{i} \,{\not\cong}\,x_{j} \,{\text{then}}\,x_{i} \ne x_{j} \,{\text{and}}} \\ {{\text{if}}\,\exists a\, \in \,I\,{\text{s}} . {\text{t}} .\,out\left( {x_{i} ,\,a} \right) = out\left( {x_{j} ,\,a} \right) = o\,{\text{then}}\,x_{i} = x_{j} \Rightarrow x_{i}{\text{-after-}}ao = x_{j}{\text{-after-}}ao} \\ \end{array} \hfill \\ \hfill \\ \end{aligned} $$

(1)

An assignment of values to variables such that the formula (1) is satisfied defines a mapping μ: X → S, where S is the set of states of an Ω-conjecture, i.e., the mapping μ defines a partition of X into n blocks.

These CSP (constraint satisfaction problem) formulas are then translated to SAT using unary coding for integer variables, represented by n Boolean variables v_x,0, …, v_x,n−1. For each x ∈ X, we have the clause:

$$ v_{x,0} { \vee } \ldots { \vee }v_{x,n - 1} $$

(2)

For each state x ∈ X and all i, j ∈ {0, …, n − 1} such that i ≠ j, we have the clauses:

$$ \neg v_{x,i} \vee \neg v_{x,j} $$

(3)

We use auxiliary variables e_{x, y} [6]. For every x, y ∈ X such that x ≇ y we have

$$ \neg e_{x,\,y} $$

(4)

For all x, y ∈ X such that out(x, a) = out(y, a) = o, we have

$$ e_{x,y} \Rightarrow e_{{x - {\text{after - }}ao,y - {\text{after - }}ao}} $$

(5)

For every x, y ∈ X and all i ∈ {0, …, n − 1}

$$ e_{x,y} \wedge v_{x,i} \Rightarrow v_{y,i} $$

(6)

$$ \neg e_{x,y} \wedge v_{x,i} \Rightarrow \neg v_{y,i} $$

(7)

The resulting Boolean formula is the conjunction of clauses (2)–(7).

The traditional use of SAT solvers for state minimization aims at obtaining a single conjecture, while the problems of conformance testing and learning require that constraints should allow a solver to check, once a conjecture is found, whether another non-equivalent conjecture exists. Absence of a conjecture proves that a checking experiment is constructed and the machine is identified.

This is achieved by using the following procedure to infer a conjecture that differs from already considered conjectures. Isomorphic conjectures are identified by their common partition, encoded into an additional constraint. Recall that states of an Ω-machine form a partition defined by an Ω-conjecture. We let Π denote a set of partitions of states of Ω′-machines, where Ω′ ⊆ Ω.

To check the satisfiability of a formula one can use any of the existing solvers, calling the function call-solver(formula). If a solution exists then we have an Ω-conjecture with n or fewer states. The latter is obtained from the determined partition on X.

5 External Checking Experiment Construction

Solving the problem of external checking experiment generation, we use as in our previous work [10] Algorithm 1 for conjecture inference from a current set of traces. The difference, however, is that traces now no longer belong to a machine considered in isolation (this becomes even more crucial for active inference), they are produced by an embedded component. Accordingly, instead of checking the equivalence of a conjecture to the specification machine, we must check their external equivalence. To this end, we need to compose a conjecture C and the context K. As discussed above, if the resulting composite automaton A(K) ◊ A(C) has a livelock, its external behavior cannot be specified by an FSM, since an external input triggering livelock cannot be paired with any output. To deal with this issue we formulate a new constraint (in the form of a partition, as before) avoiding its regeneration by a solver. Once the current conjecture composed with the context yields a composite FSM K ◊ C an external input sequence distinguishing it from the given composite machine K ◊ E can be determined, if they are not equivalent. The found sequence is added to a current set of input sequences. The distinguishing external input sequence is the X-restriction of the word of the automaton A(K) ◊ A(C) from which a trace of the embedded component is obtained as the (U ∪ V)-restriction and used to generate a next conjecture. If, however, no new trace of the embedded component is obtained and the state partition of the conjecture distinguishable from the specification machine is added as a constraint to avoid its regeneration. The process iterates until the constraints are no longer satisfiable. The procedure is implemented in Algorithm 2.

Algorithm 2 calls Infer_conjecture(Ω, n, Π), which in turn calls a SAT solver constraining it to avoid solutions of already considered conjectures.

Note that the Boolean formula used by the SAT solver is built incrementally; a current formula is saved and new clauses are added when a set Ω or Π is augmented.

Example.

We illustrate Algorithm 2 using the context and embedded machines in Fig. 2. We assume n = 2. Initially, the set of external input sequences Ψ is empty, so is the set of internal input sequences Ω. The function Infer_conjecture(Ω, n, Π) for the empty set Π returns a Ω-conjecture C₀ as an FSM with a single state and no transitions. The composite machine K ◊ C has no transitions either. We choose the external input x₁, so Ψ = {x₁}. This input is the X-restriction of the word σ = x₁u₁v₁u₁v₁o₁ in A(K) ◊ A(E). Its restriction onto the alphabets of the embedded component is $ \upsigma_{{ \downarrow \left( {U \cup V} \right)}} = u_{1} v_{1} u_{1} v_{1} .\,\Omega \,\text{ = }\left\{ {u_{1} v_{1} u_{1} v_{1} } \right\} $. The function Infer_conjecture(Ω, n, Π) for the empty set Π returns a Ω-conjecture C₁ as an FSM with a single state and transition labelled u₁/v₁. The composite FSM K ◊ C₁ is shown in Fig. 4(a).

The FSM K ◊ C₁ has an undefined input x₁ in the second state, we take the external input sequence x₁x₁, so now Ψ = {x₁x₁}. It is the X-restriction of the word σ = x₁u₁v₁u₁v₁o₁x₁u₂v₁o₁ in A(K) ◊ A(E). Its restriction onto the alphabets of the embedded component is $ \upsigma_{{ \downarrow \left( {U \cup V} \right)}} = u_{1} v_{1} u_{1} v_{1} u_{2} v_{1} .\,\Omega \, = \left\{ {u_{1} v_{1} u_{1} v_{1} u_{2} v_{1} } \right\} $. The function Infer_conjecture(Ω, n, Π) for the empty set Π returns a Ω-conjecture C₂ as an FSM with a single state and two transitions labelled u₁/v₁ and u₂/v₁. The composite FSM K ◊ C₂ is shown in Fig. 4(b). It is a complete machine, however, the product (K ◊ C) × (K ◊ E) is a partial machine, since its behavior is not specified for the input sequence x₂x₁x₂. In fact, this sequence demonstrates that K ◊ C₂ ≅ K ◊ E, since K ◊ C₂ reacts with the o₁o₁o₂, while K ◊ E with o₁o₁o₁.

The input sequence x₂x₁x₂ is added to Ψ, which becomes {x₁x₁, x₂x₁x₂}. The sequence x₂x₁x₂ is the X-restriction of the word σ = x₂u₁v₁o₁ x₁u₂v₂o₁x₂u₁v₁o₁ in A(K) ◊ A(E). Its restriction onto the alphabets of the embedded component is $ \upsigma_{{ \downarrow \left( {U \cup V} \right)}} = u_{1} v_{1} u_{2} v_{2} u_{1} v_{1} .\,\Omega \, = \left\{ {u_{1} v_{1} u_{1} v_{1} u_{2} v_{1} ,\,u_{1} v_{1} u_{2} v_{2} u_{1} v_{1} } \right\} $. Next Ω-conjecture C₃ is shown in Fig. 4(c). The composite FSM K ◊ C₃ is isomorphic to K ◊ E. Now the set of partitions Π should include the following partition of prefixes of Ω, as each of them is a state of the Ω-machine:

$$ \uppi_{1} \, = \,\left\{ {\upvarepsilon,\,u_{1} v_{1} u_{1} v_{1} ,\,u_{1} v_{1} u_{1} v_{1} u_{2} v_{1} ,\,u_{1} v_{1} u_{2} v_{2} u_{1} v_{1} ;\,u_{1} v_{1} ,\,u_{1} v_{1} u_{2} v_{2} } \right\}. $$

In the next iteration, Infer_conjecture(Ω, n, Π) returns the Ω-conjecture C₄ shown in Fig. 4(d). The composite FSM K ◊ C₄ is shown in Fig. 4(e). It is not equivalent to K ◊ E, and the shortest input sequence distinguishing them is x₁x₁x₁x₂, it extends the existing sequence x₁x₁.

The input sequence x₁x₁x₁x₂ is added to Ψ, which becomes {x₁x₁x₁x₂, x₂x₁x₂}. It is the X-restriction of the word σ = x₁u₁v₁u₁v₁o₁x₁u₂v₁o₁x₁u₂v₁o₁x₂o₂ in A(K) ◊ A(E). We have $ \upsigma_{{ \downarrow \left( {U \cup V} \right)}} = u_{1} v_{1} u_{1} v_{1} u_{2} v_{1} u_{2} v_{1} .\,\Omega \, = \left\{ {u_{1} v_{1} u_{1} v_{1} u_{2} v_{1} u_{2} v_{1} ,\,u_{1} v_{1} u_{2} v_{2} u_{1} v_{1} } \right\} $. Next Ω-conjecture C₅ is shown in Fig. 4(f). The composite FSM K ◊ C₅ is shown in Fig. 4(g). It is not equivalent to K ◊ E, and the shortest input sequence distinguishing them is x₂x₁x₂x₁x₂, it extends the existing sequence x₂x₁x₂.

The input sequence x₂x₁x₂x₁x₂ is added to Ψ, which becomes {x₁x₁x₁x₂, x₂x₁x₂x₁x₂}. It is the X-restriction of the word σ = x₂u₁v₁o₁ x₁u₂v₂o₁x₂u₁v₁o₁x₁u₂v₁o₁x₂o₂. We have $ \upsigma_{{ \downarrow \left( {U \cup V} \right)}} = u_{1} v_{1} u_{2} v_{2} u_{1} v_{1} u_{2} v_{1} .\,\Omega \, = \left\{ {u_{1} v_{1} u_{1} v_{1} u_{2} v_{1} u_{2} v_{1} ,\,u_{1} v_{1} u_{2} v_{2} u_{1} u_{2} v_{1} } \right\} $. The function Infer_conjecture(Ω, n, Π) returns False, since there is no solution which does not extend the partition π₁. Algorithm 2 terminates with the external checking experiment Ψ = {x₁x₁x₁x₂, x₂x₁x₂x₁x₂}.

This example was used in the previous work [12] to illustrate a number of various approaches to construct complete tests for the embedded component, compared to them the SAT solving approach elaborated here generates a much smaller test suite. For comparison, we construct the same experiment assuming this time that n = 3. The prototype tool presented in Sect. 7 returns just seven tests.

Notice that the algorithm not only delivers an external checking experiment for the embedded component, but also infers an FSM that is externally equivalent to the given embedded FSM. In our example, the Ω-conjecture C₃ in Fig. 4(c) is externally equivalent to the FSM E in Fig. 2. This observation indicates that the approach should work for active inference of an embedded component. We elaborate a corresponding algorithm in Sect. 7.

Theorem 1.

Given complete deterministic FSMs K and E such that the composite machine K ◊ E is a complete FSM, Algorithm 2 returns an external checking experiment for the embedded FSM E w.r.t. $ {\mathfrak{J}} \left( {n,\,U,\,V} \right) $.

Sketch of Proof.

When Algorithm 2 terminates the resulting set of external input sequence Ψ is indeed a checking experiment, since by the post-condition of Infer_conjecture no conjecture exists that is not externally equivalent to the given embedded FSM E. Note that all complete conjectures externally equivalent to E are excluded because as soon as one if found (including E itself), its partition is added to Π. Algorithm 2 will not generate the same conjecture all over again and always terminates because the set of all possible conjectures with at most n states is finite.

6 Preliminary Experiments

The complexity of checking experiments for complete deterministic FSMs is well understood, however, no result exists yet on estimating complexity of external checking experiments for embedded FSMs. Considering a system of two communicating machines, context and embedded FSMs, the question arises which of them contribute more to the complexity of external checking experiments. We decided to perform experiments aiming at shedding some light on this.

Both machines are generated randomly for |X| = |O| = |U| = |V| = 2. In the first experiment, we fix the number of states of an embedded FSM to six and vary that of the context; in the second experiment, we fix the number of states of a context FSM to six and vary that of the embedded FSM. For each pair of values, the average of ten instances obtained with a prototype tool implementing Algorithm 2 is calculated and the results are illustrated in Fig. 5. They indicate that the length of experiments grows with the number of states in an embedded machine similar to an FSM considered in isolation, but the complexity of the context seems not to be a significant contributor. More experiments are needed to check this conclusion.

7 Active Inference of Embedded FSM

Given a composition of two complete deterministic FSMs K and E with the topology in Fig. 1, called a grey box, GB, where the context FSM K is known, while the embedded FSM E is not, we want to learn the machine E by applying external inputs I and observing external as well as internal outputs O, U, V, assuming that the embedded FSM E has at most n states.

External input sequences are applied to the grey box obeying the property of a slow environment Env (Fig. 1), i.e., inputs are interleaved with outputs. We assume that livelocks are removed from the grey box. The learning procedure is implemented in Algorithm 3. It is an enhancement of Algorithm 2 replacing the FSM E by a current conjecture.

Example.

We illustrate Algorithm 3 using the context and embedded machines in Fig. 2. The embedded FSM is the one to be inferred and we use the composite automaton in Fig. 3 as the grey box. We assume n = 2. Initially, the set of external input sequences Ψ is empty, so is the set of internal input sequences Ω. The function Infer_conjecture(Ω, n, Π) for the empty set Π returns a Ω-conjecture C₀ as an FSM with a single state and no transitions. Its second execution yields D₀ which has no transition. (K ◊ C₀) × (K ◊ D₀) has no transitions either. We choose the external input x₁, so Ψ = {x₁}. When this input is applied to the grey box, the trace σ = x₁u₁v₁u₁v₁o₁ is observed. Its restriction onto the alphabets of the embedded component is $ \upsigma_{{ \downarrow \left( {U \cup V} \right)}} = u_{1} v_{1} u_{1} v_{1} .\,\Omega = \left\{ {u_{1} v_{1} u_{1} v_{1} } \right\} $. The function Infer_conjecture(Ω, n, Π) for the empty set Π returns a Ω-conjecture with a single state and transition labelled u₁/v₁. This machine becomes now the conjecture C₁. The composite FSM K ◊ C₁ is shown in Fig. 4(a). Next execution of the loop yields the conjecture D₁ equivalent to C₁.

The product (K ◊ C₁) × (K ◊ D₁) has an undefined input x₁ in the second state, we take the external input sequence x₁x₁, so now Ψ = {x₁x₁}. When this input sequence is applied to the grey box, the trace σ = x₁u₁v₁u₁v₁o₁x₁u₂v₁o₁ is observed. Its restriction onto the alphabets of the embedded component is $ \upsigma_{{ \downarrow \left( {U \cup V} \right)}} = u_{1} v_{1} u_{1} v_{1} u_{2} v_{1} .\,\Omega \, = \left\{ {u_{1} v_{1} u_{1} v_{1} u_{2} v_{1} } \right\} $. The function Infer_conjecture(Ω, n, Π) for the empty set Π returns a Ω-conjecture D₂ with a single state and two transitions labelled u₁/v₁ and u₂/v₁. This machine becomes now the conjecture C₂. The composite FSM K ◊ C₂ is shown in Fig. 4(b). The product (K ◊ C₂) × (K ◊ D₂) is a complete machine, $ \Pi : = \Pi \cup \left\{ {\uppi\,_{{D_{2} }} } \right\} $ is executed, where π_D2 = {ε, u₁v₁, u₁v₁u₁v₁, u₁v₁u₁v₁u₂v₁}. The function Infer_conjecture(Ω, n, Π) for the set Π returns a Ω-conjecture D₃ shown in Fig. 6(a). The composite FSM K ◊ D₃ is shown in Fig. 6(b). Its behavior is not specified for the input sequence x₂x₁.

The input sequence x₂x₁ is added to Ψ, which becomes {x₁x₁, x₂x₁}. The grey box produces the trace σ = x₂u₁v₁o₁ x₁u₂v₂o₁ when the sequence x₂x₁ is applied. Its restriction onto the alphabets of the embedded component is $ \upsigma_{{ \downarrow \left( {U \cup V} \right)}} = u_{1} v_{1} u_{2} v_{2} .\,\Omega = \left\{ {u_{1} v_{1} u_{1} v_{1} u_{2} v_{1} ,\,u_{1} v_{1} u_{2} v_{2} } \right\} $. Next Ω-conjecture D₄ is shown in Fig. 4(c). This machine becomes now the conjecture C₃. The composite FSM K ◊ C₃ is isomorphic to the FSM in Fig. 2(c). Then the set of partitions Π should include the following partition: $ \pi_{{D_{4} }} = \{\upvarepsilon,u_{1} v_{1} u_{1} v_{1} ,u_{1} v_{1} u_{1} v_{1} u_{2} v_{1} ;u_{1} v_{1} ,u_{1} v_{1} u_{2} v_{2} \} $.

In the next iteration, Infer_conjecture(Ω, n, Π) returns the Ω-conjecture D₅ shown in Fig. 4(d). The composite FSM K ◊ D₅ is shown in Fig. 4(e). It is not equivalent to K ◊ C₃, and the shortest input sequence distinguishing them is x₁x₁x₁x₂, it extends the sequence x₁x₁.

The input sequence x₁x₁x₁x₂ is added to Ψ, which becomes {x₁x₁x₁x₂, x₂x₁}. The sequence applied to the grey box produces the trace σ = x₁u₁v₁u₁v₁o₁x₁u₂v₁o₁x₁u₂v₁o₁x₂o₂. We have $ \upsigma_{{ \downarrow \left( {U \cup V} \right)}} = u_{1} v_{1} u_{1} v_{1} u_{2} v_{1} u_{2} v_{1} .\,\Omega \, = \left\{ {u_{1} v_{1} u_{1} v_{1} u_{2} v_{1} u_{2} v_{1} ,\,u_{1} v_{1} u_{2} v_{2} } \right\} $. Next Ω-conjecture D₅ is shown in Fig. 4(f). The composite FSM K ◊ D₅ is shown in Fig. 4(g). It is not equivalent to K ◊ E, and the shortest sequence distinguishing them is x₂x₁x₂x₁x₂, it extends the sequence x₂x₁.

The input sequence x₂x₁x₂x₁x₂ is added to Ψ, which becomes {x₁x₁x₁x₂, x₂x₁x₂x₁x₂}. Applied to the grey box it produces the trace σ = x₂u₁v₁o₁ x₁u₂v₂o₁x₂u₁v₁o₁x₁u₂v₁o₁x₂o₂. We have $ \upsigma_{{ \downarrow \left( {U \cup V} \right)}} = u_{1} v_{1} u_{2} v_{2} u_{1} v_{1} u_{2} v_{1} .\,\Omega \, = \left\{ {u_{1} v_{1} u_{1} v_{1} u_{2} v_{1} u_{2} v_{1} ,\,u_{1} v_{1} u_{2} v_{2} u_{1} v_{1} u_{2} v_{1} } \right\} $. The function Infer_conjecture(Ω, n, Π) returns False, since no solution with a state partition which does not extend the partition $ \uppi_{{D_{4} }} $ can be found. Algorithm 3 terminates with the conjecture C₃ (Fig. 4(c)) that is externally equivalent to the embedded FSM E in Fig. 3 and its external checking experiment Ψ = {x₁x₁x₁x₂, x₂x₁x₂x₁x₂}. In this example, both algorithms give the same experiment, though this should not be expected for other systems, since the function call-solver(formula) can make nondeterministic choices in solving constraints. Moreover, various input sequences can be chosen to deal with partial FSM products (see line 13 in Algorithm 3).

Theorem 2.

If a grey box behaves as a complete FSM and the embedded FSM E has n states, Algorithm 3 infers a conjecture with at most n states that is externally equivalent to E and constructs an external checking experiment for it.

Sketch of Proof.

Algorithm 3 follows the steps of Algorithm 2, just replacing the FSM E by a current conjecture. This does not influence its termination since it only occurs when no more externally distinguishable conjecture can be found. At some point, because the grey box behaves as a composite FSM of the known context FSM and the embedded machine with n states, an FSM that is externally equivalent to E will be returned by Infer_conjecture. The resulting set of external input sequences is an external checking experiment for the resulting FSM, as in Theorem 1.

8 Conclusions

We considered a system of communicating FSMs and investigated possibilities for active learning and testing of an embedded FSM without disassembling the system. The contribution of this paper is the generalization of the isolated FSM inference problem to that of an FSM embedded in a modular system (grey box learning) and an approach for solving this problem that does not depend on the composition topology. The approach also offers a novel solution to embedded testing by generating a complete test suite directly for the embedded machine that avoids intermediate testing of nondeterministic approximations, eliminating thus several sources of test redundancy inherent in the existing methods. We plan to perform more experiments to assess the proposed methods, especially for learning embedded components.

References

Biermann, A.W., Feldman, J.A.: On the synthesis of finite-state machines from samples of their behavior. IEEE Trans. Comput. 100(6), 592–597 (1972)
Article MathSciNet Google Scholar
El-Fakih, K., Petrenko, A., Yevtushenko, N.: FSM test translation through context. In: Uyar, M.Ü., Duale, A.Y., Fecko, M.A. (eds.) TestCom 2006. LNCS, vol. 3964, pp. 245–258. Springer, Heidelberg (2006). https://doi.org/10.1007/11754008_16
Chapter Google Scholar
Gold, E.M.: Complexity of automaton identification from given data. Inf. Control 37(3), 302–320 (1978)
Article MathSciNet Google Scholar
Groz, R., Li, K., Petrenko, A., Shahbaz, M.: Modular system verification by inference, testing and reachability analysis. In: Suzuki, K., Higashino, T., Ulrich, A., Hasegawa, T. (eds.) FATES/TestCom-2008. LNCS, vol. 5047, pp. 216–233. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-68524-1_16
Chapter Google Scholar
Groz, R., Li, K., Petrenko, A.: Integration testing of communicating systems with unknown components. Ann. Telecommun. 70(3), 107–125 (2015)
Article Google Scholar
Heule, Marijn J.H., Verwer, S.: Exact DFA identification using SAT solvers. In: Sempere, J.M., García, P. (eds.) ICGI 2010. LNCS (LNAI), vol. 6339, pp. 66–79. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-15488-1_7
Chapter Google Scholar
Luo, G., Bochmann, G.V., Petrenko, A.: Test selection based on communicating nondeterministic finite-state machines using a generalized Wp-method. IEEE Trans. Softw. Eng. 20(2), 149–162 (1994)
Google Scholar
Meinke, K.: CGE: a sequential learning algorithm for Mealy automata. In: Sempere, J.M., García, P. (eds.) ICGI 2010. LNCS (LNAI), vol. 6339, pp. 148–162. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-15488-1_13
Chapter Google Scholar
Oliveira, A.L., Silva, J.P.M.: Efficient algorithms for the inference of minimum size DFAs. Mach. Learn. 4(1), 93–119 (2001)
Article Google Scholar
Petrenko, A., Avellaneda, F., Groz, R., Oriat, C.: From passive to active FSM inference via checking sequence construction. In: Yevtushenko, N., Cavalli, A.R., Yenigün, H. (eds.) ICTSS 2017. LNCS, vol. 10533, pp. 126–141. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-67549-7_8
Chapter Google Scholar
Petrenko, A., Yevtushenko, N., Bochmann, G.V., Dssouli, R.: Testing in context: framework and test derivation. Comput. Commun. 19(14), 1236–1249 (1996)
Article Google Scholar
Petrenko, A., Yevtushenko, N., Bochmann, G.V.: Fault models for testing in context. In: Gotzhein, R., Bredereke, J. (eds.) Formal Description Techniques IX. IFIP Advances in Information and Communication Technology. Springer, Heidelberg (1996). https://doi.org/10.1007/978-0-387-35079-0_10
Chapter Google Scholar
Rivest, R.L., Schapire, R.E.: Inference of finite automata using homing sequences. In: Hanson, S.J., Remmele, W., Rivest, R.L. (eds.) Machine Learning: From Theory to Applications. LNCS, vol. 661, pp. 51–73. Springer, Heidelberg (1993). https://doi.org/10.1007/3-540-56483-7_22
Chapter Google Scholar
Petrenko, A., Yevtushenko, N.: Testing faults in embedded components. In: 10th International Workshop on Testing of Communicating Systems, pp. 272–287 (1997)
Chapter Google Scholar
Steffen, B., et al.: Active automata learning: from DFAs to interface programs and beyond. In: ICGI, pp. 195–209 (2012)
Google Scholar
Villa, T., Petrenko, A., Yevtushenko, N., Mishchenko, A., Brayton, R.: Component based design by solving language equations. Proc. IEEE 103(11), 2152–2167 (2015)
Article Google Scholar
Jaffar-ur Rehman, M., Jabeen, F., Bertolino, A., Polini, A.: Testing software components for integration: a survey of issues and techniques. Softw. Test. Verif. Reliab. 17, 95–133 (2007)
Article Google Scholar
Abel, A., Reineke, J.: Gray-Box learning of serial compositions of mealy machines. In: Rayadurgam, S., Tkachuk, O. (eds.) NFM 2016. LNCS, vol. 9690, pp. 272–287. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-40648-0_21
Chapter Google Scholar

Download references

Acknowledgements

This work was partially supported by MESI (Ministère de l’Économie, Science et Innovation) of Gouvernement du Québec, NSERC of Canada and CAE.

Author information

Authors and Affiliations

CRIM, Montreal, Canada
Alexandre Petrenko & Florent Avellaneda

Authors

Alexandre Petrenko
View author publications
You can also search for this author in PubMed Google Scholar
Florent Avellaneda
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Alexandre Petrenko .

Editor information

Editors and Affiliations

University of Cádiz, Cadiz, Spain
Inmaculada Medina-Bulo
Complutense University of Madrid, Madrid, Spain
Mercedes G. Merayo
Brunel University London, Uxbridge, UK
Robert Hierons

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Petrenko, A., Avellaneda, F. (2018). Conformance Testing and Inference of Embedded Components. In: Medina-Bulo, I., Merayo, M., Hierons, R. (eds) Testing Software and Systems. ICTSS 2018. Lecture Notes in Computer Science(), vol 11146. Springer, Cham. https://doi.org/10.1007/978-3-319-99927-2_10

Download citation

DOI: https://doi.org/10.1007/978-3-319-99927-2_10
Published: 07 September 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-99926-5
Online ISBN: 978-3-319-99927-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Federation for Information Processing (opens in a new tab)

Conformance Testing and Inference of Embedded Components

Abstract

Similar content being viewed by others