Para2: parameterized path reduction, acceleration, and SMT for reachability in threshold-guarded distributed algorithms

Igor Konnov; Marijana Lazić; Helmut Veith; Josef Widder

doi:10.1007/s10703-017-0297-4

. 2017 Sep 20;51(2):270–307. doi: 10.1007/s10703-017-0297-4

Para $^{2}$ : parameterized path reduction, acceleration, and SMT for reachability in threshold-guarded distributed algorithms

Igor Konnov ¹, Marijana Lazić ¹, Helmut Veith ¹, Josef Widder ^1,^✉

PMCID: PMC6959411 PMID: 32009739

Abstract

Automatic verification of threshold-based fault-tolerant distributed algorithms (FTDA) is challenging: FTDAs have multiple parameters that are restricted by arithmetic conditions, the number of processes and faults is parameterized, and the algorithm code is parameterized due to conditions counting the number of received messages. Recently, we introduced a technique that first applies data and counter abstraction and then runs bounded model checking (BMC). Given an FTDA, our technique computes an upper bound on the diameter of the system. This makes BMC complete for reachability properties: it always finds a counterexample, if there is an actual error. To verify state-of-the-art FTDAs, further improvement is needed. In contrast to encoding bounded executions of a counter system over an abstract finite domain in SAT, in this paper, we encode bounded executions over integer counters in SMT. In addition, we introduce a new form of reduction that exploits acceleration and the structure of the FTDAs. This aggressively prunes the execution space to be explored by the solver. In this way, we verified safety of seven FTDAs that were out of reach before.

Keywords: Parameterized verification, Bounded model checking, Completeness, Partial orders in distributed systems, Reduction, Fault-tolerant distributed algorithms, Byzantine faults

Introduction

Replication is a classic approach to make computing systems more reliable. In order to avoid a single point of failure, one uses multiple processes in a distributed system. Then, if some of these processes fail (e.g., by crashing or deviating from their expected behavior) the distributed system as a whole should stay operational. For this purpose one uses fault-tolerant distributed algorithms (FTDAs). These algorithms have been extensively studied in distributed computing theory [1, 50], and found application in safety critical systems (automotive or aeronautic industry). With the recent advent of data centers and cloud computing we observe growing interest in fault-tolerant distributed algorithms, and their correctness, also for more mainstream computer science applications [19, 20, 31, 47, 52, 54, 60].

We consider automatic verification techniques specifically for threshold-based fault-tolerant distributed algorithms. In these algorithms, processes collect messages from their peers, and check whether the number of received messages reaches a threshold, e.g., a threshold may ensure that acknowledgments from a majority of processes have been received. Waiting for majorities, or more generally waiting for quorums, is a key pattern of many fault-tolerant algorithms, e.g., consensus, replicated state machine, and atomic commit. In [34] we introduced an efficient encoding of these algorithms, which we used in [33] for abstraction-based parameterized model checking of safety and liveness of several case study algorithms, which are parameterized in the number of processes n and the fraction of faults t, e.g., $n > 3 t$ . In [41] we were able to verify reachability properties of more involved algorithms by applying bounded model checking. We showed how to make bounded model checking complete in the parameterized case. In particular, we considered counter systems where we record for each local state, how many processes are in this state. We have one counter per local state $ℓ$ , denoted by $κ [ℓ]$ . A process step from local state $ℓ$ to local state $ℓ^{'}$ is modeled by decrementing $κ [ℓ]$ and incrementing $κ [ℓ^{'}]$ . When $δ$ processes perform the same step one after the other, we allow the processes to do the accelerated step that instantaneously changes the two affected counters by $δ$ . The number $δ$ is called acceleration factor, which can vary in a single run.

As we focus on threshold-based FTDAs, we consider counter systems defined by threshold automata. Here, transitions are guarded by threshold guards that compare a shared integer variable to a linear combination of parameters, e.g., $x \geq n - t$ or $x < t$ , where x is a shared variable and n and t are parameters.

Completeness of the method [41] with respect to reachability is shown by proving a bound on the diameter of the accelerated system. Inspired by Lamport’s view of distributed computation as partial order on events [43], our method uses a reduction similar to Lipton’s [48]. Instead of pruning executions that are “similar” to ones explored before as in partial order reduction [28, 53, 59], we use the partial order to show (offline) that every run has a similar run of bounded length. Interestingly, the bound is independent of the parameters. In [41], we introduced the following automated method, which combines this idea with data abstraction [33]:

Apply a parametric data abstraction to the process code to get a finite state process description, and construct the threshold automaton (TA) [33, 36].
Compute the diameter bound, based on the control flow of the TA.
Construct a system with abstract counters, i.e., a counter abstraction [33, 55].
Perform SAT-based bounded model checking [6, 16] up to the diameter bound, to check whether bad states are reached in the counter abstraction.
If a counterexample is found, check its feasibility and refine, if needed [13, 33].

Figure 1 gives on top a diagram [40] that shows the technique based on counter abstraction. While this allowed us to automatically verify several FTDAs not verified before, there remained two bottlenecks for scalability to larger and more complex protocols: First, counter abstraction can lead to spurious counterexamples. As counters range over a finite abstract domain, the semantics of abstract increment and decrements on the counters introduce non-determinism. For instance, the value of a counter can remain unchanged after applying an increment. Intuitively, processes or messages can be “added” or “lost”, which results in that, e.g., in the abstract model the number of messages sent is smaller than the number of processes that have sent a message, which obviously is spurious behavior. Second, counter abstraction works well in practice only for processes with a few dozens of local states. It has been observed in [4] that counter abstraction does not scale to hundreds of local states. We had similar experience with counter abstraction in our experiments in [41]. We conjecture that this is partly due to the many different interleavings, which result in a large search space.

Fig. 1 — Tool chain with counter abstraction [27, 33, 41] on top, and with SMT-based bounded model checking on bottom

To address these bottlenecks, we make two crucial contributions in this paper:

To eliminate one of the two sources of spurious counterexamples, namely, the non-determinism added by abstract counters, we do bounded model checking using SMT solvers with linear integer arithmetic on the accelerated system, instead of SAT-based bounded model checking on the counter abstraction.
We reduce the search space dramatically: we introduce the notion of an execution schema that is defined as a sequence of local rules of the TA. By assigning to each rule of a schema an acceleration factor (possibly 0, which models that no process executes the rule), one obtains a run of the counter system. Hence, due to parameterization, each schema represents infinitely many runs. We show how to construct a set of schemas whose set of reachable states coincides with the set of reachable states of the accelerated counter system.

The resulting method is depicted at the bottom of Fig. 1. Our construction can be seen as an aggressive form of reduction, where each run has a similar run generated by a schema from the set. To show this, we capture the guards that are locked and unlocked in a context. Our key insight is that a bounded number of transitions changes the context in each run. For example, of all transitions increasing a variable x, at most one makes $x \geq n - t$ true, and at most one makes $x < t + 1$ false (the parameters n and t are fixed in a run, and shared variables can only be increased). We fix those transitions that change the context, and apply the ideas of reduction to the subexecutions between these transitions.

Our experiments show that SMT solvers and schemas outperform SAT solvers and counter abstraction in parameterized verification of threshold-based FTDAs. We verified safety of FTDAs [10, 18, 29, 51, 56, 57] that have not been automatically verified before. In addition we achieved dramatic speedup and reduced memory footprint for FTDAs [9, 12, 58] which previously were verified in [41].

In this article we focus on parameterized reachability properties. Recently, we extended this approach to safety and liveness, for which we used the reachability technique of this article as a black box [37].

Our approach at a glance

For modeling threshold-based FTDAs, we use threshold automata that were introduced in [38, 41] and are discussed in more detail in [40]. We use Fig. 2 to describe our contributions in this section. The figure presents a threshold automaton TA over two shared variables x and y and parameters n, t, and f, which is inspired by the distributed asynchronous broadcast protocol from [9]. There, $n - f$ correct processes concurrently follow the control flow of TA, and f processes are Byzantine faulty. As is typical for FTDAs, the parameters must satisfy a resilience condition, e.g., $n > 3 t \land t \geq f \geq 0$ , that is, less than a third of the processes are faulty. The circles depict the local states $ℓ_{1}, \dots, ℓ_{5}$ , two of them are the initial states $ℓ_{1}$ and $ℓ_{2}$ . The edges depict the rules $r_{1}, \dots, r_{6}$ labeled with guarded commands $φ \mapsto act$ , where $φ$ is one of the threshold guards “ $φ_{1} : x \geq ⌈ (n + t) / 2 ⌉ - f$ ”, “ $φ_{2} : y \geq (t + 1) - f$ ”, and “ $φ_{3} : y \geq (2 t + 1) - f$ ”, and an action $act$ increases the shared variables (x and y) by one, or zero (as in rule $r_{6}$ ).

We associate with every local state $ℓ_{i}$ a non-negative counter $κ [ℓ_{i}]$ that represents the number of processes in $ℓ_{i}$ . Together with the values of x, y, n, t, and f, the values of the counters constitute a configuration of the system. In the initial configuration there are $n - f$ processes in initial states, i.e., $κ [ℓ_{1}] + κ [ℓ_{2}] = n - f$ , and the other counters and the shared variables x and y are zero.

The rules define the transitions of the counter system. For example, according to the rule $r_{2}$ , if in the current configuration the guard $y \geq t + 1 - f$ holds true and $κ [ℓ_{1}] \geq 5$ , then five processes can instantaneously move out of the local state $ℓ_{1}$ to the local state $ℓ_{3}$ , and increment x as prescribed by the action of $r_{2}$ (since the evaluation of the guard $y \geq t + 1 - f$ cannot change from true to false). This results in increasing x and $κ [ℓ_{3}]$ by five, and decreasing the counter $κ [ℓ_{1}]$ by five. When, as in our example, rule $r_{2}$ is conceptually executed by 5 processes, we denote this transition by $(r_{2}, 5)$ , where 5 is the acceleration factor. A sequence of transitions forms a schedule, e.g., $(r_{1}, 2), (r_{3}, 1), (r_{1}, 1)$ .

In this paper, we address a parameterized reachability problem, e.g., can at least one correct process reach the local state $ℓ_{5}$ , when $n - f$ correct processes start in the local state $ℓ_{1}$ ? Or, in terms of counter systems, is a configuration with $κ [ℓ_{5}] \neq 0$ reachable from an initial configuration with $κ [ℓ_{1}] = n - f \land κ [ℓ_{2}] = 0$ ? As discussed in [41], acceleration does not affect reachability, and precise treatment of the resilience condition and threshold guards is crucial for solving this problem.

Schemas

When applied to a configuration, a schedule generates a path, that is, an alternating sequence of configurations and transitions. As initially x and y are zero, threshold guards $φ_{1}$ , $φ_{2}$ , and $φ_{3}$ evaluate to false. As rules may increase variables, these guards may eventually become true. In our example we do not consider guards like $x < t + 1$ that are initially true and become false, although we formally treat them in our technique. In fact, initially only $r_{1}$ is unlocked. Because $r_{1}$ increases x, it may unlock $φ_{1}$ . Thus $r_{4}$ becomes unlocked. Rule $r_{4}$ increases y and thus repeated application of $r_{4}$ (by different processes) first unlocks $φ_{2}$ and then $φ_{3}$ . We introduce a notion of a context that is the set of threshold guards that evaluate to true in a configuration. For our example we observe that each path goes through the following sequence of contexts ${}$ , ${φ_{1}}$ , ${φ_{1}, φ_{2}}$ , and ${φ_{1}, φ_{2}, φ_{3}}$ . In fact, the sequence of contexts in a path is always monotonic, as the shared variables can only be increased.

The conjunction of the guards in the context ${φ_{1}, φ_{2}}$ implies the guards of the rules $r_{1}, r_{2}, r_{3}, r_{4}, r_{5}$ ; we call these rules unlocked in the context. This motivates our definition of a schema: a sequence of contexts and rules. We give an example of a schema below, where inside the curly brackets we give the contexts, and fixed sequences of rules in between. (We discuss the underlined rules below.)

2.1

Given a schema, we can generate a schedule by attaching to each rule an acceleration factor, which can possibly be 0. For instance, if we attach non-zero factors to the underlined rules in S, and a zero factor to the other rules, we generate the following schedule $τ^{'}$ (we omit the transitions with 0 factors here).

2.2

It can easily be checked that $τ^{'}$ is generated by schema S, because the sequence of the underlined rules in S matches the sequence of rules appearing in $τ^{'}$ .

In this paper, we show that the schedules generated by a few schemas —one per each monotonic sequence of contexts —span the set of all reachable configurations. To this end, we apply reduction and acceleration to relate arbitrary schedules to their representatives, which are generated by schemas.

Reduction and acceleration

In this section we show what we mean by a schedule being “related” to its representative. Consider, e.g., the following schedule $τ$ from the initial state $σ_{0}$ with $n = 5$ , $t = f = 1$ , $κ [ℓ_{1}] = 1$ , and $κ [ℓ_{2}] = 3$ :

Observe that after $(r_{1}, 1), (r_{1}, 1)$ , variable $x = 2$ , and thus $φ_{1}$ is true. Hence transition $t_{1}$ changes the context from ${}$ to ${φ_{1}}$ . Similarly $t_{2}$ and $t_{3}$ change the context. Context changing transitions are marked with curly brackets. Between them we have the subschedules $τ_{1}, \dots, τ_{4}$ ( $τ_{3}$ is empty) marked with square brackets.

To show that this schedule is captured by the schema (2.1), we apply partial order arguments —that is, a mover analysis [48] —regarding distributed computations: As the guards $φ_{2}$ and $φ_{3}$ evaluate to true in $τ_{4}$ , and $r_{5}$ precedes $r_{6}$ in the control flow of the TA, all transitions $(r_{5}, 1)$ can be moved to the left in $τ_{4}$ . Similarly, $(r_{1}, 1)$ can be moved to the left in $τ_{2}$ . The resulting schedule is applicable and leads to the same configuration as the original one. Further, we can accelerate the adjacent transitions with the same rule, e.g., the sequence $(r_{5}, 1), (r_{5}, 1)$ can be transformed into $(r_{5}, 2)$ . Thus, we transform subschedules $τ_{i}$ into $τ_{i}^{'}$ , and arrive at the schedule $τ^{'}$ from Eq. (2.2), which we call the representative schedule of $τ$ . As the representative schedule $τ^{'}$ is generated from the schema in (2.1), we say that the schema captures schedule $τ$ . (It also captures $τ^{'}$ .) Importantly for reachability checking, if $τ$ and $τ^{'}$ are applied to the same configuration, they end in the same configuration. These arguments are formalized in Sects. 5, 6 and 7.

Encoding a schema in SMT

One of the key insights in this paper is that reachability checking via schemas can be encoded efficiently as SMT queries in linear integer arithmetic. In more detail, finite paths of counter systems can be expressed with inequalities over counters such as $κ [ℓ_{2}]$ and $κ [ℓ_{3}]$ , shared variables such as x and y, parameters such as n, t, and f, and acceleration factors. Also, threshold guards and resilience conditions are expressions in linear integer arithmetic.

We give an example of reachability checking with SMT using the simple schema ${} r_{1}, r_{1} {φ_{1}}$ which is contained in the schema S in Eq. (2.1). To obtain a complete encoding for S, one can similarly encode the other simple schemas and combine them.

To this end, we have to express constraints on three configurations $σ_{0}$ , $σ_{1}$ , and $σ_{2}$ . For the initial configuration $σ_{0}$ , we introduce integer variables: $κ_{1}^{0}, \dots, κ_{5}^{0}$ for local state counters, $x^{0}$ and $y^{0}$ for shared variables, and n, t, and f for parameters. As is written in Eq. (2.3), the configuration $σ_{0}$ should satisfy the initial constraints, and its context should be empty (that is, all guards evaluate to false):

\begin{matrix} κ_{1}^{0} + κ_{2}^{0} = & n - f \land κ_{3}^{0} = κ_{4}^{0} = κ_{5}^{0} = 0 \land x^{0} = y^{0} = 0 \\ \land n \geq 3 t \land t \geq f \geq 0 \land (\neg φ_{1} \land \neg φ_{2} \land \neg φ_{3}) [x^{0} / x, y^{0} / y] \end{matrix}

2.3

The configuration $σ_{1}$ is reached from $σ_{0}$ by applying a transition with the rule $r_{1}$ and an acceleration factor $δ^{1}$ , and the configuration $σ_{2}$ is reached from $σ_{1}$ by applying a transition with the rule $r_{1}$ and an acceleration factor $δ^{2}$ . Applying transition with the rule $r_{1}$ to $σ_{0}$ just means to increase both $κ [ℓ_{3}]$ and x by $δ^{1}$ and decrease $κ [ℓ_{2}]$ by $δ^{1}$ . Hence, we introduce four fresh variables per transition and add the arithmetic operations. According to the schema, configuration $σ_{2}$ has the context ${φ_{2}}$ . The following equations express these constraints1:

\begin{matrix} κ_{3}^{1} = & κ_{3}^{0} + δ^{1} \land κ_{2}^{1} = κ_{2}^{0} - δ^{1} \land x^{1} = x^{0} + δ^{1} \end{matrix}

2.4

\begin{matrix} κ_{3}^{2} = & κ_{3}^{1} + δ^{2} \land κ_{2}^{2} = κ_{2}^{1} - δ^{2} \land x^{2} = x^{1} + δ^{2} \\ \land (φ_{1} \land \neg φ_{2} \land \neg φ_{3}) [x^{2} / x, y^{0} / y] \end{matrix}

2.5

Finally, we express the reachability question for all paths generated by the simple schema ${} r_{1}, r_{1} {φ_{1}}$ . Whether there is a configuration with $κ [ℓ_{5}] \neq 0$ reachable from an initial configuration with $κ [ℓ_{1}] = n - f$ and $κ [ℓ_{2}] = 0$ can then be encoded as:

\begin{matrix} κ_{1}^{0} = n - f \land κ_{2}^{0} = 0 \land κ_{5}^{0} \neq 0 \end{matrix}

2.6

Note that we check only $κ_{5}^{0}$ against zero, as the local state $ℓ_{5}$ is never updated by the rule $r_{1}$ . It is easy to see that conjunction of Eqs. (2.3)–(2.6) does not have a solution, and thus all paths generated by the schema ${} r_{1}, r_{1} {φ_{1}}$ do not reach a configuration with $κ [ℓ_{5}] \neq 0$ . By writing down constraints for the other three simple schemas in Eq. (2.1), we can check reachability for the paths generated by the whole schema as well. As discussed in Sect. 2.1, our results also imply reachability on all paths whose representatives are generated by the schema. More details on the SMT encoding can be found in Sect. 9.

Parameterized counter systems

We recall the framework of [41] to the extent necessary, and extend it with the notion of a context in Sect. 3.2. A threshold automaton describes a process in a concurrent system, and is a tuple Inline graphic defined below.

The finite set $L$ contains the local states, and $I \subseteq L$ is the set of initial states. The finite set $Γ$ contains the shared variables that range over the natural numbers $N_{0}$ . The finite set $Π$ is a set of parameter variables that range over $N_{0}$ , and the resilience condition $RC$ is a formula over parameter variables in linear integer arithmetic, e.g., $n > 3 t$ . The set of admissible parameters is Inline graphic .

A key ingredient of threshold automata are threshold guards (or, just guards):

Definition 3.1

A threshold guard is an inequality of one of the following two forms:

(R): $x \geq a_{0} + a_{1} \cdot p_{1} + \dots + a_{| Π |} \cdot p_{| Π |}$ , or
(F): $x < a_{0} + a_{1} \cdot p_{1} + \dots + a_{| Π |} \cdot p_{| Π |}$ ,

where $x \in Γ$ is a shared variable, $a_{0}, \dots, a_{| Π |} \in Z$ are integer coefficients, and $p_{1}, \dots, p_{| Π |} \in Π$ are parameters. We denote the set of all guards of the form (R) by $Φ^{rise}$ , and the set of all guards of the form (F) by $Φ^{fall}$ .

A rule defines a conditional transition between local states that may update the shared variables. Formally, a rule is a tuple $(f r o m, t o, φ^{rise}, φ^{fall}, u)$ : the local states $f r o m$ and $t o$ are from $L$ . (Intuitively, they capture from which local state to which a process moves.) A rule is only executed if the conditions $φ^{rise}$ and $φ^{fall}$ evaluate to true. Condition $φ^{rise}$ is a conjunction of guards from $Φ^{rise}$ , and $φ^{fall}$ is a conjunction of guards from $Φ^{fall}$ (cf. Definition 3.1). We denote the set of guards used in $φ^{rise}$ by $guard (φ^{rise})$ , and $guard (φ^{fall})$ is the set of guards used in $φ^{fall}$ .

Rules may increase shared variables using an update vector $u \in N_{0}^{| Γ |}$ that is added to the vector of shared variables. As $u \in N_{0}^{| Γ |}$ , global variables can only be increased or left unchanged. As will be later formalized in Proposition 3.1, guards from $Φ^{rise}$ can only change from false to true (rise), and guards from $Φ^{fall}$ can change from true to false (fall). Finally, $R$ is the finite set of rules. We use the dot notation to refer to components of rules, e.g., $r . f r o m$ or $r . u$ .

Example 3.1

In Fig. 2, the rule $r_{2} : φ_{2} \mapsto x + +$ that describes a transition from $ℓ_{1}$ to $ℓ_{3}$ , can formally be written as $(ℓ_{1}, ℓ_{3}, φ_{2}, ⊤, (1, 0))$ . Its intuitive meaning is as follows. If the guard $φ_{2} : y \geq (t + 1) - f$ evaluates to true, a process can move from the local state $ℓ_{1}$ to the local state $ℓ_{3}$ , and the global variable x is incremented, while y remains unchanged. We formalize the semantics as counter systems in Sect. 3.1.

Definition 3.2

Given a threshold automaton $(L, I, Γ, Π, R, RC)$ , we define the precedence relation Inline graphic : for a pair of rules $r_{1}, r_{2} \in R$ , it holds that $r_{1} ≺_{P} r_{2}$ if and only if $r_{1} . t o = r_{2} . f r o m$ . We denote by $≺_{P}^{+}$ the transitive closure of $≺_{P}$ . Further, we say that $r_{1} \sim_{P} r_{2}$ , if $r_{1} ≺_{P}^{+} r_{2} \land r_{2} ≺_{P}^{+} r_{1}$ , or $r_{1} = r_{2}$ .

Assumption 3.3

We limit ourselves to threshold automata relevant for FTDAs, i.e., those where $r . u = 0$ for all rules $r \in R$ that satisfy $r ≺_{P}^{+} r$ . Such automata were called canonical in [41].

Remark 3.1

We use threshold automata to model fault-tolerant distributed algorithms that count messages from distinct senders. These algorithms are based on an “idealistic” reliable communication assumption (no message loss); these assumptions are typically expected to be ensured by “lower level bookkeeping code”, e.g., communication protocols. As a result, the algorithms we consider here do not gain from sending the same message (that is, increasing a variable) inside a loop, so that we can focus on threshold automata that do not increase shared variables in loops.

Example 3.2

In the threshold automaton from Fig. 3 we have that $r_{2} ≺_{P} r_{3} ≺_{P} r_{4} ≺_{P} r_{5} ≺_{P} r_{6} ≺_{P} r_{8} ≺_{P} r_{2}$ . Thus, we have that $r_{2} ≺_{P}^{+} r_{2}$ . In our case this implies that $r_{2} . u = 0$ by definition. Similarly we can conclude that $r_{3} . u = r_{4} . u = r_{5} . u = r_{6} . u = r_{7} . u = r_{8} . u = 0$ .

Looplets The relation $\sim_{P}$ defines equivalence classes of rules. An equivalence class corresponds to a loop or to a single rule that is not part of a loop. Hence, we use the term looplet for one such equivalence class. For a given set of rules $R$ let $R / \sim$ be the set of equivalence classes defined by $\sim_{P}$ . We denote by $[r]$ the equivalence class of rule r. For two classes $c_{1}$ and $c_{2}$ from $R / \sim$ we write Inline graphic iff there are two rules $r_{1}$ and $r_{2}$ in $R$ satisfying $[r_{1}] = c_{1}$ and $[r_{2}] = c_{2}$ and $r_{1} ≺_{P}^{+} r_{2}$ and $r_{1} ≁_{P} r_{2}$ . As the relation is a strict partial order, there are linear extensions of . Below, we fix an arbitrary of these linear extensions to sort transitions in a schedule: We denote by Inline graphic a linear extension of .

Example 3.3

Consider Fig. 3. The threshold automaton has five looplets: $c_{1} = {r_{1}}$ , $c_{2} = {r_{2}, \dots, r_{8}}$ , $c_{3} = {r_{9}}$ , $c_{4} = {r_{10}}$ , and $c_{5} = {r_{11}}$ . From $r_{9} ≺_{P} r_{10}$ , it follows that Inline graphic , and from $r_{4} ≺_{P}^{+} r_{10}$ , it follows that . We can pick two linear extensions of , denoted by and . We have , and . In this paper we always fix one linear extension.

Remark 3.2

It may seem natural to collapse such loops into singleton local states. In our case studies, e.g, [29], non-trivial loops are used to express non-deterministic choice due to failure detectors [12], as shown in Fig. 4. Importantly, some local states inside the loops appear in the specifications. Thus, one would have to use arguments from distributed computing to characterize when collapsing states is sound. In this paper, we present a technique that deals with the loops without need for additional modelling arguments.

Fig. 4 — A typical structure found in threshold automata that model fault-tolerant algorithms with a failure detector [12]. The gray circles depict those local states, where the failure detector reports a crash. The local states $ℓ_{i}$ and $ℓ_{i}^{'}$ differ only in the output of the failure detector. As the failure detector output changes non-deterministically, the threshold automaton contains loops of size two

Counter systems

We use a function $N : P_{R C} \to N_{0}$ to capture the number of processes for each combination of parameters. As we use SMT, we assume that $N$ can be expressed in linear integer arithmetic. For instance, if only correct processes are explictly modeled we typically have $N (n, t, f) = n - f$ , and the respective SMT expression is $n - f$ . Given $N$ , a threshold automaton $TA$ , and admissible parameter values $p \in P_{R C}$ , we define a counter system as a transition system $(Σ, I, R)$ . It consists of the set of configurations $Σ$ , which contain evaluations of the counters and variables, the set of initial configurations $I$ , and the transition relation $R$ :

Configurations $Σ$ and $I$ A configuration $σ = (κ, g, p)$ consists of a vector of counter values $σ . κ \in N_{0}^{| L |}$ (for simplicity we use the convention that $L = {1, \dots, | L |}$ ) a vector of shared variable values $σ . g \in N_{0}^{| Γ |}$ , and a vector of parameter values $σ . p = p$ . The set $Σ$ is the set of all configurations. The set of initial configurations $I$ contains the configurations that satisfy $σ . g = 0$ , $\sum_{i \in I} σ . κ [i] = N (p)$ , and $\sum_{i \notin I} σ . κ [i] = 0$ . This means that in every initial configuration all global variables have zero values, and all $N (p)$ modeled processes are located only in the initial local states.

Example 3.4

Consider the threshold automaton from Fig. 2 with the initial states $ℓ_{1}$ and $ℓ_{2}$ . Let us consider a system of five processes, one of them being Byzantine faulty. Thus, $n = 5$ , $t = f = 1$ , and we explicitely model $N (5, 1, 1) = n - f = 4$ correct processes. One of the initial configurations is $σ = (κ, g, p)$ , where $σ . κ = (1, 3, 0, 0, 0)$ , $σ . g = (0, 0)$ , and $σ . p = (5, 1, 1)$ . In other words, there is one process in $ℓ_{1}$ , three processes in $ℓ_{2}$ , and global variables are initially $x = y = 0$ . Note that $\sum_{i \in I} σ . κ [i] = κ [ℓ_{1}] + κ [ℓ_{2}] = 1 + 3 = 4 = N (5, 1, 1)$ .

Transition relation $R$ A transition is a pair $t = (r u l e, f a c t o r)$ of a rule of the $TA$ and a non-negative integer called the acceleration factor, or just factor for short. (As already discussed in Sect. 2.1, we will use the zero factors when generating schedules from schemas.) For a transition $t = (r u l e, f a c t o r)$ we refer by $t . u$ to $r u l e . u$ , and by $t . φ^{fall}$ to $r u l e . φ^{fall}$ , etc. We say a transition t is unlocked in configuration $σ$ if $(σ . κ, σ . g + k \cdot t . u, σ . p) ⊧ t . φ^{rise} \land t . φ^{fall}$ , for $k \in {0, \dots, t . f a c t o r - 1}$ . Note that here we use a notation that a configuration satisfies a formula, which is considered true if and only if the formula becomes true when all free variables of the formulas are evaluated as in the configuration.

We say that transition t is applicable (or enabled) in configuration $σ$ , if it is unlocked, and $σ . κ [t . f r o m] \geq t . f a c t o r$ . (As all counters are non-negative, a transition with the zero $f a c t o r$ is always applicable to all configurations provided that the guards are unlocked.) We say that $σ^{'}$ is the result of applying the enabled transition t to $σ$ , and write $σ^{'} = t (σ)$ , if

$σ^{'} . g = σ . g + t . f a c t o r \cdot t . u$ and $σ^{'} . p = σ . p$
if $t . f r o m \neq t . t o$ then
- $σ^{'} . κ [t . f r o m] = σ . κ [t . f r o m] - t . f a c t o r$ ,
- $σ^{'} . κ [t . t o] = σ . κ [t . t o] + t . f a c t o r$ , and
- $\forall ℓ \in L \ {t . f r o m, t . t o}$ it holds that $σ^{'} . κ [ℓ] = σ . κ [ℓ]$
if $t . f r o m = t . t o$ then $σ^{'} . κ = σ . κ$

The transition relation $R \subseteq Σ \times Σ$ of the counter system is defined as follows: $(σ, σ^{'}) \in R$ iff there is a rule $r \in R$ and a factor $k \in N_{0}$ such that $σ^{'} = t (σ)$ for $t = (r, k)$ . Updates do not decrease the values of shared variables, and thus the following proposition was introduced in [41]:

Proposition 3.1

[41] For all configurations $σ$ , all rules r, and all transitions t applicable to $σ$ , the following holds:

1. If $σ ⊧ r . φ^{rise}$ then $t (σ) ⊧ r . φ^{rise}$ 3. If $σ ⊭ r . φ^{fall}$ then $t (σ) ⊭ r . φ^{fall}$

2. If $t (σ) ⊭ r . φ^{rise}$ then $σ ⊭ r . φ^{rise}$ 4. If $t (σ) ⊧ r . φ^{fall}$ then $σ ⊧ r . φ^{fall}$

Schedules and paths A schedule is a (finite) sequence of transitions. For a schedule $τ$ and an index $i : 1 \leq i \leq | τ |$ , by $τ [i]$ we denote the ith transition of $τ$ , and by $τ^{i}$ we denote the prefix $τ [1], \dots, τ [i]$ of $τ$ . A schedule $τ = t_{1}, \dots, t_{m}$ is applicable to configuration $σ_{0}$ , if there is a sequence of configurations $σ_{1}, \dots, σ_{m}$ with $σ_{i} = t_{i} (σ_{i - 1})$ for $1 \leq i \leq m$ . If there is a $t_{i} . f a c t o r > 1$ , then a schedule is accelerated.

By $τ \cdot τ^{'}$ we denote the concatenation of two schedules $τ$ and $τ^{'}$ . A sequence $σ_{0}, t_{1}, σ_{1}, \dots, σ_{k - 1}, t_{k}, σ_{k}$ of alternating configurations and transitions is called a (finite) path, if transition $t_{i}$ is enabled in $σ_{i - 1}$ and $σ_{i} = t_{i} (σ_{i - 1})$ , for $1 \leq i \leq k$ . For a configuration $σ_{0}$ and a schedule $τ$ applicable to $σ_{0}$ , by $path (σ_{0}, τ)$ we denote the path $σ_{0}, t_{1}, \dots, t_{| τ |}, σ_{| τ |}$ with $t_{i} = τ [i]$ and $σ_{i} = t_{i} (σ_{i - 1})$ , for $1 \leq i \leq | τ |$ .

Contexts and slices

The evaluation of the guards in the sets $Φ^{rise}$ and $Φ^{fall}$ in a configuration solely defines whether certain transitions are unlocked (but not necessarily enabled). From Proposition 3.1, one can see that after a transition has been applied, more guards from $Φ^{rise}$ may get unlocked and more guards from $Φ^{fall}$ may get locked. In other words, more guards from $Φ^{rise}$ may evaluate to true and more guards from $Φ^{fall}$ may evaluate to false. To capture this intuition, we define:

Definition 3.4

A context $Ω$ is a pair $(Ω^{rise}, Ω^{fall})$ with $Ω^{rise} \subseteq Φ^{rise}$ and $Ω^{fall} \subseteq Φ^{fall}$ . We denote by $| Ω | = | Ω^{rise} | + | Ω^{fall} |$ .

For two contexts $(Ω_{1}^{rise}, Ω_{1}^{fall})$ and $(Ω_{2}^{rise}, Ω_{2}^{fall})$ , we define that $(Ω_{1}^{rise}, Ω_{1}^{fall}) ⊏ (Ω_{2}^{rise}, Ω_{2}^{fall})$ if and only if $Ω_{1}^{rise} \cup Ω_{1}^{fall} \subset Ω_{2}^{rise} \cup Ω_{2}^{fall}$ . Then, a sequence of contexts $Ω_{1}, \dots, Ω_{m}$ is monotonically increasing, if $Ω_{i} ⊏ Ω_{i + 1}$ , for $1 \leq i < m$ . Further, a monotonically increasing sequence of contexts $Ω_{1}, \dots, Ω_{m}$ is maximal, if $Ω_{1} = (\emptyset, \emptyset)$ and $Ω_{m} = (Φ^{rise}, Φ^{fall})$ and $| Ω_{i + 1} | = | Ω_{i} | + 1$ , for $1 \leq i < m$ . We obtain:

Proposition 3.2

Every maximal monotonically increasing sequence of contexts is of length $| Φ^{rise} | + | Φ^{fall} | + 1$ . There are at most $(| Φ^{rise} | + | Φ^{fall} |)!$ such sequences.

Example 3.5

For the example in Fig. 2, we have $Φ^{rise} = {φ_{1}, φ_{2}, φ_{3}}$ , and $Φ^{fall} = \emptyset$ . Thus, there are $(| Φ^{rise} | + | Φ^{fall} |)! = 6$ maximal monotonically increasing sequences of contexts. Two of them are $(\emptyset, \emptyset) ⊏ ({φ_{1}}, \emptyset) ⊏ ({φ_{1}, φ_{2}}, \emptyset) ⊏ ({φ_{1}, φ_{2}, φ_{3}}, \emptyset)$ and $(\emptyset, \emptyset) ⊏ ({φ_{3}}, \emptyset) ⊏ ({φ_{1}, φ_{3}}, \emptyset) ⊏ ({φ_{1}, φ_{2}, φ_{3}}, \emptyset)$ . All of them have length $| Φ^{rise} | + | Φ^{fall} | + 1 = 4$ .

To every configuration $σ$ , we attach the context consisting of all guards in $Φ^{rise}$ that evaluate to true in $σ$ , and all guards in $Φ^{fall}$ that evaluate to false in $σ$ :

Definition 3.5

Given a threshold automaton, we define its configuration context as a function $ω : Σ \to 2^{Φ^{rise}} \times 2^{Φ^{fall}}$ that for each configuration $σ \in Σ$ gives a context $(Ω^{rise}, Ω^{fall})$ with $Ω^{rise} = {φ \in Φ^{rise} : σ ⊧ φ}$ and $Ω^{fall} = {φ \in Φ^{fall} : σ ⊭ φ}$ .

The following monotonicity result is a direct consequence of Proposition 3.1.

Proposition 3.3

If a transition t is enabled in a configuration $σ$ , then either $ω (σ) ⊏ ω (t (σ))$ , or $ω (σ) = ω (t (σ))$ .

Definition 3.6

A schedule $τ$ is steady for a configuration $σ$ , if for every prefix $τ^{'}$ of $τ$ , the context does not change, i.e., $ω (τ^{'} (σ)) = ω (σ)$ .

Proposition 3.4

A schedule $τ$ is steady for a configuration $σ$ if and only if $ω (σ) = ω (τ (σ))$ .

In the following definition, we associate a sequence of contexts with a path:

Definition 3.7

Given a configuration $σ$ and a schedule $τ$ applicable to $σ$ , we say that $path (σ, τ)$ is consistent with a sequence of contexts $Ω_{1}, \dots, Ω_{m}$ , if there exist indices $n_{0}, \dots, n_{m}$ , with $0 = n_{0} \leq n_{1} \leq \dots \leq n_{m} = | τ | + 1$ , such that for every k, $1 \leq k \leq m$ , and every i with $n_{k - 1} \leq i < n_{k}$ , it holds that $ω (τ^{i} (σ)) = Ω_{k}$ .

Every path is consistent with a uniquely defined maximal monotonically increasing sequence of contexts. (Some of the indices $n_{0}, \dots, n_{m}$ in Definition 3.7 may be equal.) In Sect. 4, we use such sequences of contexts to construct a schema recognizing many paths that are consistent with the same sequence of contexts.

A context defines which rules of the $TA$ are unlocked. A schedule that is steady for a configuration visits only one context, and thus we can statically remove $TA$ ’s rules that are locked in the context:

Definition 3.8

Given a threshold automaton $TA = (L, I, Γ, Π, R, RC)$ and a context $Ω$ , we define the slice of $TA$ with context $Ω = (Ω^{rise}, Ω^{fall})$ as a threshold automaton ${TA |}_{Ω} = (L, I, Γ, Π, R |_{Ω}, RC)$ , where a rule $r \in R$ belongs to ${R |}_{Ω}$ if and only if $(⋀_{φ \in Ω^{rise}} φ) \to r . φ^{rise}$ and $(⋀_{ψ \in Φ^{fall} \ Ω^{fall}} ψ) \to r . φ^{fall}$ .

In other words, ${R |}_{Ω}$ contains those and only those rules r with guards that evaluate to true in all configurations $σ$ with $ω (σ) = Ω$ . These are exactly the guards from $Ω^{rise} \cup (Φ^{fall} \ Ω^{fall})$ . When $ω (σ) = Ω$ , then all guards from $Ω^{rise}$ evaluate to true, and then $r . φ^{rise}$ must also be true. As $Ω^{fall}$ contains those guards from $Φ^{fall}$ that evaluate to false in $σ$ , then all other guards from $Φ^{fall}$ must evaluate to true, and then $r . φ^{fall}$ must be true too. Figure 5 shows an example of a slice.

Fig. 5 — The slice of the $TA$ in Fig. 2 that is constructed for the context $({φ_{2}}, \emptyset)$

Model checking problem: parameterized reachability

Given a threshold automaton $TA$ , a state property B is a Boolean combination of formulas that have the form $⋀_{i \in Y} κ [i] = 0$ , for some $Y \subseteq L$ . The parameterized reachability problem is to decide whether there are parameter values $p \in P_{R C}$ , an initial configuration $σ_{0} \in I$ , with $σ_{0} . p = p$ , and a schedule $τ$ , such that $τ$ is applicable to $σ_{0}$ , and property B holds in the final state: $τ (σ_{0}) ⊧ B$ .

Main result: a complete set of schemas

To address parameterized reachability, we introduce schemas, i.e., alternating sequences of contexts and rule sequences. A schema serves as a pattern for a set of paths, and is used to efficiently encode parameterized reachability in SMT. As parameters give rise to infinitely many initial states, a schema captures an infinite set of paths. We show how to construct a finite set of schemas $S$ with the following property: for each schedule $τ$ and each configuration $σ$ there is a representative schedule $s (τ)$ such that: (1) applying $s (τ)$ to $σ$ results in $τ (σ)$ , and (2) $path (σ, s (τ))$ is generated by a schema from $S$ .

Definition 4.1

A schema is a sequence $Ω_{0}, ρ_{1}, Ω_{1}, \dots, ρ_{m}, Ω_{m}$ of alternating contexts and rule sequences. We often write ${Ω_{0}} ρ_{1} {Ω_{1}} \dots {Ω_{m - 1}} ρ_{m} {Ω_{m}}$ for a schema. A schema with two contexts is called simple.

Given two schemas $S_{1} = Ω_{0}, ρ_{1}, \dots, ρ_{k}, Ω_{k}$ and $S_{2} = Ω_{0}^{'}, ρ_{1}^{'}, \dots, ρ_{m}^{'}, Ω_{m}^{'}$ with $Ω_{k} = Ω_{0}^{'}$ , we define their composition $S_{1} \circ S_{2}$ to be the schema that is obtained by concatenation of the two sequences: $Ω_{0}, ρ_{1}, \dots, ρ_{k}, Ω_{0}^{'}, ρ_{1}^{'}, \dots, ρ_{m}^{'}, Ω_{m}^{'}$ .

Definition 4.2

Given a configuration $σ$ and a schedule $τ$ applicable to $σ$ , we say that $path (σ, τ)$ is generated by a simple schema ${Ω} ρ {Ω^{'}}$ , if the following hold:

For $ρ = r_{1}, \dots, r_{k}$ there is a monotonically increasing sequence of indices $i (1), \dots, i (m)$ , i.e., $1 \leq i (1) < \dots < i (m) \leq k$ , and there are factors $f_{1}, \dots, f_{m} \geq 0$ , so that schedule $(r_{i (1)}, f_{1}), \dots, (r_{i (m)}, f_{m}) = τ$ .
The first and the last states match the contexts: $ω (σ) = Ω$ and $ω (τ (σ)) = Ω^{'}$ .

In general, we say that $path (σ, τ)$ is generated by a schema S, if $S = S_{1} \circ \dots \circ S_{k}$ for simple schemas $S_{1}, \dots, S_{k}$ and $τ = τ_{1} \dots τ_{k}$ such that each $path (π_{i} (σ), τ_{i})$ is generated by the simple schema $S_{i}$ , for $π_{i} = τ_{1} \dots τ_{i - 1}$ and $1 \leq i \leq k$ .

Remark 4.1

Definition 4.2 allows schemas to generate paths that have transitions with zero acceleration factors. Applying a transition with a zero factor to a configuration $σ$ results in the same configuration $σ$ , which corresponds to a stuttering step. This does not affect reachability. In the following, we will apply Definition 4.2 to representative paths that may have transitions with zero factors.

Example 4.1

Let us go back to the example of a schema S and a schedule $τ^{'}$ introduced in Eqs. (2.1) and (2.2) in Sect. 2.1. It is easy to see that schema S can be decomposed into four simple schemas $S_{1} \circ \dots \circ S_{4}$ , e.g., $S_{1} = {} r_{1}, r_{1} {φ_{1}}$ and $S_{2} = {φ_{1}} r_{1}, r_{3}, r_{4}, r_{4} {φ_{1}, φ_{2}}$ . Consider an initial state $σ_{0}$ with $n = 5$ , $t = f = 1$ , $x = y = 0$ , $κ [ℓ_{1}] = 1$ , $κ [ℓ_{2}] = 3$ , and $κ [ℓ_{i}] = 0$ for $i \in {3, 4, 5}$ . To ensure that $path (σ_{0}, τ^{'})$ is generated by schema S, one has to check Definition 4.2 for schemas $S_{1}, \dots, S_{4}$ and schedules $(τ_{1}^{'} \cdot t_{1})$ , $(τ_{2}^{'} \cdot t_{2})$ , $(τ_{3}^{'} \cdot t_{3})$ , and $τ_{4}^{'}$ , respectively. For instance, $path (σ_{0}, τ_{1}^{'} \cdot t_{1})$ is generated by $S_{1}$ . Indeed, take the sequence of indices 1 and 2 and the sequence of acceleration factors 1 and 1. The path $path (σ_{0}, τ_{1}^{'} \cdot t_{1})$ ends in the configuration $σ_{1}$ that differs from $σ_{0}$ in that $κ [ℓ_{2}] = 1$ , $κ [ℓ_{3}] = 2$ , and $x = 2$ . The contexts $ω (σ_{0}) = ({}, {})$ and $ω (σ_{1}) = ({φ_{1}}, {})$ match the contexts of schema $S_{1}$ , as required by Definition 4.2.

Similarly, $path (σ_{1}, τ_{2}^{'} \cdot t_{2})$ is generated by schema $S_{2}$ . To see that, compare the contexts and use the index sequence 1, 2, 4, and acceleration factors 1.

The language of a schema S —denoted with $L (S)$ —is the set of all paths generated by S. For a set of configurations $C \subseteq Σ$ and a set of schemas $S$ , we define the set $Reach (C, S)$ to contain all configurations reachable from C via the paths generated by the schemas from $S$ , i.e., $Reach (C, S) = {τ (σ) ∣ σ \in C, \exists S \in S . path (σ, τ) \in L (S)}$ . We say that a set $S$ of schemas is complete, if for every set of configurations $C \subseteq Σ$ it is the case that the set of all states reachable from C via the paths generated by the schemas from $S$ , is exactly the set of all possible states reachable from C. Formally, $\forall C \subseteq Σ . {τ (σ) ∣ σ \in C, τ is applicable to σ} = Reach (C, S)$ .

In [41], a quantity $C$ has been introduced that depends on the number of conditions in a TA. It has been shown that for every configuration $σ$ and every schedule $τ$ applicable to $σ$ , there is a schedule $τ^{'}$ of length at most $d = | R | \cdot (C + 1) + C$ that is also applicable to $σ$ and results in $τ (σ)$ [41, Thm. 8]. Hence, by enumerating all sequences of rules of length up to d, one can construct a complete set of schemas:

Theorem 4.1

For a threshold automaton, there is a complete schema set $S_{d}$ of cardinality ${| R |}^{| R | \cdot (C + 1) + C}$ .

Although the set $S_{d}$ is finite, enumerating all its elements is impractical. We show that there is a complete set of schemas whose cardinality solely depends on the number of guards that syntactically occur in the TA. These numbers $| Φ^{rise} |$ and $| Φ^{fall} |$ are in practice much smaller than the number of rules $| R |$ :

Theorem 4.2

For all threshold automata, there exists a complete schema set of cardinality at most $(| Φ^{rise} | + | Φ^{fall} |)!$ . In this set, the length of each schema does not exceed $(3 \cdot | Φ^{rise} \cup Φ^{fall} | + 2) \cdot | R |$ .

In the following sections we prove the ingredients of the following argument for the theorem: construct the set Z of all maximal monotonically increasing sequences of contexts. From Proposition 3.2, we know that there are at most $(| Φ^{rise} | + | Φ^{fall} |)!$ maximal monotonically increasing sequences of contexts. Therefore, $| Z | \leq (| Φ^{rise} | + | Φ^{fall} |)!$ . Then, for each sequence $z \in Z$ , we do the following:

We show that for each configuration $σ$ and each schedule $τ$ applicable to $σ$ and consistent with the sequence z, there is a schedule $s (τ)$ that has a specific structure, and is also applicable to $σ$ . We call $s (τ)$ the representative of $τ$ . We introduce and formally define this specific structure of representative schedules in Sects. 5, 6 and 7. We prove existence and properties of the representative schedule in Theorem 7.1 (Sect. 7). Before that we consider special cases: when all rules of a schedule belong to the same looplet (Theorem 5.1, Sect. 5), and when a schedule is steady (Theorem 6.1, Sect. 6).
Next we construct a schema (for the sequence z) and show that it generates all paths of all schedules $s (τ)$ found in (1). The length of the schema is at most $(3 \cdot (| Φ^{rise} | + | Φ^{fall} |) + 2) \cdot | R |$ . This is shown in Theorem 7.2 (Sect. 7).

Theorem 4.2 follows from the above theorems, which we prove in the following.

Remark 4.2

Let us stress the difference between [41] and this work. From [41], it follows that in order to check correctness of a $TA$ it is sufficient to check only the schedules of bounded length $d (TA)$ . The bound $d (TA)$ does not depend on the parameters, and can be computed for each $TA$ . The proofs in [41] demonstrate that every schedule longer than $d (TA)$ can be transformed into an “equivalent” representative schedule, whose length is bounded by $d (TA)$ . Consequently, one can treat every schedule of length up to $d (TA)$ as its own representative schedule. Similar reasoning does not apply to the schemas constructed in this paper: (i) we construct a complete set of schemas, whose cardinality is substantially smaller than $| S_{d} |$ , and (ii) the schemas constructed in this paper can be twice as long as the schemas in $S_{d}$ .

As discussed in Remark 3.2, the looplets in our case studies are typically either singleton looplets or looplets of size two. In fact, most of our benchmarks have singleton looplets only, and thus their threshold automata can be reduced to directed acyclic graphs. The theoretical constructs of Sect. 5.2 are presented for the more general case of looplets of any size. For most of the benchmarks —the ones not using failure detectors —we need only the simple construction laid out in Sect. 5.1.

Case I: one context and one looplet

We show that for each schedule that uses only the rules from a fixed looplet and does not change its context, there exists a representative schedule of bounded length that reaches the same final state. The goal is to construct a single schema per looplet. The technical challenge is that this single schema must generate representative schedules for all possible schedules, where, intuitively, processes may move arbitrarily between all local states in the looplet. As a consequence, the rules that appear in the representative schedules can differ from the rules that appear in the arbitrary schedules visiting a looplet.

We fix a threshold automaton, a context $Ω$ , a configuration $σ$ with $ω (σ) = Ω$ , a looplet c, and a schedule $τ$ applicable to $σ$ and using only rules from c. We then construct the representative schedule ${crep}_{c}^{Ω} [σ, τ]$ and the schema ${cschema}_{c}^{Ω}$ .

The technical details of the construction of ${crep}_{c}^{Ω} [σ, τ]$ for the case when $| c | = 1$ is given in Sect. 5.1, and for the case when $| c | > 1$ in Sect. 5.2. We show in Sect. 5.3 that these constructions give us a schedule that has the desired properties: it reaches the same final state as the given schedule $τ$ , and its length does not exceed $2 \cdot | c |$ .

Note that in [41], the length of the representative schedule was bounded by |c|. However, all representative schedules of a looplet in this section can be generated by a single looplet schema.

Singleton looplet

Let us consider the case of the looplet c containing only one transition, that is, $| c | = 1$ . There is a trivial representative schedule of a single transition:

Lemma 5.1

Given a threshold automaton, a configuration $σ$ , and a schedule $τ = (r, f_{1})$ , ..., $(r, f_{m})$ applicable to $σ$ , one of the two schedules is also applicable to $σ$ and results in $τ (σ)$ : schedule $(r, f_{1} + \dots + f_{m})$ , or schedule (r, 0).

Proof

We distinguish two cases:

Case $r . t o = r . f r o m$ Then, $r . u = 0$ , and $τ^{k} (σ) = σ$ for $0 \leq k \leq | τ |$ . Consequently, the schedule (r, 0) is applicable to $σ$ , and it results in $τ (σ) = σ$ .

Case $r . t o \neq r . f r o m$ We prove by induction on the length $k : 1 \leq k \leq m$ of a prefix of $τ$ , that the following constraints hold for all k:

\begin{matrix} (τ^{k} (σ)) . κ [r . f r o m] = σ . κ [r . f r o m] - (f_{1} + \dots + f_{k}) \end{matrix}

5.1

\begin{matrix} (τ^{k} (σ)) . g = σ . g + (f_{1} + \dots + f_{k}) \cdot r . u \end{matrix}

5.2

\begin{matrix} (σ . κ, σ . g + f \cdot r . u, σ . p) ⊧ r . φ^{fall} \land r . φ^{rise} for all f \in {0, \dots, f_{1} + \dots + f_{k}} \end{matrix}

5.3

Base case $k = 1$ . As schedule $τ$ is applicable to $σ$ , its first transition is enabled in $σ$ . Thus, by the definition of an enabled transition, the rule r is unlocked, i.e., for all $f \in {0, \dots, f_{1}}$ , it holds $(σ . κ, σ . g + f_{1} \cdot r . u, σ . p) ⊧ r . φ^{fall} \land r . φ^{rise}$ . By the definition, once the transition $τ [1]$ is applied, it holds that $τ^{1} (σ) . κ [f r o m] = σ . κ [f r o m] - f_{1}$ and $(τ^{k} (σ)) . g = σ . g + f_{1} \cdot r . u$ . Thus, Constraints (5.1)–(5.3) are satisfied for $k = 1$ .

Inductive step $k > 1$ . As schedule $τ$ is applicable to $σ$ , its prefix $τ^{k}$ is applicable to $σ$ . Hence, transition $τ [k]$ is applicable to $τ^{k - 1} (σ)$ .

By the definition of an enabled transition, for all $f \in {0, \dots, f_{k}}$ , it holds

\begin{matrix} ((τ^{k - 1} (σ)) . κ, ((τ^{k - 1} (σ)) . g + f \cdot r . u, σ . p) ⊧ r . φ^{fall} \land r . φ^{rise} . \end{matrix}

By applying the Eq. (5.2) for $k - 1$ of the inductive hypothesis, we obtain that for all $f \in {0, \dots, f_{k}}$ , it holds that $(σ . κ, σ . g + (f_{1} + \dots + f_{k - 1} + f \cdot r . u, σ . p) ⊧ r . φ^{fall} \land r . φ^{rise}$ . By combining this constraint with the constraint (5.3) for $k - 1$ , we arrive at the constraint (5.3) for k.

By applying $τ [k]$ , we get that $(τ^{k} (σ)) . κ [r . f r o m] = (τ^{k - 1} (σ)) . κ [r . f r o m] - f_{k}$ and $(τ^{k} (σ)) . g = (τ^{k - 1} (σ)) . g + f_{k} \cdot r . u$ . By applying (5.1) and (5.2) for $k - 1$ to these equations, we arrive at the Eqs. (5.1) and (5.2) for k.

Based on (5.1) and (5.3) for all values of k, and in particular $k = m$ , we can now show applicability. From Eq. (5.1), we immediately obtain that $σ . κ [r . f r o m] \geq f_{1} + \dots + f_{m}$ . From constraint (5.3), we obtain that $(σ . κ, σ . g + f \cdot r . u, σ . p) ⊧ r . φ^{fall} \land r . φ^{rise}$ for all $f \in {0, \dots, f_{1} + \dots + f_{m}}$ . These are the required conditions for the transition $(r, f_{1} + \dots + f_{m})$ to be applicable to the configuration $σ$ . $□$

Consequently, when c has a single rule r, for configuration $σ$ and a schedule $τ = (r, f_{1}), \dots, (r, f_{m})$ , Lemma 5.1 allows us to take the singleton schedule (r, f) as ${crep}_{c}^{Ω} [σ, τ]$ and to take the singleton schema ${Ω} r {Ω}$ as ${cschema}_{c}^{Ω}$ . The factor f is either $f_{1} + \dots + f_{m}$ or zero.

Non-singleton looplet

Next we focus on non-singleton looplets. Thus, we assume that $| c | > 1$ . Our construction is based on two directed trees, whose undirected versions are spanning trees, sharing the same root. In order to find a representative of a steady schedule $τ$ which leads from $σ$ to $τ (σ)$ , we determine for each local state how many processes have to move in or out of the state, and then we move them along the edges of the trees. First, we give the definitions of such trees, and then we show how to use them to construct the representative schedules and the schema.

Spanning out-trees and in-trees

We construct the underlying graph of looplet c, that is, a directed graph $G_{c}$ , whose vertices consist of local states that appear as components $f r o m$ or $t o$ of the rules from c, and the edges are the rules of c. More precisely, we construct a directed graph $G_{c} = (V_{c}, E_{c}, L_{c})$ , whose edges from $E_{c}$ are labeled by function $L_{c} : E_{c} \to c$ with the rules of c as follows:

\begin{matrix} V_{c} & = {ℓ ∣ \exists r \in c, r . t o = ℓ \lor r . f r o m = ℓ}, \\ E_{c} & = {(ℓ, ℓ^{'}) ∣ \exists r \in c, r . f r o m = ℓ, r . t o = ℓ^{'}}, \\ L_{c} ((ℓ, ℓ^{'})) & = r, if r . f r o m = ℓ, r . t o = ℓ^{'} for (ℓ, ℓ^{'}) \in E_{c} and r \in c . \end{matrix}

Lemma 5.2

Given a threshold automaton and a non-singleton looplet $c \in R / \sim$ , graph $G_{c}$ is non-empty and strongly connected.

Proof

As, $| c | > 1$ and thus $E_{c} \geq 2$ , graph $G_{c}$ is non-empty. To prove that $G_{c}$ is strongly connected, we consider a pair of rules $r_{1}, r_{2} \in c$ . By the definition of a looplet, it holds that $r_{1} ≺_{P}^{+} r_{2}$ and $r_{2} ≺_{P}^{+} r_{1}$ . Thus, there is a path in $G_{c}$ from $r_{1} . t o$ to $r_{2} . f r o m$ , and there is a path in $G_{c}$ from $r_{2} . t o$ to $r_{1} . f r o m$ . As $r_{1}$ and $r_{2}$ correspond to some edges in $G_{c}$ , there is a cycle that contains the vertices $r_{1} . f r o m$ , $r_{1} . t o$ , $r_{2} . f r o m$ , and $r_{2} . t o$ . Thus, graph $G_{c}$ is strongly connected. $□$

As $G_{c}$ is non-empty and strongly connected, we can fix an arbitrary node $h \in V_{c}$ —called a hub —and construct two directed trees, whose undirected versions are spanning trees of the undirected version of $G_{c}$ . These are two subgraphs of $G_{c}$ : a directed tree $T_{out} = (V_{c}, E_{out})$ , whose edges $E_{out} \subseteq E_{c}$ are pointing away from h (out-tree); a directed tree $T_{in} = (V_{c}, E_{in})$ , whose edges $E_{in} \subseteq E_{c}$ are pointing to h (in-tree). For every node $v \in V_{c} \ {h}$ , it holds that $| {u : (u, v) \in E_{out}} | = 1$ and $| {w : (v, w) \in E_{in}} | = 1$ .

Further, we fix a topological order $⪯_{in}$ on the edges of tree $T_{in}$ . More precisely, $⪯_{in}$ is such a partial order on $E_{in}$ that for each pair of adjacent edges $(ℓ, ℓ^{'}), (ℓ^{'}, ℓ^{''}) \in E_{in}$ , it holds that $(ℓ, ℓ^{'}) ⪯_{in} (ℓ^{'}, ℓ^{''})$ . In the same way, we fix a topological order $⪯_{out}$ on the edges of tree $T_{out}$ .

Example 5.1

Consider again the threshold automaton from Example 3.3 and Fig. 3. We construct trees $T_{in}$ and $T_{out}$ for looplet $c_{2}$ , shown in Fig. 6.

Note that $V_{c} = {ℓ_{2}, ℓ_{3}, ℓ_{4}, ℓ_{5}, ℓ_{6}}$ , and $E_{c} = {(ℓ_{2}, ℓ_{3}), (ℓ_{3}, ℓ_{5}), (ℓ_{5}, ℓ_{6}), (ℓ_{6}, ℓ_{4}),$ $(ℓ_{4}, ℓ_{4}), (ℓ_{4}, ℓ_{5}), (ℓ_{4}, ℓ_{2})} .$ Fix $ℓ_{4}$ as a hub. We can fix a linear order $⪯_{in}$ such that $(ℓ_{2}, ℓ_{3}) ⪯_{in} (ℓ_{3}, ℓ_{5}) ⪯_{in} (ℓ_{5}, ℓ_{6}) ⪯_{in} (ℓ_{6}, ℓ_{4})$ , and a linear order $⪯_{out}$ such that $(ℓ_{4}, ℓ_{2}) ⪯_{out} (ℓ_{2}, ℓ_{3}) ⪯_{out} (ℓ_{4}, ℓ_{5}) ⪯_{out} (ℓ_{5}, ℓ_{6})$ .

Note that for the chosen hub $l_{4}$ and this specific example, $T_{in}$ and $⪯_{in}$ are uniquely defined, while an out-tree can be different from $T_{out}$ from our Fig. 6 (the rules $r_{8}, r_{2}, r_{3}, r_{4}$ constitute a different tree from the same hub). Because out-tree $T_{out}$ is not a chain, several linear orders different from $⪯_{out}$ can be chosen, e.g., $(ℓ_{4}, ℓ_{2}) ⪯_{out} (ℓ_{4}, ℓ_{5}) ⪯_{out} (ℓ_{2}, ℓ_{3}) ⪯_{out} (ℓ_{5}, ℓ_{6})$ .

Representatives of non-singleton looplets

Using these trees, we show how to construct a representative ${crep}_{c}^{Ω} [σ, τ]$ of a schedule $τ$ applicable to $σ$ with $σ^{'} = τ (σ)$ . For a configuration $σ$ and a schedule $τ$ applicable to $σ$ , consider the trees $T_{in}$ and $T_{out}$ . We construct two sequences: the sequence $e_{in} (1), \dots, e_{in} (| E_{in} |)$ of all edges of $T_{in}$ following the order $⪯_{in}$ , i.e., if $e_{in} (i) ⪯_{in} e_{in} (j)$ , then $i \leq j$ ; the sequence $e_{out} (1), \dots, e_{out} (| E_{out} |)$ of all edges of $T_{out}$ following the order $⪯_{out}$ . Further, we define the sequence of rules $r_{in} (1), \dots, r_{in} (| E_{in} |)$ with $r_{in} (i) = L_{c} (e_{in} (i))$ for $1 \leq i \leq | E_{in} |$ , and the sequence of rules $r_{out} (1), \dots, r_{out} (| E_{out} |)$ with $r_{out} (i) = L_{c} (e_{out} (i))$ for $1 \leq i \leq | E_{out} |$ . Using configurations $σ$ and $σ^{'} = τ (σ)$ , we define:

\begin{matrix} δ_{in} (i) & = σ . κ [f] - σ^{'} . κ [f], for f = r_{in} (i) . f r o m and 1 \leq i \leq | E_{in} |, \\ δ_{out} (j) & = σ^{'} . κ [t] - σ . κ [t], for t = r_{out} (j) . t o and 1 \leq j \leq | E_{out} | . \end{matrix}

If $δ_{in} (i) \geq 0$ , then $δ_{in} (i)$ processes should leave the local state $r_{in} (i) . f r o m$ towards the hub, and they do it exclusively using the edge $e_{in} (i)$ . If $δ_{out} (j) \geq 0$ , then $δ_{out} (j)$ processes should reach the state $r_{out} (j) . t o$ from the hub, and they do it exclusively using the edge $e_{out} (j)$ . The negative values of $δ_{in} (i)$ and $δ_{out} (j)$ do not play any role in our construction, and thus, we use $max (δ_{in} (i), 0)$ and $max (δ_{out} (j), 0)$ .

The main idea of the representative construction is as follows. First, we fire the sequence of rules $r_{in} (1), \dots, r_{in} (k)$ to collect sufficiently many processes in the hub. Then, we fire the sequence of rules $r_{out} (1), \dots, r_{out} (k)$ to distribute the required number of processes from the hub. As a result, for each location $ℓ$ in the graph, the processes are transferred from $ℓ$ to the other locations, if $σ [ℓ] > σ^{'} [ℓ]$ , and additional processes arrive at $ℓ$ , if $σ [ℓ] < σ^{'} [ℓ]$ . Using $δ_{in} (i)$ and $δ_{out} (i)$ , we define the acceleration factors for each rule as follows:

\begin{matrix} w_{in} (i) = & \sum_{j : e_{in} (j) ⪯_{in} e_{in} (i)} max (δ_{in} (j), 0) and \\ w_{out} (i) = & \sum_{j : e_{out} (i) ⪯_{out} e_{out} (j)} max (δ_{out} (j), 0) . \end{matrix}

Finally, we construct the schedule ${crep}_{c}^{Ω} [σ, τ]$ as follows:

\begin{matrix} {crep}_{c}^{Ω} [σ, τ] = & (r_{in} (1), w_{in} (1)), \dots, (r_{in} (| E_{in} |), w_{in} (| E_{in} |)), \\ (r_{out} (1), w_{out} (1)), \dots, (r_{out} (| E_{out} |), w_{out} (| E_{out} |)) . \end{matrix}

5.4

Example 5.2

Consider the TA shown in Fig. 7. Let c be the four-element looplet that contains the rules $r_{1}$ , $r_{2}$ , $r_{3}$ , and $r_{4}$ , and $τ$ be the schedule $τ = (r_{4}, 1), (r_{3}, 1)$ , $(r_{4}, 1), (r_{1}, 1), (r_{2}, 1), (r_{3}, 1), (r_{1}, 1), (r_{4}, 1), (r_{1}, 1)$ that uses the rules of the looplet c. Consider a configuration $σ$ with $σ . κ [ℓ_{3}] = σ . κ [ℓ_{4}] = 1$ , and $σ . κ [ℓ_{1}] = σ . κ [ℓ_{2}] = 0$ . The final configuration $σ^{'} = τ (σ)$ has the following properties: $σ^{'} . κ [ℓ_{2}] = 2$ and $σ^{'} . κ [ℓ_{1}] = σ^{'} . κ [ℓ_{3}] = σ^{'} . κ [ℓ_{4}] = 0$ . By comparing $σ$ and $σ^{'}$ , we notice that one process should move from $ℓ_{3}$ to $ℓ_{2}$ , and one from $ℓ_{4}$ to $ℓ_{2}$ . We will now show how this is achieved by our construction.

For constructing the representative schedule ${crep}_{c}^{Ω} [σ, τ]$ , we first define trees $T_{in}$ and $T_{out}$ . If we chose $ℓ_{1}$ to be the hub, we get that $E_{in} = {(ℓ_{4}, ℓ_{1}), (ℓ_{3}, ℓ_{4}), (ℓ_{2}, ℓ_{3})}$ , and thus the order is $(ℓ_{2}, ℓ_{3}) ⪯_{in} (ℓ_{3}, ℓ_{4}) ⪯_{in} (ℓ_{4}, ℓ_{1})$ . Therefore, we obtain $e_{in} (1) = (ℓ_{2}, ℓ_{3})$ , $e_{in} (2) = (ℓ_{3}, ℓ_{4})$ and $e_{in} (3) = (ℓ_{4}, ℓ_{1})$ . By calculating $δ_{in} (i)$ for every $i \in {1, 2, 3}$ , we see that $δ_{in} (2) = 1$ and $δ_{in} (3) = 1$ are positive. Consequently, two processes go to the hub: one from $r_{in} (2) . f r o m = ℓ_{3}$ and one from $r_{in} (3) . f r o m = ℓ_{4}$ . The coefficients $w_{in}$ give us acceleration factors for all rules.

Similarly, we obtain $E_{out} = {(ℓ_{1}, ℓ_{2}), (ℓ_{2}, ℓ_{3}), (ℓ_{3}, ℓ_{4})}$ , and the order must be $(ℓ_{1}, ℓ_{2}) ⪯_{out} (ℓ_{2}, ℓ_{3}) ⪯_{out} (ℓ_{3}, ℓ_{4})$ . Thus, $e_{out} (1) = (ℓ_{1}, ℓ_{2})$ , $e_{in} (2) = (ℓ_{2}, ℓ_{3})$ , and $e_{out} (3) = (ℓ_{3}, ℓ_{4})$ . Here only $δ_{out} (1) = 2$ has a positive value, and hence, two processes should move from hub to the local state $r_{out} (1) . t o = ℓ_{2}$ . To achieve this, the acceleration factor of every rule $r_{out} (i)$ , $1 \leq i \leq 3$ , must be $w_{out} (i)$ .

Therefore, by Eq. (5.4), the representative schedule is

\begin{matrix} {crep}_{c}^{Ω} [σ, τ] = (r_{2}, 0), (r_{3}, 1), (r_{4}, 2), (r_{1}, 2), (r_{2}, 0), (r_{3}, 0) . \end{matrix}

Choosing another hub gives us another representative. For each hub, the representative is not longer than $2 | c | = 8$ , and leads to $σ^{'}$ when applied to $σ$ .

In the following, we fix a threshold automaton $TA$ , a context $Ω$ , and a non-singleton looplet c of the slice ${TA |}_{Ω}$ . We also fix a configuration $σ$ of $TA$ and a schedule $τ$ that is contained in c and is applicable to $σ$ . Our goal is to prove Lemma 5.8, which states that ${crep}_{c}^{Ω} [σ, τ]$ is indeed applicable to $σ$ and ends in $τ (σ)$ . To this end, we first prove auxiliary Lemmas 5.3–5.7.

Lemma 5.3

For every $i : 1 \leq i \leq | E_{in} |$ , it holds that $σ . κ [r_{i} . f r o m] \geq max (δ_{in} (i), 0)$ , where $r_{i} = L_{c} (e_{in} (i))$ .

Proof

Recall that by the definition of a configuration, every counter $σ . κ [ℓ]$ is non-negative. If $δ_{in} (i) \geq 0$ , then $max (δ_{in} (i), 0) = δ_{in} (i) = σ . κ [r_{i} . f r o m] - σ^{'} . κ [r_{i} . f r o m]$ , which is bound from above by $σ . κ [r_{i} . f r o m]$ . Otherwise, $δ_{in} (i) \leq 0$ , and we trivially have $max (δ_{in} (i), 0) = 0$ and $0 \leq σ . κ [r_{i} . f r o m]$ . $□$

Lemma 5.4

Schedule $τ_{in} = (r_{in} (1), w_{in} (1)), \dots, (r_{in} (| E_{in} |), w_{in} (| E_{in} |))$ is applicable to configuration $σ$ .

Proof

We denote by $α^{i}$ the schedule $(r_{in} (1), w_{in} (1)), \dots, (r_{in} (i), w_{in} (i))$ , for $1 \leq i \leq | E_{in} |$ . Then $τ_{in} = α^{| E_{in} |}$ .

All rules $r_{in} (1), \dots, r_{in} (| E_{in} |)$ are from ${R |}_{Ω}$ , and thus are unlocked. Hence, it is sufficient to show that the values of the locations from the set $V_{c}$ are large enough to enable each transition $(r_{in} (i), w_{in} (i))$ for $1 \leq i \leq | E_{in} |$ . To this end, we prove by induction that $(α^{i - 1} (σ)) . κ [r_{i} . f r o m] \geq w_{in} (i)$ , for $1 \leq i \leq | E_{in} |$ and $r_{i} = L_{c} (e_{in} (i))$ .

Base case $i = 1$ . For $r_{1} = L_{c} (e_{in} (1))$ , we want to show that $σ . κ [r_{1} . f r o m] \geq w_{in} (1)$ . As $e_{in} (1)$ is the first element of the sequence $e_{in} (1), \dots, e_{in} (E_{in})$ , which respects the order $⪯_{in}$ , we conclude that $w_{in} (1) = max (δ_{in} (1), 0)$ . From Lemma 5.3, it follows that $σ . κ [r_{1} . f r o m] \geq max (δ_{in} (1), 0)$ .

Inductive step k assume that for all $i : 1 \leq i \leq k - 1 < | E_{in} |$ , schedule $α^{i}$ is applicable to $σ$ and show that $(α^{k - 1} (σ)) . κ [r_{k} . f r o m] \geq w_{in} (k)$ with $r_{k} = L_{c} (e_{in} (k))$ .

To this end, we construct the set of edges $P_{k}$ that precede the edge $e_{in} (k)$ in the topological order $⪯_{in}$ , that is, $P_{k} = {e ∣ e \in E_{in}, e ⪯_{in} e_{in} (k), e \neq e_{in} (k)}$ . We show that the following equation holds:

\begin{matrix} α^{k - 1} (σ)) . κ [r_{k} . f r o m] = σ . κ [r_{k} . f r o m] + \sum_{e_{in} (j) \in P_{k}} max (δ_{in} (j), 0) . \end{matrix}

5.5

Indeed, if one picks an edge $e_{in} (j) \in P_{k}$ , the edge $e_{in} (j)$ adds $w_{in} (j)$ to the counter $κ [r_{k} . f r o m]$ . As the sequence ${e_{in} (i)}_{i \leq k}$ is topologically sorted, it follows that $j < k$ . Moreover, as the tree $T_{in}$ is oriented towards the root, $e_{in} (k)$ is the only edge leaving the local state $r_{k} . f r o m$ . Thus, no edge $e_{in} (i)$ with $i < k$ decrements the counter $σ . κ [r_{k} . f r o m]$ .

From Eq. (5.5) and Lemma 5.3, we conclude that $(α^{k - 1} (σ)) . κ [r_{k} . f r o m]$ is not less than $max (δ_{in} (k), 0) + \sum_{e_{in} (j) : e_{in} (j) ⪯_{in} e_{in} (k), j \neq k} max (δ_{in} (j), 0)$ , which equals to $w_{in} (k)$ . This proves the inductive step.

Therefore, we have shown that $τ_{in} = α^{| E_{in} |}$ is applicable to $σ$ . $□$

The following lemma is easy to prove by induction on the length of a schedule. The base case for a single transition follows from the definition of a counter system.

Lemma 5.5

Let $σ$ and $σ^{'}$ be two configurations and $τ$ be a schedule applicable to $σ$ such that $τ (σ) = σ^{'}$ . Then it holds that $\sum_{ℓ \in L} (σ^{'} [ℓ] - σ [ℓ]) = 0$ .

Further, we show that the required number of processes is reaching (or leaving) the hub, when the transitions derived from the trees $T_{in}$ and $T_{out}$ are executed:

Lemma 5.6

The following equality holds:

\begin{matrix} σ^{'} . κ [h] - σ . κ [h] = \sum_{1 \leq i \leq | E_{in} |} max (δ_{in} (i), 0) - \sum_{1 \leq i \leq | E_{out} |} max (δ_{out} (i), 0) . \end{matrix}

Proof

Recall that $T_{in}$ is a tree directed towards h, and the undirected version of $T_{in}$ is a spanning tree of graph C. Hence, for each local state $ℓ \in V_{c} \ {h}$ , there is exactly one edge $e \in E_{in}$ with $L_{c} (e) . f r o m = ℓ$ . Thus, the following equation holds:

\begin{matrix} \sum_{1 \leq i \leq | E_{in} |} max (δ_{in} (i), 0) = \sum_{ℓ \in V_{c} \ {h}} max (σ . κ [ℓ] - σ^{'} . κ [ℓ], 0) . \end{matrix}

5.6

Similarly, $T_{out}$ is a tree directed outwards h, and the undirected version of $T_{out}$ is a spanning tree of graph C. Hence, for each local state $ℓ \in V_{c} \ {h}$ , there is exactly one edge $e \in E_{out}$ with $L_{c} (e) . t o = ℓ$ . Thus, the following equation holds:

\begin{matrix} \sum_{1 \leq i \leq | E_{out} |} max (δ_{out} (i), 0) = \sum_{ℓ \in V_{c} \ {h}} max (σ^{'} . κ [ℓ] - σ . κ [ℓ], 0) . \end{matrix}

5.7

By combining (5.6) and (5.7), we obtain the following:

\begin{matrix} \sum_{1 \leq i \leq | E_{in} |} max (δ_{in} (i), 0) - \sum_{1 \leq i \leq | E_{out} |} max (δ_{out} (i), 0) \\ = \sum_{ℓ \in V_{c} \ {h}} (max (σ . κ [ℓ] - σ^{'} . κ [ℓ], 0) - max (σ^{'} . κ [ℓ] - σ . κ [ℓ], 0)) \\ = \sum_{ℓ \in V_{c} \ {h}} (σ . κ [ℓ] - σ^{'} . κ [ℓ]) = (\sum_{ℓ \in V_{c}} σ . κ [ℓ] - σ^{'} . κ [ℓ]) - (σ . κ [h] - σ^{'} . κ [h]) . \end{matrix}

5.8

As the initial schedule $τ$ is applicable to $σ$ , and $τ (σ) = σ^{'}$ , by Lemma 5.5, $\sum_{ℓ \in L} (σ . κ [ℓ] - σ^{'} . κ [ℓ]) = 0$ . As all rules in ${crep}_{c}^{Ω} [σ, τ]$ are from ${R |}_{Ω}$ and thus change only the counters of local states in $V_{c}$ , for each local state $ℓ \in L \ V_{c}$ , its respective counter does not change, that is, $σ . κ [ℓ] - σ^{'} . κ [ℓ] = 0$ . Hence, $\sum_{ℓ \in V_{c}} (σ . κ [ℓ] - σ^{'} . κ [ℓ]) = 0$ . From this and Eq. (5.8), the statement of the lemma follows. $□$

Lemma 5.7

If $τ_{in}$ denotes the schedule $(r_{in} (1), w_{in} (1)), \dots, (r_{in} (| E_{in} |), w_{in} (| E_{in} |))$ , the following equation holds:

\begin{matrix} τ_{in} (σ) . κ [ℓ] = \{\begin{matrix} σ^{'} . κ [h] + \sum_{1 \leq i \leq | E_{out} |} max (δ_{out} (i), 0), & if ℓ = h \\ min (σ . κ [ℓ], σ^{'} . κ [ℓ]), & if ℓ \in V_{c} \ {h} . \end{matrix} \end{matrix}

Proof

We prove the lemma by case distinction:

Case $ℓ = h$ We show that $(τ_{in} (σ)) . κ [h] = σ . κ [h] + \sum_{1 \leq i \leq | E_{in} |} max (δ_{in} (i), 0)$ . Indeed, let P be the indices of edges coming into h, i.e., $P = {i ∣ 1 \leq i \leq | E_{in} |, L_{c} (e_{in} (i)) = r, h = r . t o}$ . As all edges in $T_{in}$ are oriented towards h, it holds that $(τ_{in} (σ)) . κ [h]$ equals to $σ . κ [h] + \sum_{i \in P} w_{in} (i)$ . By unfolding the definition of $w_{in}$ , we obtain that $(τ_{in} (σ)) . κ [h] = σ . κ [h] + \sum_{1 \leq i \leq | E_{in} |} max (δ_{in} (i), 0)$ . We observe that by Lemma 5.6, this sum equals to $σ^{'} . κ [h] + \sum_{1 \leq i \leq | E_{out} |} max (δ_{out} (i), 0)$ . This proves the first case.

Case $ℓ \in V_{c} \ {h}$ We show that $(τ_{in} (σ)) . κ [ℓ] = min (σ . κ [ℓ], σ^{'} . κ [ℓ])$ . Indeed, fix a node $ℓ \in V_{c} \ {h}$ and construct two sets: the set of incoming edges $I n = {e_{in} (i) ∣ \exists ℓ^{'} \in V_{c} . e_{in} (i) = (ℓ^{'}, ℓ)}$ and the singleton set of outgoing edges $O u t = {e_{in} (i) ∣ \exists ℓ^{'} \in V_{c} . e_{in} (i) = (ℓ, ℓ^{'})}$ . By summing up the effect of all transitions in $τ_{in}$ , we obtain $(τ_{in} (σ)) . κ [ℓ] = σ . κ [ℓ] + \sum_{e_{in} (i) \in I n} w_{in} (i) - \sum_{e_{out} (i) \in O u t} w_{out} (i)$ . By unfolding the definition of $w_{in}$ , we obtain $(τ_{in} (σ)) . κ [ℓ] = σ . κ [ℓ] - \sum_{e_{in} (i) \in O u t} δ_{in} (i)$ , which can be rewritten as $σ . κ [ℓ] - max (σ . κ [ℓ] - σ^{'} . κ [ℓ], 0)$ , which, in turn, equals to $min (σ . κ [ℓ], σ^{'} . κ [ℓ])$ . This proves the second case. $□$

Now we are in a position to prove that schedule ${crep}_{c}^{Ω} [σ, τ]$ is applicable to configuration $σ$ and results in configuration $τ (σ)$ :

Lemma 5.8

The schedule ${crep}_{c}^{Ω} [σ, τ]$ has the following properties: (a) ${crep}_{c}^{Ω} [σ, τ]$ is applicable to $σ$ , and (b) ${crep}_{c}^{Ω} [σ, τ]$ results in $τ (σ)$ when applied to $σ$ .

Proof

Denote with $τ_{in}$ the prefix $(r_{in} (1), w_{in} (1)), \dots, (r_{in} (| E_{in} |), w_{in} (| E_{in} |))$ of the schedule ${crep}_{c}^{Ω} [σ, τ]$ . For each $j : 1 \leq j \leq | E_{out} |$ , denote with $β^{j}$ the prefix of ${crep}_{c}^{Ω} [σ, τ]$ that has length of $| E_{in} | + j$ . Note that $β^{| E_{out} |} = {crep}_{c}^{Ω} [σ, τ]$ .

Proving applicability of ${crep}_{c}^{Ω} [σ, τ]$ to $σ$ We notice that all rules in ${crep}_{c}^{Ω} [σ, τ]$ are from ${R |}_{Ω}$ and thus are unlocked, and that $τ_{in}$ is applicable to $σ$ by Lemma 5.4. Hence, we only have to check that the values of counters from $V_{c}$ are large enough, so that transitions $(r_{out} (j), w_{out} (j))$ can fire.

We prove that each schedule $β^{j}$ is applicable to $σ$ , for $j : 1 \leq j \leq | E_{out} |$ . We do so by induction on the distance from the root h in the tree $T_{out}$ .

Base case root node h. Denote with $O_{h}$ the set ${(ℓ, ℓ^{'}) \in E_{out} ∣ ℓ = h}$ . Let $j_{1}, \dots, j_{m}$ be the indices of all edges in $O_{h}$ , and $j_{m}$ be the maximum among them.

From Lemma 5.7, $(τ_{in} (σ)) . κ [h] = σ^{'} . κ [h] + \sum_{1 \leq i \leq | E_{out} |} max (δ_{out} (i), 0) = σ^{'} . κ [h] + \sum_{e_{out} (j) \in O_{h}} w_{out} (j)$ . Thus, every transition $(e_{out} (j), w_{out} (j))$ with $e_{out} (j) \in O_{h}$ , is applicable to $β^{j - 1} (σ)$ . Also, $(β^{j_{m}} (σ)) . κ [h] = σ^{'} . κ [h]$ .

Inductive step assume that for a node $ℓ \in V_{c}$ and an edge $e_{out} (k) = (ℓ, ℓ^{'}) \in E_{out}$ outgoing from node $ℓ$ , schedule $β^{k}$ is applicable to configuration $σ$ . Show that for each edge $e_{out} (i)$ outgoing from node $ℓ^{'}$ the following hold: (i) schedule $β^{i}$ is also applicable to $σ$ ; and (ii) $β^{| E_{out} |} (σ) . κ [ℓ^{'}] = σ^{'} . κ [ℓ^{'}]$ .

(i) As the sequence ${e_{out} (j)}_{j \leq | E_{out} |}$ is topologically sorted, for each edge $e_{out} (i)$ outgoing from node $ℓ^{'}$ , it holds that $k < i$ .

From Lemma 5.7, we have that $β^{k} (σ) . κ [ℓ^{'}] = min (σ . κ [ℓ^{'}], σ^{'} . κ [ℓ^{'}])$ . Because the transition $(e_{out} (k), w_{out} (k))$ adds $w_{out} (k)$ to $β^{k - 1} (σ) . κ [ℓ^{'}]$ , we have $β^{k} (σ) . κ [ℓ^{'}] = min (σ . κ [ℓ^{'}], σ^{'} . κ [ℓ^{'}]) + w_{out} (k)$ . Let S be the set of all immediate successors of $e_{out} (k)$ , i.e., $S = {i ∣ \exists ℓ^{''} . (ℓ^{'}, ℓ^{''}) = e_{out} (i)}$ . From the definition of $w_{out} (k)$ , it follows that $w_{out} (k) = max (δ_{out} (k), 0) + \sum_{s \in S} w_{out} (s)$ . Thus, the transition $(e_{out} (i), w_{out} (i))$ for edge $e_{out} (i)$ outgoing from node $ℓ^{'}$ , can be executed.

(ii) Let $j_{1}, \dots, j_{m}$ be the indices of all edges outgoing from $ℓ^{'}$ , and $j_{m}$ be the maximum among them. From (i), it follows that

\begin{matrix} (β^{j_{m}} (σ)) . κ [ℓ^{'}] = min (σ . κ [ℓ^{'}], σ^{'} . κ [ℓ^{'}]) + max (δ_{out} (k), 0), \end{matrix}

which equals to $σ^{'} . κ [ℓ^{'}]$ .

This proves that the schedule $β^{| E_{out} |} = {crep}_{c}^{Ω} [σ, τ]$ is applicable to $σ$ .

Proving that ${crep}_{c}^{Ω} [σ, τ]$ results in $τ (σ)$ From the induction above, we conclude that for each $ℓ \in V_{c}$ , it holds that $(β^{| E_{out} |} (σ)) . κ [ℓ] = σ^{'} . κ [ℓ]$ . Edges in the trees $T_{in}$ and $T_{out}$ change only local states from $V_{c}$ . We conclude that for all $ℓ \in L$ , it holds that ${crep}_{c}^{Ω} [σ, τ] (σ) . κ [ℓ] = σ^{'} . κ [ℓ]$ . As the rules in non-singleton looplets do not change shared variables, ${crep}_{c}^{Ω} [σ, τ] (σ) . g = σ . g = σ^{'} . g$ . Therefore, ${crep}_{c}^{Ω} [σ, τ] (σ) = σ^{'}$ . $□$

Representatives for one context and one looplet

We now summarize results from Sects. 5.1 and 5.2, giving the representative of a schedule $τ$ in the case when $τ$ uses only the rules from one looplet, and does not change its context. If the given looplet consists of a single rule, the construction is given in Sect. 5.1, and otherwise in Sect. 5.2. We show that these constructions indeed give us a schedule of bounded length, that reaches the same state as $τ$ .

In the following, given a threshold automaton $TA$ and a looplet c, we will say that a schedule $τ = t_{1}, \dots, t_{n}$ is contained in c, if $[t_{i} . r u l e] = c$ for $1 \leq i \leq n$ .

Theorem 5.1

Fix a threshold automaton, and a context $Ω$ , and a looplet c in the slice ${TA |}_{Ω}$ . Let $σ$ be a configuration and $τ$ be a steady schedule contained in c and applicable to $σ$ . There exists a representative schedule ${crep}_{c}^{Ω} [σ, τ]$ with the following properties:

schedule ${crep}_{c}^{Ω} [σ, τ]$ is applicable to $σ$ , and ${crep}_{c}^{Ω} [σ, τ] (σ) = τ (σ)$ ,
the rule of each transition t in ${crep}_{c}^{Ω} [σ, τ]$ belongs to c, that is, $[t . r u l e] = c$ ,
schedule ${crep}_{c}^{Ω} [σ, τ]$ is not longer than $2 \cdot | c |$ .

Proof

If $| c | = 1$ , then we use a single accelerated transition or the empty schedule as representative, as described in Lemma 5.1.

If $| c | > 1$ , we construct the representative as in Sect. 5.2, so that by Lemma 5.8 property (a) follows. For every edge $e \in E_{c}$ , the rule $L_{c} (e)$ belongs to c, and thus ${crep}_{c}^{Ω} [σ, τ]$ satisfies property (b). As $| E_{in} | \leq | c |$ and $| E_{out} | \leq | c |$ , we conclude that $| {crep}_{c}^{Ω} [σ, τ] | \leq 2 \cdot | c |$ , and thus property c) is also satisfied. From this and Lemma 5.8, we conclude that ${crep}_{c}^{Ω} [σ, τ]$ is the required representative schedule. $□$

Theorem 5.1 gives us a way to construct schemas that generate all representatives of the schedules contained in a looplet:

Theorem 5.2

Fix a threshold automaton $TA$ , a context $Ω$ , and a looplet c in the slice ${TA |}_{Ω}$ . There exists a schema ${cschema}_{c}^{Ω}$ with the following properties:

Fix an arbitrary configuration $σ$ and a steady schedule $τ$ that is contained in c and is applicable to $σ$ . Let $τ^{'} = {crep}_{c}^{Ω} [σ, τ]$ be the representative schedule of $τ$ , from Theorem 5.1. Then, $path (σ, τ^{'})$ is generated by ${cschema}_{c}^{Ω}$ . Moreover, the length of ${cschema}_{c}^{Ω}$ is at most $2 \cdot | c |$ .

Proof

Note that $τ^{'} = {crep}_{c}^{Ω} [σ, τ]$ can be constructed in two different ways depending on the looplet c.

If $| c | = 1$ , then by Lemma 5.1 we have that $τ^{'} = (r, f)$ for a rule $r \in c$ and a factor $f \in N_{0}$ . In this case we construct ${cschema}_{c}^{Ω}$ to be

\begin{matrix} {cschema}_{c}^{Ω} = {Ω} r {Ω} . \end{matrix}

It is easy to see that $path (σ, τ^{'})$ is generated by ${cschema}_{c}^{Ω}$ , as well as that the length of ${cschema}_{c}^{Ω}$ is exactly 1, that is less than $2 \cdot | c |$ .

If $| c | > 1$ , then we use the trees $T_{in}$ and $T_{out}$ to construct the schema ${cschema}_{c}^{Ω}$ as follows:

\begin{matrix} {cschema}_{c}^{Ω} = {Ω} r_{in} (1) \dots r_{in} (| E_{in} |) \cdot r_{out} (1) \dots r_{out} (| E_{out} |) {Ω} . \end{matrix}

5.9

Since for an arbitrary configuration $σ$ and a schedule $τ$ , we use the same sequence of edges in Eqs. (5.4) and (5.9) to construct ${crep}_{c}^{Ω} [σ, τ]$ and ${cschema}_{c}^{Ω}$ , the schema ${cschema}_{c}^{Ω}$ generates all paths of the representative schedules, and its length is at most $2 \cdot | c |$ . $□$

Case II: one context and multiple looplets

In this section, we show that for each steady schedule, there exists a representative steady schedule of bounded length that reaches the same final state.

Theorem 6.1

Fix a threshold automaton and a context $Ω$ . For every configuration $σ$ with $ω (σ) = Ω$ and every steady schedule $τ$ applicable to $σ$ , there exists a steady schedule ${srep}_{Ω} [σ, τ]$ with the following properties:

${srep}_{Ω} [σ, τ]$ is applicable to $σ$ , and ${srep}_{Ω} [σ, τ] (σ) = τ (σ)$ ,
$| {srep}_{Ω} [σ, τ] | \leq 2 \cdot | (R |_{Ω}) |$

To construct a representative schedule, we fix a context $Ω$ of a TA, a configuration $σ$ with $ω (σ) = Ω$ , and a steady schedule $τ$ applicable to $σ$ . The key notion in our construction is a projection of a schedule on a set of looplets:

Definition 6.1

Let $τ = t_{1}, \dots, t_{k}$ , for $k > 0$ , be a schedule, and let C be a set of looplets. Given an increasing sequence of indices $i (1), \dots, i (m) \in {1, \dots, k}$ , where $m \leq k$ , i.e., $i (j) < i (j + 1)$ , for $1 \leq j < m$ , a schedule $t_{i (1)} \dots t_{i (m)}$ is a projection of $τ$ on C, if each index $j \in {1, \dots, k}$ belongs to ${i (1), \dots, i (m)}$ if and only if $[t_{j} . r u l e] \in C$ .

In fact, each schedule $τ$ has a unique projection on a set C. In the following, we write ${τ |}_{c_{1}, \dots, c_{m}}$ to denote the projection of $τ$ on a set ${c_{1}, \dots, c_{m}}$ .

Provided that $c_{1}, \dots, c_{m}$ are all looplets of the slice ${R |}_{Ω}$ ordered with respect to Inline graphic , we construct the following sequences of projections on each looplet (note that $π_{0}$ is the empty schedule): $π_{i} = τ {|_{c_{1}} \cdot \dots \cdot τ |}_{c_{i}} for 0 \leq i \leq m$ .

Having defined ${π_{i}}_{0 \leq i \leq m}$ , we construct the representative ${srep}_{Ω} [σ, τ]$ simply as a concatenation of the representatives of each looplet:

\begin{matrix} {srep}_{Ω} [σ, τ] = {crep}_{c_{1}}^{Ω} [π_{0} (σ), τ |_{c_{1}}] \cdot {crep}_{c_{2}}^{Ω} [π_{1} (σ), τ |_{c_{2}}] \cdot \dots \cdot {crep}_{c_{m}}^{Ω} [π_{m - 1} (σ), τ |_{c_{m}}] \end{matrix}

Example 6.1

Consider the TA shown in Fig. 8. It has three looplets, namely $c_{1} = {r_{1}, r_{2}, r_{3}, r_{4}}$ , $c_{2} = {r_{5}}$ , $c_{3} = {r_{6}, r_{7}, r_{8}}$ , and the rules are depicted as solid, dotted, and dashed, respectively. These looplets are ordered such that Inline graphic .

Let $σ$ be the configuration represented in Fig. 8 left, i.e. $κ [ℓ_{3}] = κ [ℓ_{4}] = κ [ℓ_{5}] = 1$ and $κ [ℓ_{3}] = κ [ℓ_{4}] = κ [ℓ_{5}] = 0$ . Let $τ$ be the schedule $(r_{4}, 1), (r_{6}, 1), (r_{3}, 1),$ $(r_{4}, 1), (r_{1}, 1), (r_{2}, 1), (r_{7}, 1), (r_{3}, 1), (r_{1}, 1), (r_{5}, 1), (r_{7}, 1), (r_{4}, 1), (r_{8}, 1), (r_{1}, 1), (r_{6}, 1),$ $(r_{7}, 1), (r_{5}, 1), (r_{8}, 1), (r_{7}, 1)$ . Note that $τ$ is applicable to $σ$ and that $τ (σ)$ is the configuration $σ^{'}$ from Fig. 8 right, i.e. $κ [ℓ_{5}] = 1$ , $κ [ℓ_{6}] = 2$ and $κ [ℓ_{1}] = κ [ℓ_{2}] = κ [ℓ_{3}] = κ [ℓ_{4}] = 0$ . We construct the representative schedule ${srep}_{Ω} [σ, τ]$ .

Projection of $τ$ on the looplets $c_{1}$ , $c_{2}$ , and $c_{3}$ , gives us the following schedules:

\begin{matrix} {τ |}_{c_{1}} & = (r_{4}, 1), (r_{3}, 1), (r_{4}, 1), (r_{1}, 1), (r_{2}, 1), (r_{3}, 1), (r_{1}, 1), (r_{4}, 1), (r_{1}, 1), \\ {τ |}_{c_{2}} & = (r_{5}, 1), (r_{5}, 1), \\ {τ |}_{c_{3}} & = (r_{6}, 1), (r_{7}, 1), (r_{7}, 1), (r_{8}, 1), (r_{6}, 1), (r_{7}, 1), (r_{8}, 1), (r_{7}, 1) . \end{matrix}

Recall that

\begin{matrix} {srep}_{Ω} [σ, τ] = {crep}_{c_{1}}^{Ω} [π_{0} (σ), τ |_{c_{1}}] \cdot {crep}_{c_{2}}^{Ω} [π_{1} (σ), τ |_{c_{2}}] \cdot {crep}_{c_{3}}^{Ω} [π_{2} (σ), τ |_{c_{3}}] . \end{matrix}

In order to construct this schedule, we firstly construct the required configurations. Note that $π_{0} (σ) = σ$ . Then $π_{1} {(σ) = τ |}_{c_{1}} (σ)$ , and this is the configuration from Fig. 8 lower left, i.e. $κ [ℓ_{2}] = 2$ , $κ [ℓ_{5}] = 1$ and $κ [ℓ_{1}] = κ [ℓ_{3}] = κ [ℓ_{4}] = κ [ℓ_{6}] = 0$ . Configuration $π_{2} {(σ) = τ |}_{c_{1}} \cdot τ {|_{c_{2}} (σ) = τ |}_{c_{2}} (π_{1} (σ))$ is represented on Fig. 8 lower right, i.e. $κ [ℓ_{5}] = 3$ and all other counters are zero.

Section 5 deals with the construction of representatives of schedules that contain rules from only one looplet. Recall that construction of ${crep}_{c_{1}}^{Ω} [π_{0} (σ), τ |_{c_{1}}]$ corresponds to the one from Example 5.2. Thus, we know that

\begin{matrix} {crep}_{c_{1}}^{Ω} [π_{0} (σ), τ |_{c_{1}}] = (r_{2}, 0), (r_{3}, 1), (r_{4}, 2), (r_{1}, 2), (r_{2}, 0), (r_{3}, 0) . \end{matrix}

As $c_{2}$ is a singleton looplet, we use the result of Sect. 5.1. Thus,

\begin{matrix} {crep}_{c_{2}}^{Ω} [π_{1} (σ), τ |_{c_{2}}] = (r_{5}, 2) . \end{matrix}

Using the result from Sect. 5.2 we obtain that

\begin{matrix} {crep}_{c_{3}}^{Ω} [π_{2} (σ), τ |_{c_{3}}] = (r_{8}, 0), (r_{7}, 2), \end{matrix}

and finaly we have the representative for $τ$ that is

\begin{matrix} {srep}_{Ω} [σ, τ] = (r_{2}, 0), (r_{3}, 1), (r_{4}, 2), (r_{1}, 2), (r_{2}, 0), (r_{3}, 0), (r_{5}, 2), (r_{8}, 0), (r_{7}, 2) . \end{matrix}

Lemma 6.1

(Looplet sorting) Given a threshold automaton, a context $Ω$ , a configuration $σ$ , a steady schedule $τ$ applicable to $σ$ , and a sequence $c_{1}, \dots, c_{m}$ of all looplets in the slice ${R |}_{Ω}$ with the property Inline graphic for $1 \leq i < j \leq m$ , the following holds:

Schedule ${τ |}_{c_{1}}$ is applicable to the configuration $σ$ .
Schedule ${τ |}_{c_{2}, \dots, c_{m}}$ is applicable to the configuration ${τ |}_{c_{1}} (σ)$ .
Schedule ${τ |}_{c_{1}} {\cdot τ |}_{c_{2}, \dots, c_{m}}$ , when applied to $σ$ , results in configuration $τ (σ)$ .

Proof

In the following, we show Points 1–3 one-by-one.

We need extra notation. For a local state $ℓ$ we denote by $1_{ℓ}$ the $| L |$ -dimensional vector, where the $ℓ$ th component is 1, and all the other components are 0. Given a schedule $ρ = t_{1} \dots t_{k}$ , we introduce a vector $Δ_{κ} (ρ) \in Z^{| L |}$ to keep counter difference and a vector $Δ_{g} (ρ) \in N_{0}^{| Γ |}$ to keep difference on shared variables as follows:

\begin{matrix} Δ_{κ} (ρ) = \sum_{1 \leq i \leq | ρ |} t_{i} . f a c t o r \cdot (1_{t_{i} . t o} - 1_{t_{i} . f r o m}) and Δ_{g} (ρ) = \sum_{1 \leq i \leq | ρ |} t_{i} . u \end{matrix}

Proof of (1) Assume by contradiction that schedule ${τ |}_{c_{1}}$ is not applicable to configuration $σ$ . Thus, there is a schedule $τ^{'}$ and a transition $t^{*}$ that constitute a prefix of ${τ |}_{c_{1}}$ , with the following property: $τ^{'}$ is applicable to $σ$ , whereas $τ^{'} \cdot t^{*}$ is not applicable to $σ$ . Let $ℓ = t^{*} . f r o m$ and $ℓ^{'} = t^{*} . t o$ .

There are three cases of why $t^{*}$ may be not applicable to $τ^{'} (σ)$ :

(i) There is not enough processes to move: $(σ . κ + Δ_{κ} (τ^{'} \cdot t^{*})) [ℓ] < 0$ . As $τ$ is applicable to $σ$ , there is a transition t of $τ$ with $[t . r u l e] \neq c_{1}$ and $t . t o = ℓ$ as well as $t . f a c t o r > 0$ . From this, by definition of Inline graphic , it follows that . This contradicts the lemma’s assumption on the order .

(ii) The condition $t^{*} . φ^{rise}$ is not satisfied, that is, $τ^{'} (σ) ⊭ t^{*} . φ^{rise}$ . Then, there is a guard $φ \in guard (t^{*} . φ^{rise})$ with $τ^{'} (σ) ⊭ φ$ .

Since $τ$ is applicable to $σ$ , there is a prefix $ρ \cdot t$ of $τ$ , for a schedule $ρ$ and a transition t that unlocks $φ$ in $ρ (σ)$ , that is, $ρ (σ) ⊭ φ$ and $t (ρ (σ)) ⊧ φ$ . Thus, transition t changes the context: $ω (ρ (σ)) \neq ω (t (ρ (σ)))$ . This contradicts the assumption that schedule $τ$ is steady.

(iii) The condition $t^{*} . φ^{fall}$ is not satisfied: $τ^{'} (σ) ⊭ t^{*} . φ^{fall}$ . Then, there is a guard $φ \in guard (t^{*} . φ^{fall})$ with $τ^{'} (σ) ⊭ φ$ .

Let $ρ$ be the longest prefix of $τ$ satisfying ${ρ |}_{c_{1}} = τ^{'}$ . Note that $ρ \cdot t^{*}$ is also a prefix of $τ$ . As ${ρ |}_{c_{1}} = τ^{'}$ and no transition decrements the shared variables, we conclude that $(τ^{'} (σ)) . g \leq (ρ (σ)) . g$ . From this and from the fact that $τ^{'} (σ) ⊭ φ$ , it follows that $ρ (σ) ⊭ φ$ . Thus transition $t^{*}$ is not applicable to $ρ (σ)$ . This contradicts the assumption that $τ$ is applicable to $σ$ .

From (i), (ii), and (iii), we conclude that (1) holds.

Proof of (2) We show that ${τ |}_{c_{2}, \dots, c_{m}}$ is applicable to ${τ |}_{c_{1}} (σ)$ .

To this end, we fix an arbitrary prefix $τ^{'}$ of $τ$ , a transition t, and a suffix $τ^{''}$ , that constitute $τ$ , that is, $τ = τ^{'} \cdot t \cdot τ^{''}$ . We show that if schedule $τ^{'} |_{c_{2}, \dots, c_{m}}$ is applicable to ${τ |}_{c_{1}} (σ)$ , then so is $(τ^{'} \cdot t) |_{c_{2}, \dots, c_{m}}$ .

Let us assume that $τ^{'} |_{c_{2}, \dots, c_{m}}$ is applicable to ${τ |}_{c_{1}} (σ)$ , and let $σ^{''}$ denote the resulting state $(τ |_{c_{1}} \cdot τ^{'} |_{c_{2}, \dots, c_{m}}) (σ)$ . We consider two cases:

$[t . r u l e] = c_{1}$ . This case holds trivially, as $(τ^{'} \cdot t) |_{c_{2}, \dots, c_{m}}$ equals to $τ^{'} |_{c_{2}, \dots, c_{m}}$ , which is applicable to ${τ |}_{c_{1}} (σ)$ by assumption.
$[t . r u l e] \neq c_{1}$ . In order to prove that $(τ^{'} \cdot t) |_{c_{2}, \dots, c_{m}}$ is applicable to ${τ |}_{c_{1}} (σ)$ , we show that counters $σ^{''} . κ$ and shared variables $σ^{''} . g$ are large enough, so that transition t is applicable to $σ^{''}$ :

(i) We start by showing that $σ^{''} . κ [t . f r o m] \geq t . f a c t o r$ . We distinguish between different cases on source and target states of transition t.

(i.A)

We will show by contradiction that there is no rule $r \in c_{1}$ with $t . t o = r . f r o m$ . Let’s assume it exists. Then, on one hand, as $[t . r u l e] \neq c_{1}$ , by definition of Inline graphic , it follows that . On the other hand, as $[t . r u l e] \neq c_{1}$ and $c_{1}, \dots, c_{m}$ are all classes of the rules used in $τ$ , it holds that $[t . r u l e] \in {c_{2}, \dots, c_{m}}$ . By the lemma’s assumption, , and thus, . We arrive at a contradiction.

(i.B)

Let’s consider the case of a rule

r \in c_{1}

with

r . t o = t . f r o m

. Assume by contradiction that t is not applicable to

σ^{''}

, that is,

σ^{''} . κ [t . f r o m] < t . f a c t o r

. On one hand, transition t is not applicable to

σ^{''} = {(τ |}_{c_{1}} \cdot τ^{'} |_{c_{2}, \dots, c_{m}}) (σ)

. Then by the definition of

Δ_{κ}

, it holds that

σ [t . f r o m] + (Δ_{κ} {(τ |}_{c_{1}} \cdot τ^{'} |_{c_{2}, \dots, c_{m}}) + Δ_{κ} (t)) [t . f r o m] < 0

. By observing that

{τ |}_{c_{1}} = τ^{'} {|_{c_{1}} + τ^{''} |}_{c_{1}}

, we derive the following inequality:

\begin{matrix} σ [t . f r o m] \\ + (Δ_{κ} (τ^{'} |_{c_{1}}) + Δ_{κ} (τ^{''} |_{c_{1}}) + Δ_{κ} (τ^{'} |_{c_{2}, \dots, c_{m}}) + Δ_{κ} (t)) [t . f r o m] < 0 \end{matrix}

6.1

On the other hand, schedule

τ = τ^{'} \cdot t \cdot τ^{''}

is applicable to configuration

σ

. Thus,

σ [t . f r o m] + (Δ_{κ} (τ^{'}) + Δ_{κ} (t) + Δ_{κ} (τ^{''})) [t . f r o m] \geq 0

. By observing that

{τ |}_{c_{1}} = τ^{'} {|_{c_{1}} + τ^{''} |}_{c_{1}}

and

{τ |}_{c_{2}, \dots, c_{m}} = τ^{'} {|_{c_{2}, \dots, c_{m}} + τ^{''} |}_{c_{2}, \dots, c_{m}}

, we arrive at:

\begin{matrix} σ [t . f r o m] + (Δ_{κ} (τ^{'} |_{c_{1}}) + Δ_{κ} (τ^{'} |_{c_{2}, \dots, c_{m}}) \\ + Δ_{κ} (t) + Δ_{κ} (τ^{''} |_{c_{1}}) + Δ_{κ} (τ^{''} |_{c_{2}, \dots, c_{m}})) [t . f r o m] \geq 0 \end{matrix}

6.2

By subtracting (6.2) from (6.1), and by commutativity of vector addition, we arrive at

Δ_{κ} (τ^{''} |_{c_{2}, \dots, c_{m}}) [t . f r o m] > 0

. Thus, there is a transition

t^{'}

τ^{''} |_{c_{2}, \dots, c_{m}}

and a rule

r^{'} \in c_{1}

such that

t^{'} . t o = r^{'} . f r o m

. We again arrived at the contradictory Case (i.A). Hence, transition t must be applicable to configuration

σ^{''}

(i.C)

Otherwise, neither $t . f r o m$ nor $t . t o$ belong to the set of local states affected by the rules from $c_{1}$ , i.e., ${t . f r o m, t . t o} \cap {ℓ ∣ \exists r \in c_{1} . r . f r o m = ℓ \lor r . t o = ℓ}$ is empty. Then, schedule ${τ |}_{c_{1}}$ does not change the counter $κ [t . f r o m]$ , and $Δ_{κ} (τ^{'}) [t . f r o m] = Δ_{κ} (τ^{'} |_{c_{2}, \dots, c_{m}}) [t . f r o m]$ . As t is applicable to $τ^{'} (σ)$ , that is, $(τ^{'} (σ)) . κ [t . f r o m] \geq t . f a c t o r$ , we conclude that $σ^{''} . κ [t . f r o m] \geq t . f a c t o r$ .

(ii) We now show that $σ^{''} ⊧ t . φ^{rise} \land t . φ^{fall}$ . Assume by contradiction that $σ^{''} ⊭ t . φ^{rise} \land t . φ^{fall}$ . There are two cases to consider.

If $σ^{''} ⊭ t . φ^{rise}$ .: By definition, the shared variables are never decremented in a non-singleton looplet. As $τ^{'}$ is a prefix of $τ$ , schedule ${τ |}_{c_{1}} \cdot τ^{'} |_{c_{2}, \dots, c_{m}}$ includes all transitions of $τ^{'}$ . Thus, $Δ_{g} {(τ |}_{c_{1}} \cdot τ^{'} |_{c_{2}, \dots, c_{m}}) \geq Δ_{g} (τ^{'})$ . From this and $σ^{''} ⊭ t . φ^{rise}$ , it follows that $τ^{'} (σ) ⊭ t . φ^{rise}$ . This contradicts applicability of $τ$ to $σ$ .
If $σ^{''} ⊭ t . φ^{fall}$ .: Then, there is a guard $φ \in guard (t . φ^{fall})$ with $τ^{''} (σ) ⊭ φ$ . On one hand, ${τ |}_{c_{1}} \cdot τ^{'} |_{c_{2}, \dots, c_{m}}$ is applicable to $σ$ . On the other hand, $τ$ is applicable to $σ$ . We notice that $Δ_{g} (τ) = Δ_{g} (τ |_{c_{1}}) + Δ_{g} (τ^{'} |_{c_{2}, \dots, c_{m}}) + Δ_{g} (τ^{''} |_{c_{2}, \dots, c_{m}}) + Δ_{g} (t) \geq Δ_{g} (τ |_{c_{1}}) + Δ_{g} (τ^{'} |_{c_{2}, \dots, c_{m}})$ . As shared variables are never decreased, it follows that $(τ |_{c_{1}} \cdot τ^{'} |_{c_{2}, \dots, c_{m}}) (σ) ⊭ φ$ . Thus, $ω (σ) \neq ω (τ (σ))$ . This contradicts the assumption on that schedule $τ$ is steady.

Having proved that, we conclude that transition t is applicable to configuration $(τ |_{c_{1}} \cdot τ^{'} |_{c_{2}, \dots, c_{m}}) (σ)$ . Thus, by induction $(τ |_{c_{1}} \cdot τ |_{c_{2}, \dots, c_{m}}) (σ)$ is applicable to $σ$ . We conclude that Point 2 of the theorem holds.

Proof of (3) By the commutativity property of vector addition,

\begin{matrix} Δ_{κ} {(τ |}_{c_{1}} \cdot τ |_{c_{2}, \dots, c_{m}}) = Δ_{κ} (τ |_{c_{1}}) + Δ_{κ} (τ |_{c_{2}, \dots, c_{m}}) = \sum_{1 \leq i \leq | τ |} Δ_{κ} (t_{i}) = Δ_{κ} (τ) . \end{matrix}

Thus, $(τ |_{c_{1}} \cdot τ |_{c_{2}, \dots, c_{m}}) (σ) = τ (σ)$ , and Point (3) follows.

We have thus shown all three points of Lemma 6.1. $□$

Proof

(of Theorem 6.1) By iteratively applying Lemma 6.1, we prove by induction that schedule ${τ |}_{c_{1}} {\cdot \dots \cdot τ |}_{c_{m}}$ is applicable to $σ$ and results in $τ (σ)$ . From Theorem 5.1, we conclude that each schedule ${τ |}_{c_{i}}$ can be replaced by its representative ${crep}_{c_{i}}^{Ω} [π_{i - 1} (σ), τ |_{c_{i}}]$ . Thus, ${srep}_{Ω} [σ, τ]$ is applicable to $σ$ and results in $τ (σ)$ . By Proposition 3.4, schedule ${srep}_{Ω} [σ, τ]$ is steady, since $ω (σ) = ω (τ (σ))$ . $□$

Finally, we show that for a given context, there is a schema that generates all paths of such representative schedules.

Theorem 6.2

Fix a threshold automaton and a context $Ω$ . Let $c_{1}, \dots, c_{m}$ be the sorted sequence of all looplets of the slice ${R |}_{Ω}$ , i.e., Inline graphic . Schema ${sschema}_{Ω} = {cschema}_{c_{1}}^{Ω} \circ \dots \circ {cschema}_{c_{m}}^{Ω}$ has two properties: (a) For a configuration $σ$ with $ω (σ) = Ω$ and a steady schedule $τ$ applicable to $σ$ , $path (σ, τ^{'})$ of the representative $τ^{'} = {srep}_{Ω} [σ, τ]$ is generated by ${sschema}_{Ω}$ ; and (b) the length of ${sschema}_{Ω}$ is at most $2 \cdot | (R |_{Ω}) |$ .

Proof

Fix a configuration $σ$ with $ω (σ) = Ω$ and a steady schedule $τ$ applicable to $σ$ . As ${srep}_{Ω} [σ, τ]$ is a sorted sequence of the looplet representatives, all paths of ${srep}_{Ω} [σ, τ]$ are generated by ${sschema}_{Ω}$ , which is not longer than $2 \cdot | (R |_{Ω}) |$ . $□$

Proving the main result

Using the results from Sects. 5 and 6, for each configuration and each schedule (without restrictions) we construct a representative schedule.

Theorem 7.1

Given a threshold automaton, a configuration $σ$ , and a schedule $τ$ applicable to $σ$ , there exists a schedule $rep [σ, τ]$ with the following properties:

$rep [σ, τ]$ is applicable to $σ$ , and $rep [σ, τ] (σ) = τ (σ)$ ,
$| rep [σ, τ] | \leq 2 \cdot | R | \cdot (| Φ^{rise} | + | Φ^{fall} | + 1) + | Φ^{rise} | + | Φ^{fall} |$ .

Proof

Given a threshold automaton, fix a configuration $σ$ and a schedule $τ$ applicable to $σ$ . Let $Ω_{1}, \dots, Ω_{K + 1}$ be the maximal monotonically increasing sequence of contexts such that $path (σ, τ)$ is consistent with the sequence by Definition 3.7. From Proposition 3.2, the length of the sequence is $K + 1 = | Φ^{rise} | + | Φ^{fall} | + 1$ . Thus, there are at most K transitions $t_{1}^{⋆}, \dots, t_{K}^{⋆}$ in $τ$ that change their context, i.e., for $i \in {1, \dots, K}$ , it holds $ω (σ_{i}) ⊏ ω (t_{i}^{⋆} (σ_{i}))$ for $t_{i}^{⋆}$ ’s respective state $σ_{i}$ in $τ$ . Therefore, we can divide $τ$ into $K + 1$ steady schedules separated by the transitions $t_{1}^{⋆}, \dots, t_{K}^{⋆}$ :

\begin{matrix} τ = ν_{1} \cdot t_{1}^{⋆} \cdot ν_{2} \dots ν_{K} \cdot t_{K}^{⋆} \cdot ν_{K + 1} . \end{matrix}

Now, the main idea is to replace the steady schedules with their representatives from Theorem 6.1. That is, using $t_{1}^{⋆}, \dots, t_{K}^{⋆}$ and $ν_{1}, \dots, ν_{K + 1}$ , we construct the schedules $ρ_{1}, \dots, ρ_{K}$ (by convention, $ρ_{0}$ is the empty schedule):

\begin{matrix} ρ_{i} = ρ_{i - 1} \cdot ν_{i} \cdot t_{i}^{⋆} for 1 \leq i \leq K . \end{matrix}

Finally, the representative schedule $rep [τ, σ]$ is constructed as follows:

\begin{matrix} {rep}_{Ω_{1}} [σ, ν_{1}] \cdot t_{1}^{⋆} \cdot {rep}_{Ω_{2}} [ρ_{1} (σ), ν_{2}] \dots {rep}_{Ω_{K}} [ρ_{K - 1} (σ), ν_{K}] \cdot t_{K}^{⋆} \cdot {rep}_{Ω_{K + 1}} [ρ_{K} (σ), ν_{K + 1}] \end{matrix}

From Theorem 6.1, it follows that $rep [τ, σ]$ is applicable to $σ$ and it results in $τ (σ)$ . Moreover, the representative of a steady schedule is not longer than $2 | R |$ , which together with K transitions gives us the bound $2 | R | (K + 1) + K$ . As we have that $K = | Φ^{rise} | + | Φ^{fall} |$ , this gives us the required bound. $□$

Further, given a maximal monotonically increasing sequence z of contexts, we construct a schema that generates all paths of the schedules consistent with z:

Theorem 7.2

For a threshold automaton and a monotonically increasing sequence z of contexts, there exists a schema $schema (z)$ that generates all paths of the representative schedules that are consistent with z, and the length of $schema (z)$ does not exceed $3 \cdot | R | \cdot (| Φ^{rise} | + | Φ^{fall} |) + 2 \cdot | R |$ .

Proof

Given a threshold automaton, let $ρ_{all}$ be the sequence $r_{1}, \dots, r_{| R |}$ of all rules from $R$ , and let $z = Ω_{0}, \dots, Ω_{m}$ be a monotonically increasing sequence of contexts. By the construction in Theorem 7.1, each representative schedule $rep [σ, τ]$ consists of the representatives of steady schedules terminated with transitions that change the context. Then, for each context $Ω_{i}$ , for $0 \leq i < m$ , we compose ${sschema}_{Ω}$ and ${Ω_{i}} ρ_{all} {Ω_{i + 1}}$ . This composition generates the representative of a steady schedule and the transition changing the context from $Ω_{i}$ to $Ω_{i + 1}$ . Consequently, we construct the $schema (z)$ as follows:

\begin{matrix} ({sschema}_{Ω_{0}} \circ {Ω_{0}} ρ_{all} {Ω_{1}}) \circ \dots \circ ({sschema}_{Ω_{m - 1}} \circ {Ω_{m - 1}} ρ_{all} {Ω_{m}}) \circ {sschema}_{Ω_{m}} \end{matrix}

By inductively applying Theorem 6.2, we prove that $schema (z)$ generates all paths of schedules $rep [σ, τ]$ that are consistent with the sequence z. We get the needed bound on the length of $schema (z)$ by using an argument similar to Theorem 7.1 and by noting that for every context, instead of one rule that is changing it, we add $| R |$ extra rules. $□$

Complete set of schemas and optimizations

Our proofs show that the set of schemas is easily computed from the TA: the threshold guards are syntactic parts of the TA, and enable us to directly construct increasing sequences of contexts. To find a slice of the TA for a given context, we filter the rules with unlocked guards, i.e., check whether the context contains the guard. To produce the simple schema of a looplet, we compute a spanning tree over the slice. To construct simple schemas, we do a topological sort over the looplets. For example, it takes just 30 s to compute the schemas in our longest experiment that runs for 4 h. In our tool we have implemented the following optimizations that lead to simpler and fewer SMT queries.

Entailment optimization We say that a guard $φ_{1} \in Φ^{rise}$ entails a guard $φ_{2} \in Φ^{rise}$ , if for all combinations of parameters $p \in P_{R C}$ and shared variables $g \in N_{0}^{| Γ |}$ , it holds that $(g, p) ⊧ φ_{1} \to φ_{2}$ . For instance, in our example, $φ_{3} : y \geq (2 t + 1) - f$ entails $φ_{2} : y \geq (t + 1) - f$ . If $φ_{1}$ entails $φ_{2}$ , then we can omit all monotonically increasing sequences that contain a context $(Ω^{rise}, Ω^{fall})$ with $φ_{1} \in Ω^{rise}$ and $φ_{2} \notin Ω^{rise}$ . If the number of schemas before applying this optimization is m! and there are k entailments, then the number of schemas reduces from m! to $(m - k)!$ . A similar optimization is introduced for the guards from $Φ^{fall}$ .

Control flow optimization Based on the proof of Lemma 6.1, we introduce the following optimization for TAs that are directed acyclic graphs (possibly with self loops). We say that a rule $r \in R$ may unlock a guard $φ \in Φ^{rise}$ , if there is a $p \in P_{R C}$ and $g \in N_{0}^{| Γ |}$ satisfying: $(g, p) ⊧ r . φ^{rise} \land r . φ^{fall}$ (the rule is unlocked); $(g, p) ⊭ φ$ (the guard is locked); $(g + r . u, p) ⊧ φ$ (the guard is now unlocked).

In our example from Fig. 2, the rule $r_{1} : t r u e \mapsto x + +$ may unlock the guard $φ_{1} : x \geq ⌈ (n + t) / 2 ⌉ - f$ .

Let $φ \in Φ^{rise}$ be a guard, $r_{1}^{'}, \dots, r_{m}^{'}$ be the rules that use $φ$ , and $r_{1}, \dots, r_{k}$ be the rules that may unlock $φ$ . If Inline graphic , for $1 \leq i \leq k$ and $1 \leq j \leq m$ , then we exclude some sequences of contexts as follows (we call $φ$ forward-unlockable). Let $ψ_{1}, \dots, ψ_{n} \in Φ^{rise}$ be the guards of $r_{1}, \dots, r_{k}$ . Guard $φ$ cannot be unlocked before $ψ_{1}, \dots, ψ_{n}$ , and thus we can omit all sequences of contexts, where $φ$ appears in the contexts before $ψ_{1}, \dots, ψ_{n}$ . Moreover, as $ψ_{1}, \dots, ψ_{n}$ are the only guards of the rules unlocking $φ$ , we omit the sequences with different combinations of contexts involving $φ$ and the guards from $Φ^{rise} \ {φ, ψ_{1}, \dots, ψ_{n}}$ . Finally, as the rules $r_{1}^{'}, \dots, r_{m}^{'}$ appear after the rules $r_{1}, \dots, r_{k}$ in the order Inline graphic , the rules $r_{1}^{'}, \dots, r_{m}^{'}$ appear after the rules $r_{1}, \dots, r_{k}$ in a rule sequence of every schema. Thus, we omit the combinations of the contexts involving $φ$ and $ψ_{1}, \dots, ψ_{n}$ .

Hence, we add all forward-unlockable guards to the initial context (we still check the guards of the rules in the SMT encoding in Sect. 9). If the number of schemas before applying this optimization is m! and there are k forward-unlocking guards, then the number of schemas reduces from m! to $(m - k)!$ . A similar optimization is introduced for the guards from $Φ^{fall}$ .

Checking a schema with SMT

We decompose a schema into a sequence of simple schemas, and encode the simple schemas. Given a simple schema $S = {Ω_{1}} r_{1}, \dots, r_{m} {Ω_{2}}$ , which contains m rules, we construct an SMT formula such that every model of the formula represents a path from $L (S)$ —the language of paths generated by schema S —and for every path in $L (S)$ there is a corresponding model of the formula. Thus, we need to model a path of $m + 1$ configurations and m transitions (whose acceleration factors may be 0).

To represent a configuration $σ_{i}$ , for $0 \leq i \leq m$ , we introduce two vectors of SMT variables: Given the set of local states $L$ and the set of shared variables $Γ$ , a vector $k^{i} = (k_{1}^{i}, \dots, k_{| L |}^{i})$ to represent the process counters, a vector $x^{i} = (x_{1}^{i}, \dots, x_{| Γ |}^{i})$ to represent the shared variables. We call the pair $(k^{i}, x^{i})$ the layer i, for $1 \leq i \leq m$ .

Based on this we encode schemas, for which the sequence of rules $r_{1}, \dots, r_{m}$ is fixed. We exploit this in two ways: First, we encode for each layer i the constraints of rule $r_{i}$ . Second, as this constraint may update only two counters —the processes move from and move to according to the rule —we do not need $| L |$ counter variables per layer, but only encode the two counters per layer that have actually changed. As is a common technique in bounded model checking, the counters that are not changed are “reused” from previous layers in our encoding. By doing so, we encode the schema rules with $| L | + | Γ | + m \cdot (2 + | Γ |)$ integer variables, 2m equations, and inequalities in linear integer arithmetic that represent threshold guards that evaluate to true (at most the number of threshold guards times m of these inequalities).

In the following, we use the notation $[k : m]$ to denote the set ${k, \dots, m}$ . In order to reuse the variables from the previous layers, we introduce a function $υ : L \times [0 : m] \to [0 : m]$ that for a layer $i \in [0 : m]$ and a local state $ℓ \in L$ , gives the largest number $j \leq i$ of the layer, where the counter $k_{ℓ}^{j}$ is updated:

\begin{matrix} υ (ℓ, i) = \{\begin{matrix} i, & if i = 0 \lor ℓ \in {r_{i} . f r o m, r_{i} . t o} \\ υ (ℓ, i - 1), & otherwise . \end{matrix} \end{matrix}

Having defined layers, we encode: the effect of rules on counters and shared variables (in formulas M and U below), the effect of rules on the configuration (T), restrictions imposed by contexts (C), and, finally, the reachability question.

To represent m transitions, for each transition $i \in [1 : m]$ , we introduce a non-negative variable $δ^{i}$ for the acceleration factor, and define two formulas: formula $M^{ℓ} (i - 1, i)$ to express the update of the counter of local state $ℓ \in L$ , and formula $U^{x} (i - 1, i)$ to represent the update of the shared variable $x \in Γ$ :

\begin{matrix} M^{ℓ} (i - 1, i) & \equiv \{\begin{matrix} k_{ℓ}^{i} = k_{ℓ}^{υ (ℓ, i - 1)} + δ^{i}, & for ℓ = r_{i} . t o and i \in [1 : m] \\ k_{ℓ}^{i} = k_{ℓ}^{υ (ℓ, i - 1)} - δ^{i}, & for ℓ = r_{i} . f r o m and i \in [1 : m] \\ t r u e, & otherwise \end{matrix} \\ U^{x} (i - 1, i) & \equiv \{\begin{matrix} x^{i} = x^{i - 1} + δ^{i} \cdot u, & if u = r_{i} . u [j] > 0, \\ t r u e, & otherwise . \end{matrix} \end{matrix}

The formula $T (i - 1, i)$ collects all constraints by the rule $r_{i}$ :

\begin{matrix} T (i - 1, i) \equiv \underset{ℓ \in L}{⋀} M^{ℓ} (i - 1, i) \land \underset{x \in Γ}{⋀} U^{x} (i - 1, i) . \end{matrix}

For a formula $φ$ , we denote by $φ [x^{i}]$ the formula, where each variable $x \in Γ$ is substituted with $x^{i}$ . Then, given a context $Ω = (Ω^{rise}, Ω^{fall})$ , a formula $C^{Ω} (i)$ adds the constraints of the context $Ω$ on the layer i:

\begin{matrix} C_{Ω} (i) \equiv \underset{φ \in Ω^{rise}}{⋀} φ [x^{i}] \land \underset{φ \in Φ^{rise} \ Ω^{rise}}{⋀} \neg φ [x^{i}] \land \underset{φ \in Ω^{fall}}{⋀} \neg φ [x^{i}] \land \underset{φ \in Φ^{fall} \ Ω^{fall}}{⋀} φ [x^{i}] . \end{matrix}

Finally, the formula $C_{Ω_{1}} (0) \land T (0, 1) \land \dots \land T (m - 1, m) \land C_{Ω_{2}} (m)$ captures all the constraints of the schema $S = {Ω_{1}} r_{1}, \dots, r_{m} {Ω_{2}}$ , and thus, its models correspond to the paths of schedules that are generated by S.

Let I(0) be the formula over the variables of layer i that captures the initial states of the threshold automaton, and B(i) be a state property over the variables of layer i. Then, parameterized reachability for the schema S is encoded with the following formula in linear integer arithmetic:

\begin{matrix} I (0) \land C_{Ω_{1}} (0) \land T (0, 1) \land \dots \land T (m - 1, m) \land C_{Ω_{2}} (m) \land (B (0) \lor \dots \lor B (m)) . \end{matrix}

Experiments

We have extended our tool ByMC (Byzantine Model Checker [2]) with the technique discussed in this paper. All of our benchmark algorithms were originally published in pseudo-code, and we model them in a parametric extension of Promela, which was discussed in [27, 34].

Benchmarks

We revisited several asynchronous FTDAs that were evaluated in [33, 41]. In addition to these classic FTDAs, we considered asynchronous (Byzantine) consensus algorithms, namely, BOSCO [57], C1CS [10], and CF1S [18], that are designed to work despite partial failure of the distributed system. In contrast to the conference version of this paper [39], we used a new version of the benchmarks from [37] that have been slightly updated for liveness properties. Hence, for some benchmarks, the running times of our tool may vary from [39]. The benchmarks, their source code in parametric of Promela, and the code of the threshold automata are freely available [30].

Implementation

ByMC supports several tool chains (shown in Fig. 1, p. 3), the first using counter abstraction (that is, process counters over an abstract domain), and the second using counter systems with counters over integers:

Data and counter abstractions In this chain, the message counters are first mapped to parametric intervals, e.g., counters range over the abstract domain $\hat{D} = {[0, 1)$ , $[1, t + 1), [t + 1, n - t), [n - t, \infty)}$ . By doing so, we obtain a finite (data) abstraction of each process, and thus we can represent the system as a counter system: We maintain one counter $κ [ℓ]$ per local state $ℓ$ of a process, as well as the counters for the sent messages. Then, in the counter abstraction step, every process counter $κ [ℓ]$ is mapped to the set of parametric intervals $\hat{D}$ . As the abstractions may produce spurious counterexamples, we run them in an abstraction-refinement loop that incrementally prunes spurious transitions and unfair executions. More details on the data and counter abstractions and refinement can be found in [33]. In our experiments, we use two kinds of model checkers as backend:

BDD The counter abstraction is checked with nuXmv [11] using Binary Decision Diagrams (BDDs). For safety properties, the tool executes the command check_invar. In our experiments, we used the timeout of 3 days, as there was at least one benchmark that needed a bit more than a day to complete.
BMC The counter abstraction is checked with nuXmv using bounded model checking [6]. To ensure completeness (at the level of counter abstraction), we explore the computations of the length up to the diameter bounds that were obtained in [41]. To efficiently eliminate shallow spurious counterexamples, we first run the bounded model checker in the incremental mode up to length of 30. This is done by issuing the nuXmv command check_ltlspec_sbmc_inc, which uses the built-in SAT solver MiniSAT. Then, we run a single-shot SAT problem by issuing the nuXmv command gen_ltlspec_sbmc and checking the generated formula with the SAT solver lingeling [5]. In our experiments, we set the timeout to 1 day.

Reachability for threshold automata In this tool chain, to obtain a threshold automaton, our tool first applies data abstraction over the domain $\hat{D}$ to the Promela code, which abstracts the message counters that keep the number of messages received by every process, while the message counters for the sent messages are kept as integers. More details can be found in [40]. Having constructed a threshold automaton, we compare two verification approaches:

${P A R A}^{2}$ Bounded model checking with SMT The approach of this article. BYMC enumerates the schemas (as explained in Sect. 4), encodes them in SMT (as explained in Sect. 9) and checks every schema with the SMT solver Z3 [17].
FAST Acceleration of counter automata In this chain, our tool constructs a threshold automaton and checks the reachability properties with the existing tool FAST [3]. For comparison with our tool, we run FAST with the MONA plugin that produced the best results in our experiments.

The challenge in the verification of FTDAs is the immense non-determinism caused by interleavings, asynchronous message passing, and faults. In our modeling, all these are reflected in non-deterministic choices in the Promela code. To obtain threshold automata, as required for our technique, our tool constructs a parametric interval data abstraction [33] that adds to non-determinism.

Comparing to [39], in this paper, we have introduced an optimization to schema checking that dramatically reduced the running times for some of the benchmarks. In this optimization, we group schemas in a prefix tree, whose nodes are contexts and edges are simple schemas. In each node of the prefix tree, our tool checks, whether there are configurations that are reachable from the initial configurations by following the schemas in the prefix. If there are no such reachable configurations, we can safely prune the whole suffix and thus prove many schemas to be unsatisfiable at once.

Evaluation

Table 1 summarizes the features of threshold automata that are automatically constructed by ByMC from parametric Promela. The number of local states $| L |$ varies from 7 (FRB and STRB) to hundreds (C1CS and CBC). Our threshold automata are obtained by applying interval abstraction to Promela code, which keeps track of the number of messages received by each process. Thus, the number $| L |$ is proportional to the number of control states and $| \hat{D} |^{k}$ , where $\hat{D}$ is the domain of parametric intervals (discussed above) and k is the number of message types. Sometimes, one can manually construct a more efficient threshold automaton that models the same fault-tolerant distributed algorithm and preserves the same safety properties. For instance, Fig. 2 shows a manual abstraction of ABA that has only 5 local states, in contrast to 61 local states in the automatic abstraction (cf. Table 1). We leave open the question of whether one can automatically construct a minimal threshold automaton with respect to given specifications.

Table 1.

The benchmarks used in our experiments. Some benchmarks, e.g., ABA, require us to consider several cases on the parameters, which are mentioned in the column “Case”. The meaning of the other columns is as follows: $| L |$ is the number of local states in TA, $| R |$ is the number of rules in TA, $| Φ^{rise} |$ and $| Φ^{fall} |$ is the number of (R)- and (F)-guards respectively. Finally, $| S |$ is the number of enumerated schemas, and Bound is the theoretical upper bound on $| S |$ , as given in Theorem 4.2

graphic file with name 10703_2017_297_Figa_HTML.jpg

Open in a new tab

Table 2 summarizes our experiments conducted with the techniques introduced in Sect. 10.2: BDD, BMC, PARA $^{2}$ , and FAST. On large problems, our new technique works significantly better than BDD- and SAT-based model checking. BDD-based model checking works very well on top of counter abstraction. Importantly, our new technique does not use abstraction refinement. In comparison to our earlier experiments [39], we verified safety of a larger set of benchmarks with nuXmv. We believe that this is due to the improvements in nuXmv and, probably, slight modifications of the benchmarks from [37].

Table 2.

Summary of our experiments on AMD Opteron®6272, 32 cores, 192 GB. The symbols are: “ Inline graphic ” for timeout (72 h. for BDD and 24 h. otherwise); “” for memory overrun of 32 GB; “” for BDD nodes overrun; “” for timeout in the refinement loop (72 h. for BDD and 24 h. otherwise); “” for spurious counterexamples due to counter abstraction

graphic file with name 10703_2017_297_Figg_HTML.jpg

Open in a new tab

NBAC and NBACC are challenging as the model checker produces many spurious counterexamples, which are an artifact of counter abstraction losing or adding processes. When using SAT-based model checking, the individual calls to nuXmv are fast, but the abstraction-refinement loop times out, due to a large number of refinements (about 500). BDD-based model checking times out when looking for a counterexample. Our new technique, preserves the number of proceses, and thus, there are no spurious counterexamples of this kind. In comparison to the general-purpose acceleration tool FAST, our tool uses less memory and is faster on the benchmarks where FAST is successful.

As predicted by the distributed algorithms literature, our tool finds counterexamples, when we relax the resilience condition. In contrast to counter abstraction, our new technique gives us concrete values of the parameters and shows how many processes move at each step of the counterexample.

Our new method uses integer counters and thus does not introduce spurious behavior due to counter abstraction, but still has spurious behavior due to data abstraction on complex FTDAs such as BOSCO, C1CS, and NBAC. In these cases, we manually refine the interval domain by adding new symbolic interval borders, see [33]. We believe that these intervals can be obtained directly from threshold automata, and no refinement is necessary. We leave this question to future work.

Sets of schemas and time to check a single schema

On one hand, Theorem 4.2 gives us a theoretical bound on the number of schemas to be explored. On the other hand, optimizations discussed in Sect. 8 introduce many ways of reducing the number of schemas. Two columns in Table 1 compare the theoretical bound and the practical number of schemas: the column “Theoretical bound” shows the bound of $(| Φ^{rise} | + | Φ^{fall} |)!$ , while the column $| S |$ shows the actual number of schemas. (For reachability, we are merging the schemas with the prefix tree, and thus the actual number of explored schemas is even smaller.) As one can see, the theoretical bound is quite pessimistic, and is only useful to show completeness of the set of schemas. The much smaller numbers for the fault-tolerant distributed algorithms are due to a natural order on guards, e.g., as $x \geq t + 1$ becomes true earlier than $x \geq n - t$ under the resilience condition $n > 3 t$ . The drastic reduction in the case of CBC is due to the control flow optimization discussed in Sect. 8 and the fact that basically all guards are forward-unlocking.

When doing experiments, we noticed that the only kinds of guards that cannot be treated by our optimizations and blow up the number of schemas are the guards that use independent shared variables. For instance, consider the guards $x_{0} \geq n - t$ and $x_{1} \geq n - t$ that are counting the number of 0’s and 1’s sent by the correct processes. Even though they are mutually exlusive under the resilience condition $n > 3 t$ , our tool has to explore all possible orderings of these guards. We are not aware of a reduction that would prevent our method from exploding in the number of schemas for this example.

Since the schemas can be checked independently, one can check them in parallel. Figure 9 shows a distribution of schemas along with the time needed to check an individual schema. There are only a few divergent schemas that required more than 7 s to get checked, while the large portion of schemas require 1–3 s. Hence, a parallel implementation of the tool should verify the algorithms significantly faster. We leave such a parallel extension for future work.

Fig. 9 — The times required to check individual schemas and the distribution of schemas over these times (the value 0 refers to the running times of less than a second). The benchmarks containing the schemas that are verified in (a) $T \geq 8$ sec. and (b) $T \geq 18$ sec. are: (a) C1CS, CBC, CF1S, and (b) CBC and CF1S

Discussions and related work

We introduced a method to efficiently check reachability properties of FTDAs in a parameterized way. If $n > 7 t$ as for BOSCO, even the simplest interesting case with $t = 2$ leads to a system size that is out of range of explicit state model checking. Hence, FTDAs force us to develop parameterized verification methods.

The problem we consider is concerned with parameterized model checking, for which many interesting results exist [14, 15, 21–23, 35]; cf. [7] for a survey. However, the FTDAs considered by us run under the different assumptions.

From a methodological viewpoint, our approach combines techniques from several areas including compact programs [49], counter abstraction [4, 55], completeness thresholds for bounded model checking [6, 16, 42], partial order reduction [8, 28, 53, 59], and Lipton’s movers [48]. Regarding counter automata, our result entails flattability [46] of every counter system of threshold automata: a complete set of schemas immediately gives us a flat counter automaton. Hence, the acceleration-based semi-algorithms [3, 46] should in principle terminate on the systems of TAs, though it did not always happen in our experiments. Similar to our SMT queries based on schemas, the inductive data flow graphs iDFG introduced in [24] are a succinct representations of schedules (they call them traces) for systems where the number of processes (or threads) is fixed. The work presented in [25] then considers parameterized verification. Further, our execution schemas are inspired by a general notion of semi-linear path schemas SLPS [45, 46]. We construct a small complete set of schemas and thus a provably small SLPS. Besides, we distinguish counter systems and counter abstraction: the former counts processes as integers, while the latter uses counters over a finite abstract domain, e.g., ${0, 1, m a n y}$ [55].

Many distributed algorithms can be represented with I/O Automata [50] or TLA+ [44]. In these frameworks, correctness is typically shown with a proof assistant, while model checking is used as a debugger on small instances. Parameterized model checking is not a concern there, except one notable result [32].

The results presented in this article can be used to check reachability properties of FTDAs. We can thus establish safety of FTDAs. However, for fault-tolerant distributed algorithms liveness is as important as safety: The seminal impossibility result by Fischer, Lynch, and Paterson [26] states that a fault-tolerant consensus algorithm cannot ensure both safety and liveness in asynchronous systems. In recent work [37] we also considered liveness verification, or more precisely, verification of temporal logic specification with the $G$ and $F$ temporal operators. In [37], we use the results of this article as a black box and show that combinations of schemas can be used to generate counterexamples to liveness properties, and that we can verify both safety and liveness by complete SMT-based bounded model checking.

Acknowledgements

Open access funding provided by Austrian Science Fund (FWF). We are grateful to Azadeh Farzan for valuable discussions during her stay in Vienna and to the anonymous reviewers for their insightful comments regarding partial order reduction, and for suggestions that helped us in improving the presentation of the paper.

Footnotes

Our model requires all variables to be non-negative integers. Although these constraints (e.g., $x^{1} \geq 0$ ) have to be encoded in the SMT queries, we omit these constraints here for a more concise presentation.

Supported by: the Austrian Science Fund (FWF) through the National Research Network RiSE (S11403 and S11405), project PRAVDA (P27722), and Doctoral College LogiCS (W1255-N23); and by the Vienna Science and Technology Fund (WWTF) through project APALACHE (ICT15-103). This is an extended version of the paper “SMT and POR beat Counter Abstraction: Parameterized Model Checking of Threshold-Based Distributed Algorithms” that appeared in CAV (Part I), volume 9206 of LNCS, pages 85–102, 2015.

Contributor Information

Igor Konnov, Email: konnov@forsyte.at, http://forsyte.at/konnov.

Marijana Lazić, Email: lazic@forsyte.at, http://forsyte.at/lazic.

Helmut Veith, Email: veith@forsyte.at, http://forsyte.at/veith.

Josef Widder, Phone: +43 (1) 58801-18263, Email: widder@forsyte.at, http://forsyte.at/widder.

References

1.Attiya H, Welch J. Distributed computing. 2. New York: Wiley; 2004. [Google Scholar]
2.ByMC: Byzantine model checker (2013). http://forsyte.tuwien.ac.at/software/bymc/. Accessed Dec 2016
3.Bardin S, Finkel A, Leroux J, Petrucci L. Fast: acceleration from theory to practice. STTT. 2008;10(5):401–424. doi: 10.1007/s10009-008-0064-3. [DOI] [Google Scholar]
4.Basler G, Mazzucchi M, Wahl T, Kroening D (2009) Symbolic counter abstraction for concurrent software. In: CAV. LNCS, vol 5643, pp 64–78
5.Biere A (2013) Lingeling, Plingeling and Treengeling entering the SAT competition 2013. In: Proceedings of SAT competition 2013; Solver and p. 51
6.Biere A, Cimatti A, Clarke EM, Zhu Y (1999) Symbolic model checking without BDDs. In: TACAS. LNCS, vol 1579, pp 193–207
7.Bloem R, Jacobs S, Khalimov A, Konnov I, Rubin S, Veith H, Widder J. Decidability of parameterized verification, synthesis lectures on distributed computing theory. San Rafael: Morgan & Claypool; 2015. [Google Scholar]
8.Bokor P, Kinder J, Serafini M, Suri N (2011) Efficient model checking of fault-tolerant distributed protocols. In: DSN, pp 73–84
9.Bracha G, Toueg S. Asynchronous consensus and broadcast protocols. J ACM. 1985;32(4):824–840. doi: 10.1145/4221.214134. [DOI] [Google Scholar]
10.Brasileiro FV, Greve F, Mostéfaoui A, Raynal M (2001) Consensus in one communication step. In: PaCT. LNCS, vol 2127, pp 42–50
11.Cavada R, Cimatti A, Dorigatti M, Griggio A, Mariotti A, Micheli A, Mover S, Roveri M, Tonetta S (2014) The nuXmv symbolic model checker. In: CAV. LNCS, vol 8559, pp 334–342
12.Chandra TD, Toueg S. Unreliable failure detectors for reliable distributed systems. J ACM. 1996;43(2):225–267. doi: 10.1145/226643.226647. [DOI] [Google Scholar]
13.Clarke E, Grumberg O, Jha S, Lu Y, Veith H. Counterexample-guided abstraction refinement for symbolic model checking. J ACM. 2003;50(5):752–794. doi: 10.1145/876638.876643. [DOI] [Google Scholar]
14.Clarke E, Talupur M, Touili T, Veith H (2004) Verification by network decomposition. In: CONCUR 2004, vol 3170, pp 276–291
15.Clarke E, Talupur M, Veith H (2008) Proving Ptolemy right: the environment abstraction framework for model checking concurrent systems. In: TACAS’08/ETAPS’08. Springer, Berlin, pp 33–47
16.Clarke EM, Kroening D, Ouaknine J, Strichman O (2004) Completeness and complexity of bounded model checking. In: VMCAI. LNCS, vol 2937, pp 85–96
17.De Moura L, Bjørner N (2008) Z3: an efficient SMT solver. In: Tools and algorithms for the construction and analysis of systems. LNCS, vol 1579, pp 337–340
18.Dobre D, Suri N (2006) One-step consensus with zero-degradation. In: DSN, pp 137–146
19.Drăgoi C, Henzinger TA, Zufferey D (2016) PSync: a partially synchronous language for fault-tolerant distributed algorithms. In: POPL, pp 400–415
20.Drăgoi C, Henzinger TA, Veith H, Widder J, Zufferey D (2014) A logic-based framework for verifying consensus algorithms. In: VMCAI. LNCS, vol 8318, pp 161–181
21.Emerson E, Namjoshi K (1995) Reasoning about rings. In: POPL, pp 85–94
22.Emerson EA, Kahlon V (2003) Model checking guarded protocols. In: LICS. IEEE, pp 361–370
23.Esparza J, Ganty P, Majumdar R (2013) Parameterized verification of asynchronous shared-memory systems. In: CAV, pp 124–140
24.Farzan A, Kincaid Z, Podelski A (2013) Inductive data flow graphs. In: POPL, pp 129–142
25.Farzan A, Kincaid Z, Podelski A (2015) Proof spaces for unbounded parallelism. In: POPL, pp 407–420
26.Fischer MJ, Lynch NA, Paterson MS. Impossibility of distributed consensus with one faulty process. J ACM. 1985;32(2):374–382. doi: 10.1145/3149.214121. [DOI] [Google Scholar]
27.Gmeiner A, Konnov I, Schmid U, Veith H, Widder J (2014) Tutorial on parameterized model checking of fault-tolerant distributed algorithms. In: SFM. LNCS, vol 8483. Springer, Berlin, pp 122–171
28.Godefroid P (1990) Using partial orders to improve automatic verification methods. In: CAV. LNCS, vol 531, pp 176–185
29.Guerraoui R. Non-blocking atomic commit in asynchronous distributed systems with failure detectors. Distrib Comput. 2002;15(1):17–25. doi: 10.1007/s446-002-8027-4. [DOI] [Google Scholar]
30.https://github.com/konnov/fault-tolerant-benchmarks/tree/master/fmsd17
31.Hawblitzel C, Howell J, Kapritsos M, Lorch JR, Parno B, Roberts ML, Setty STV, Zill B (2015) Ironfleet: proving practical distributed systems correct. In: SOSP, pp 1–17
32.Jensen H, Lynch N (1998) A proof of Burns n-process mutual exclusion algorithm using abstraction. In: Steffen B (ed) TACAS. LNCS, vol 1384. Springer, Berlin, pp 409–423
33.John A, Konnov I, Schmid U, Veith H, Widder J (2013) Parameterized model checking of fault-tolerant distributed algorithms by abstraction. In: FMCAD, pp 201–209
34.John A, Konnov I, Schmid U, Veith H, Widder J (2013) Towards modeling and model checking fault-tolerant distributed algorithms. In: SPIN. LNCS, vol 7976, pp 209–226
35.Kaiser A, Kroening D, Wahl T (2012) Efficient coverability analysis by proof minimization. In: CONCUR, pp 500–515
36.Kesten Y, Pnueli A. Control and data abstraction: the cornerstones of practical formal verification. STTT. 2000;2:328–342. doi: 10.1007/s100090050040. [DOI] [Google Scholar]
37.Konnov I, Lazić M, Veith H, Widder J (2017) A short counterexample property for safety and liveness verification of fault-tolerant distributed algorithms. In: POPL, pp 719–734
38.Konnov I, Veith H, Widder J (2014) On the completeness of bounded model checking for threshold-based distributed algorithms: reachability. In: CONCUR. LNCS, vol 8704, pp 125–140
39.Konnov I, Veith H, Widder J (2015) SMT and POR beat counter abstraction: parameterized model checking of threshold-based distributed algorithms. In: CAV (Part I). LNCS, vol 9206, pp 85–102
40.Konnov I, Veith H, Widder J (2016) What you always wanted to know about model checking of fault-tolerant distributed algorithms. In: PSI 2015, revised selected papers. LNCS, vol 9609. Springer, pp 6–21
41.Konnov I, Veith H, Widder J. On the completeness of bounded model checking for threshold-based distributed algorithms: reachability. Inf Comput. 2017;252:95–109. doi: 10.1016/j.ic.2016.03.006. [DOI] [Google Scholar]
42.Kroening D, Strichman O (2003) Efficient computation of recurrence diameters. In: VMCAI. LNCS, vol 2575, pp 298–309
43.Lamport L. Time, clocks, and the ordering of events in a distributed system. Commun ACM. 1978;21(7):558–565. doi: 10.1145/359545.359563. [DOI] [Google Scholar]
44.Lamport L. Specifying systems: the TLA+ language and tools for hardware and software engineers. Boston: Addison-Wesley Longman Publishing Co. Inc; 2002. [Google Scholar]
45.Leroux J, Sutre G (2004) On flatness for 2-dimensional vector addition systems with states. In: CONCUR 2004-concurrency theory. Springer, pp 402–416
46.Leroux J, Sutre G (2005) Flat counter automata almost everywhere! In: ATVA. LNCS, vol 3707, pp 489–503
47.Lesani M, Bell CJ, Chlipala A (2016) Chapar: certified causally consistent distributed key-value stores. In: POPL, pp 357–370
48.Lipton RJ. Reduction: a method of proving properties of parallel programs. Commun ACM. 1975;18(12):717–721. doi: 10.1145/361227.361234. [DOI] [Google Scholar]
49.Lubachevsky BD. An approach to automating the verification of compact parallel coordination programs. I. Acta Inform. 1984;21(2):125–169. doi: 10.1007/BF00289237. [DOI] [Google Scholar]
50.Lynch N. Distributed algorithms. Burlington: Morgan Kaufman; 1996. [Google Scholar]
51.Mostéfaoui A, Mourgaya E, Parvédy PR, Raynal M (2003) Evaluating the condition-based approach to solve consensus. In: DSN, pp 541–550
52.Padon O, McMillan KL, Panda A, Sagiv M, Shoham S (2016) Ivy: safety verification by interactive generalization. In: PLDI, pp 614–630
53.Peled D (1993) All from one, one for all: on model checking using representatives. In: CAV. LNCS, vol 697, pp 409–423
54.Peluso S, Turcu A, Palmieri R, Losa G, Ravindran B (2016) Making fast consensus generally faster. In: DSN, pp 156–167
55.Pnueli A, Xu J, Zuck L (2002) Liveness with (0,1,$\infty $)-counter abstraction. In: CAV. LNCS, vol 2404, pp 93–111
56.Raynal M (1997) A case study of agreement problems in distributed systems: non-blocking atomic commitment. In: HASE, pp 209–214
57.Song YJ, van Renesse R (2008) Bosco: one-step Byzantine asynchronous consensus. In: DISC. LNCS, vol 5218, pp 438–450
58.Srikanth T, Toueg S. Simulating authenticated broadcasts to derive simple fault-tolerant algorithms. Distrib Comput. 1987;2:80–94. doi: 10.1007/BF01667080. [DOI] [Google Scholar]
59.Valmari A (1991) Stubborn sets for reduced state space generation. In: Advances in Petri Nets 1990. LNCS, vol 483. Springer, pp 491–515
60.Wilcox JR, Woos D, Panchekha P, Tatlock Z, Wang X, Ernst MD, Anderson TE (2015) Verdi: a framework for implementing and formally verifying distributed systems. In: PLDI, pp 357–368

[CR1] 1.Attiya H, Welch J. Distributed computing. 2. New York: Wiley; 2004. [Google Scholar]

[CR2] 2.ByMC: Byzantine model checker (2013). http://forsyte.tuwien.ac.at/software/bymc/. Accessed Dec 2016

[CR3] 3.Bardin S, Finkel A, Leroux J, Petrucci L. Fast: acceleration from theory to practice. STTT. 2008;10(5):401–424. doi: 10.1007/s10009-008-0064-3. [DOI] [Google Scholar]

[CR4] 4.Basler G, Mazzucchi M, Wahl T, Kroening D (2009) Symbolic counter abstraction for concurrent software. In: CAV. LNCS, vol 5643, pp 64–78

[CR5] 5.Biere A (2013) Lingeling, Plingeling and Treengeling entering the SAT competition 2013. In: Proceedings of SAT competition 2013; Solver and p. 51

[CR6] 6.Biere A, Cimatti A, Clarke EM, Zhu Y (1999) Symbolic model checking without BDDs. In: TACAS. LNCS, vol 1579, pp 193–207

[CR7] 7.Bloem R, Jacobs S, Khalimov A, Konnov I, Rubin S, Veith H, Widder J. Decidability of parameterized verification, synthesis lectures on distributed computing theory. San Rafael: Morgan & Claypool; 2015. [Google Scholar]

[CR8] 8.Bokor P, Kinder J, Serafini M, Suri N (2011) Efficient model checking of fault-tolerant distributed protocols. In: DSN, pp 73–84

[CR9] 9.Bracha G, Toueg S. Asynchronous consensus and broadcast protocols. J ACM. 1985;32(4):824–840. doi: 10.1145/4221.214134. [DOI] [Google Scholar]

[CR10] 10.Brasileiro FV, Greve F, Mostéfaoui A, Raynal M (2001) Consensus in one communication step. In: PaCT. LNCS, vol 2127, pp 42–50

[CR11] 11.Cavada R, Cimatti A, Dorigatti M, Griggio A, Mariotti A, Micheli A, Mover S, Roveri M, Tonetta S (2014) The nuXmv symbolic model checker. In: CAV. LNCS, vol 8559, pp 334–342

[CR12] 12.Chandra TD, Toueg S. Unreliable failure detectors for reliable distributed systems. J ACM. 1996;43(2):225–267. doi: 10.1145/226643.226647. [DOI] [Google Scholar]

[CR13] 13.Clarke E, Grumberg O, Jha S, Lu Y, Veith H. Counterexample-guided abstraction refinement for symbolic model checking. J ACM. 2003;50(5):752–794. doi: 10.1145/876638.876643. [DOI] [Google Scholar]

[CR14] 14.Clarke E, Talupur M, Touili T, Veith H (2004) Verification by network decomposition. In: CONCUR 2004, vol 3170, pp 276–291

[CR15] 15.Clarke E, Talupur M, Veith H (2008) Proving Ptolemy right: the environment abstraction framework for model checking concurrent systems. In: TACAS’08/ETAPS’08. Springer, Berlin, pp 33–47

[CR16] 16.Clarke EM, Kroening D, Ouaknine J, Strichman O (2004) Completeness and complexity of bounded model checking. In: VMCAI. LNCS, vol 2937, pp 85–96

[CR17] 17.De Moura L, Bjørner N (2008) Z3: an efficient SMT solver. In: Tools and algorithms for the construction and analysis of systems. LNCS, vol 1579, pp 337–340

[CR18] 18.Dobre D, Suri N (2006) One-step consensus with zero-degradation. In: DSN, pp 137–146

[CR19] 19.Drăgoi C, Henzinger TA, Zufferey D (2016) PSync: a partially synchronous language for fault-tolerant distributed algorithms. In: POPL, pp 400–415

[CR20] 20.Drăgoi C, Henzinger TA, Veith H, Widder J, Zufferey D (2014) A logic-based framework for verifying consensus algorithms. In: VMCAI. LNCS, vol 8318, pp 161–181

[CR21] 21.Emerson E, Namjoshi K (1995) Reasoning about rings. In: POPL, pp 85–94

[CR22] 22.Emerson EA, Kahlon V (2003) Model checking guarded protocols. In: LICS. IEEE, pp 361–370

[CR23] 23.Esparza J, Ganty P, Majumdar R (2013) Parameterized verification of asynchronous shared-memory systems. In: CAV, pp 124–140

[CR24] 24.Farzan A, Kincaid Z, Podelski A (2013) Inductive data flow graphs. In: POPL, pp 129–142

[CR25] 25.Farzan A, Kincaid Z, Podelski A (2015) Proof spaces for unbounded parallelism. In: POPL, pp 407–420

[CR26] 26.Fischer MJ, Lynch NA, Paterson MS. Impossibility of distributed consensus with one faulty process. J ACM. 1985;32(2):374–382. doi: 10.1145/3149.214121. [DOI] [Google Scholar]

[CR27] 27.Gmeiner A, Konnov I, Schmid U, Veith H, Widder J (2014) Tutorial on parameterized model checking of fault-tolerant distributed algorithms. In: SFM. LNCS, vol 8483. Springer, Berlin, pp 122–171

[CR28] 28.Godefroid P (1990) Using partial orders to improve automatic verification methods. In: CAV. LNCS, vol 531, pp 176–185

[CR29] 29.Guerraoui R. Non-blocking atomic commit in asynchronous distributed systems with failure detectors. Distrib Comput. 2002;15(1):17–25. doi: 10.1007/s446-002-8027-4. [DOI] [Google Scholar]

[CR30] 30.https://github.com/konnov/fault-tolerant-benchmarks/tree/master/fmsd17

[CR31] 31.Hawblitzel C, Howell J, Kapritsos M, Lorch JR, Parno B, Roberts ML, Setty STV, Zill B (2015) Ironfleet: proving practical distributed systems correct. In: SOSP, pp 1–17

[CR32] 32.Jensen H, Lynch N (1998) A proof of Burns n-process mutual exclusion algorithm using abstraction. In: Steffen B (ed) TACAS. LNCS, vol 1384. Springer, Berlin, pp 409–423

[CR33] 33.John A, Konnov I, Schmid U, Veith H, Widder J (2013) Parameterized model checking of fault-tolerant distributed algorithms by abstraction. In: FMCAD, pp 201–209

[CR34] 34.John A, Konnov I, Schmid U, Veith H, Widder J (2013) Towards modeling and model checking fault-tolerant distributed algorithms. In: SPIN. LNCS, vol 7976, pp 209–226

[CR35] 35.Kaiser A, Kroening D, Wahl T (2012) Efficient coverability analysis by proof minimization. In: CONCUR, pp 500–515

[CR36] 36.Kesten Y, Pnueli A. Control and data abstraction: the cornerstones of practical formal verification. STTT. 2000;2:328–342. doi: 10.1007/s100090050040. [DOI] [Google Scholar]

[CR37] 37.Konnov I, Lazić M, Veith H, Widder J (2017) A short counterexample property for safety and liveness verification of fault-tolerant distributed algorithms. In: POPL, pp 719–734

[CR38] 38.Konnov I, Veith H, Widder J (2014) On the completeness of bounded model checking for threshold-based distributed algorithms: reachability. In: CONCUR. LNCS, vol 8704, pp 125–140

[CR39] 39.Konnov I, Veith H, Widder J (2015) SMT and POR beat counter abstraction: parameterized model checking of threshold-based distributed algorithms. In: CAV (Part I). LNCS, vol 9206, pp 85–102

[CR40] 40.Konnov I, Veith H, Widder J (2016) What you always wanted to know about model checking of fault-tolerant distributed algorithms. In: PSI 2015, revised selected papers. LNCS, vol 9609. Springer, pp 6–21

[CR41] 41.Konnov I, Veith H, Widder J. On the completeness of bounded model checking for threshold-based distributed algorithms: reachability. Inf Comput. 2017;252:95–109. doi: 10.1016/j.ic.2016.03.006. [DOI] [Google Scholar]

[CR42] 42.Kroening D, Strichman O (2003) Efficient computation of recurrence diameters. In: VMCAI. LNCS, vol 2575, pp 298–309

[CR43] 43.Lamport L. Time, clocks, and the ordering of events in a distributed system. Commun ACM. 1978;21(7):558–565. doi: 10.1145/359545.359563. [DOI] [Google Scholar]

[CR44] 44.Lamport L. Specifying systems: the TLA+ language and tools for hardware and software engineers. Boston: Addison-Wesley Longman Publishing Co. Inc; 2002. [Google Scholar]

[CR45] 45.Leroux J, Sutre G (2004) On flatness for 2-dimensional vector addition systems with states. In: CONCUR 2004-concurrency theory. Springer, pp 402–416

[CR46] 46.Leroux J, Sutre G (2005) Flat counter automata almost everywhere! In: ATVA. LNCS, vol 3707, pp 489–503

[CR47] 47.Lesani M, Bell CJ, Chlipala A (2016) Chapar: certified causally consistent distributed key-value stores. In: POPL, pp 357–370

[CR48] 48.Lipton RJ. Reduction: a method of proving properties of parallel programs. Commun ACM. 1975;18(12):717–721. doi: 10.1145/361227.361234. [DOI] [Google Scholar]

[CR49] 49.Lubachevsky BD. An approach to automating the verification of compact parallel coordination programs. I. Acta Inform. 1984;21(2):125–169. doi: 10.1007/BF00289237. [DOI] [Google Scholar]

[CR50] 50.Lynch N. Distributed algorithms. Burlington: Morgan Kaufman; 1996. [Google Scholar]

[CR51] 51.Mostéfaoui A, Mourgaya E, Parvédy PR, Raynal M (2003) Evaluating the condition-based approach to solve consensus. In: DSN, pp 541–550

[CR52] 52.Padon O, McMillan KL, Panda A, Sagiv M, Shoham S (2016) Ivy: safety verification by interactive generalization. In: PLDI, pp 614–630

[CR53] 53.Peled D (1993) All from one, one for all: on model checking using representatives. In: CAV. LNCS, vol 697, pp 409–423

[CR54] 54.Peluso S, Turcu A, Palmieri R, Losa G, Ravindran B (2016) Making fast consensus generally faster. In: DSN, pp 156–167

[CR55] 55.Pnueli A, Xu J, Zuck L (2002) Liveness with (0,1,$\infty $)-counter abstraction. In: CAV. LNCS, vol 2404, pp 93–111

[CR56] 56.Raynal M (1997) A case study of agreement problems in distributed systems: non-blocking atomic commitment. In: HASE, pp 209–214

[CR57] 57.Song YJ, van Renesse R (2008) Bosco: one-step Byzantine asynchronous consensus. In: DISC. LNCS, vol 5218, pp 438–450

[CR58] 58.Srikanth T, Toueg S. Simulating authenticated broadcasts to derive simple fault-tolerant algorithms. Distrib Comput. 1987;2:80–94. doi: 10.1007/BF01667080. [DOI] [Google Scholar]

[CR59] 59.Valmari A (1991) Stubborn sets for reduced state space generation. In: Advances in Petri Nets 1990. LNCS, vol 483. Springer, pp 491–515

[CR60] 60.Wilcox JR, Woos D, Panchekha P, Tatlock Z, Wang X, Ernst MD, Anderson TE (2015) Verdi: a framework for implementing and formally verifying distributed systems. In: PLDI, pp 357–368

PERMALINK

Para2: parameterized path reduction, acceleration, and SMT for reachability in threshold-guarded distributed algorithms

Igor Konnov

Marijana Lazić

Helmut Veith

Josef Widder

Abstract

Introduction

Fig. 1.

Our approach at a glance

Fig. 2.

Schemas

Reduction and acceleration

Encoding a schema in SMT

Parameterized counter systems

Definition 3.1

Example 3.1

Definition 3.2

Assumption 3.3

Remark 3.1

Example 3.2

Fig. 3.

Example 3.3

Remark 3.2

Fig. 4.

Counter systems

Example 3.4

Proposition 3.1

Contexts and slices

Definition 3.4

Proposition 3.2

Example 3.5

Definition 3.5

Proposition 3.3

Definition 3.6

Proposition 3.4

Definition 3.7

Definition 3.8

Fig. 5.

Model checking problem: parameterized reachability

Main result: a complete set of schemas

Definition 4.1

Definition 4.2

Remark 4.1

Example 4.1

Theorem 4.1

Theorem 4.2

Remark 4.2

Case I: one context and one looplet

Singleton looplet

Lemma 5.1

Proof

Non-singleton looplet

Lemma 5.2

Proof

Example 5.1

Fig. 6.

Example 5.2

Fig. 7.

Lemma 5.3

Proof

Lemma 5.4

Proof

Lemma 5.5

Lemma 5.6

Proof

Lemma 5.7

Proof

Lemma 5.8

Proof

Representatives for one context and one looplet

Theorem 5.1

Proof

Theorem 5.2

Proof

Case II: one context and multiple looplets

Theorem 6.1

Definition 6.1

Example 6.1

Fig. 8.

Para $^{2}$ : parameterized path reduction, acceleration, and SMT for reachability in threshold-guarded distributed algorithms