spot/doc/org/oaut.org
Alexandre Duret-Lutz cb9867b7d4 org: fix -o example
* doc/org/oaut.org: Adjust the parameters of randaut so that we get a
mix of deterministic and nondeterministic automata.
2015-03-05 20:42:38 +01:00

28 KiB

Common output options for automata

Spot supports different output syntaxes for automata. This page documents the options, common to all tools where it makes sense, that are used to specify how to output of automata.

Common output options

All tools that can output automata implement the following options:

  -8, --utf8                 enable UTF-8 characters in output (ignored with
                             --lbtt or --spin)
      --dot[=c|h|n|N|s|t|v]  GraphViz's format (default).  Add letters for (a)
                             acceptance display, (c) circular nodes, (h)
                             horizontal layout, (v) vertical layout, (n) with
                             name, (N) without name, (s) with SCCs, (t) force
                             transition-based acceptance.
  -H, --hoaf[=s|t|m|l]       Output the automaton in HOA format.  Add letters
                             to select (s) prefer state-based acceptance when
                             possible [default], (t) force transition-based
                             acceptance, (m) mix state and transition-based
                             acceptance, (l) single-line output
      --lbtt[=t]             LBTT's format (add =t to force transition-based
                             acceptance even on Büchi automata)
      --name=FORMAT          set the name of the output automaton
  -o, --output=FORMAT        send output to a file named FORMAT instead of
                             standard output.  The first automaton sent to a
                             file truncates it unless FORMAT starts with '>>'.
  -q, --quiet                suppress all normal output
  -s, --spin[=6|c]           Spin neverclaim (implies --ba).  Add letters to
                             select (6) Spin's 6.2.4 style, (c) comments on
                             states
      --stats=FORMAT         output statistics about the automaton

The main three output formats (that can also been used as input to some of the tools) are HOAF (activated by -H or --hoaf), LBTT (activated by --lbtt), or Spin never claims (activated by -s or --spin). These three formats also support streaming, i.e., you can concatenate multiple automata (and even mix these three formats in the same stream), and the tools will be able to read and process them in sequence.

The other possible outputs are GraphViz output (--dot), various statistics (--stats), or nothing at all (--quiet). It may seem strange to ask a tool to not output anything, but it makes sense when only the exit status matters (for instance using autfilt to check whether an input automaton has some property) or for timing purposes.

HOAF output

The HOAF output should be the preferred format to use if you want to pass automata between different tools. This format can be requested using the -H option.

Here is an example where ltl2tgba is used to construct two automata: one for a U b and one for (Ga -> Gb) W c.

ltl2tgba -H 'a U b' '(Ga -> Gb) W c'
HOA: v1
name: "a U b"
States: 2
Start: 1
AP: 2 "a" "b"
acc-name: Buchi
Acceptance: 1 Inf(0)
properties: trans-labels explicit-labels state-acc deterministic
--BODY--
State: 0 {0}
[t] 0
State: 1
[1] 0
[0&!1] 1
--END--
HOA: v1
name: "(Gb | F!a) W c"
States: 5
Start: 1
AP: 3 "b" "a" "c"
acc-name: Buchi
Acceptance: 1 Inf(0)
properties: trans-labels explicit-labels trans-acc
--BODY--
State: 0
[0] 0 {0}
State: 1
[0&1&!2] 0
[!1&!2] 1 {0}
[1&!2] 2
[2] 3
State: 2
[!1&!2] 1 {0}
[1&!2] 2
[!1&2] 3
[1&2] 4
State: 3
[t] 3 {0}
State: 4
[!1] 3
[1] 4
--END--

The above output contains two automata, named after the formulas they represent. Here is a picture of these two automata:

/alarsyo/spot/media/commit/cb9867b7d4ce7d67cbb89ceda02bd43b15d8721d/doc/org/hoafex.png

The HOA format supports both state and transition-based acceptance. Although Spot works only with transition-based acceptance, its output routines default to state-based acceptance whenever possible (this is the case in the first of these two automata) and use transition-based acceptance otherwise. You can change this behavior using -Hs (or --hoaf=s), -Ht, or -Hm. Option s corresponds to the default to use state-based acceptance whenever possible. Option t forces transition-based acceptance. For instance compare this output to the previous one:

ltl2tgba -Ht 'a U b'
HOA: v1
name: "a U b"
States: 2
Start: 0
AP: 2 "a" "b"
acc-name: Buchi
Acceptance: 1 Inf(0)
properties: trans-labels explicit-labels trans-acc deterministic
--BODY--
State: 0
[1] 1
[0&!1] 0
State: 1
[t] 1 {0}
--END--

Option m uses mixed acceptance, i.e, some states might use state-based acceptance while other will not:

ltl2tgba -Hm '(Ga -> Gb) W c'
HOA: v1
name: "(Gb | F!a) W c"
States: 5
Start: 1
AP: 3 "b" "a" "c"
acc-name: Buchi
Acceptance: 1 Inf(0)
properties: trans-labels explicit-labels
--BODY--
State: 0 {0}
[0] 0
State: 1
[0&1&!2] 0
[!1&!2] 1 {0}
[1&!2] 2
[2] 3
State: 2
[!1&!2] 1 {0}
[1&!2] 2
[!1&2] 3
[1&2] 4
State: 3 {0}
[t] 3
State: 4
[!1] 3
[1] 4
--END--

It is also possible to output each automaton on a single line, in case the result should be used with line-based tools or embedded into a CSV file… Here is an example using both transition-based acceptance, and single-line output:

ltl2tgba -Htl 'a U b' '(Ga -> Gb) W c'
HOA: v1 name: "a U b" States: 2 Start: 1 AP: 2 "a" "b" acc-name: Buchi Acceptance: 1 Inf(0) properties: trans-labels explicit-labels trans-acc deterministic --BODY-- State: 0 [t] 0 {0} State: 1 [1] 0 [0&!1] 1 --END--
HOA: v1 name: "(Gb | F!a) W c" States: 5 Start: 1 AP: 3 "b" "a" "c" acc-name: Buchi Acceptance: 1 Inf(0) properties: trans-labels explicit-labels trans-acc --BODY-- State: 0 [0] 0 {0} State: 1 [0&1&!2] 0 [!1&!2] 1 {0} [1&!2] 2 [2] 3 State: 2 [!1&!2] 1 {0} [1&!2] 2 [!1&2] 3 [1&2] 4 State: 3 [t] 3 {0} State: 4 [!1] 3 [1] 4 --END--

LBTT output

The LBTT output has two flavors: state-based (which is used to output Büchi automata or monitors) or transition-based (for TGBA).

ltl2tgba --ba --lbtt 'p0 U p1'
2 1
0 1 -1
1 p1
0 & p0 ! p1
-1
1 0 0 -1
1 t
-1

If you want to request transition-based output even for Büchi automata, use --lbtt=t.

ltl2tgba --ba --lbtt=t 'p0 U p1'
2 1t
0 1
1 -1 p1
0 -1 & p0 ! p1
-1
1 0
1 0 -1 t
-1

Note that the LBTT output generalizes the format output by LBT with support for transition-based acceptance. Both formats however are restricted to atomic propositions of the form p0, p1, etc… In case other atomic propositions are used, Spot output them in double quotes. This other extension of the format is also supported by ltl2dstar.

ltl2tgba --ba --lbtt 'a U b'
2 1
0 1 -1
1 "b"
0 & "a" ! "b"
-1
1 0 0 -1
1 t
-1

Spin output

Spin never claims can be requested using -s or --spin. They can only represent Büchi automata, so these options imply --ba.

ltl2tgba -s 'a U b'
never { /* a U b */
T0_init:
  if
  :: ((b)) -> goto accept_all
  :: ((a) && (!(b))) -> goto T0_init
  fi;
accept_all:
  skip
}

Recent versions of Spin (starting with Spin 6.2.4) output never claims in a slightly different style that can be requested using either -s6 or --spin=6:

ltl2tgba -s6 'a U b'
never { /* a U b */
T0_init:
  do
  :: atomic { ((b)) -> assert(!((b))) }
  :: ((a) && (!(b))) -> goto T0_init
  od;
accept_all:
  skip
}

(Note that while Spot is able to read never claims that follow any of these two styles, it is not capable of interpreting an arbitrary piece of Promela syntax.)

Dot output

The --dot option (which usually is the default) causes automata to be output in GraphViz's format.

ltl2tgba '(Ga -> Gb) W c'
digraph G {
  rankdir=LR
  I [label="", style=invis, width=0]
  I -> 1
  0 [label="0"]
  0 -> 0 [label="b\n{0}"]
  1 [label="1"]
  1 -> 0 [label="a & b & !c"]
  1 -> 1 [label="!a & !c\n{0}"]
  1 -> 2 [label="a & !c"]
  1 -> 3 [label="c"]
  2 [label="2"]
  2 -> 1 [label="!a & !c\n{0}"]
  2 -> 2 [label="a & !c"]
  2 -> 3 [label="!a & c"]
  2 -> 4 [label="a & c"]
  3 [label="3"]
  3 -> 3 [label="1\n{0}"]
  4 [label="4"]
  4 -> 3 [label="!a"]
  4 -> 4 [label="a"]
}

This output should be processed with dot to be converted into a picture. For instance use dot -Tpng or dot -Tpdf.

/alarsyo/spot/media/commit/cb9867b7d4ce7d67cbb89ceda02bd43b15d8721d/doc/org/oaut-dot1.png

This output can be customized by passing optional characters to the --dot option. For instance v requests a vertical layout (instead of the default horizontal layout), c requests circle states, s causes strongly-connected components to be displayed, n causes the name (see below) of the automaton to be displayed, and a causes the acceptance condition to be shown as well.

ltl2tgba --dot=vcsna '(Ga -> Gb) W c'

/alarsyo/spot/media/commit/cb9867b7d4ce7d67cbb89ceda02bd43b15d8721d/doc/org/oaut-dot2.png

The acceptance condition is displayed in the same way as in the HOA format. Here Inf(0) means that runs are accepting if and only if they visit some the transitions in the set #0 infinitely often.

The strongly connected components are displayed using the following colors:

  • green components contain an accepting cycle
  • red components contain no accepting cycle
  • orange components may or may not contain an accepting cycle. Such an indecision occur only when using Fin acceptance primitive, deciding that would require a better algorithm than what the output routine is using.
  • black components are trivial (i.e., they contain no cycle)
  • gray components are useless (i.e., they are non-accepting, and are only followed by non-accepting components)

Here is an example involving all colors:

/alarsyo/spot/media/commit/cb9867b7d4ce7d67cbb89ceda02bd43b15d8721d/doc/org/oaut-dot3.png

Statistics

The --stats option takes format string parameter to specify what and how statistics should be output.

Most tool support a common set of statistics about the output automaton (like %s for the number of states, %t for transitions, %e for edges, etc.) Additional statistics might be available depending on what the tool does (for instance autfilt also has %S, %T, and %E to display the same statistics about the input automaton). All the available statistics are displayed when a tool is run with --help.

For instance here are the statistics available in randaut:

  %%                         a single %
  %a                         number of acceptance sets
  %c                         number of SCCs
  %d                         1 if the output is deterministic, 0 otherwise
  %e                         number of edges
  %F                         seed number
  %L                         automaton number
  %m                         name of the automaton
  %n                         number of nondeterministic states in output
  %p                         1 if the output is complete, 0 otherwise
  %r                         processing time (excluding parsing) in seconds
  %s                         number of states
  %t                         number of transitions
  %w                         one word accepted by the output automaton

In most tools %F and %L are the input filename and line number, but as this makes no sense in randaut, these two sequences emit numbers related to the generation of automata.

For instance let's generate 100 random automata with 10 states and density 0.2, and just count the number of edges in each automaton. Then use R to summarize the distribution of these values:

randaut -d 0.2 -S 10 -n 1000 a --stats %e > size.csv
R --slave -e "summary(read.csv('size.csv', header=FALSE, col.names='edges'))"
     edges
 Min.   :17.00
 1st Qu.:25.00
 Median :28.00
 Mean   :27.96
 3rd Qu.:30.00
 Max.   :42.00

For $S=10$ states and density $D=0.2$ the expected degree of each state $1+(S-1)D = 1+9\times 0.2 = 2.8$ so the expected number of edges should be 10 times that.

Naming automata

Automata can be given names. This name can be output in GraphViz output when --dot=n is given, and is also part of the HOA format (as activated by -H).

By default, ltl2tgba will use the input format as name. Other tools have no default name. This name can be changed using the --name option, that takes a format string similar to the one of --stats.

ltl2tgba --name='TGBA for %f' --dot=n 'a U b'

/alarsyo/spot/media/commit/cb9867b7d4ce7d67cbb89ceda02bd43b15d8721d/doc/org/oaut-name.png

If you have an automaton saved in the HOA format, you can extract its name using autfilt --stats=%M input.hoa. The %M escape sequence is replaced by the name of the input automaton.

Here is a pipeline of commands that generates five LTL formulas $\varphi$ such that both $\varphi$ and $\lnot\varphi$ are translated into a 3-state TGBA by ltl2tgba. It starts by generating an infinite stream of random LTL formulas using a and b as atomic propositions, then it converts these formulas as TGBA (in the HOA format, therefore carrying the formula as name), filtering only the TGBA with 3 states and outputting !(%M) (that is the negation of the associated formula), translating the resulting formulas as TGBA, again retaining only the names (i.e. formulas) of the automata with 3 states, and finally restricting the output to the first 5 matches using autfilt -n5.

randltl -n -1 a b |
ltl2tgba -H -F- |
autfilt --states=3 --stats='!(%M)' |
ltl2tgba -H -F- |
autfilt --states=3 --stats=%M -n5
G(F!a & XF(a | G!b))
GFb | G(!b & FG!b)
!a & F((a | b) & (!a | !b))
!a | (b R a)
!b & X(!b U a)

Note that the above result can also be obtained without using autfilt and automata names. We can use the fact that ltl2tgba --stats can output the automaton size, and that ltl2tgba is also capable of reading from a CSV file (-F-/2 instructs ltl2tgba to read the standard input as if it was a CSV file, and to process its second column):

randltl -n -1 a b |                 # generate a stream of random LTL formulas
ltl2tgba -F- --stats='%s,!(%f)' |   # for each formula output "states,negated formula"
grep '^3,' |                        # keep only formulas with 3 states
ltl2tgba -F-/2 --stats='%s,%f' |    # for each negated formula output "states,formula"
grep '^3,' |                        # keep only negated formulas with 3 states
head -n5 | cut -d, -f2              # return the five first formulas
G(F!a & XF(a | G!b))
GFb | G(!b & FG!b)
!a & F((!a | !b) & (a | b))
!a | (b R a)
!b & X(!b U a)

Naming output

By default, all output is sent to standard output, so you can either redirect it to a file, or pipe it to another program. You can also use the --output (a.k.a. -o) option to specify a filename where automata should be written. The advantage over a shell redirection, is that you may build a name using the same escape sequences as used by --stats and --name.

For instance %d is replaced by 0 or 1 depending on whether the automaton is deterministic. We can generate 20 random automata, and output them in two files depending on their determinism:

randaut -n 20 -S2 -d1 1 -H -o out-det%d.hoa
autfilt -c out-det0.hoa    # Count of non-deterministic automata
autfilt -c out-det1.hoa    # Count of deterministic automata
14
6

If you use this feature, beware that the output filename is only truncated by the first file that is output to it: so if no automaton generate some filename, the existing file will be left untouched. For instance we we run the above commands, again, but forcing randaut to output 20 deterministic automata, it may look like we produced more than 20 automata:

randaut -D -n 20 -S2 -d1 1 -H -o out-det%d.hoa
autfilt -c out-det0.hoa    # Count of non-deterministic automata
autfilt -c out-det1.hoa    # Count of deterministic automata
14
20

This is because the out-det0.hoa file hasn't changed from the previous execution, while out-det1.hoa has been overwritten.

In the case where you want to append to a file instead of overwriting it, prefix the output filename with >> as in

randaut -D -n 20 -S2 1 -H -o '>>out-det%d.hoa'

(You need the quotes so that the shell does not interpret '>>'.)