org: document SAT-based minimization

* doc/org/satmin.org, doc/org/satmin.tex: New files. * doc/Makefile.am: Add them. * doc/org/tools.org: Point to satmin.org. * NEWS: Mention satmin.html.
2013-09-15 22:37:55 +02:00 · 2013-09-15 22:37:55 +02:00 · 3076c3da4e
commit 3076c3da4e
parent cda847e207
5 changed files with 519 additions and 1 deletions
--- a/doc/org/satmin.org
+++ b/doc/org/satmin.org
@ -0,0 +1,403 @@
+#+TITLE: SAT-based Minimization of Deterministic (Generalized) Büchi Automata
+#+EMAIL spot@lrde.epita.fr
+#+OPTIONS: H:2 num:nil toc:t
+#+LINK_UP: file:tools.html
+
+This page explains how to use [[file:ltl2tgba.org][=ltl2tgba=]] or [[file:dstar2tgba.org][=dstar2tgba=]] to minimize
+deterministic automata using a SAT solver.
+
+Let us first state a few facts about this minimization procedure.
+
+1) The procedure works only on *deterministic* Büchi automata: any
+   recurrence property can be converted into a deterministic Büchi
+   automaton, and sometimes there are several ways of doing so.
+2) Spot actually implement two SAT-based minimization procedures: one
+   that builds a deterministic transition-based Büchi automaton
+   (DTBA), and one the builds a deterministic transition-based
+   generalized Büchi automaton (DTGBA).  For the latter, we can supply
+   the number $m$ of acceptance sets to use.
+3) These two procedures can optionally constrain their output to
+   use state-based acceptance. (They simply restrict all the outgoing
+   transitions of a state to belong to the same acceptance sets.)
+4) A SAT solver should be installed for this to work. (Spot does not
+   distribute any SAT solver.)
+5) [[file:ltl2tgba.org][=ltl2tgba=]] and [[file:dstar2tgba.org][=dstar2tgba=]] will always try to output an automaton
+   If they fail to determinize the property, they will simply output a
+   nondeterministic automaton, if they managed to obtain a
+   deterministic automaton but failed to minimize it (e.g., the
+   requested number of states in the final automaton is too low), they
+   will return that "unminimized" deterministic automaton.  There are
+   only two cases where these tool will abort without returning an
+   automaton: when the number of clauses output by Spot (and to be fed
+   to the SAT solver) exceeds $2^{31}$, or when the SAT-solver was
+   killed by a signal.
+
+* How change the SAT solver used
+
+The environment variable =SPOT_SATSOLVER= can be used to change the
+SAT solver used by Spot.  The default is "=glucose %I >%O=", therefore
+if you have installed [[https://www.lri.fr/~simon/?page=glucose][=glucose=]] in your =$PATH=, it should work right
+away.  Otherwise you may redefine this variable to point the correct
+location or to another SAT solver.  The =%I= and =%O= sequences will be
+replaced by the names of temporary files containing the input for the
+SAT solver and receiving its output.  We assume that the SAT solver
+should follow the conventions of the [[http://www.satcompetition.org/][SAT competition]] for input and
+output.
+
+* Enabling SAT-based minimization for deterministic automata
+
+Both tools follow the same interface, because they use the same
+post-processing steps internally (i.e., the =spot::postprocessor=
+class).
+
+First, option =-D= should be used to declare that you are looking for
+more determinism.  This will tweak the translation algorithm used by
+=ltl2tgba= to improve determinism, and will also instruct the
+post-processing routine used by both tools to prefer a
+deterministic automaton over a smaller equivalent nondeterministic
+automaton.
+
+However =-D= is not a guarantee to obtain a deterministic automaton,
+even if one exists.  For instance, =-D= fails to produce a
+deterministic automaton for =GF(a <-> XXb)=.  Instead we get a 9-state
+non-deterministic automaton.
+
+#+BEGIN_SRC sh :results verbatim :exports both
+ltl2tgba -D "GF(a <-> XXb)" --stats='states=%s, det=%d'
+#+END_SRC
+#+RESULTS:
+: states=9, det=0
+
+Option =-x tba-det= enables an additional
+determinization procedure, that would otherwise not be used by =-D=
+alone.  This procedure will work on any automaton that can be
+represented by a DTBA; if the automaton to process use multiple
+acceptance conditions, it will be degeneralized first.
+
+On our example, =-x tba-det= successfully produces a deterministic
+TBA, but a non-minimal one:
+
+#+BEGIN_SRC sh :results verbatim :exports both
+ltl2tgba -D -x tba-det "GF(a <-> XXb)" --stats='states=%s, det=%d'
+#+END_SRC
+#+RESULTS:
+: states=7, det=1
+
+Option =-x sat-minimize= will turn-on SAT-based minimization.  It also
+implies =-x tba-det=, so there is no need to supply both options.
+
+#+BEGIN_SRC sh :results verbatim :exports both
+ltl2tgba -D -x sat-minimize "GF(a <-> XXb)" --stats='states=%s, det=%d'
+#+END_SRC
+#+RESULTS:
+: states=4, det=1
+
+We can draw it:
+
+#+BEGIN_SRC sh :results verbatim :exports code
+ltl2tgba -D -x sat-minimize "GF(a <-> XXb)"
+#+END_SRC
+#+RESULTS:
+#+begin_example
+digraph G {
+  0 [label="", style=invis, height=0]
+  0 -> 1
+  1 [label="1"]
+  1 -> 1 [label="a & !b\n"]
+  1 -> 2 [label="!b & !a\n"]
+  1 -> 2 [label="b & !a\n{Acc[1]}"]
+  1 -> 3 [label="a & b\n{Acc[1]}"]
+  2 [label="2"]
+  2 -> 4 [label="!b & !a\n"]
+  2 -> 4 [label="b & !a\n{Acc[1]}"]
+  2 -> 3 [label="a & !b\n"]
+  2 -> 3 [label="a & b\n{Acc[1]}"]
+  3 [label="4"]
+  3 -> 1 [label="a & !b\n{Acc[1]}"]
+  3 -> 1 [label="a & b\n"]
+  3 -> 2 [label="!b & !a\n{Acc[1]}"]
+  3 -> 2 [label="b & !a\n"]
+  4 [label="3"]
+  4 -> 2 [label="!b & !a\n{Acc[1]}"]
+  4 -> 4 [label="b & !a\n"]
+  4 -> 3 [label="a & !b\n{Acc[1]}"]
+  4 -> 3 [label="a & b\n"]
+}
+#+end_example
+
+#+NAME: gfaexxb3
+#+BEGIN_SRC sh :results verbatim :exports none
+ltl2tgba -D -x sat-minimize "GF(a <-> XXb)" | sed 's/\\/\\\\/'
+#+END_SRC
+#+RESULTS: gfaexxb3
+#+begin_example
+digraph G {
+  0 [label="", style=invis, height=0]
+  0 -> 1
+  1 [label="1"]
+  1 -> 1 [label="a & !b\\n"]
+  1 -> 1 [label="a & b\\n{Acc[1]}"]
+  1 -> 2 [label="!b & !a\\n"]
+  1 -> 2 [label="b & !a\\n{Acc[1]}"]
+  2 [label="2"]
+  2 -> 3 [label="!b & !a\\n"]
+  2 -> 3 [label="b & !a\\n{Acc[1]}"]
+  2 -> 4 [label="a & !b\\n"]
+  2 -> 4 [label="a & b\\n{Acc[1]}"]
+  3 [label="3"]
+  3 -> 1 [label="a & !b\\n{Acc[1]}"]
+  3 -> 3 [label="b & !a\\n"]
+  3 -> 4 [label="!b & !a\\n{Acc[1]}"]
+  3 -> 4 [label="a & b\\n"]
+  4 [label="4"]
+  4 -> 1 [label="a & !b\\n{Acc[1]}"]
+  4 -> 1 [label="a & b\\n"]
+  4 -> 2 [label="!b & !a\\n{Acc[1]}"]
+  4 -> 2 [label="b & !a\\n"]
+}
+#+end_example
+
+#+BEGIN_SRC dot :file gfaexxb3.png :cmdline -Tpng :var txt=gfaexxb3 :exports results
+$txt
+#+END_SRC
+#+RESULTS:
+[[file:gfaexxb3.png]]
+
+Clearly this is automaton benefit from the transition-based
+acceptance.  If we want a traditional Büchi automaton, with
+state-based acceptance, we only need to add the =-B= option.  The
+result will of course be slightly bigger.
+
+#+BEGIN_SRC sh :results verbatim :exports code
+ltl2tgba -BD -x sat-minimize "GF(a <-> XXb)"
+#+END_SRC
+#+RESULTS:
+#+begin_example
+digraph G {
+  0 [label="", style=invis, height=0]
+  0 -> 1
+  1 [label="1", peripheries=2]
+  1 -> 2 [label="!a\n{Acc[1]}"]
+  1 -> 3 [label="a & !b\n{Acc[1]}"]
+  1 -> 4 [label="a & b\n{Acc[1]}"]
+  2 [label="2", peripheries=2]
+  2 -> 1 [label="!b & !a\n{Acc[1]}"]
+  2 -> 4 [label="a\n{Acc[1]}"]
+  2 -> 5 [label="b & !a\n{Acc[1]}"]
+  3 [label="4"]
+  3 -> 1 [label="a & b\n"]
+  3 -> 2 [label="b & !a\n"]
+  3 -> 3 [label="a & !b\n"]
+  3 -> 6 [label="!b & !a\n"]
+  4 [label="3"]
+  4 -> 1 [label="!b\n"]
+  4 -> 3 [label="a & b\n"]
+  4 -> 6 [label="b & !a\n"]
+  5 [label="6"]
+  5 -> 1 [label="!b\n"]
+  5 -> 4 [label="a & b\n"]
+  5 -> 5 [label="b & !a\n"]
+  6 [label="5"]
+  6 -> 1 [label="a & b\n"]
+  6 -> 2 [label="b & !a\n"]
+  6 -> 4 [label="a & !b\n"]
+  6 -> 5 [label="!b & !a\n"]
+}
+#+end_example
+
+#+NAME: gfaexxb4
+#+BEGIN_SRC sh :results verbatim :exports none
+ltl2tgba -BD -x sat-minimize "GF(a <-> XXb)" | sed 's/\\/\\\\/'
+#+END_SRC
+#+RESULTS: gfaexxb4
+#+begin_example
+digraph G {
+  0 [label="", style=invis, height=0]
+  0 -> 1
+  1 [label="1", peripheries=2]
+  1 -> 1 [label="!b & !a\\n{Acc[1]}"]
+  1 -> 2 [label="b & !a\\n{Acc[1]}"]
+  1 -> 3 [label="a\\n{Acc[1]}"]
+  2 [label="2"]
+  2 -> 1 [label="!b & !a\\n"]
+  2 -> 2 [label="b & !a\\n"]
+  2 -> 3 [label="a & !b\\n"]
+  2 -> 4 [label="a & b\\n"]
+  3 [label="3", peripheries=2]
+  3 -> 5 [label="!a\\n{Acc[1]}"]
+  3 -> 6 [label="a\\n{Acc[1]}"]
+  4 [label="5"]
+  4 -> 1 [label="!b & !a\\n"]
+  4 -> 5 [label="b & !a\\n"]
+  4 -> 3 [label="a & !b\\n"]
+  4 -> 6 [label="a & b\\n"]
+  5 [label="4"]
+  5 -> 1 [label="b & !a\\n"]
+  5 -> 2 [label="!b & !a\\n"]
+  5 -> 3 [label="a & b\\n"]
+  5 -> 4 [label="a & !b\\n"]
+  6 [label="6"]
+  6 -> 1 [label="b & !a\\n"]
+  6 -> 5 [label="!b & !a\\n"]
+  6 -> 3 [label="a & b\\n"]
+  6 -> 6 [label="a & !b\\n"]
+}
+#+end_example
+
+#+BEGIN_SRC dot :file gfaexxb4.png :cmdline -Tpng :var txt=gfaexxb4 :exports results
+$txt
+#+END_SRC
+#+RESULTS:
+[[file:gfaexxb4.png]]
+
+
+There are cases where =ltl2tgba='s =tba-det= algorithm fails to produce a deterministic automaton.
+In that case, SAT-based minimization is simply skipped.  For instance:
+
+#+BEGIN_SRC sh :results verbatim :exports both
+ltl2tgba -D -x sat-minimize "Ga R (F!b & (c U b))" --stats='states=%s, det=%d'
+#+END_SRC
+#+RESULTS:
+: states=4, det=0
+
+The question, of course, is whether there exist a deterministic
+automaton for this formula, in other words: is this a recurrence
+property?  There are two ways to answer this question using Spot (and
+some help from [[http://www.ltl2dstar.de/][=ltl2dstar=]]).
+
+The first is purely syntactic.  If a formula belongs to the class of
+"syntactic recurrence formulas", it expresses a syntactic property.
+(Of course there are formulas that expresses a syntactic properties
+without being syntactic recurrences.)  [[file:ltlfilt.org][=ltlfilt=]] can be instructed to
+print only formulas that are syntactic recurrences:
+
+#+BEGIN_SRC sh :results verbatim :exports both
+ltlfilt --syntactic-recurrence -f "Ga R (F!b & (c U b))"
+#+END_SRC
+#+RESULTS:
+: Ga R (F!b & (c U b))
+
+Since our input formula was output, it expresses a recurrence property.
+
+The second way to check whether a formula is a recurrence is by
+converting a deterministic Rabin automaton using [[file:dstar2tgba.org][=dstar2tgba=]].  The
+output is guaranteed to be deterministic if and only if the input DRA
+expresses a recurrence property.
+
+#+BEGIN_SRC sh :results verbatim :exports both
+ltlfilt -f "Ga R (F!b & (c U b))" -l |
+ltl2dstar --ltl2nba=spin:../../src/bin/ltl2tgba@-Ds - - |
+dstar2tgba -D --stats='input(states=%S) output(states=%s, acc-sets=%a, det=%d)'
+#+END_SRC
+#+RESULTS:
+: input(states=11) output(states=9, acc-sets=1, det=1)
+
+In the above command, =ltlfilt= is used to convert the LTL formula
+into =ltl2dstar='s syntax.  Then =ltl2dstar= creates a deterministic
+Rabin automaton (using =ltl2tgba= as an LTL to BA translator), and the
+resulting 11-state DRA is converted into a 9-state DTBA by
+=dstar2tgba=.  Since that result is deterministic, we can conclude
+that the formula was a recurrence.
+
+As far as SAT-based minimization goes, =dstar2tgba= will take the same
+options as =ltl2tgba=, so we for instance check that the smallest DTBA
+has 6 states:
+
+#+BEGIN_SRC sh :results verbatim :exports both
+ltlfilt -f "Ga R (F!b & (c U b))" -l |
+ltl2dstar --ltl2nba=spin:../../src/bin/ltl2tgba@-Ds - - |
+dstar2tgba -D -x sat-minimize --stats='input(states=%S) output(states=%s, acc-sets=%a, det=%d)'
+#+END_SRC
+#+RESULTS:
+: input(states=11) output(states=6, acc-sets=1, det=1)
+
+* More acceptance sets
+
+The formula "=Ga R (F!b & (c U b))=" can in fact be minimized into an
+even smaller automaton if we use multiple acceptance sets.
+
+Unfortunately because =dstar2tgba= does not know the formula being
+translated, and it always convert a DRA into a DBA (with a single
+acceptance set) before further processing, it does not know if using
+more acceptance sets could be useful to further minimize it.   This
+number of acceptance sets can however be specified on the command-line
+with option =-x sat-acc=M=.  For instance:
+
+#+BEGIN_SRC sh :results verbatim :exports both
+ltlfilt -f "Ga R (F!b & (c U b))" -l |
+ltl2dstar --ltl2nba=spin:../../src/bin/ltl2tgba@-Ds - - |
+dstar2tgba -D -x sat-minimize,sat-acc=2 --stats='input(states=%S) output(states=%s, acc-sets=%a, det=%d)'
+#+END_SRC
+#+RESULTS:
+: input(states=11) output(states=5, acc-sets=2, det=1)
+
+Beware that the size of the SAT problem is exponential in the number of acceptance sets.
+
+The case of =ltl2tgba= is slightly different because it can remember
+the number of acceptance sets used by the translation algorithm, and
+reuse that for SAT-minimization even if the automaton had to be
+degeneralized in the meantime for the purpose of determinization.
+
+* Low-level details
+
+The following figure gives an overview of the processing chains that
+can be used to turn an LTL formula into a minimal DBA/DTBA/DTGBA.  The
+blue area at the top describes =ltl2tgba -D -x sat-minimize=, while
+the purple area at the bottom corresponds to =dstar2tgba -D -x
+stat-minimize=.
+
+[[file:satmin.png]]
+
+The picture is slightly inaccurate in the sense that both =ltl2tgba=
+and =dstar2tgba= are actually using the same post-processing chain:
+only the initial translation to TGBA or conversion to DBA differs, the
+rest is the same.  However in the case of =dstar2tgba=, no
+degeneration or determinization are needed.
+
+Also the picture does not show what happens when =-B= is used: any
+DTBA is degeneralized into a DBA, before being sent to "DTBA SAT
+minimization", with a special option to request state-based output.
+
+The WDBA-minimization boxes are able to produce minimal Weak DBA from
+any TGBA representing an obligation property.  In that case using
+transition-based or generalized acceptance will not allow further
+reduction.  This minimal WDBA is always used when =-D= is given
+(otherwise, for the default =--small= option, the minimal WDBA is only
+used if it is smaller than the nondeterministic automaton it has been
+built from).
+
+The "simplify" boxes are actually simulation-based reductions, and
+SCC-based simplifications.
+
+The red boxes "not in TCONG" or "not a recurrence" correspond to
+situations where the tools will produce non-deterministic automata.
+
+The following options can be used to fine-tune this procedure:
+
+- =-x tba-det= :: attempt a powerset construction and check if
+                  there exists a acceptance set such that the
+                  resulting DTBA is equivalent to the input
+- =-x sat-minimize= :: enable SAT-based minimization.  By default it
+     tries to reduce the size of the automaton one state at a time.
+     This option implies =-x tba-det=.
+- =-x sat-minimize=2= :: enabled SAT-based minimization, but perform a
+     dichotomy to locate the correct automaton size.  Use this only if
+     you suspect that the optimal size is far away from the input
+     size.  This option implies =-x tba-det=.
+- =-x sat-acc=$m$= :: attempt to build a minimal DTGBA with $m$ acceptance sets.
+     This options implies =-x sat-minimize=.
+- =-x sat-states=$n$= :: attempt to build an equivalent DTGBA with $n$
+     states.  This also implies =-x sat-minimize= but won't perform
+     any loop to lower the number of states.  Note that $n$ should be
+     the number of states in a complete automaton, while =ltl2tgba=
+     and =dstar2tgba= both remove sink states in their output by
+     default (use option =--complete=) to output a complete automaton.
+     Also note that even with the =--complete= option, the output
+     automaton may have appear to have less states because the other
+     are unreachable.
+- =-x state-based= :: for all outgoing transition of each state
+     to belong to the same acceptance sets.
+- =-x !wdba-minimize= :: disable WDBA minimization.
+
+When options =-B= and =-x sat-minimize= both used, =-x state-based= and
+=-x sat-acc=1= are implied.