Todd-Coxeter algorithm
In FriCAS the main group functionality is in PermutationGroup so it is useful to be able to convert to and from it. For conversions from presentations to PermutationGroup we use a well known algorithm called the Todd-Coxeter algorithm.
So a presentation consists of:
- A set of generators.
- A set of rules.
and each of these generators in the presentation will be a generator in the permutation, but it needs to be defined as a permutation, to do that we use the rules.
Each rule corresponds to a loop in the Cayley table so we need to use this to piece together a permutation for each generator. When we multiply together the permutations in each rule we get the identity permutation. We therefore work back from this to get the individual permutations for each generator.
The most efficient form of the algorithm is when we use subgroups, however just to keep things simple for the first example lets just concentrate on the basic method for now.
Example D3
In order to get some intuition lets look at the dihedral group of dimension 3. We will use 2 generators:
|
Then we get the following presentation and permutation:
presentation | permutation | |
---|---|---|
<r,m | rrr, mm, rmrm > | r := (1,2,3) | m := (1,2) |
We can see how these two views interact by seeing what the rules do to permutations:
rrr | mm | rmrm |
---|---|---|
As we can see each rule sends each point back to itself. This is to be expected because a rule corresponds to a loop in the Cayley table above.
FriCAS Code
Here is a simple version of the code without subgroups:
If you wish to see how the algorithm works (or does not work) try calling the function, with trace set to 'true', like this:
Using Coset Tables
The above method so far is not very efficient because we just apply it to the whole group.
Now we want to use coset tables so that we can work with cosets rather than individual elements.
So again lets use our D3 example. We first want to work with coset tables such as stab(1) here: |
|
Then we can work out the orbit+svc which will be: [orb = [1,2,3], svc = [-1,1,2]] |
A coset table has a column for each generator element (and inverse generator element?) and a row for each coset.
Coset Table | Relator Table for: mm |
Relator Table for: rrr |
Relator Table for: rmrm |
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
Spanning Tree
generators | |||||||||
cosets | |||||||||
Further Work To Do
The conversions in both directions can be improved by implementing better simplifications but since there is no canonical form the simplifications can never be perfect. The Todd-Coxeter does not yet detect and remove 'coincidences', that is, duplicated points. So this in the next improvement that needs to be made.
Notes from Waldek
Subject: Re: [fricas-devel] [PATCH] add coerce from PermutationGroup to GroupPresentation From: Waldek Hebisch To: fricas-devel@googlegroups.com Date: Wed, 25 Jan 2017 06:23:37 +0100 (CET) Martin Baker wrote: > > I find it hard to understand the functions in PermutationGroup. Partly > because I cant find any documentation (I searched the usual places but > its easy to miss things) and its hard to follow the code because Rep is > so complicated. Algorithm is described in A. Seress book "Permutation group algorithms", but I do not think this book is freely available. On the net there are notes by A. Hulpke which contain sketch of the algorithm. Data structure in a sense are very natural once you know what algorithms are supposed to do. More precisely, we have: Rep := Record(gens : L PERM S, information : REC2) REC2 ==> Record(order : NNI, sgset : L V NNI, _ gpbase : L NNI, orbs : V REC, mp : L S, wd : L L NNI) REC ==> Record ( orb : L NNI, svc : V I ) in REP gens are original generators, information is extra data coded in efficient form. In particular, instead of S internal computation works with integers: we first create list of points appearing in all permutation and for internal purposes we replace elements by position in the list. Permutations are represented by vectors of integers, so that v(i) gives image of point i. Now in REC2 order is number of elements in the group. Value zero has special meaning -- it means that no information is computed. sgset is strong generators, gpbase is list of base points, mp is list of elements of S moved by some permutation (needed for mapping between permutations on S and internal representation), wd gives representation of strong generators in terms of orignal generators. The most interesting structure is orbs which describes orbits of base point. The orb part is just list of point on the orbit. The svc part allows you to compute element of the group moving given point to base point of the orbit. In detail, -2 means point not in orbit, -1 means base point, positive value is index of strong generator moving given point closer to base point (look at 'strip' to see how this is used). Given that I know how presentation of permutation group can be dane and that I know structure of PermutationGroup I decide to code the convertion. I need to do more test, but at least S5 seem to work correctly. I also noticed that Todd-Coxeter (toPermutationIfCan) may produce something strange. Namely, the following (junky) relations: [[4,4,- 3,- 2,- 1,- 4,- 3], [3,4,- 2,- 4,- 3,- 2], [2,4,- 4,- 3,- 2], [1,4,- 4,- 3,- 1], [4,4,- 3,- 2,- 1,- 4,- 3,- 1], [3,4,- 2,- 4,- 3,- 1], [2,4,- 1,- 4,- 3], [1,4,- 4,- 3], [4,4,- 3,- 2,- 1,- 4,- 3,- 2], [3,4,- 1,- 2,- 1,- 4], [2,4,- 1,- 4,- 3,- 1], [1,4,- 1,- 4,- 3,- 2], [4,4], [3,4,- 4,- 3], [2,4,- 2,- 4], [1,4,- 1,- 4], [3,3,- 2,- 3,- 2], [2,3,- 3,- 2], [1,3,- 3,- 1], [3,3,- 2,- 3,- 1], [2,3,- 1,- 3], [1,3,- 3], [3,3,- 1,- 2,- 1], [2,3,- 1,- 3,- 1], [1,3,- 1,- 3,- 2], [2,2,- 1], [1,2,- 2], [2,2,- 1,- 2,- 1], [1,2,- 2,- 1], [1,1]] and [1, 2, 3, 4] as generators produced set of permutations on 6 elements which generates full symmetric group. The presentation above is junky because relation [1,2,- 2] means that 1 is identity. Then relation [2,2,- 1,- 2,- 1] simplifies to [2,2,- 2] and next [2] which means that 2 is identity. Then [3,3,- 2,- 3,- 1] means that 3 is identity and [4,4,- 3,- 2,- 1,- 4,- 3] means 4 is identity, so we get trivial group. BTW: can you get smaller number of point than number of elements in the group? All correct examples I saw produce just permutation representation on itself. However, Todd-Coxeter in general produces representation on cosets of a subgroup, so if you take notrivial subgroup the set on which permutation act will be smaller than the whole group. -- Waldek Hebisch |
I have now commited improved version of Todd-Coxeter. It now handles coincidencies and is much faster than previus version. Scaling is not great, but much better than before: I was able to do enumeration for Mathieu 11 (using second presentation from Canon et all paper) in 211 seconds. Currently coincidencies are handled using union-find and table of equivalents. It seems that we could handle them more directly by careful updating of coset table. But that is potentially error-prone so I am using simpler implementation now. Coset with respect to a subgroup are still to do. AFAICS still quite a lot of work goes into redundant scanning of coset table. We probably could get large speedup by having some kind of agenda with current work, so that after adding a point we could quickly restart work waiting for this point. -- Waldek Hebisch |
Subject: [fricas-devel] Todd-Coxeter To: fricas-devel@googlegroups.com Date: Wed, 1 Feb 2017 17:33:40 +0100 (CET) From: Waldek Hebisch I looked more at Todd-Coxeter routine. I think I now better understand performance. Basically Todd-Coxeter has tentative set of coset starting from coset 1 and tries to apply relations and perform inferences. When same transition is undefined in coset table we need to add new point. Current routine (in trunk) uses Felsch style strategy: we do all possible inferences before adding a new point. However, this repeats a lot of work: if (partial) coset table is closed under inferences than after adding a link any new inference must involve added link. So it is enough to try all relators trough a link. In practice Felsch is starting all rotations of relators and their inverses at one end of link. One can give simple estimate of running time for Felsch strategy in terms of generators, relations and number of generated cosets, number of operations is proportional to M|G|\sum_i 2|r_i|^2 where M is number of generated cosets |G| is number of generators and inverses, |r_i| is length of relator i. Our implementation performs full scan of coset table before adding a point so it is of order M^2|G|\sum_i |r_i| When M > max |r_i| our current code looses compared to Felsch. Actually, it is hard to imagine presentation for which our current strategy would work better than Felsch: Felsch has extra cost due to square of relator lengths, So one would have to use long relators such that we get small number of cosets -- looks weird. There is different strategy: Haselgrove-Leech-Trotter (HLT) method tries to complete relators trough given point adding new point if necessary. Because it adds point early it may add point which later will be proven to be equal to already added point. So for given order of adding point HLT will need the same or larger number of point than Felsch. But HLT saves work on perfroming inferences: we apply given relator (or invere) to given coset only once. So one can give simple estimate of number of operations: M|G|\sum_i |r_i| where M is number of generated cosets |G| is number of generators and inverses, |r_i| is length of relator i. Since HLT may generate more cosets there is a tradeoff. I have now implemented HLT type strategy. It is worse, because after adding point I come back to applying all relators to the coset. For m11 result is good (about 120 ms) in other not so. Anyway some results are: Current (trunk with small improvenents): m11 w 182.19 using 14830 cosets HLT style: m11 w 0.12 sec using 47966 cosets a5 from relationsInGenerators 0.01 sec using 678 cosets a6 from relationsInGenerators 2.62 sec using 35831 cosets s5 from relationsInGenerators 0.63 sec using 10692 cosets s6 from relationsInGenerators after long time run out of stack space ... As you can see relationsInGenerators seem to produce bad (expensive to enumenrate) presentations. You can see the HLT thing at: http://www.math.uni.wroc.pl/~hebisch/fricas/gpresent.spad I reused location of previous version so click reload in brewser to get fresh one. TODO: - real HLT (without rescanning coset from the beginning), should get rid of quadratic behaviour for long relators - real Felsch (probably need to keep relators in array to have easy access to rotated version). - get rid of find and reuse entries in coset table. I will do few more test with this version. Early tests discoverd bug in handling coincidencies and I could do m11 only after fixing this bug. If all goes well I will commit this or better version soon. -- Waldek Hebisch |