Complexity of some algorithmic problems in groups:
a survey
Abstract.
In this survey, we address the worst-case, average-case, and generic-case time complexity of the word problem and some other algorithmic problems in several classes of groups and show that it is often the case that the average-case complexity of the word problem is linear with respect to the length of an input word, which is as good as it gets if one considers groups given by generators and defining relations. At the same time, there are other natural algorithmic problems, for instance, the geodesic (decision) problem or Whitehead’s automorphism problem, where the average-case time complexity can be sublinear, even constant.
1. Introduction
Algorithmic problems for infinite groups have received a lot of attention over the last 100+ years, since Dehn’s work back in 1911. More recently, with the increasing interest in computational complexity, many algorithmic problems of group theory have been reexamined with regard to efficiency.
The worst-case complexity of group-theoretic algorithms has been studied probably since small cancellation groups were introduced and it was noticed that the word problem in these groups admits an efficient solution by Dehn’s algorithm, see e.g. [26]. Time complexity of Dehn’s algorithm was eventually shown to be linear (with respect to the “length” of the input). The interest in complexity of group-theoretic algorithms was afterwards gradually increasing over the rest of the 20th century, and then experienced explosive growth in the 21st century, in part due to input from theoretical computer science (notably the Knuth-Morris-Pratt algorithm and the Master theorem for divide-and-conquer recurrences), and in part due to the introduction of the generic-case and the average-case complexity into group theory about 20 years ago.
Genericity of group-theoretic properties and generic-case complexity of group-theoretic algorithms were introduced in [2] and [19]. Average-case complexity, formally introduced in [24] (although informally considered before in [23]), was addressed in the context of group theory for the first time in [20]. Specifically, the authors of [20] addressed the average-case complexity of the word and subgroup membership problems in non-amenable groups and showed that this complexity was often linear. The fact that the considered groups were non-amenable played an essential role in computing the average-case complexity due to the fact that the cogrowth rate of any non-amenable group is strictly smaller than the growth rate of the ambient free group, see [11], [12]. In contrast, most groups considered in a recent paper [32] are amenable. The case of amenable groups is more difficult; general tricks of [20] are not applicable in this case, and one has to actually investigate the structure of groups in question.
There are several ways to define the average-case time complexity of a group-theoretic algorithm that takes words as inputs; below is one of the natural ways. Let denote the set of all words of length in a finite group alphabet. For a word , let denote the time that the algorithm (for a classical Turing machine) works on input .
One can then define the average-case time complexity of the algorithm on inputs of length as
(1) |
1.1. Why the average-case complexity is often strictly lower than the worst-case complexity
On the intuitive level, the reason is that inputs for which is high are “sparse”. If one is able to quantify this “sparsity”, then one can try to split in a disjoint union of sets , so that the formula (1) is stratified as
(2) |
This formula is not very practical though because the number of summands may be too large. Keeping in mind that our goal is typically reduced to finding an upper bound for the average-case complexity rather than its precise value, we will use a slightly different stratification that is more practical. Denote by an upper bound on for and use the following sum instead of (2):
(3) |
In this sum, the number of summands can be rather small (typically 2 or 3, although it can be larger in some cases), depending on an upper bound on the average-case complexity one would like to establish.
Thus, our strategy will be to find suitable upper bounds on for those where is high.
An alternative strategy (used in [20] and in [39], say) is, for a given input, to run two algorithms in parallel. One algorithm, call it honest, always terminates in finite time and gives a correct result. The other algorithm, a Las Vegas algorithm, is a fast randomized algorithm that never gives an incorrect result; that is, it either produces the correct result or informs about the failure to obtain any result. (In contrast, a Monte Carlo algorithm is a randomized algorithm whose output may be incorrect with some (typically small) probability.)
A Las Vegas algorithm can improve the time complexity of an honest, “hard-working”, algorithm that always gives a correct answer but is slow. Specifically, by running a fast Las Vegas algorithm and a slow honest algorithm in parallel, one often gets an algorithm that always terminates with a correct answer and whose average-case complexity is somewhere in between. This idea was used in [20] where it was shown, in particular, that if a group has the word problem solvable in subexponential time and if has a non-amenable factor group where the word problem is solvable in a complexity class , then there is an honest algorithm that solves the word problem in with average-case complexity in .
2. Background
Bibliography on the worst-case complexity of various group-theoretic problems (the word problem, conjugacy problem, subgroup membership problem, geodesic problem, equations in groups) includes hundreds of papers, so a survey on those would take a hefty monograph. Perhaps just as a few milestones, we can mention here papers [5], [6], [28], [33], [37] and the monograph [25].
The shift of the interest, in the last 20 years or so, from the worst-case to the generic-case and average-case complexity is in part due to applications of non-abelian groups in information security, see [29], [30], [17], [18] and, more generally, due to emerging connections between group theory and theoretical computer science (see e.g. [3] for a general overview).
Some ideas in the spirit of the average-complexity in group theory were actually put forward by Gromov in his essay [13]. Specifically, he proposed to consider the averaged Dehn function instead of the usual Dehn function and suggested that the former might be asymptotically smaller than the latter. Relevance of the averaged Dehn function to the average-case complexity of the word problem was pointed out by Young [46]: the averaged Dehn function represents the complexity of verifying that an element of a group represents the identity, while the technique of [20] established low average-case complexity in many (non-amenable) groups by showing that in those groups, most elements are not the identity.
We also mention here an earlier paper by Wang [43] with “average-case complexity” in the title. However, Wang’s approach is very different in substance: in his approach the set of instances of a problem involves all finite presentations of a group, rather than just a fixed one.
Below we first briefly survey the approach of [20] applicable to non-amenable groups, and then focus on a more difficult case of amenable groups.
2.1. Non-amenable groups
The basic idea of [20] is very straightforward and is often used in practice. If we have a total algorithm solving a decision problem whose worst-case complexity is “not too high” and we also have a partial algorithm solving the problem with “very low” generic-case complexity, then by running and in parallel we get a total algorithm for which we can often prove to have low average-case complexity.
To formalize this idea, let us start with a definition. Below denotes the set of all group words in the alphabet .
Definition 1.
Let be a finite alphabet with elements. A discrete probability measure on is a function such that .
We say that is length-invariant if for any words with we have .
We say that a measure is tame if there exists a subexponential function such that for any we have .
Then the main result of [20] is
Theorem 2.1.
Let be a finitely presented group where the word problem is in (i.e., is solvable in subexponential time). Suppose has a subgroup of finite index which has a non-amenable quotient group whose word problem is solvable in a complexity class , where .
Then the word problem in is solvable with average-case complexity in , uniformly relative to the family of all length-invariant discrete probability measures .
Moreover, for any tame discrete probability measure on the word problem in is solvable with average-case complexity in relative to .
Theorem 2.1 applies to a surprisingly wide range of examples, although the idea behind the proof of this theorem is quite simple. We run in parallel the total subexponential algorithm for the word problem of and the partial generic algorithm provided by the quotient group . If the image of an element of is nontrivial in , then the element is obviously nontrivial in . On all but an exponentially negligible set of inputs this partial algorithm will actually terminate within the complexity bound and it turns out that the average-case complexity of the combined algorithm is in .
Below are some more concrete corollaries of Theorem 2.1.
Corollary 1.
Let be a finitely presented group where the word problem is solvable in subexponential time. Let be a generating set of and let . Let be the family of all length-invariant discrete probability measures .
-
(1)
Suppose that has a subgroup of finite index that has a non-elementary word-hyperbolic quotient group. Then the word problem in is solvable with linear time average-case complexity uniformly relative to .
Moreover, for any tame discrete probability measure on the word problem in is solvable with linear time average-case complexity relative to .
-
(2)
Suppose that has a subgroup of finite index that has a non-amenable automatic quotient group. Then the word problem in is solvable with quadratic time average-case complexity uniformly relative to .
Moreover, for any tame discrete probability measure on the word problem in is solvable with quadratic time average-case complexity relative to .
This corollary applies, in particular, to braid groups since has a subgroup of finite index, the pure braid group , that admits a homomorphism onto the free group . Since the word problem for is solvable in quadratic time (see e.g. [6]), Corollary 1 implies that the word problem in has linear time average-case complexity.
Another application is to Artin groups of extra large type. An Artin group has a presentation
where for
and where for each . We allow in which case the relation is omitted from the above presentation. is of extra large type if and all . Peifer [34] proved that all such groups are automatic and therefore have the word problem solvable in quadratic time. An Artin group has an associated Coxeter group which is the quotient group obtained by setting the squares of the generators equal to the identity. If is of extra large type then so is its Coxeter quotient
This is known to be a nonelementary word-hyperbolic group if . Thus, by Corollary 1, the word problem in any Artin group of extra large type is solvable in linear time on average.
2.2. Amenable groups
For amenable groups, the trick used in the proof of Theorem 2.1 may not work, as we have explained in the Introduction. Below we mention several results obtained by using different approaches.
In [32], it was shown that in several classes of (amenable) groups, as well as in a large class of groups that are free products, the average-case complexity of the word problem is linear. More specifically, the following results were established:
-
1.
The average-case time complexity of the word problem in groups of matrices over integers (or rationals) is often linear; in particular, this is the case in all polycyclic groups. These results apply, in particular, to finitely generated nilpotent groups. It is also noteworthy that the worst-case complexity of the word problem in these groups is , where is the length of the input (as a word in the generating matrices). The proof uses a recent result [15] on fast multiplication of integers.
For finitely generated nilpotent groups, the worst-case time complexity of the word problem is, in fact, for any integer , where denotes the function , with logarithms.
-
2.
If a finitely generated group has polynomial-time worst-case complexity of the word problem and maps onto a (generalized) lamplighter group (the wreath product of and ), then the average-case time complexity of the word problem in is linear. This applies, in particular, to free solvable groups.
-
3.
In popular groups like solvable Baumslag-Solitar groups and Thompson’s group , the average-case time complexity of the word problem is linear.
We recall that the group has a presentation We also note, in passing, that it is a major open problem whether or not Thompson’s group is amenable, see [14] for a survey of work done on this problem.
-
4.
If a finitely generated group is a free product of nontrivial groups and , both having polynomial-time worst-case complexity of the word problem, then has linear average-case time complexity of the word problem.
-
5.
If a finitely generated group is a free product of nontrivial groups and , both having polynomial-time worst-case time complexity of the subgroup membership problem, then has linear average-case time complexity of the subgroup membership problem. All subgroups in question should be finitely generated.
We also mention that the problem in some sense dual to finding the average-case complexity of the word problem in a given group is finding “non-cooperative” words for which deciding whether or not in is as computationally hard as it gets; these words are “responsible” for a high worst-case complexity of the word problem in . This is relevant to properties of the Dehn function of , see e.g. [5], [31], [46] and references therein.
The word and subgroup membership problems are not the only group-theoretic problems whose average-case complexity can be significantly lower than the worst-case complexity. In [39], we showed that the average-case complexity of the problem of detecting a primitive element in a free group has constant time complexity (with respect to the length of the input) if the input is a cyclically reduced word. The same idea was later used in [36] to design an algorithm, with constant average-case complexity, for detecting relatively primitive elements, i.e., elements that are primitive in a given subgroup of a free group.
3. Directions of research
In this section, we mention several interesting (in our opinion) problems relevant to the average-case time complexity of the word problem in several classes of groups. We emphasize that it is often the case that the average-case complexity of the word problem is linear with respect to the length of an input word, which is as good as it gets if one considers groups given by generators and defining relations, see e.g. [38].
With some other problems, e.g. the geodesic (decision) problem, it can be the case for some groups that the average-case complexity is sublinear, even constant.
More specifically, we suggest the following directions of research. Some of them come from the “Open problems” section of [32].
Direction 1. Prove that the average-case time complexity of the word problem in any finitely generated metabelian group is linear.
We note that in [22], the authors showed that there are finitely generated solvable groups of derived length 3 (even residually finite ones) with super-exponential worst-case complexity of the word problem. The average-case time complexity of the word problem in such groups cannot be polynomial since the super-exponential runtime, even on a very small set of words, will dominate everything else.
With metabelian groups, the situation is different since it is known that, unlike in solvable groups of derived length [21], the word problem in any finitely presented metabelian group is solvable. This follows from the fact that these groups are residually finite, but the first explicit algorithm was offered in [42].
The question of the worst-case complexity of the word problem in metabelian groups is a problem of independent interest that would have to addressed along the way. Here we just note that the worst-case complexity of the word problem in a finitely generated free metabelian groups was shown to be , i.e., it is quasilinear in the length of the input [28].
Direction 2. Prove that the average-case time complexity of the word problem in any finitely generated group of matrices over is linear.
This is a natural direction prompted by the results of [32] about complexity of the word problem in matrix groups mentioned in Section 2 of this survey.
We also note that attacking this problem will probably involve estimating the generic-case complexity of matrix multiplication. Since the latter is a problem of independent interest, we single it out as Direction 5 below.
Direction 3. Let be a finitely generated (or even a finitely presented) group where the average-case time complexity of the word problem is linear. We plan to address the following question: is it necessarily the case that the generic-case time complexity (i.e., complexity on “most” inputs) of the word problem in is linear, too?
We note that the converse is not true, see the comment to the Direction 1 above. We also note that if the sums in the formula (1) are bounded by a linear function of , then for any superlinear function the densities of subsets of where (for ) are approaching 0 when goes to infinity. This means that if the average-case time complexity of the word problem in is linear, then the generic-case time complexity of the word problem in can be bounded by any superlinear function.
A similar argument shows that if the average-case time complexity of the word problem in is polynomial, then the generic-case time complexity of the word problem in is polynomial, too, although the degree of a polynomial may be a little higher. More accurately, if the average-case complexity is , then the generic-case complexity is for any .
Another implication that we can mention in this context is that the exponential time average-case time complexity of the word problem in implies the exponential time (as opposed to super-exponential time) worst-case complexity of the word problem in . This is because super-exponential complexity of the word problem even on just a single input will dominate everything else, so the average-case complexity would be super-exponential as well.
Direction 4. This direction is somewhat informal. Prompted by a result of [39] mentioned at the end of Section 1, look for other algorithmic problems where the result of [10] on exponentially negligible proportion of words with “forbidden subwords” can lead to the constant-time average-case complexity.
One way to generalize the result of [39] about fast (on average) detection of primitive elements of a free group would be finding other “orbit-blocking” words, see [4, Problem (F40)]. In case that problem has a positive answer for a particular word , the following restricted version of Whitehead’s automorphism problem will have constant-time average-case complexity (with respect to the length of the input word ): given a cyclically reduced word , find out if for some .
Another problems of that kind is the geodesic (decision) problem: given a finitely presented group and a word in the generators of , find out whether or not there is a shorter word that represents the same element of . “Forbidden subwords” in this case are subwords of that are longer subwords of defining relators of . By a “longer subword” of we mean a subword whose length is more than half of the length of (in particular, any defining relator is such a “forbidden subword”). If has such a subword, then is not a geodesic word.
Based on this simple observation, one can show, in particular, that the average-case time complexity of the geodesic problem in any hyperbolic group is constant (with respect to the length of the input word). This is because the worst-case time complexity of this problem in any hyperbolic group is at most quadratic. In fact, if the geodesic (decision) problem in a group is solvable in subexponential time in the worst case, then the average-case complexity of this problem in is linear-time.
One can look for other algorithmic problems where the presence of “forbidden subwords” can quickly lead to a decision, and then use this to bound the average-case complexity.
Direction 5. This direction is relevant to Direction 2, but is also of independent interest because of connections to Cayley hash functions, see [8]. More specifically, it is relevant to a well-known problem of estimating the girth of the Cayley graph of some special 2-generator groups of matrices over ; the latter problem has received a lot of attention over the last 15 years or so, see e.g. [1], [7], [8], [16].
The problem to consider here is as follows. Let and be matrices over that generate a free group. Let be a freely reduced (semi)group word of length . After evaluating as a product of matrices, we get a matrix, call it . What is the largest (by the absolute value) possible entry of , over all of length , as a function of ? The latter function is usually (although not always) exponential, so we are looking for an answer in the form , i.e., we are looking for the base of the exponent.
The relevance of this problem to the problem of estimating the girth of the Cayley graph of 2-generator (semi)groups of matrices over is almost obvious. If and generate a free sub(semi)group of , then there cannot be any relations of the form in unless at least one of the entries of the matrix or is at least . Thus, if the largest entry in a product of matrices is of the size , then the girth of the Cayley graph of our sub(semi)group of is .
A more difficult problem is to estimate the growth of a “random” product of matrices, each of which is or . That is, each matrix in such a product is either or , with probability . This is relevant to Direction 2, specifically to the generic-case complexity of matrix multiplication and therefore to the generic-case complexity of the word problem in (two-generator) groups of matrices. We discuss an approach to this problem at the end of Section 4.
4. Some possible approaches
In this section, we describe what we believe are promising approaches along some of the research avenues mentioned in Section 3.
In Direction 1, one will first need to address the worst-case complexity of the word problem in finitely generated metabelian groups. There is an explicit algorithm, due to Timoshenko [42], for solving this problem, but the worst-case complexity of this algorithm is unknown. Perhaps it was discouraging that Timoshenko’s algorithm employed Gröbner bases (in Laurent polynomial rings), and these are not known to be efficiently constructible. However, constructing relevant Gröbner bases is not part of solving the word problem for a given input word. These bases can be precomputed and then used for any input word . With this in mind, it is quite likely that the worst-case time complexity of the word problem in finitely generated metabelian groups is polynomial.
If this is the case, then for the average-case complexity one can use an approach from [32] based on the following
Theorem 4.1.
[45, Theorem 15.8a] Let be a polycyclic group, and suppose that all words of length (in the given generators of ) are sampled with equal probability. Then one has the following alternative:
(a) has polynomial growth with degree and for a random word of length , the probability of the event is .
(b) has exponential growth and the probability of the event is for some constant .
As an example of using this result, we show below how one can establish that in any polycyclic group, the average-case complexity of the word problem is linear. Let be an infinite polycyclic group. It has a factor group that is virtually free abelian of rank . Thus, we first check if an input word is equal to 1 modulo ; this takes linear time in , the length of . Indeed, let and let be an abelian normal subgroup of such that is finite. Then first check if in ; this can be done in linear time. If , then rewrite in generators of using the Reidemeister-Schreier procedure. This, too, takes linear time. Indeed, if we already have a Schreier representative for a product , then we multiply its image in the factor group by the image of and get a representative for . This step is performed in time bounded by a constant since the group is finite. Therefore, the whole rewriting takes linear time in .
The obtained word should have length in generators of . Then we solve the word problem for in the abelian group , which can be done in linear time in the length of , and therefore the whole solution takes linear time in .
If modulo , then we apply an algorithm for solving the word problem in with the worst-case complexity ; such an algorithm exists by a result of [32] mentioned in Section 2. (Recall that polycyclic groups are representable by matrices over integers.) The probability that modulo is by Theorem 4.1.
Therefore, the expected runtime of the combined algorithm is .
Along the same lines, one can establish linear-time average-case complexity of the word problem in many groups that have an (infinite) abelian factor group. It is important however to have a polynomial bound on the worst-case time complexity of the word problem in . (In some cases, a subexponential bound may suffice.) In particular, this is how one can approach linear-time average-case complexity of the word problem in finitely generated metabelian groups, by first establishing a polynomial bound on the worst-case complexity.
4.1. Constant-time average-case complexity
One might find surprising the fact that the average-case complexity of some natural group-theoretic problems can be constant. Below we illustrate how this can happen by using a special case of the Whitehead problem for a free group. The idea of using “forbidden subwords” in estimating the average-case complexity can be fruitful in other situations as well.
Let be a free group with a free generating set and let . Call an element primitive if there is an automorphism of that takes to .
We will also need the definition of the Whitehead graph of an element . The Whitehead graph of has vertices that correspond to . For each occurrence of a subword in the word , there is an edge in that connects the vertex to the vertex ; if has a subword , then there is an edge connecting to , etc. There is one more edge (the external edge): this is the edge that connects the vertex corresponding to the last letter of to the vertex corresponding to the inverse of the first letter.
It was observed by Whitehead himself (see [44] or [26]) that the Whitehead graph of any cyclically reduced primitive element of length has either an isolated edge or a cut vertex, i.e., a vertex that, having been removed from the graph together with all incident edges, increases the number of connected components of the graph.
Now call a group word primitivity-blocking if it cannot be a subword of any cyclically reduced primitive element of . For example, if the Whitehead graph of (without the external edge) is complete (i.e., any two vertices are connected by at least one edge), then is primitivity-blocking because in this case, if is a subword of , then the Whitehead graph of , too, is complete and therefore does not have a cut vertex or an isolated edge. Examples of primitivity-blocking words are: (for any ), (for an even ), etc. Here denotes .
The usual algorithm for deciding whether or not a given element of is primitive is a special case of the general Whitehead algorithm that decides, given two elements , whether or not can be taken to by an automorphism of . If , the worst-case complexity of the Whitehead algorithm is at most quadratic in . Here we are going to address the average-case complexity of the Whitehead algorithm run in parallel with a “fast check” algorithm, when applied to recognizing primitive elements of .
A “fast check” algorithm to test primitivity of an input (cyclically reduced) word would be as follows. Let be the length of . The algorithm would read the initial segments of of length , adding one letter at a time, and build the Whitehead graph of this segment, excluding the external edge. Then the algorithm would check if this graph is complete. If it is complete, then in particular it does not have a cut vertex or an isolated edge, so the input element is not primitive.
Note that the Whitehead graph always has vertices, so checking the property of such a graph to be complete takes constant time with respect to the length of , although reading a segment of of length takes time . If the Whitehead graph of is complete, the algorithm returns “ is not primitive”. If it is not, the algorithm just stops.
Denote the “usual” Whitehead algorithm by . Now we are going to run the algorithms and in parallel; denote the composite algorithm by . Suppose that inputs of the algorithm are cyclically reduced words that are selected uniformly at random from the set of cyclically reduced words of length . Then we claim that the average-case time complexity (a.k.a. expected runtime) of the algorithm , working on a classical Turing machine, is , a constant that depends on but not on .
Indeed, for complexity of the algorithm we can use a result of [10] saying that the number of (freely reduced) words of length with (any number of) forbidden subwords grows exponentially slower than the number of all freely reduced words of length .
In our situation, if the Whitehead graph of a word is not complete, that means does not have at least one as a subword. Thus, if the Whitehead graph of any initial segment of the input word is not complete, we have at least one forbidden subword. Therefore, the probability that the Whitehead graph of the initial segment of length of the word is not complete is bounded by for some . Thus, the average time complexity of the algorithm is
(4) |
which is bounded by a constant.
Now suppose that the Whitehead graph of the input word of length is not complete, so that we have to employ the Whitehead algorithm . The probability of this to happen is bounded by for some , as was mentioned before.
Then, the worst-case complexity of the Whitehead algorithm for detecting primitivity of the input word is known to have time complexity .
Thus, the average-case complexity of the composite algorithm is
(5) |
which is bounded by a constant.
These results may be generalized as follows. Call an -blocking word if it cannot be a subword of any cyclically reduced . Suppose there are -blocking words for a (cf. [4], Problem (F40)). Then, by using the same kind of argument as above, one would be able to show that the average-case complexity (with respect to the length of the input word ) of the following problem is constant: given a cyclically reduced word , find out if for some .
4.2. Complexity of a product of matrices
Finally, we comment on Direction 5 from Section 3. To begin, let
The largest entry in a product of matrices each of which is or is ; this was proved in [8]. (In fact, there is given an explicit function for the largest entry.) The fastest growing largest entry in a product of matrices is when such a product is of the form (assuming that is even).
To estimate the average growth of entries in a random product of matrices or (where each factor is either or with probability ), one can try to solve the following pair of recurrence relations for, say, the entries in the first row of a matrix:
– with probability , , ;
– with probability , , .
One can compute the “expectation” of as
The same for the “expectation” of
Thus, “on average” , hence from the above, , whence grows as , and so does .
The variance, however, seems to be pretty large, as suggested by computer experiments. In particular, the largest entry in a product of 1000 random matrices ranges from to . The most popular largest entry is on the order of , which suggests that “generically”, the largest entry grows like .
If one has in mind applications to Cayley hash functions (see Section 3, Direction 5), then one wants the generic growth to be as slow as possible. To that end, one can consider a slightly different pair of matrices:
With this pair, the largest entry in a product of matrices or is when such a product is of the form (assuming that is a multiple of 4). The growth rate of the largest entry in the matrix is . Note that .
We note in passing that, interestingly, entries of exhibit linear(!) growth in this case.
The average growth for the largest entry in a product of matrices or in this case is , and in a generic product it is about . We make a disclaimer that all these growth estimates for the pair are based just on computer experiments.
We also note that Pollicott [35] studied the (maximal) Lyapunov exponent for random matrix products of strictly positive matrices. In our situation, the maximal Lyapunov exponent is the natural logarithm of the maximal base of the exponent in the growth rate of the entries in a random product of matrices or . In [35], an algorithm is given for estimating with any desired precision. However, our matrices and are not strictly positive, so this algorithm may not be applicable in our situation.
Finally, we note that it seems natural to use (for hashing purposes) pairs of matrices with smaller entries, like or . However, the matrices do not generate a free semigroup (they satisfy the “braid relation” ), so our approach to collision resistance does not work in this case. The matrices do generate a free semigroup, but they generate the whole semigroup of matrices in with nonnegative entries. This property was used in [41] to arrange an attack that was successful in recovering a preimage of a hash, thereby showing that the corresponding Cayley hash function is not preimage resistant.
We leave it at that.
References
- [1] G. Arzhantseva and A. Biswas, Logarithmic girth expander graphs of , J. Algebraic Combinatorics 56 (2022), 691–723.
- [2] G. Arzhantseva and A. Olshanskii, Genericity of the class of groups in which subgroups with a lesser number of generators are free, (Russian) Mat. Zametki 59 (1996), no. 4, 489–496.
- [3] F. Bassino, I. Kapovich, M. Lohrey, A. Myasnikov, C. Nicaud, A. Nikolaev, I. Rivin, V. Shpilrain, A. Ushakov, P. Weil, Complexity and Randomness in Group Theory: GAGTA Book 1. Walter de Gruyter, 2020.
-
[4]
G. Baumslag, A. G. Myasnikov, V. Shpilrain, Open problems in
combinatorial group theory,
http://shpilrain.ccny.cuny.edu/gworld/problems/oproblems.html - [5] J.-C. Birget, A. Yu. Olshanskii, E. Rips, M. V. Sapir, Isoperimetric functions of groups and computational complexity of the word problem, Ann. of Math. (2) 156 (2002), 467–518.
- [6] J. Birman, K. H. Ko, S. J. Lee, A new approach to the word and conjugacy problems in the braid groups, Adv. Math. 139 (1998), 322–353.
- [7] J. Bourgain, A. Gamburd, Uniform expansion bounds for Cayley graphs of . Ann. of Math. (2) 167 (2008), 625–642.
- [8] L. Bromberg, V. Shpilrain, A. Vdovina, Navigating in the Cayley graph of and applications to hashing, Semigroup Forum 94 (2017), 314–324.
- [9] A. Carnevale, M. Cavaleri, Partial word and equality problems and Banach densities, Adv. Math. 368 (2020), 107133.
- [10] T. Ceccherini-Silberstein, W. Woess, Growth and ergodicity of context-free languages, Trans. Amer. Math. Soc. 354 (2002), 4597–4625.
- [11] J. M. Cohen, Cogrowth and amenability of discrete groups, J. Funct. Anal. 48 (1982), 301–309.
- [12] R. I. Grigorchuk, Symmetrical random walks on discrete groups, in: Advances in Probability and Related Topics, Vol. 6, pp. 285–325, Marcel Dekker 1980.
- [13] M. Gromov, Asymptotic invariants of infinite groups, Geometric group theory, Vol. 2 (Sussex, 1991), 1–295. London Math. Soc. Lecture Note Ser., 182 Cambridge University Press, Cambridge, 1993.
- [14] V. Guba, Amenability problem for Thompson’s group : state of the art, Groups, Complexity, Cryptology 15 (2023.
- [15] D. Harvey and J. van der Hoeven, Integer multiplication in time , Ann. of Math. 193 (2021), 563–617.
- [16] H. A. Helfgott,Growth and generation in ) Ann. of Math. (2) 167 (2008), 601–623.
- [17] D. Kahrobaei, R. Flores, M. Noce, Group-based cryptography in the quantum era, Notices Amer. Math. Soc. 70 (2023), 752–763.
- [18] D. Kahrobaei, R. Flores, M. Noce, M. Habeeb, C. Battarbee, Applications of Group Theory in Cryptography: Post-quantum Group-based Cryptography, Amer. Math. Soc. Surveys and Monographs, 2023, to appear.
- [19] I. Kapovich, A. G. Myasnikov, P. Schupp, V. Shpilrain, Generic-case complexity, decision problems in group theory and random walks, J. Algebra 264 (2003), 665–694.
- [20] I. Kapovich, A. G. Myasnikov, P. Schupp, V. Shpilrain, Average-case complexity and decision problems in group theory, Advances in Math. 190 (2005), 343–359.
- [21] O. G. Kharlampovich, A finitely presented solvable group with unsolvable word problem. (Russian) Izv. Akad. Nauk SSSR Ser. Mat. 45 (1981), 852–873.
- [22] O. G. Kharlampovich, A. G. Myasnikov, M. Sapir, Algorithmically complex residually finite groups, Bull. Math. Sci. 7 (2017), 309–352.
- [23] D. E. Knuth, The analysis of algorithms. Actes du Congrès International des Mathématiciens (Nice, 1970), Tome 3, pp. 269–-274. Gauthier-Villars, Paris, 1971.
- [24] L. Levin, Average case complete problems, SIAM J. Comput. 15 (1986), 285–286.
- [25] M. Lohrey, The compressed word problem for groups, Springer Briefs Math., Springer, New York, 2014.
- [26] R. Lyndon and P. Schupp, Combinatorial Group Theory, Springer-Verlag, 1977. Reprinted in the “Classics in mathematics” series, 2000.
- [27] C. Mattes, A. Weiss, Improved Parallel Algorithms for Generalized Baumslag Groups, in: LATIN 2022: Theoretical Informatics, Lecture Notes in Comput. Sci. 13568 (2022), 658–675.
- [28] A. G. Myasnikov, V. Roman’kov, A. Ushakov, and A. Vershik, The word and geodesic problems in free solvable groups, Trans. Amer. Math. Soc. 362 (2010), 4655–4682.
- [29] A. G. Myasnikov, V. Shpilrain, and A. Ushakov, Group-based cryptography. Birkhäuser Verlag, Basel-Boston-Berlin, 2008.
- [30] A. G. Myasnikov, V. Shpilrain, and A. Ushakov, Non-commutative cryptography and complexity of group-theoretic problems. Amer. Math. Soc. Surveys and Monographs, 2011.
- [31] A. G. Miasnikov, A. Ushakov, Random van Kampen diagrams and algorithmic problems in groups, Groups Complex. Cryptol. 3 (2011), 121–185.
- [32] A. Olshanskii, V. Shpilrain, Linear average-case complexity of algorithmic problems in groups, preprint.
- [33] M. S. Paterson, A. A. Razborov, The set of minimal braids is co-NP-complete, J. Algorithms 12 (1991), 393–408.
- [34] D. Peifer, Artin groups of extra-large type are biautomatic, J. Pure Appl. Algebra 110 (1996), 15–56.
- [35] M. Pollicott, Maximal Lyapunov exponents for random matrix products, Invent. Math. 181 (2010), 209–226.
- [36] M. Roy, E. Ventura, P. Weil, The central tree property and algorithmic problems on subgroups of free groups, J. Group Theory, to appear.
- [37] S. Schleimer, Polynomial-time word problems, Comment. Math. Helv. 83 (2008), 741–765.
- [38] V. Shpilrain, Sublinear time algorithms in the theory of groups and semigroups, Illinois J. Math. 54 (2011), 187–197.
- [39] V. Shpilrain, Average-case complexity of the Whitehead problem for free groups, Comm. Algebra 51 (2023), 799–806.
- [40] A. Sisto, Tracking rates of random walks, Israel J. Math. 220 (2017), 1–28.
- [41] J.-P. Tillich and G. Zémor, Group-theoretic hash functions, in Proceedings of the First French-Israeli Workshop on Algebraic Coding, Lecture notes Comp. Sci. 781 (1993), 90–110.
- [42] E. I. Timoshenko, Certain algorithmic questions for metabelian groups, Algebra and Logic 12 (1973), 132–137.
- [43] J. Wang, Average-case completeness of a word problem in groups, Proc. of the 27-th Annual Symposium on Theory of Computing, ACM Press, New York, 1995, 325–334.
- [44] J. H. C. Whitehead, On equivalent sets of elements in free groups, Ann. of Math. 37 (1936), 782–800.
- [45] W. Woess, Random walks on infinite graphs and groups, Cambridge Tracts in Mathematics, 138. Cambridge University Press, Cambridge, 2000.
- [46] R. Young, Averaged Dehn functions for nilpotent groups, Topology 47 (2008), 351–367.