Abstract
We study progression-free sets in the abelian groups . Let denote the maximal size of a set that does not contain a proper arithmetic progression of length k. We give lower bound constructions, which e.g. include that , when m is even. When this is of order at least . Moreover, if the progression-free set satisfies a technical condition, which dominates the problem at least in low dimension, then holds. We present a number of new methods which cover lower bounds for several infinite families of parameters m, k, n, which includes for example: . For we determine the exact values, when , e.g. , and for we determine the exact values, when , e.g. . With regard to affine caps, i.e. sets without 3 points on a line, the new methods asymptotically improve the known lower bounds, when and : in from to , and when from to . This last improvement modulo 5 appears to be the first asymptotic improvement of any cap in AG(n, m), when over a tensor lifting from dimension 6 (see Edel, in Des Codes Crytogr 31:5–14, 2004).
Keywords: Permutation polynomial, Triple-cycle permutation, Finite field; Block cipher
Introduction
There has been great interest in finding progression-free sets in , especially when or 4. When the properties “no arithmetic progression of length 3 modulo m” and “no 3 points on any line” are equivalent. The last property is also well known under the name cap-sets. In spite of this great interest in progression-free sets and caps there is not much literature on progression-free sets in , in the case of general , and of general progressions of length k, and hardly any explicit values of the maximal size of such sets is known.1
This paper intends to fill this gap and provides several new techniques to find lower bounds, and even to find exact values in the case , which are comparable in size to the known values for .
However, before we come to this, we briefly summarize a number of related questions. The problem of finding sets with, or without, a given property has been investigated frequently. Often one is actually interested in the maximal size of |S|. Also, often even the one-dimensional case has been of fundamental interest. Let us recall some of the properties that have been investigated.
-
Erdős and Turán [28] raised the problem of studying the maximal size of sets in without an arithmetic progression of length k. There are important contributions by Behrend, Bloom and Sisask, Bourgain, Gowers, Green, Roth, Salem and Spencer, Sanders, Szemerédi, Tao [5, 7, 31, 33, 50, 51, 53, 55]. In particular, the proof of , as N tends to infinity, and quantitative versions thereof, proved to be very influential in this area. It is interesting to note that the size of progression-free sets even enters the complexity of matrix multiplication, see [14, 58].
The question of arithmetic progressions has also been studied modulo m, see e.g. Croot [15]. In this setting “modulo m” one has to clarify if elements of the progression can occur more than once. For example (1, 3, 1, 3) can possibly be considered as a progression of length 4 modulo . In this paper, however, we study “proper arithmetic progressions” meaning that all elements in the progression are distinct, unless otherwise stated.
- Assume that S does not have k elements that satisfy (for fixed constants ) a linear equation
- The case was first investigated by Erdős, Ginzburg and Ziv [29], who proved that for any elements in , where in this problem repetition is allowed, there exists a subset of m elements with sum . (There are hundreds of papers on generalizations and variants, the general topic is called “zero sums in finite abelian groups”). In the case there has been important work by Reiher [48]. The multidimensional case with is widely open, even though there are lower bounds by Edel, Elsholtz et al. [19, 22, 25], and upper bounds by Alon and Dubiner [2], Naslund [42] and Hegedüs [35].
-
The case corresponds to sum-free sets. In the one-dimensional case it is known that the maximal size is , if all are distinct, or if is allowed. In the case modulo a prime m it follows from the Cauchy-Davenport theorem that the maximal size satisfies ( all distinct).
The case of no geometric line (of m points) in the integer grid is known as Moser’s cube problem, see [41, 45]. Closely related is the question of finding the maximal number of lattice points in the same cube , but without any combinatorial line. The famous upper bound by Hales-Jewett [34] of points, when m is fixed and n tends to infinity, became very influential.
In this paper we concentrate on sets of maximal size with no k distinct elements in arithmetic progression. Observe that an arithmetic progression of length k can be expressed by means of linked linear conditions .
The multidimensional case of no 3 points in arithmetic progression has frequently been studied, especially modulo . Here the questions of “no zero sums ” and “no arithmetic progression ” turn out to be equivalent as . The problem is known as the “cap set problem”. There were important contributions by Brown and Buhler [8], Frankl, Graham and Rödl [30], Meshulam [40], Lev [38], Bateman and Katz [4], Croot et al. [16], Ellenberg and Gijswijt [24].
For a long time it was an important open problem if there is a such that holds, for all progression-free sets . Various authors mentioned this statement with varying degree of certainty or doubt, (see [1, 2, 18, 32, 36, 56]) until the solution by Croot et al. [16] (when ), and finally Ellenberg and Gijswijt [24].
Meshulam’s [40] long-standing bound for odd values of was extended by Lev [38] to even values . Improving this, Sanders [52] proved the following result:
for some positive c. Green and Tao [33] write that is admissible. Introducing an entirely new approach, based on the polynomial method rather than Fourier techniques, Croot et al. [16] proved that
where . The new methods introduced in [16] also led to the result in the case by Ellenberg and Gijswijt [24]. Again, the case of cap sets has applications to the complexity of matrix multiplication, see [3, 6].
The corresponding problem on lower bounds of progression-free sets in has also been studied in detail. It is known (see [18] for the history and current record) that there is a set S with with . The currently strongest lower bound example comes from a product construction, based on an example in dimension 480.
For a lower bound when Sanders [52] proved: there exists which does not contain a proper three term arithmetic progression with
This result follows from finding an example in with 16 elements and using a product construction. (Note that .)
The following is known:
and for primes and some positive constant
Indeed the argument yields the bound
where
| 1 |
Furthermore, it is known that J(s) is decreasing and (see equation (4.11) of [6]).
Remark 1.1
As J(s) is decreasing and , with the additional consideration of composite m (see below), one can conclude, that for every the following holds (see e.g. [6, 44]).
| 2 |
For m not being a power of 2 this also holds: if p is an odd prime divisor of m, then . For integers divisible by 4 this follows from [16], since .
There are only very few explicit values known. In the case of cap sets modulo the following is known:
The author of the 6-dimensional result (Potechin [46]), and the authors of the classification of the unique 5-dimensional maximum cap [21], (required for the 6-dimensional case by Potechin) mentioned they used computer calculations. Y. Edel informed us that for the paper [21] the computation time was a few weeks.
The remaining part of the paper is organized as follows: After some necessary notation and describing the results we first prove the asymptotic lower bounds in Sect. 4, as these proofs are shorter. In Sect. 5 we give a reformulation for the problem of finding and . In Sect. 3.1 we give a lower bound for , we then prove that this construction gives the exact values up to dimension 5 (Sects. 6 and 7), which require some detailed case studies. Finally, in Sect. 8 we prove the exact values for up to dimension 4.
Notation
We use the Landau O and o-notation such as , where the O-constant depends at most on a parameter t. We also use the Vinogradov notation, where or has the same meaning as the O-expression above.
We use the standard coding notation A(n, d) on the maximal size of a binary code of words of length n and minimum distance d.
In Sections 6, 7, 8 we will work with linear and affine subspaces of . If L is a linear subspace of dimension d, for brevity we will say that L is a d-subspace. The smallest linear subspace containing the vectors will be denoted by .
Similarly, if L is an affine subspace of dimension d, we will say that L is an affine d-subspace and the smallest affine subspace containing will be denoted by .
Throughout the paper for a subset we use the notation for the sumset and for the restricted sumset.
Results and methods
Progression-free sets
Theorem 3.1
For sets without arithmetic progression of length 3 we have the following results:
We give quite uniform proofs for all these dimensions. The value was stated before by Sanders [52] (and was indeed a computer calculation by O. Sisask), and the value was determined in a Masters’ Thesis by Lawrence Newcombe [43] (a student of the first author). From that proof it was already apparent that could be much smaller than , due to a -substructure of , but proceeding to higher dimension might have been very tedious.
Next we give a lower bound on . In the construction we use binary codes with certain minimum distances. Let A(m, d) denote the largest possible size of a (possibly non-linear) code in with minimum distance at least d. Note that (all vectors can be taken) and (all codewords can be taken with even Hamming-weight). Here are links to tables of exact values of maximal codes or bounds: https://www.win.tue.nl/~aeb/codes/binary-1.html and http://www.codetables.de/
Theorem 3.2
For we have .
Proof of Theorem 3.2
For a point define . If form an arithmetic 3-progression, then for we have:
Hence and a and c differ only at positions .
Fix t and let be such that for every and such that has minimum Hamming distance at least for every T with . Then S does not contain a proper 3-progression, since, if form a 3-progression, then , which implies that .
We can construct such S as follows. For every of size we take a binary code in of size of minimum distance and add symbols ‘1’ in the positions to get a code . The set gives the stated lower bound.
As a consequence of this result one can prove a quite good lower bound.
Corollary 3.3
which implies that there exists a progression-free set with
The exponent 0.7924 is not only much larger than the previous one of 2/3, but it is also much larger than the corresponding one 0.724851 when . This can be interpreted that the progression-free sets in are denser than those in . The ultimate reason for this is that we find a geometrically well structured subset, namely , on which we can find a very dense progression-free subset. As two elements from the same coset of the subgroup forbid other points, namely an affine copy of , it comes handy that this forbidden set has some geometric-algebraic structure.
This Corollary is the first nontrivial case of the lower bound constructions and is suitable for discussing various methods. We first prove it as a direct application of Theorem 3.2.
Proof of Corollary (Proof 1)
Calculations show that the optimal choice for t in Theorem 3.2 satisfies . In particular, for the optimal choice is . Note that the sum of only the first two terms in the lower bound , with an optimal value of t, is
The total sum is not much larger as it is bounded above by (see also [12]).
The proof of Theorem 3.2 and Corollary 3.3 above may appear a bit formal. In Section 4 we explain in detail the geometric motivation of a direct proof of Corollary 3.3, i.e. the connection to Moser’s cube problem, and to the Behrend and Salem-Spencer constructions.
Finally, we found a quite different proof, based on weighted Sperner capacity of the 2-vertex graph with one directed edge, and vertex weights 1 and 2, but decided not to include it.
Corollary 3.4
Let us explain this with two examples: when , choose . Then
which is best possible by Theorem 3.1. When , choose .
Theorem 3.5
For sets without arithmetic progression of length 4 we have the following results:
It is well known that results of this type can be lifted to higher dimensions and yield asymptotic results by a simple product construction, compare also Proposition 3.5 [22] in the similar setting of zero-sum free sets.
Lemma 3.6
Let q be a prime power.
- Let and be k-progression-free sets, then is also k-progression-free.
- A repeated application of part a) gives:
Lifting the largest known exact values and gives:
Corollary 3.7
The first result is considerably weaker than Corollary 3.3, while the second one is the strongest that is currently known. The product construction only makes use of “local” information from small dimensions. The “relative density” for the high dimensional problem is the same as for the low dimensional base-example that was lifted. Lifting for example the bound , (which is not known to be sharp), gives a better estimate . But for it is better to use the “global” information from the digits giving the lower bound . But for we do not know how to replace the product construction by a better strategy.
In many cases we present constructions much better than the product construction. These make use of “global” properties i.e. making full use of the actual dimension n. With our current understanding this only works when . For the product construction appears to be the strongest available method, see also Edel [18].
These proofs describe a set explicitly in terms of its coordinate entries, similar to the constructions by Salem and Spencer [51], and Behrend [5]. Salem and Spencer constructed progression-free sets in the integers by representing integers in an m-ary digit system, m odd, and using the digits a fixed number of times, namely with frequency n/d for integers of length n. Restricting the digits avoids wrapping over modulo m. Behrend constructed large progression-free sets in the integers by mapping a high-dimensional sphere, which by convexity is progression-free, to the integers. He also represented integers in an m-ary system with digits , where m is odd, and fixed value . In the integer case the optimization of the values of m and n shows that Behrend’s construction is greatly superior. In our setting we make use of both ideas, and observe that m, n are fixed by the problem, and the method of Behrend, when applicable, is only slightly stronger, but a bit more complicated.
Proposition 3.8
Let where the prime power q and k are fixed. The limit
exists.
It follows from Theorems 3.11 and 3.12 that . For more is known, see Sect. 1: , when is an odd prime, and where J(p) was defined by (1). We state a more general conjecture:
Conjecture 3.9
By the proposition above and the theorems below we know: for each and each prime there exists a constant , which is certainly in , such that holds, as n tends to infinity.
We conjecture that the following limit exists, and thus is in . (If the bound would mean that an exponential saving for the upper bound holds, as is the case with .)
In view of the above results, and also in view of an upper bound in a relevant case, see Theorem 3.18, we state the following conjecture:
Conjecture 3.10
Tao and Vu [57, exercise 10.1.3] observe that there is a construction in with at least points without 3-progression (based on Behrend’s construction).2
Lin and Wolf [39] proved the following: If m is a prime and
Their proof makes use of a product construction, as explained in Lemma 3.6. They also have some results, when m is a pure prime power, but this refers to finite fields , which are different from . In particular, when m is prime and is large, and n increases, the exponential growth of the lower bound is based on the constant , compared to here.
We now give our general theorems, which improve the above lower bound and remove the prime condition on m:
Theorem 3.11
Let be odd. There exists some such that
Moreover, with the value is admissible. For increasing odd m asymptotically holds.
The case also improves the asymptotic lower bound of affine caps, for details see Sect. 3.2 In the case this would give a lower bound of only which is smaller than the trivial lower bound by taking all elements with coordinate entries 0 or 1. Also note that in view of the theorem trivially induces lower bounds for any (also in the theorem below).
A crucial idea again is to avoid any product construction and to use one more digit than Tao and Vu [57, exercise 10.1.3] used, with some extra constraints, which are less costly (if m is constant and n increases). Their lower bound in case would also be weaker than the trivial progression-free set with elements.
Theorem 3.12
Let be even. There exists some such that
With one can choose . For large m one has that .
(A version of this result, in the special case has also been observed in [44], having seen a precursor of this manuscript. Their main concern is an improvement of the upper bound.)
As is well known from Behrend’s construction there are good reasons to restrict to half of the available digits. In the above cases we go up to one element more than half of the digits. In the cases of even m one additionally has to study progressions of type carefully. In the examples below we go even further, and note that those progressions which actually use the reduction modulo m cause quite a bit of extra work. (For example, in the case we have to care about progressions of type 1, 6, 0, 5 modulo 11.)
Theorem 3.13
The following holds
(No attempt was made to reduce the exponent 3.) For comparison Lin and Wolf [39] have a lower bound of about about . (For fixed k the improvement increases, as m increases.)
It is clear that on a case by case study one can prove related results for several individual values of m and k. Here we present two further cases where these ideas are generalized to infinite families (or respectively), where p is prime. It should be noted that in this case the set of digits used is not consecutive, but makes use of the structure of orbits of length p, and hence the algebraic structure. As can be seen, several good properties are preserved: many progression types can be excluded by the Salem-Spencer “same-frequency property”, and the “all-elements-distinct” property, (i.e. proper progressions).
Theorem 3.14
Let be a pure prime power, . Let . Then there exist constants and such that the following holds:
Corollary 3.15
There exist positive constants and such that the following holds:
Theorem 3.16
Let be a pure prime power, . Let . Then there exist constants and such that the following holds:
For , this is certainly not best possible. By Theorem 3.12 for one can use 5 digits, rather than 4.
Corollary 3.17
There exist positive constants and such that the following holds:
We are not aware of any earlier results of this type.
We now briefly discuss some aspects of the proofs of the exact values, and of a conditional upper bound.
For the estimations of we shall need a reformulation of the problem which is presented in Section 5. Let us say that a system of subsets () satisfies property , if the following implication holds:
(Note that for we define .) In Lemma 5.1 we will show that the largest possible total size of a system of subsets satisfying property is exactly . Hence, estimating the maximal total size of a system of subsets satisfying is equivalent with our original question.
As it turns out it is very useful that we can reduce the case of arbitrary subsets A(x) to the case of subspaces. We do not know, if this can be done for higher dimension, but for the low dimensions studied here explicitly this is a quite powerful method. In this case, the upper bound is quite close to the general lower bound in the unrestricted case, namely . This is the heuristic reason why we state Conjecture 3.10.
Theorem 3.18
If the system of subsets A(x) satisfies and all non-empty subsets A(x) are subspaces, then .
Note that for any 2-element subset forms a progression-free subset in . If , then the extremal construction is also unique in the following sense:
Theorem 3.19
Let . If the systems of subsets and both have total size and they satisfy , then there is an invertible affine linear transformation and vectors () such that for every .
Caps
An affine cap in AG(n, q) is a set in with no three points on a line. Here we set , and we study sets without three points on a line in . In other words, when is a prime, caps in and AG(n, q) are the same. The condition “no three points on a line” can be expressed by linear equations of the type (where ). As we see below when it is enough to consider the case of arithmetic 3-progressions, . The case does not correspond to affine caps, as , but in with any line with three points actually contains 3 points in arithmetic progression. To see this modulo , one just has to examine all cases in one dimension. There are obvious cases such as or . The crucial case is that also the example is a progression, as is a progression.
Modulo the cap set problem is known even from the popular card game SET. The size of the caps in dimension up to 6 are . More generally modulo prime the following is known about affine caps in , (we would like to thank Yves Edel for this collection): in dimension
one has points (a so called oval)
one has points (which is the affine part of an ovoid in projective space)
at least 65 points (which is the affine part of a projective cap of 66 points, see [20].
one has at least 195 points.3
one has at least points, see Edel [18].
For large dimensions, the best lower bound constructions are due to Edel [18] and are based on a tensor product construction of this best cap in dimension 6. For prime m, this gives a lower bound of , (which is also the construction of Lin and Wolf [39]), and this gives an asymptotic exponent of about 2/3.
Some refinements are known, when or (finite field case, different from ). Edel ([18]) writes: “No better lower bound seems to be known for general q, except for the ternary and quaternary cases.”
Especially in the case there have been a number of refinements to an exponent of 0.724851 (see [18]). The progress over the previous record 0.7218 (see [9]) seems small, but this progress is, of course, on a logarithmic scale.
When the above lifting from dimension 6 gives a lower bound of points. In contrast, Theorem 3.11 above gives the lower bound of points. It may be possible to optimise the constant in the construction modulo 5, similar to the case in Theorem 3.2. In any case this improvement appears to be the first improvement for any affine cap in AG(n, m), when .
As the case of affine caps modulo primes (or prime powers) has been well studied in the literature it seems somewhat surprising to us that the quite simple construction of vectors with n/3 of the entries being 1, and the other 2n/3 of the entries being 0 or 2 has not been observed before, and still asymptotically breaks the record. The reason may be that the improvement actually can only be seen for . Even with an improved constant one will not see the improvement for small dimension.
Proofs of the asymptotic lower bounds
We will use several times that the central multinomial coefficients can be approximated by Stirling’s formula:
Lemma 4.1
Let be an integer. There exists a constant such that
Here we give a geometrically inspired proof of Corollary 3.3.
Proof of Corollary 3.3 (Proof 2):
The crucial idea is that an arithmetic progression of length 3 (with 3 distinct points) in has a uniquely defined middle point. For comparison, this is not the case in .
We relate the problem to a problem posed by Moser [41]. Find in the maximal set of elements without “three on a line” (which is also known as Moser’s cube problem). Observe that in this case there is no reduction modulo 3. Let f(n) denote the largest such number in . It is known that , (see [13]), (see [11]), [45]. In dimensions 1, 2, 3 and 5 these values are the same as , but in dimension 4 one has that .
A simple observation by Komlós [37] shows that , and the implicit constant was refined again by Chvátal [12]. The construction by Chvátal relates the problem to coding theory and gives , for example.
Let us adapt Komlós’ [37] observation to our situation: the set
has the claimed number of elements and has no three points on a line.
Let us count the number of such points, let n be a multiple of 3, then by Stirling’s formula S has
elements. When we have the same order of magnitude, up to a constant factor, for example, by filling the extra 1 or 2 coordinates with entries from . Further observe that for three points to be on a line (in this order), one would need, in each coordinate, that
i) all entries are the same, or ii) the entries are 0, 1, 2 or 2, 1, 0 (in this order). Since the number of “middle entries 1” is constant for all points, there cannot be an arithmetic progression of three distinct digits.
Let us embed the set S from canonically into . Think of G as the lattice points but now with reduction modulo 4 in each coordinate. Observe that the set S does not have a single “3”-entry. An arithmetic progression of length 3 modulo 4 that does not make use of in any coordinate must be of one of the types below, in a given coordinate.
The digits are:
-
(i)
the same,
-
(ii)
or are 0, 1, 2 or 2, 1, 0 in this order,
-
(iii)
or 0, 2, 0, or 2, 0, 2.
We will show that the set does not contain a proper 3-progression. Suppose S does contain three distinct points in arithmetic progression. The case i) where all entries are the same does not play any role. Let us look at those coordinates where the entries differ. Since all points have the same number of 1 entries, let us study, where one of the three elements uses a “1”, but another point does not: For this, the only possibilities are 0, 1, 2 and 2, 1, 0. But here only the middle point can make use of a 1. So, the two points and cannot make use of their ones, unless all three entries are identically 1. This means that all three points have their ones in exactly the same position, and that there is no coordinate with a progression 012 or 210. So, let us look at the other coordinates. The only possibilities left are 020 or 202. But then and would be the very same point, a contradiction to the definition of a proper progression.
Proof of Proposition 3.8
The idea of this proof might go back to Shannon [54], see also Davis and Maclagan [17]. Let , so that we have the following properties: By the product construction (Lemma 3.6) we have
i.e. and therefore
Therefore, the sequence is superadditive. By Fekete’s Lemma on superadditive sequences the limit exists and equals .
Proof of Theorem 3.11:
We first prove a slightly weaker result based on the Salem-Spencer construction [51] for sets of integers without arithmetic 3-progression. Recall that m is odd and that we only need to study . Assume first that n is a multiple of . Choose vectors with digits
with exactly entries of digit i, where . The number of such vectors is maximized when for every i. This gives at least points, for positive constants . If n is not a multiple of one can fill the remaining coordinates with entries , which slightly weakens the constant .
We show that there is no arithmetic 3-progression: by the choice of the allowed digits, if the digit occurs, then the digit is forbidden, so 0 is never in the centre of a proper 3-progression. As all vectors have the same number of 0-entries, all of these digits 0 must occur in the same coordinate position, giving a trivial 000-progression. One then continues: All nontrivial 3-progressions, without the digit 0 do not have a digit 1 in the centre, and hence the digit 1 can only come from a 111-progression.
To do an explicit example, let , we use the digits: 0, 1, 2, 3, 4, 5. A complete list of all possible 3-progressions of these digits is:
As there are three distinct points, there must be a proper 3-progression of 3 distinct digits abc. As the digit 0 is never in the centre of any of these nontrivial 3-progressions, and as all vectors have the same number of 0-entries, the digit can only occur in the trivial way: 000. This leaves the following shorter list of nontrivial 3-progressions:
Now the digit 1 is never in the centre, and 1 can only occur in the trivial 111 progression. leaving the list 234, 345, 432, 543. Now, the digit 2 is never in the centre, so 2 can only occur as 222, leaving 345, 543. Now 3 is never in the centre, which gives the final contradiction.
Note that initially we have restricted the frequency of all digits 0, 1, 2, 3, 4, 5, but we can now observe that restricting the frequency of the digits 0, 1, 2, 3 is enough.
Based on a comment of a referee, we can also observe that one gets some saving on the number of restrictions, when fixing the total number of occurrences of “digit is 0 or 5”. As there is no 0 or 5 in the centre position one can remove 0 and 5 so that the list of nontrivial progressions 012, 024, 123, 135, 234, 210, 345, 321, 420, 432, 531, 543 immediately shrinks to 123, 234, 321, 432. Now a second condition such as “the total number of occurrences of 1 and 4 is constant” also forbids these cases, so that we have used only two restrictions. In general it seems that about m/4 such restrictions of joint occurrence of digits a and are sufficient.
We now prove the theorem in its full strength, based on Behrend’s construction. The number of elements used is larger by a factor only.
Let m be odd, and n be a multiple of . Let
Here can be thought of as a sphere about centre with R as squared radius. We prove that all are progression-free and there exists an of size at least .
Suppose there are three distinct points in arithmetic progression. None of the progressions in a fixed coordinate makes use of the reduction modulo m, so the convexity of the geometric sphere gives a contradiction. But let us look at this arithmetically: Let the progression in the i-th coordinate be . Then for the three points one has that . Then
This gives =0. Hence for all i. In other words, the three points are identical, which is a contradiction. The size of large sets follows from the observation that most elements in have a value of in an interval of size the standard deviation around the mean value. To make this more precise, we follow Elkin [23] and consider as independent random variables , distributed uniformly in , and . The expected value is and . The variance is
and . The standard deviation is and , where depends only on m. By Chebychev’s inequality . With we see that for at least two thirds of all elements in the sum of digit squares-distances from the centre point is in the interval . By the pigeonhole principle there exists a squared radius R with frequency at least , where .
Note that . As the proof only makes use of effective bounds, the result is valid for all odd and all n. If the odd value m tends to infinity, then, asymptotically holds, giving the claimed value of .
Remark
While the Salem-Spencer type construction with all frequencies of the digits being constant is completely explicit, the above Behrend-type proof uses the pigeonhole principle, which is not explicit, and in algorithmic terms slowly, as one would need to search for a good value R. However, a result of Rankin [47] gives entirely explicit bounds on the number of representations of numbers as a sum of n squares of bounded size. In particular this shows that not only there are good values R but that all values R in the interval are good, when weakening the constant by a small factor only. In particular, one can choose . In another direction, as the above argument does not make use of reduction modulo m, it seems possible to implement the improvement by Elkin [23], which might gain an extra factor, maybe of size . Elkin observed that 3-progressions in a suitable union of spheres (annulus) are geometrically quite restricted. One can then prove that there is a large subset of this union which is progression-free.
Proof of Theorem 3.12:
Again, we first prove a sightly weaker version based on the Salem-Spencer construction. This proof is similar to the previous case, but as m is even there is one extra complication to care for. Assume first that n is a multiple of , and that there is an arithmetic progression of three distinct points.
Choose vectors with exactly entries of digit i, where . The number of such vectors is maximized when for every i. This gives at least points. If n is not a multiple of one can fill the remaining coordinates with 0-entries, which will slightly weaken the constant .
Working out the set of all nontrivial 3-progressions, one observes that the boundary values 0 and m/2 occur as values in the middle position only in the progressions of type , or constant progressions. This means that the values of 0 or can occur in constant 3-progressions, 000, and the same number of progressions of type and . Hence other nontrivial progressions using 0, or , like 012 never occur.
By definition of a proper 3-progression we search for three distinct points, this means there must be somewhere another nontrivial progression abc with three distinct digits in . One can then continue iteratively as before, and concludes there is no nontrivial 3-progression of 3 distinct points. (As in the case of odd m it is possible to reduce the number of restrictions, for example by fixing the joint occurrences of digits a and .)
Let us define the Behrend-sphere:
We prove that is 3-progression-free in . The estimate on the number of points is as in the case of odd m above.
Suppose there are three distinct points in arithmetic progression. The non-constant progressions in a fixed coordinate do not make use of the reduction modulo m, with the two exceptions of and . Let denote the number of coordinates with a fixed progression-pattern such as 000, 012, 024 etc. Of these, let count the pattern and and let count the pattern . As all other patterns do not wrap over modulo m let count the pattern .
Hence . The points in lie on a sphere with centre . Let the progression pattern of the j-th coordinates be . Then for the three points one has that . Then
This gives . Hence for all non-constant patterns with one has that . The three points only consist of patterns aaa, or . Therefore the first and the third point are exactly the same point, in contradiction to the assumption.
We estimate as above: for consider as independent random variables, distributed uniformly in , and . The expected value is and . The variance is
and . The standard deviation is and , where depends only on m. By Chebychev’s inequality . With we see that for at least two thirds of all elements in the sum of digit squares-distances from the centre point is in the interval . By the pigeonhole principle there exists a squared radius R with frequency at least , where .
Note that . As the proof only makes use of effective bounds, the result is valid for all even and all n. If the even value of m tends to infinity, then asymptotically holds, giving the claimed value of .
Note that the values of the constants in the two cases m odd and even are quite similar.
Proof of Theorem 3.13
Let n be a multiple of 7 and let . Let
The list of trivial and nontrivial arithmetic progressions of length 4 with digits in D modulo 11 is:
Let denote the number of coordinates, where the pattern occurs among the 4 points which are in arithmetic progression. As the digit 0 occurs in all 4 positions with the same frequency, and applying it to positions 3 and 1 we see that the number of occurrences of a pattern 1605 equals the sum of the number of occurrences of patterns 0123 and 0246 together. (See underlined symbols in the list of patterns.) Also looking at digit 1 at positions 2 and 1, and combining these gives:
As 1234 is the only nontrivial progression with digit 4 in the last position, all 4’s must occur in form of a trivial progression, 4444. Therefore
This leaves only the following nontrivial progressions.
Here we observe that there are no digits 2 or 6 at the boundary, and also no digits 3 or 5 in the positions 2 and 3. So, in each coordinate there can only be a constant progression, which contradicts that we have a proper progression of distinct points in S. The number of elements in S is the multinomial coefficient for some constant , by Stirling’s formula. If n is not a multiple of 7, say , one adds further coordinates with constant digits, which weakens the overall lower bound by a small factor.
Proof of Theorem 3.14
In this situation we do not take the digits consecutively, but make use of the algebraic structure of . In particular generates a subgroup of order p, and k-progressions in with gap size divisible by p have the property that the first element is the same as the last element. We choose the digits as follows:
Observe that D contains complete cycles of length p, and one extra element, and so . There are three types of progressions of length in D:
Type I progressions have a non-zero gap size divisible by p. In this case the first element and the last element of the progression are the same.
For Type II progressions the gap size is not divisible by p. In this case all residue classes modulo occur, and the first and last element are the same modulo , but cannot be the same modulo . The residue class must occur, as D contains only one element . We observe that no such k-progression can start with , as it would have to end at another element , which is impossible.
Type III progressions are constant.
So far this was the part which generalized the algebraic situation from to prime powers. The last part is the set-theoretic trick inspired by Salem and Spencer.
Let and let
The number |S| of elements is the multinomial coefficient according to Lemma 4.1. Suppose that S contains a proper arithmetic progression of length .
Let us study the occurrence of the digit in the first vector. It cannot be part of a type I or type II progression, and hence must be a constant type III progression. Therefore all coordinate entries in all vectors occur in the same positions. In all other coordinates we only have type I and type III progressions. For these the first and the last elements are the same, modulo m. Hence there cannot be a proper arithmetic progression of length k, which by definition consists of k distinct elements.
Proof of Theorem 3.16
Recall that . Let ,
and . Choose the digits:
Observe that . For example, when , then
There are four types of progressions of length in D:
Type I progressions with gap size , which therefore contain a cycle of length . Here the first element and the last element is the same. Note that the class 0 cannot be part of such a progression, as the element is not in D.
Type II: progressions of gap size p. They must use exactly one of the digits in , but cannot use it in the first or last position: starting with and gap size p the longest progression size is , as otherwise a digit in would be needed, which is impossible. Also the progression cannot contain 0, as it would then also contain , which is impossible. (Example, : the longest progression with gap size 3 is: 22, 25, 1, 4, 7.)
Type III progressions have a gap size coprime to p, and do not contain any cycle. They consist of distinct digits, and in particular go through all residue classes modulo , and therefore contain the special element 0. But note that no such progression can start with 0, as it would also have to end at another element , which is impossible.
Type IV progressions are constant.
Note that progressions starting with 0 must be of type IV. Now let |D| divide n and let
The number |S| of elements is the multinomial coefficient . As all elements contain the same number of 0-entries, the constant progressions (type IV) are the only ones that contain any 0-entry.
Now suppose that S has a proper progression of length . All k elements contain in positions an entry . Looking at the first element of the progression we see that these progressions starting with can only be of type IV, i.e. constant. Hence all digits cannot take part in any nontrivial progression. With all other digits in and with all progression types we observe that the first and the last elements are the same. Altogether, the set S of vectors does not have a proper arithmetic progression of length k, which by definition consists of k distinct elements.
Subset reformulation
In this section we give a “subset formulation” for the question of determining and . As an application of the former one, we give another proof for Corollary 3.3, then we prove Theorem 3.18.
Reformulation for 3AP-free-ness
Let us say that a system of subsets () satisfies property , if the following implication holds:
(Note that for we define .) Let denote the maximal possible size of , if the system of subsets satisfies .
The proof of Lemma 5.1 (below) shows that property nicely captures the condition that the “corresponding” is 3AP-free.
Lemma 5.1
For every we have .
Proof
Let and . Every element can be written as () in a unique way. Let . Let us assign to every (where ) a subset in the following way: . Three distinct elements (where ) form an arithmetic progression (in this order) if and only if , that is, if . As , this implies , so the condition gives . Such elements can be found in A if and only if for distinct we have and . Note that , and this is equivalent with the condition that the system of subsets satisfies property . Furthermore, , so the maximal possible size of a 3AP-free subset of is equal to the maximal possible total size of a system of subsets A(x) satisfying property .
3AP-free sets: lower bound and subspace version
In this subsection, first, as an illustration, we give an alternative – different from the proof presented in Section 4 – proof (using the subset reformulation) for Corollary 3.3, then we prove Theorem 3.18.
Alternative proof of Corollary 3.3
For let . Let us fix some . Let if and otherwise. We claim that the system of subsets A(x) satisfies . Indeed, if , then , thus yields .
The total size of the subsets A(x) is . The optimal choice is yielding .
Proof of Theorem 3.18
For let contain those x for which A(x) is a subspace of codimension k. If there is an A(x) of codimension 0, that is, , then all the other A(y) sets are empty, thus the total size of the subsets is only . From now on, we assume that each nonempty subset is a subspace of positive codimension.
Let us fix k. For let be a basis for the orthogonal complement of A(x), that is, .
Let and . Now, for every we have . If , then , thus for some we have . However, this implies that , that is, . Let and . If , then , so the vectors (u(x), v(x)) (with ) form a biorthogonal system of vectors, specially, the u(x) vectors are linearly independent. However, all the u(x) vectors lie in a subspace of dimension , thus . Therefore, the total size of the subsets A(x) is at most .
Now we use the tensor power trick to get rid of the factor 6. Let us assume that in the system of subsets A(x) satisfies and all the non-empty subsets are subspaces. Let . Now, we can define a system of subsets in as follows. For let . It is easy to check that this system satisfies , all the non-empty subsets are subspaces and the total size of the subspaces is . Therefore, , thus . This holds for every t, so the statement is proven.
Reformulation for 4AP-free-ness
Let us say that a system of subsets () satisfies property , if the following implication holds:
(Note that for we define .) Let denote the maximal possible size of , if the system of subsets satisfies .
Lemma 5.2
For every we have .
Proof
Similarly to the proof of Lemma 5.1 let us write every element in the form (where ). Let . Let us assign to every (where ) a subset in the following way: .
Now four distinct elements (where form an arithmetic progression (in this order) if and only if and , that is, if and . This implies and , so the condition gives and . Such elements exist in A if and only if for distinct elements we have and . Note that . Hence, A is 4AP-free if and only if the system of subsets satisfies property .
Furthermore, , so the maximal possible size of a progression-free subset of is the same as the maximal possible total size of a family of subsets A(x) satisfying property .
3AP-free subsets of , if
Now, we are ready to prove Theorem 3.1. In this section we give a proof for , the case is covered in the next section.
Before starting the proof we give a brief outline of the main strategy. If we take a look at condition or , then heuristically it seems to be a good idea to use sets with small doubling, since and seem to be less restrictive for sets with a small doubling. Subspaces have a small doubling, and working with them is easier, an important step will be to show that it can be assumed (up to ) that in a maximal configuration all the (non-empty) subsets are subspaces. To arrive at this all-subspace state, we can use arguments of the following type. If for a large subspace V (where “large” means that ), then we can replace A(x) by V, since (or ) remains true (that is, the corresponding subset is still 3AP/4AP-free) and the total size of the subsets is larger (not smaller). So the general plan is to replace the subsets with subspaces, and then solve the subspace version of the problem. If the dimension is small, then for almost all subsets A(x) we can do this reduction step easily, there are just a few cases, when does not contain a sufficiently large subspace. However, even in these exceptional cases turns out to be too large, so these cases can be excluded, as well. As the dimension increases, both the reduction step and both handling the all-subspace problem is getting more difficult. The 5-dimensional case is considerably more difficult than the previous cases, the proof of it is presented in the next section. Now, we continue with the proof of the cases .
Proof of Theorem 3.1in the cases . According to Lemma 5.1 and Corollary 3.4 it suffices to show that .
Case 1: . If the dimension is 1, then it is trivial that every 2-element subset of is 3AP-free and any three elements form a 3AP, so .
We continue with some general observations that are going to be used when the dimension is at least 2. Let us take a system of subsets (indexed by elements ) satisfying . For brevity let .
Observation 1
If for some , then by the pigeon-hole principle . Since, for every we have , so, for some we have , that is, . Therefore, , so all the subsets are empty except A(x), thus . Hence, in this case the statement holds.
From now on, let us assume that for every x.
Observation 2
Let A(x) be a nonempty subset: . It can be assumed that , since changing A(x) to a translate of itself, , preserves .
Observation 3
If , then A(x) is automatically a subspace, as . If , let u and v be two different nonzero elements of A(x), that is, . Clearly, for we have , so we may replace A(x) by the 2-dimensional linear subspace . This way is still satisfied, and either S does not change or it increases by 1.
Now we consider the cases one by one.
Case 2: .
Now, we continue with the case when the dimension is 2. If none of the subsets is empty, then all of them can have size at most 1, thus . Otherwise, by Observation 1 we can assume that every nonempty subset has size at most 2, thus , since there must be an empty set.
Case 3: .
If the dimension is 3, then let be a basis for .
According to Observations 1-3 we can assume that all subsets have size at most 4 and every nonempty subset is a subspace (of dimension at most 2).
Let k denote the number of 2-subspaces and l the number of empty sets. If , then , and we are done. Note that in fact , since either all subsets have size at most 1 or at least one of them is empty.
So we can assume that . If a 2-subspace, then are all empty, that is, we can assign an “empty triple” to each 2-subspace. To different 2-subspaces we assign different triples, as the sum of the elements in the triple is x. That is, . We have , if , equality holds if and only if . If , then . Therefore, is shown and the maximum occurs when .
We continue with the 4-dimensional case.
Case 4: .
We will show that if the system of subsets satisfies , then .
At first it is going to be shown that “in most of the cases” it can be assumed that all the nonempty A(x) subsets are linear subspaces, then we will prove the statement for the special case when the non-empty A(x) subsets are all linear subspaces and finally we will also cover the remaining cases.
By Observations 1-3 we can assume that all subsets have size at most 8 and every nonempty subset of size at most 4 is a subspace (of dimension at most 2).
Let . As , we may choose three linearly independent vectors from A(x). Let these be and let . As , we have that , that is, contains all the elements of the subspace , possibly with the exception of .
We claim that if there exists some , then . To see this, we only need to show that . However, either (with some distinct ) and (where ) or and is a good representation. Therefore, in this case we can replace A(x) by . It remains to check the case when any four vectors in are linearly independent.
Step 1. Assuming that A(x) is not a subspace, and any four vectors in are linearly independent we prove under the additional assumption that at most two subsets have size 8.
Without loss of generality it can be assumed that , where is a basis. The 3-subspaces spanned by three out of these basis vectors cover with the exception of . That is, if , then , but in this case for , so we can replace A(x) by a larger set . So, it suffices to check the case when . The system of subsets can be replaced by a “translate” of itself: where for some fixed (not depending on y). So by taking we may suppose that . Then , so at least 10 subsets are empty. The size of A(0) is 5 and the size of the other five (possibly) nonempty subsets is at most 8. If at least two out of these five subsets have size at most 5, then . If this does not hold, then at least four of them are of size 8. We will cover this case later: indeed, it is going to be shown that if at least three subsets are of size 8, then .
Step 2. From now on, we will assume that
either all the nonempty A(x) sets are linear subspaces of dimension at most 3, or
some of the subsets are of size 5 but there are at least three subsets of size 8,
and show that in these cases, too.
Let h be the number of 3-subspaces. We distinguish 4 subcases.
-
Subcase 1 () In this case all of the subsets are of size at most 4. If is a 2-dimensional subspace for some x, then , thus and are all empty. So for each 2-subspace A(x) we can assign an “empty triple”, since the subsets assigned to the elements of are all empty. Moreover, the triple determines x, since the sum of the vectors in the triple is x. Let k be the number of 2-subspaces and l be the number of empty subsets (among the A(x) sets). As empty triples can be assigned to the 2-subspaces by an injective mapping, we have .
Hence, . If , then this yields . Moreover, for we obtain that . If , then .
In all cases we obtained that .
Subcase 2 () Let . As , at least 7 subsets are empty and consequently .
Subcase 3 () Let A(u) and A(v) be the two 3-subspaces. Then and are 3-dimensional affine subspaces. If , then and for all , so . Otherwise, is a 2-dimensional affine subspace, so , that is, at least 10 subsets are empty. Then .
Subcase 4 () Finally, let us assume that A(u), A(v), A(w) are 3-subspaces. Note that in this case it can happen that some of the nonempty subsets are not subspaces (these sets have size 5 and contain 5 affine independent vectors). According to Subcase 3, at least 10 subsets are empty. If at least 11 subsets are empty, then , and we are done. So it can be assumed that exactly 10 subsets are empty. Let . Since there are only 10 empty subsets, from the argument of Subcase 3 it follows that these are exactly the 10 subsets A(x) which are assigned to the 10 elements . However, are all affine subspaces, so the sum of the vectors in U adds up to 0 and the same holds for V and . Thus the sum of the vectors in is also 0. Hence, the sum of all vectors to which the empty set is assigned is . However, we can repeat this argument with U and W and get that the sum is also equal to , which is a contradiction. We are done.
Proof of Theorem 3.19
We are going to use the implications of the previous proof.
When , one of the sets must be empty and all other sets must have size 2 in order to get 6 elements. If, say, , then for any the set A(x) must contain two elements whose difference is x. Two such configurations always can be mapped to each other in the required way.
When , then we need four empty sets and four 2-subspaces to get the total size of 16. Assume that . We claim that are affine independent. Otherwise they form an affin 2-subspace, however, taking some the affine 2-subspace would have to contain exactly three of (and x as the fourth element) which is impossible. Therefore, are affine independent, and by some affine linear transformation these can be mapped to , for simplicity. Now, we can assume that 0 is contained in every nonempty A(x) (by suitable translations). Then it follows that , for and .
Finally, let . Note that can hold only in Subcase 1 when .
From the proof it follows that is possible only if there are exactly five empty sets, ten 2-subspaces and one 1-subspace. Moreover, if are the vectors to which the empty set is assigned, then the 3-term sums made out of these 5 vectors have to be all distinct. Clearly, by applying a suitable affine linear transformation we can assume that and are linearly independent. If , then all the 10 triple sums lie in a 3-subspace, so they can not be all distinct. Thus are linearly independent. Therefore, by renaming (if necessary), let , where is a basis. The set can not be a 2-subspace, since all vectors in it must have Hamming-weight at least 3 to satisfy . So it is the unique 1-subspace, for instance is an appropriate choice, but is also fine with any 3-subset of . By permuting with a suitable affine linear transformation we might assume that .
The remaining 10 sets need to be 2-subspaces. For the unique appropriate choice is , with this choice holds. For the unique appropriate choice is , with this choice is satisfied.
Proof of
We will show that if the system of subsets satisfies , then .
Again, by Observations 1-3 we can assume that all subsets have size at most 16 and every nonempty subset of size at most 4 is a subspace (of dimension at most 2).
Now, let us assume that . The set A(x) must contain at least 4 linearly independent vectors. (Note that by Observation 2 we have .)
Step 1. First, let us assume that a set A(x) with size spans a 4-dimensional subspace. Our aim is to show it can be assumed that A(x) itself is a 4-subspace.
Let be linearly independent. Then contains all the pairwise sums . If also lies in A(x), then , since the 3-term sums like can be obtained as and . Hence, if , then A(x) can be replaced by .
Now, let us assume that . Let us call the 2-term sums (with ) pairs and the 3-term sums (with i, j, k distinct) triples. The pair can be identified with the set of indices , let us call this subset also a pair, and similarly the 3-element subset will be called a triple corresponding to the vector . As the size of A(x) is at least 9, the set A(x) must contain at least elements among the six pairs and four triples.
Now, we will prove that (at least) one of following cases holds:
-
(i)
A(x) contains two disjoint pairs: for instance ,
-
(ii)
A(x) contains a pair and a triple such that their intersection has size 1: for instance: and ,
-
(iii)
A(x) contains all triples,
-
(iv)
A(x) contains a triple and all the three pairs contained in it: for instance: .
For the sake of contradiction let us assume that none of (i-iv) holds. Since we need at least four more vectors, at least one triple is contained in A(x), by symmetry we shall assume that . Note that the pairs are not in A(x), since (ii) does not hold. Therefore, there must be (at least) one more triple in A(x), otherwise (iv) would hold. We may assume that . Now, it follows that the pairs are not in A(x). However, A(x) must contain at least one pair, this pair can only be (since the other five pairs are already excluded). The two remaining triples ( and ) intersect the pair in a single element, so they are not contained in A(x) which contradicts that A(x) contains at least 4 elements from the 4 triples and 6 pairs.
Finally, we show that the equality holds in all of the four cases (i)-(iv).
In case (i) we have and each triple contains either or , thus they can be expressed like .
In case (ii) we have , the triples and can be obtained like , furthermore, and , as all .
In case (iii) all the triples can be written like and .
In case (iv) all the triples can be written like or and .
Thus in all cases we get . Hence, if A(x) is a set of size at least 9 (and at most 16) such that A(x) is not a 4-subspace, then we can assume that .
Step 2. We show that it can be assumed that there is no subset for which and . Our aim is to show that A(x) can be replaced by a 4-subspace. Together with Step 1 this implies that we can assume that all sets having size larger than 8 are 4-subspaces. Moreover, we show that there can be at most one such subset.
Our aim is to show that either there is a 4-subspace such that or the total size S of the sets is at most 124.
Let us assume that , where is a basis. Then all singletons and pairs lie in . If a 4-term sum, like lies in A(x), then contains and we are done: A(x) can be replaced by . More generally we can formulate the following observation:
Observation 4
If it is possible to choose 6 vectors from A(x) in such a way that they span a 4-dimensional affine subspace and their sum is 0, then A(x) can be replaced by a 4-subspace, since translating A(x) by and taking gives , so this case can be handled in the same way as the previous case.
Therefore, , since adds up to 0. Thus the remaining elements of A(x) are all pairs and triples. We claim that the following cases can be excluded with the help of Observation 4:
-
(i)
there are two disjoint pairs, e.g.
-
(ii)
there are two triples intersecting each other in a single element, e.g.
-
(iii)
there is a pair and a triple intersecting each other in a single element, e.g.
In case (i) .
In case (ii) .
In case (iii) .
Finally, let us assume that (i-iii) do not hold. From (i) it follows that the pairs either form a star or a triangle. If they form a triangle, let us assume that it is . Since would imply , we have . Furthermore, (iii) implies that none of the other triples is in A(x). Hence, , we will refer to this as case (a). From now on, we assume that the pairs in A(x) form a star.
If this star contains 4 vectors, e.g. , then A(x) can not contain any triples because of (iii). (Case (b).)
If this star contains 3 vectors, e.g. , then A(x) can not contain any triples because of (iii). (Case (c).)
If this star contains 2 vectors, e.g. . At least one triple must lie in A(x) and (iii) implies that this triple is . (Case (d).)
If only one pair is in A(x), e.g. . There are at least two more vectors (thus triples) in A(x). If one of them is , then the other triple intersects the pair or the triple in one element, contradicting (ii) or (iii). Thus, by (iii) these two triples must contain , which gives case (e).
If there are no pairs, then there are at least three triples. Any two of them have an intersection of size 2, giving case (f) or case (g).
We summarize this:
Note that the size of A(x) is 10 in case (b) and 9 in the remaining cases (a) and (c-g). Also, the size of is 21 in cases (b), (c), (e), (f) and 22 in cases (a), (d), (g).
Let us assume that there is at least one subset A(x) having size at least 9 and not being a 4-subspace. Then at least 21 subsets out of the 32 sets A(y) are empty, so at most 11 subsets are non-empty. Let k denote the number of 4-subspaces among the subsets A(x). Then . If , then this is at most 122. So let us assume that there are at least three 4-subspaces, namely, A(y), A(z), A(u). Let , then K, L, M are affine subspaces of dimension 4.
If two of them are disjoint, for instance , then , giving that for every , which is a contradiction. So any two of them intersect nontrivially each other, and then any pairwise intersection is a 3-dimensional affine subspace. As , we have that , since both of them is an 8-element subset of the 15-element set , hence . Then has size 4 or 8. By inclusion-exclusion principle, in both cases , therefore, at least subsets are empty and .
Therefore, it can be assumed that all subsets having at least 9 elements are 4-subspaces, moreover there are at most 2 such subsets. If there are 2 such subsets A(x) and A(y), then , so at least subsets are empty and . Hence, it can be assumed that there is at most one 4-subspace.
Step 3. Now we show that if , then it can be assumed that A(x) is either a 3-subspace or a set of 5 or 6 affine independent points.
Let us assume that . If has dimension 3, then A(x) can be replaced with this 3-subspace. If , then it can be assumed that . If at least one more element is in A(x), then contains a 3-subspace and we can replace A(x) by this 3-subspace, otherwise , we will refer to this case as case (A).
If , then it can be assumed that . If at least one more element with Hamming-weight at most 4 is in A(x), then contains a 3-subspace. If , then , otherwise , we will refer to this case as case (B).
Hence, it can be assumed that if there is a subset A(x) (with size in [5, 8]) which is not a subspace, then it contains 5 or 6 affine independent points:
-
(A)
-
(B)
Note that the size of A(x) in these cases is either 5 or 6.
Step 4. We show that it can be assumed that all subsets have size at most 8.
Note that we have already seen (in Step 2) that there can be at most one 4-subspace, so let us assume that there exists a (unique) 4-subspace A(y). Then , so there are at least 15 empty subsets. All the other subsets are 3-subspaces or have size at most 6. If there is no 3-subspace, then , and we are done. Let A(x) be a 3-subspace and . As , we have , so there are at least empty subsets, thus at most 14 non-empty ones implying , and we are done. Therefore, none of the subsets can be a 4-subspace, and consequently all the subsets have size at most 8.
Step 5. We show that it can be assumed that all nonempty subsets are subspaces of dimension at most 3 or a set of 5 or 6 affine independent points. Furthermore, the number of empty sets among the A(x) subsets is at most 16 and there exists a subset of size at least 5.
If , then by Observations 1-3 it can be assumed that A(x) is a subspace.
Now, we can assume that all the subsets have size at most 8 and all those non-empty subsets that are not subspaces are of type (A) or (B).
If there are at least 17 empty subsets, then , so it can be assumed that at most 16 subsets are empty.
If there is no subset with size larger than 2, then . If there is no subset with size larger than 4, then there must be a subset with size 4 and there are at most 29 non-empty sets, so . So there is a subset of size at least five, this can be either of type (A) or (B) or a 3-subspace.
Now our aim is to show that we can assume that there is no subset of type (A) neither of type (B).
Step 6. We show that there is no subset of type (B).
Let us assume that there is a subset of type (B). Without loss of generality this is . Then , that is, has size 15 and for every . As , at least 11 subsets are 3-subspaces. Let A(x) be a 3-subspace and . As , we have . We claim that .
For the sake of contradiction, let us assume the contrary. Let and . As , the set can not be a 3-subspace. If is a 2-subspace, then without loss of generality, and . However, none of the translates of this set is contained in , thus we must have , which leads to a contradiction, as well.
Hence, there exists some such that , so the number of the empty subsets is at least 16. If the number of 3-subspaces is at most 14, then , and we are done. So we can suppose that the number of 3-subspaces is at least 15 and one subset has size 6. The set is not a 3-subspace, since any affine 3-subspace containing contains at least 2 more elements that are not in T. Hence is the 16th empty subset. Now, we claim that is not a 3-subspace. This holds, since any affine 3-subspace containing has at least one more element outside of .
Therefore, there is no subset of type (B).
Step 7. We show that there is no subset of type (A).
Let us assume that there is a subset of type (A), it can be assumed that it is . Then and has size 10. That is, we already have 10 empty subsets.
For brevity let us write for if
(E.g. , and so on.)
Let us assume first that the total size of the subsets
is at most 32.
Consider the following 16 subsets: (). Let k denote the number of 3-subspaces among these and l the number of empty ones. If , then , thus , and then
| 3 |
If is a 3-subspace, then is an affine 3-subspace containing . The 1-codimensional affine subspace contains either all 8 elements of or 4 elements of . In the first case we get 7 new empty subsets, so the total number of empty subsets is at least 17 and we are done: . So for every 3-subspace exactly 4 elements of lie in R. The sum of these 4 vectors is 0, so the sum of the three vectors in is . Hence, for every 3-subspace we get an “empty triple” of vectors from R, therefore,
| 4 |
By (3) and (4) we obtain that , which yields . Then (3) implies that , which is a contradiction, since .
Hence, it can be assumed that the total size of the sets
is at least 33, on the other hand, it is clearly at most 40. It follows that none of them is empty and at least three of them are 3-subspaces, so we can assume that A(123) is a 3-subspace. is not possible, since then would contain or or or . Since, if an affine 3-subspace of contains but none of the other 4 vectors, then it is , however, contains 0, as well, contradiction.
So intersects nontrivially R, so , thus at least 4 subsets (among subsets A(x) with ) are empty: . Note that that the sum of the four corresponding vectors is 0. Also, note that in this case (similarly to (3) in the previous case) we shall assume that
| 5 |
Now (5) yields that at least 7 such subsets are 3-subspaces: . Then (4) implies that the number of empty ones is at least 5. Again, by (5) we get . If , then we have triples, but there is a 4-term zero-sum, so 4 triples can not be “empty triples”, thus there is a 6th empty subset: , and by (5) we obtain that . So . As , all the sets A(123), A(124), A(134), A(234), A(1234) must be 3-subspaces. If , then intersects R in 4 vectors whose sum is 0. It can be checked that this set of 4 vectors can not be the same for all the 5 possible v-s. (Otherwise would contain at least 15 vectors to which the empty set is assigned, however, there are only 10 such vectors.) So there must be at least two such 4-element sets. Their intersection has size at least 2, since we have only 6 vectors in R to which the empty set is assigned, and also at most 2, since otherwise they would be the same. Let be the empty ones, and let us assume that the two 4-zero-sum-sets are and . Then . 20 triples can be chosen out of these 6 vectors, but just 8 of them can be “empty triples”, contradiction.
Therefore, we can assume that there is no subset of type (A), that is, all the nonempty subsets are subspaces of dimension at most 3. According to Step 5 there must be at least one 3-subspace among the subsets, as Steps 6-7 imply that all the sets of size at least 5 are 3-subspaces.
Step 8. We show that the number of empty subsets is at least 13.
Let be the number of 3-subspaces and l the number of empty subsets. Let us colour the elements of : x is coloured red if and x is coloured blue if A(x) is a 3-subspace. (If A(x) is a subspace of dimension at most 2, then x is not coloured.) Let , specially, if x is blue, then is a 3-dimensional affine subspace containing x and seven red vectors.
If , then which yields . Now we are going to show that . If x is blue, then in there are two kinds of triples: the 2-subspace spanned by them either contains x or not. The number of triples in is 35 and 7 of these triples span a 2-subspace containing x. These triples are not contained in any other affine 3-subspace .
Furthermore, we claim that if , then a triple can appear in at most two 3-subspaces. For the sake of contradiction, let us assume that a triple is contained in , where are 3-subspaces. Let H be the 2-subspace spanned by this triple, then and , , are disjoint, thus . However, in all the vectors are red except x, y, z, hence , contradiction.
Now, since each triple appears in at most two 3-subspaces, we obtain that
thus
implying that .
Therefore, , as we claimed.
Step 9. We show that if A(x), A(y), A(z) are 3-subspaces (with distinct x, y, z), then is not an affine 2-subspace.
Now, for the sake of contradiction, assume that there are three 3-subspaces, A(x), A(y), A(z) whose intersection is an affine 2-subspace L. Without loss of generality we can assume that L is a linear (2-)subspace. Note that .
Note that can be partitioned into 8 translates of L. Every affine 3-subspace contains the same number of vectors from those L-translates that has a nonempty intersection with it. That is, given a 2-subspace L, we can distinguish three types of affine 3-subspaces, we are going to say that a 3-subspace is of
type-1, if it contains 1-1 vector from each L-translate,
type-2, if it contains 2-2 vectors from four L-translates (and none from the remaining four L-translates),
type-4, if it contains 4-4 vectors from two L-translates (and none from the remaining six L-translates).
In there are 13 red elements, namely, all the vectors except x, y, z. If is blue, then is a 3-subspace of type-1, type-2 or type-4 which contains t and 7 seven red vectors.
If at least two L-translates do not contain any red vector, then the elements of these translates can not be blue, so , contradiction. Hence, there is at most one L-translate without any red vector. In particular, this means that , since there are 13 red vectors in M and at least 3 red vectors outside of M.
Thus . Let us assume that the red vectors outside of M are , these vectors must be in different L-translates. Let be the unique L-translate not containing any red vector. If , then at most one of the sets can be a 3-subspace (namely, ), contradiction. Now assume that . By symmetry we can assume that also holds. But then the union of the sets (that are all affine 3-subspaces of type-1) cover and the (unique) for which can not be blue (since x is not red). Hence, no three-wise intersection of 3-subspaces can be a 2-subspace.
Step 10. Now we know that and no three-wise intersection of 3-subspaces is a 2-subspace. We finish the proof of the upper bound 124 by verifying the statement in these cases.
Let N be the number of those pairs of 3-subspaces whose intersection is a 2-subspace. Then
| 6 |
since each of the k 3-subspaces contain 35 empty triples. Hence, for we have , that is, two of the 3-subspaces assigned to blue vectors intersect each other in a 2-subspace. In the following subcases we always take two such subsets first.
Subcase 1. If , then we can assume that L is a linear 2-subspace and are 3-subspaces corresponding to blue vectors x and y. At least 2 translates of L does not contain any red vector, and in these translates there can not be any blue vectors, either. So the number of blue vectors is at most , contradiction.
Subcase 2. If , then again let L be a linear 2-subspace and be 3-subspaces corresponding to blue vectors x and y. Note that in there are 10 red vectors. We have 4 more red vectors, say, , which must lie in different L-translates. (Otherwise there would be two L-translates without any red vector, which would imply that the 8 vectors in these translates are not coloured, contradicting that that the number of non-coloured vectors is at most 4.)
Note that all the 3-subspaces assigned to some blue vector different from x, y are of type-1 or type-2. To get a 3-subspace of type-2 we need to take 2-2 red vectors from . Moreover, these pairs must determine parallel vectors in these three L-translates (that is, in each pair the sum of the two vectors is the same), so there are at most 6 such subspaces. A type-1 3-subspace must correspond to a (blue) vector from the last L-translate, so there are at most 4 such subspaces. Hence , contradiction.
Subcase 3. Let us assume that . Again, we can assume that for some linear 2-subspace L the sets are two 3-subspaces. Let be the remaining five L-translates. They contain altogether 5 red vectors. If at least two of them do not contain any red vector, then in these two L-translates there aren’t any blue vectors either, so the number of blue vectors is at most 9, contradiction. So without the loss of generality it can be assumed that either (i) contains two red vectors and contain one-one red vector: in () or (ii) contain one-one red vector: in ().
In case (i) let be the two directions that are different from the direction determined by the two red vectors of . That is, and are those two nonzero elements of L that are different from the sum of the two red vectors in . Let us consider the following 6 vectors in : (for ). If such a vector is blue, then the corresponding 3-subspace is of type-2, moreover, contain one-one red pair of this 3-subspace, and in each pair the sum is the same, either or . There are only 4 such triples (of pairs of vectors) meaning that at least two of the vectors () are not blue. To get 15 blue vectors all vectors in must be blue (as there are at most 2 non-coloured vectors). Note that the corresponding 3-subspaces must be of type-1. If , then there can be at most one blue element in (namely ). If , then by symmetry we can also assume that . If is blue, then the corresponding 3-subspace is , but these four 3-subspaces cover , which contradicts that contains only 3 red vectors.
In case (ii) there are two 3-subspaces of type-4. To get a 3-subspace of type-2, we have to choose one-one red pair from in such a way that these pairs determine parallel directions. This can be done in 6 ways, and every affine 3-subspace is determined by 6 points of it, so there are at most six 3-subspaces of type-2. To get a 3-subspace of type-1 we have to choose a red vector from all but one of the L-translates. First assume that no four-element subset of is a 2-subspace. Then the (at least) four red vectors chosen to be in this 3-subspace from determine uniquely a 3-subspace, so the number of 3-subspaces of type-1 is at most 5, thus , a contradiction. Now assume that a 4-element subset, say, forms a 2-subspace. Each 3-subspace of type-1 contains at least 3 elements of , hence all of them contain all these four vectors. Then the blue vector is in , so there are at most 3 such subspaces, thus, , a contradiction.
Subcase 4. Finally, let us assume that , that is, all vectors are either red or blue. First we show that there are two 3-subspaces whose intersection is a 2-subspace. For the sake of contradiction, assume the contrary. Let be four 3-subspaces assigned to blue vectors. If every pairwise intersection has size less than 4 (that is, the intersection is either empty or has size 2), then
| 7 |
so contains at least red vectors. Since there are only 16 red vectors, we must have equality in (7), so each pairwise intersection has size 2 and each triple-intersection has size 0. Clearly, these hold for any four 3-subspaces assigned to blue vectors. Pick such a 3-subspace, for instance, . Then the other fifteen 3-subspaces have to intersect in pairwise disjoint pairs, which is impossible. Therefore, there are two 3-subspaces whose intersection is a 2-subspace.
Hence, we can assume that this 2-subspace is a linear 2-subspace L and the sets are two 3-subspaces corresponding to blue vectors x and y. Let be the remaining five L-translates. These contain 6 more red vectors. As there can be at most one L-translate without any red vector, we can assume that the number of red vectors among them is i) 3-1-1-1-0 or ii) 2-2-1-1-0 or iii) 2-1-1-1-1.
In case (i) let be the red vectors in . If , then in there is at most one blue vector (namely, ), contradiction. Assume that . We can assume that . If is blue, then , but these cover , which contradicts that contains a blue element.
In case (ii) let us assume that the direction is different from the direction(s) determined by the pairs in . Let be the red vectors in these translates. Consider the blue vectors and . The 3-subspaces corresponding to them are of type-2, and both of them contain one-one pair from , moreover, all these pairs determine direction . In and these pairs are uniquely determined. In there are two choices (two disjoint pairs). However, these two pairs in together with the pairs from and determine two pairs in the same L-translate, which contradicts the existence of such a pair in both and .
Finally, we consider case (iii). Let and . For every let . We can assume that contains 4 red vectors and contains 3-3 red vectors.
First assume that the L-translate containing 2 red vectors is , we can assume that these vectors are and . Let t be a blue vector in one of the four L-translates . Then is either of type-2 or type-1. However, only contain at least two red vectors, which means that any 3-subspace of type-2 must contain at least 6 vectors from , which is a 4-subspace, thus the remaining two vectors of the 3-subspace must also lie in this subspace, too. So is of type-1. As is an affine 4-subspace, it intersects in an affine 2-subspace. Therefore, if, say, , then t and the red vectors from form an affine 2-subspace, that is, t is the sum of these three red vectors. But then in the only blue vector is t, contradiction.
Hence, contains one red vector. By symmetry, we can assume that contains 2 red vectors and these are and . Let the red vector in be for .
Let . We claim that and must be of type-1. Otherwise, or would contain at least two vectors from , so it would have to contain two more red vectors from one of the L-translates , these could only be and from . But then the blue vector would also lie in the 3-subspace (to get parallel pairs from the different translates), a contradiction. Hence, and are of type-1.
Consider and . Each of these two subspaces contain either or and they contain and . As the intersection of with the 1-codimensional affine subspace must be of size 2, they contain different elements from . Without loss of generality we can assume that contains . Then , that is, . Now and are determined, since they must contain :
Similarly, the type-1 3-subspaces containing 2-2 blue vectors from and are:
So the set of red vectors in is and in is .
Now consider which is a blue vector in . Note that is of type-2 (otherwise the three 3-subspaces corresponding to blue vectors from would have a 2-subspace intersection, contradicting Step 9). Also, it must contain . As it contains at least 2 vectors from the 1-codimensional affine subspace , it must contain two more, which can only be . The remaining two red pairs are in two of . As , these two L-translates must be and . Also, the difference of the vectors from the same L-translate must be , so the 3-subspace is:
As , we get that has to coincide with . However, this leads to , contradiction.
4AP-free subsets of
Proof of Theorem 3.5
According to Lemma 5.2 it suffices to show that and . In other words, we will show that if the system of subsets satisfies , then is at most 3, 10, 36, 128 for , respectively. Then, we will present constructions of these sizes.
By the pigeon-hole principle we get that if . Hence, holds for at most one x, since and would imply that , contradicting .
This observation immediately yields that . For we obtain the claimed upper bounds 3, 10, 36, respectively.
For we obtain that , now we will show that also holds. We have already seen (in the proof of Theorem 3.1) that it can be assumed that all the nonempty A(x) subsets are linear subspaces or a set of 5 affine independent points. If all the subsets are of size at most 8, then clearly . So we can assume that one of them is , without loss of generality let . It can be assumed that the number of 3-subspaces among the A(x) sets is at least 13, since otherwise . If A(x) is a 3-subspace, then for some (uniquely determined) we have . As , we obtain that . We claim that is injective, that is, if A(x) and A(y) are 3-subspaces (with ), then . Otherwise, , so and similarly . So this would lead to , which contradicts property . Therefore, is injective. Also, if A(x) and A(y) are 3-subspaces (and ), then or , since would imply that and so , which would contradict property .
Now, let us assume that for some z the set A(z) is a set of 5 affine independent points. Let . As , we have . Note that contains (distinct) sums, thus we have . For all we have . We know that at least 13 subsets are 3-subspaces, so there are at most three subsets that are not 3-subspaces: A(0), A(z) and possibly one more. Thus for at least 9 elements of X the set A(x) is a 3-subspace. For such an x the condition implies that , hence . As is injective, this would mean that the 3-subspace contains at least 9 different vectors, which is a contradiction. Hence, it can be assumed that all the nonempty A(x) sets are linear subspaces.
At least 14 of the A(x) subsets are 3-subspaces, since otherwise clearly holds, as for every for which A(x) is not a 3-subspace. Therefore, the mapping is defined on with the exception of at most one point. Also, is injective, so it can be extended to a bijective mapping from to . Let . Then either or for some u. Let
At first assume that . As at least one of and is equal to 0 for every , we get that . On the other hand . Therefore, and for any two distinct elements of H exactly one of and is equal to 0. In other words, for any two different elements . Let for every . Then , thus is a biorthogonal system, implying that , which is a contradiction.
Now assume that there is a subset A(u) (with ) which is not a 3-subspace: . As at least one of and is equal to 0 for every , we get that . However, . Hence, and there is at most one pair of distinct elements such that . By dropping out one of the two elements of this pair from H (if such a pair exists at all) we obtain a 13-element subset such that for every , . Again, let for every . Then , thus is a biorthogonal system, implying that , which is a contradiction.
Hence, it is shown that .
Now we give constructions to prove the lower bounds.
Case 1: .
give . (In fact, any 3-element subset of is free of arithmetic progressions of length 4, trivially.)
Case 2: .
Let . Furthermore, let . Then for every and for every , . For let . Then , since . Also, for any two nonzero vectors x and y either or . We can assume that . (Otherwise we swap x and y.) Then implies that , so the condition holds. Thus .
Case 3: . Let . Similarly to the previous case it suffices to define a bijective mapping such that for every and for every . It is easy to check that the following mapping satisfies these conditions: . Hence, .
Case 4: .
Let . Let us extend the mapping defined in Case 3 with . For every let . Let and . We have to show that
unless and . If , then , that is, . Similarly, implies that .
If , then yields , and we are done. From now on, let us assume that .
If , then by adding up the two equations: , which is a contradiction. Similarly, also leads to a contradiction.
Finally, let us assume that and . Then by adding up the two equations we get , which is a contradiction, too.
Hence, the system satisfies property , and .
Acknowledgements
Open access funding provided by Graz University of Technology. We would like to thank Yves Edel for useful comments, especially on the case of cap sets and the anonymous referees for their useful comments. In particular, we also thank one referee for the quite compact formulation of the proof of Theorem 3.2. C.E. was partially supported by The Austrian Science Fund (FWF), grant W1230, P.P.P. was supported by the Lendület program of the Hungarian Academy of Sciences (MTA), the National Research, Development and Innovation Office of Hungary (Grant Nr. PD115978 and K129335) and the János Bolyai Research Scholarship of the Hungarian Academy of Sciences.
Footnotes
There is certainly an extensive literature in the related area of finite geometry over finite fields, but in literature from an additive combinatorics point of view we are essentially aware of an exercise in the book by Tao and Vu, and a paper by Lin and Wolf, details below.
It seems they possibly intended the denominator to be (in our notation).
in fact, here the caps in PG(5, 5) and AG(5, 5) have the same number of points, see https://www.mathi.uni-heidelberg.de/~yves/Matritzen/CAPs/Matrizen/(195,5,5).html
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Contributor Information
Christian Elsholtz, Email: elsholtz@math.tugraz.at.
Péter Pál Pach, Email: ppp@cs.bme.hu.
References
- 1.Alon N., Dubiner M.: Zero-sum sets of prescribed size. In: Combinatorics, Paul Erdős is Eighty. Bolyai Society, Mathematical Studies, Keszthely, Hungary, pp. 33–50 (1993).
- 2.Alon N, Dubiner M. A lattice point problem and additive number theory. Combinatorica. 1995;15:301–309. [Google Scholar]
- 3.Alon N, Shpilka A, Umans C. On sunflowers and matrix multiplication. Comput. Complexity. 2013;22(2):219–243. [Google Scholar]
- 4.Bateman M, Katz NH. New bounds on cap sets. J. Am. Math. Soc. 2012;25(2):585–613. [Google Scholar]
- 5.Behrend FA. On sets of integers which contain no three terms in arithmetical progression. Proc. Natl Acad. Sci. U.S.A. 1946;32:331–332. doi: 10.1073/pnas.32.12.331. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Blasiak J., Church T., Cohn H., Grochow J., Naslund E., Sawin W., Umans C.: On cap sets and the group-theoretic approach to matrix multiplication. Discret. Anal. Paper No. 3, 27 pp. (2017)
- 7.Bourgain J. On triples in arithmetic progression. Geom. Funct. Anal. 1999;9(5):968–984. [Google Scholar]
- 8.Brown TC, Buhler JP. A density version of a geometric Ramsey theorem. J. Combin. Theory Ser. A. 1982;25:20–34. [Google Scholar]
- 9.Calderbank AR, Fishburn PC. Maximal three-independent subsets of . Des. Codes Cryptogr. 1994;4(3):203–211. [Google Scholar]
- 10.Cameron P.J.: Sum-free sets of a square. Manuscript. http://www.maths.qmul.ac.uk/~pjc/odds/sfsq.pdf.
- 11.Chandra AK. On the solution of Moser’s problem in four dimensions. Can. Math. Bull. 1973;16:507–511. [Google Scholar]
- 12.Chvátal V. Remarks on a problem of Moser. Can. Math. Bull. 1972;15:19–21. [Google Scholar]
- 13.Chvátal V.: Edmonds polytopes and a hierarchy of combinatorial problems Discret. Math. 4, 305–337 (1973). Reprinted: Discret. Math. 306, 886–904 (2006).
- 14.Coppersmith D., Winograd S.: Matrix multiplication via arithmetic progressions STOC ’87 (Proceedings of the Nineteenth Annual ACM Symposium on Theory of Computing) Pages 1–6, also: J. Symbol. Comput. 9(3), 251–280 (1990).
- 15.Croot E. The minimal number of three-term arithmetic progressions modulo a prime converges to a limit. Can. Math. Bull. 2008;51(1):47–56. [Google Scholar]
- 16.Croot E, Lev VF, Pach PP. Progression-free sets in are exponentially small. Ann. Math. (2) 2017;185(1):331–337. [Google Scholar]
- 17.Davis BL, Maclagan D. The card game SET. Math. Intell. 2003;25(3):33–40. [Google Scholar]
- 18.Edel Y. Extensions of generalized product caps. Des. Codes Cryptogr. 2004;31:5–14. [Google Scholar]
- 19.Edel Y. Sequences in abelian groups of odd order without zero-sum subsequences of length Des. Codes Cryptogr. 2008;47(1–3):125–134. [Google Scholar]
- 20.Edel Y, Bierbrauer J. Large caps in small spaces. Des. Codes Cryptogr. 2001;23(2):197–212. [Google Scholar]
- 21.Edel Y, Ferret S, Landjev I, Storme L. The classification of the largest caps in AG(5,3) J. Comb. Theory Ser. A. 2002;99:95–110. [Google Scholar]
- 22.Edel Y, Elsholtz C, Geroldinger A, Kubertin S, Rackham L. Zero-sum problems in finite abelian groups and affine caps. Q. J. Math. 2007;58(2):159–186. [Google Scholar]
- 23.Elkin M. An improved construction of progression-free sets. Isr. J. Math. 2011;184:93–128. [Google Scholar]
- 24.Ellenberg JS, Gijswijt D. On large subsets of with no three-term arithmetic progression. Ann. Math. (2) 2017;185(1):339–343. [Google Scholar]
- 25.Elsholtz C. Lower bounds for multidimensional zero sums. Combinatorica. 2004;24(3):351–358. [Google Scholar]
- 26.Elsholtz C, Rackham L. Maximal sum-free sets of integer lattice grids. J. Lond. Math. Soc. (2) 2017;95(2):353–372. [Google Scholar]
- 27.Erdős P, et al. Problems and results on combinatorial number theory. In: Srivastava JN, et al., editors. A Survey of Combinatorial Theory. Amsterdam: North Holland; 1973. pp. 117–138. [Google Scholar]
- 28.Erdős P, Turán P. On some sequences of integers. J. Lond. Math. Soc. 1936;11:261–264. [Google Scholar]
- 29.Erdős P., Ginzburg A., Ziv A.: Theorem in the additive number theory. Bull. Res. Council Israel F (10), 41–43 (1961).
- 30.Frankl P, Graham RL, Rödl V. On subsets of abelian groups with no 3-term arithmetic progression. J. Comb. Theory Ser. A. 1987;45(1):157–161. [Google Scholar]
- 31.Gowers WT. A new proof of Szemerédi’s theorem. Geom. Funct. Anal. 2001;11(3):465–588. [Google Scholar]
- 32.Green B.J.: Finite field models in additive combinatorics. In: Surveys in Combinatorics . London Mathematical Society. Lecture Note Series, vol. 327, pp. 1–27. Cambridge University Press, Cambridge (2005).
- 33.Green B., Tao T.: New bounds for Szemerédi’s theorem. I. Progressions of length 4 in finite field geometries. Proc. Lond. Math. Soc. (3) 98(2), 365–392 and correction: New bounds for Szemeredi’s theorem, Ia: Progressions of length 4 in finite field geometries revisited. arXiv:1205.1330 (2009).
- 34.Hales AW, Jewett RI. Regularity and positional games. Trans. Am. Math. Soc. 1963;106:222–229. [Google Scholar]
- 35.Hegedüs G.: A new exponential upper bound for the Erdős–Ginzburg–Ziv constant. arXiv:1712.00228.
- 36.Kalai G.: Webblog, 7th February 2009. http://gilkalai.wordpress.com/2009/02/07/frankl-rodls-theorem-and-variations-on-the-cap-set-problem-a-recent-research-project-with-roy-meshulam-a/ (2009).
- 37.Komlós J. Solution to problem P.170 by Leo Moser. Can. Math. Bull. 1972;15:312–313. [Google Scholar]
- 38.Lev VF. Progression-free sets in finite abelian groups. J. Number Theory. 2004;104:162–169. [Google Scholar]
- 39.Lin Y, Wolf J. Subsets of containing no -term progressions. Eur. J. Comb. 2010;31(5):1398–1403. [Google Scholar]
- 40.Meshulam R. On subsets of finite abelian groups with no -term arithmetic progressions. J. Comb. Theory Ser. A. 1995;71:168–172. [Google Scholar]
- 41.Moser L. Problem P.170. Can. Math. Bull. 1970;13:268. [Google Scholar]
- 42.Naslund E. Exponential bounds for the Erdős-Ginzburg-Ziv constant. J. Combin. Theory Ser. A. 2020;174:105185. [Google Scholar]
- 43.Newcombe L.: MSc Thesis, Royal Holloway (2008).
- 44.Petrov F, Pohoata C. Improved bounds for progression-free sets in . Israel J. Math. 2020;236(1):345–363. [Google Scholar]
- 45.Polymath D.H.J.: Density Hales-Jewett and Moser numbers. In: An Irregular Mind, pp. 689–753. Bolyai Soc. Math. Stud., vol. 21, János Bolyai Math. Soc., Budapest (2010).
- 46.Potechin A. Maximal caps in AG(6,3) Des. Codes Cryptogr. 2008;46(3):243–259. [Google Scholar]
- 47.Rankin R.A.: Representations of a number as the sum of a large number of squares. Proc. R. Soc. Edinb. Sect. A 65, 318–331 (1960/1961).
- 48.Reiher C. On Kemnitz’ conjecture concerning lattice-points in the plane. Ramanuj. J. 2007;13:333–337. [Google Scholar]
- 49.Riddel J. A lattice point problem related to sets containing no -term arithmetic progression. Can. Math. Bull. 1971;14:535–538. [Google Scholar]
- 50.Roth KF. On certain sets of integers. J. Lond. Math. Soc. (2) 1953;28(1):104–109. [Google Scholar]
- 51.Salem R, Spencer DC. On sets of integers which contain no three terms in arithmetical progression. Proc. Natl Acad. Sci. U.S.A. 1942;28:561–563. doi: 10.1073/pnas.28.12.561. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Sanders T. Roth’s theorem in . Anal. PDE. 2009;2(2):211–234. [Google Scholar]
- 53.Sanders T. On Roth’s theorem on progressions. Ann. Math. (2) 2011;174(1):619–636. [Google Scholar]
- 54.Shannon CE. The zero-error capacity of a noisy channel. IRE Trans. Inf. Theory. 1956;2:8–19. [Google Scholar]
- 55.Szemerédi E. On sets of integers containing no elements in arithmetic progression. Acta Arith. 1975;27:199–245. [Google Scholar]
- 56.Tao T.: Webblog, 23 February 2007. Open question: best bounds for cap sets. http://terrytao.wordpress.com/2007/02/23/open-question-best-bounds-for-cap-sets/ (2007).
- 57.Tao T, Vu V. Additive Combinatorics. Cambridge: Cambridge University Press; 2006. [Google Scholar]
- 58.Williams V.V.: Multiplying matrices faster than Coppersmith–Winograd. In: STOC’12–Proceedings of the 2012 ACM Symposium on Theory of Computing, pp. 887–898. ACM, New York, (2012).
- 59.Wolf J. Finite field models in arithmetic combinatorics-ten years on. Finite Fields Appl. 2015;32:233–274. [Google Scholar]
