Abstract
An arc in is a set such that no three points of P are collinear. We use the method of hypergraph containers to prove several counting results for arcs. Let denote the family of all arcs in . Our main result is the bound
This matches, up to the factor hidden in the o(1) notation, the trivial lower bound that comes from considering all subsets of an arc of size q. We also give upper bounds for the number of arcs of a fixed (large) size. Let , and let denote the family of all arcs in with cardinality k. We prove that
This result improves a bound of Roche-Newton and Warren [12]. A nearly matching lower bound
follows by considering all subsets of size k of an arc of size q.
Keywords: Arcs, Hypergraph containers, Supersaturation
Introduction
Over a century ago, Dudeney [8] asked how many points can be placed in an grid such that no three are collinear. In 1951 Erdős (published by Roth [13]) showed that when is a prime number the set mod : contains no collinear triple, whereas a simple upper bound of follows from the pigeonhole principle. Hall et al. [9] subsequently improved Erdős’s lower bound, but despite receiving considerable attention, the problem remains open. In this paper we will consider a closely related problem in the finite field setting.
Let be the finite field of order for some prime p. An arc in is a subset of with no three points collinear. Let denote the family of all arcs in . One of the main goals of this paper is to provide bounds for the cardinality of . For context, observe that the set
| 1 |
is an arc of cardinality q. This is essentially the same as the Erdős construction mentioned above. Since each subset of an arc is also an arc, it immediately follows that
| 2 |
We prove the following upper bound, which almost matches (2).
Theorem 1.1
Let q be a prime power which is sufficiently large. There is an absolute constant C such that the set of all arcs in satisfies the bound
In particular, .
We also consider the set of all arcs of a fixed size k, with a focus on the case when k is large. The problem of bounding the size of was the main focus of a paper of the second author and Warren [12], and this paper is a sequel to [12].
For the case when , this question was largely settled in [12], where the bounds
| 3 |
were established. In (3), c and C are positive absolute constants. We interpret (3) as a statement that, for small k, a random set of k elements has a fairly high probability of being an arc, with this probability even tending towards 1 for .
Write . For , a significant change of behaviour in terms of the size of was observed in [12]. A better upper bound than that of (3) was established, see [12, Thm. 2]. However, it was not clear whether or not the upper bound given in [12] for this range was optimal.
In this paper, we give improved upper bounds for the size of when k is large.
Theorem 1.2
Let q be a prime power which is sufficiently large and let . Then
A more precise version of the statement of Theorem 1.2 which gives information about the value hidden in the o(1) term is given as Theorem 4.2. Theorem 1.2 is close to optimal; the lower bound
can be seen by considering all subsets of size k of the set C described in (1).
This paper follows a similar approach to that of [12]. In particular, the main tool is the method of hypergraph containers. The theory of hypergraph containers was developed independently by Balogh et al. [2] and Saxton and Thomason [14]. We defer the full statement of the container theorem we use until Sect. 2.1. Roughly speaking, it says that if a hypergraph has a reasonably good edge distribution, we can obtain strong information about where the independent sets of the hypergraph may be found.
In comparison with the predecessor paper [12], there are two main new ideas which allow us to break new ground.
To prove Theorem 1.1, we need a supersaturation lemma, namely Corollary 2.3, which gives good bounds for the number of arcs determined by sets with slightly more than q elements, and in particular sets of size when . See Sect. 2.2 for more background on supersaturation results and their interaction with the method of hypergraph containers.
As was the case in [12], we repeatedly apply the hypergraph container theorem to obtain a set of containers for the family of arcs . However, we observe that, as these containers get smaller, we can eventually make use of a non-trivial bound for the maximum co-degree . See Sect. 2.1 for the definition of this parameter. This results in better quantitative information about the set of containers, which in turn yields the improved bound stated in Theorem 1.2.
Recent Work of Chen, Liu, Nie and Zeng
Since the first version of this paper was published online, work of Chen, Liu, Nie and Zeng [7] has appeared which considers many of the problems studied in this paper and their extensions to the higher dimensional setting. In particular, they prove that there is an absolute constant c such that
holds for all integers . Although the result in [7] is weaker than Theorem 1.2 in the sense that there is a multiplicative constant on the top of the binomial coefficient, it has the significant advantage that it extends to consider almost all possible values of k, thus essentially closing the problem of counting arcs of a fixed size, up to the presence of the constant c. The same paper also closes a gap which was left by our forthcoming Theorem 4.3 concerning the typical size of the largest arc in a random set. The paper [7] uses similar methods to ours, with the augmentation of a so-called “balanced supersaturation" argument allowing for additional progress.
Containers and Supersaturation
Statement of the Container Theorem
The main tool of this paper is a container theorem for 3-uniform hypergraphs. The method of hypergraph containers has had a remarkable impact on extremal combinatorics in recent years (see for example [4] and [10]). This impact is also being felt in Additive Combinatorics (see [1] and [11]) and discrete geometry (see [5]). See [3] for a fairly recent survey of this topic.
Before stating the container theorem to be used, it is necessary to introduce some related quantities. Since we will only apply the container theorem for 3-uniform hypergraphs, we give all of the definitions we need only for this case. A more general form of the statement and definitions we need, adapted to k-uniform hypergraphs, can be found in [5], amongst other places.
For a 3-uniform hypergraph and , d(v) denotes the degree of v, i.e. the number of edges which contain v. Let denote the average degree of , so
| 4 |
We can also define the co-degree for a subset of vertices as
Using this definition we define the maximum co-degree as
More generally, one can define the parameter , which counts the maximum co-degree among all sets of cardinality t. However, since we only consider 3-uniform hypergraphs in this paper, it turns out that we only use this definition for the case .
For any , denotes the sub-hypergraph induced by .
We now state the container theorem we need, which is a special case of Corollary 3.6 in [14].
Theorem 2.1
Let be a 3-uniform hypergraph on n vertices, and let . Suppose that
| 5 |
Then there exists a set of subsets of V such that
-
i.
if is an independent set then there exists such that ;
-
ii.
for all ;
-
iii.
,
where c is an absolute constant (we can take ).
The set above is referred to as the set of containers, and a set is itself a container.
Supersaturation Results
In all applications of the method of hypergraph containers, it is necessary to have a supersaturation result. In general terms, a supersaturation lemma is a result that says that, once we have enough elements in our set to guarantee the existence of a certain substructure, we quickly generate many copies of the substructure in question. In our case, this means that we need to show that sets in with significantly more than q elements must contain many collinear triples.
Given a set , let T(P) denote the number of collinear triples in P. That is,
It was first proven by Bose [6] that, for q even, any point-set of size in contains a collinear triple, and, for q odd, any point-set of size in contains a collinear triple. This makes the construction (1) almost best possible. In [12], an application of the Cauchy–Schwarz inequality was used to prove that, for an unspecified absolute constant ,
| 6 |
This statement is optimal up to the multiplicative constant. This can be seen by taking a p-random subset of for some p with order of growth at least 1/q. However, (6) does not say anything about what happens in the range when , i.e. when P is only slightly larger than the threshold for guaranteeing at least one arc. In order to prove Theorem 1.1, a good supersaturation result for this range is required. We prove the following rather general supersaturation result for arcs, which includes information about this range.
Lemma 2.2
Let and let x be an integer satisfying . Let with . Then
Recall that existence of a collinear triple in a set of size of follows from the result of Bose [6]. From Lemma 2.2 we can say a set of size of for will contain more than collinear triples.
Before proving Lemma 2.2, it is convenient to record two corollaries that will be used later.
Corollary 2.3
Let q be a prime power and let . Suppose such that
Then .
Proof
We will prove the contrapositive form of the statement; assuming that , we prove that .
Since , Lemma 2.2 can be applied with , after passing to a subset if necessary. We have
and thus . The latter inequality follows from the assumption that . Lemma 2.2 then gives
as required.
Lemma 2.2 can be used to reprove (6). We state the following version, with a concrete multiplicative constant.
Corollary 2.4
Let be a prime power and let with . Then
Proof
Write with . By Lemma 2.2,
The final inequality uses the assumption that .
We remark here that, if we take a closer inspection of the last step in the previous proof, we can carry out a small case analysis to check that the condition is not really needed. However, since we will need q to be sufficiently large in several other steps of our forthcoming arguments, we omit the details of this calculation. Moreover, the constant in Corollary 2.4 can be improved further, at the expense of some slightly more involved calculations that we prefer to skip.
We now turn towards the proof of Lemma 2.2. The proof uses Karamata’s inequality. For two finite non-increasing sequences of real numbers and , we say that majorizes if
| 7 |
and
| 8 |
Lemma 2.5
(Karamata’s Inequalty) Let be a convex function defined on an interval I. Suppose that and are non-increasing sequences in I such that majorizes . Then
Proof of Lemma 2.2
For any point in ; there are exactly lines passing through it. Let the set of lines passing through be . We label the lines in according the number of elements of P they contain, in non-increasing order. That is, we write
such that
We assign weights to points in in a way such that the sum of weight of all the points in will be T(P). For a point in assign a weight to it as follows:
| 9 |
where for all . Note that W(v) counts one third of the number of collinear triples in P which contain v. Therefore, assigning weights in this fashion satisfies the required property; that is,
| 10 |
Also observe that for any ,
| 11 |
We will state and prove the following claim in order to finish the proof.
Claim 2.6
For any point in ,
Proof of Claim
We observed in (11) that the sum is fixed, for all . It’s a natural intuition from (9) that the quantity reaches its minimum when the terms are almost equal, in other words for all . We will prove that this intuition is indeed true via an application of Karamata’s inequality.
The function is convex, and the non-increasing sequence consisting of occurrences of and occurrences of is majorized by every other non-increasing sequence of integers of length and total sum . It therefore follows from Karamata’s Inequality that
This finishes the proof of the Claim 2.6.
Alternatively, instead of using Karamata’s inequality Claim 2.6 can also be proved by using the fact that
for all .
Now applying Claim 2.6 in (10) we conclude that
Hence Lemma 2.2 is proved.
Container Lemmas for Arcs
Basic Properties of the Graph Encoding Triples
Define a 3-uniform hypergraph with vertices corresponding to points in , with three points forming a hyperedge if they are collinear. Note that the number of edges in this graph is
In this section, we will make repeated applications of the container theorem for this graph and its induced subgraphs until we obtain a family of containers for arcs in (i.e. a family of subsets of with the property that, for any arc , there exists such that ) with the properties we need. Before starting this iterative process, we collect a few inequalities that will be used repeatedly in the proofs of the forthcoming three lemmas.
A reformulation of Corollary 2.4 states that, for any ,
| 12 |
It therefore follows from the definition (4) that
| 13 |
We will need to bound the quantity to apply Theorem 2.1. A first observation is that, for any ,
| 14 |
Indeed, given a pair of points in the plane, the number of points in which are collinear with the given pair is , and (14) follows.
A better bound for is available when T(A) is smaller. We have
| 15 |
Indeed,
A line containing M elements of P gives rise to collinear triples, and so
A rearrangement of this inequality gives (15).
A First Container Lemma for Arcs
The argument for our first container lemma largely follows that of [12, Lem. 2]. The only difference in what follows is that we keep track of the number of collinear triples determined by the containers, rather than their size.
Lemma 3.1
Suppose that q is a sufficiently large prime power. Then there exists a family of subsets of such that
,
For all , ,
For every arc , there exists such that .
In the first point above, K is an absolute constant.
Proof
In this proof, we will iteratively apply Theorem 2.1 to subsets of . The structure of the argument is similar to that of the proof of the main result in [5]. We begin by applying it to the graph encoding collinear triples, which we defined at the beginning of this section. Note that independent sets in this hypergraph are the same thing as arcs in . As a result, we obtain a set of containers. We iterate by considering each . If A contains too many collinear triples, then we apply Theorem 2.1 to the graph to get a family of containers . If the number of collinear triples in A is sufficiently small then we put this A into a final set of containers (or to put it another way, we write ).
Repeating this for all we obtain a new set of containers
Note that is a container set for . Indeed, suppose that X is an independent set in . Then there is some such that . Also, X is an independent set in the hypergraph , which implies that for some .
We then repeat this process, defining
By choosing the values of and appropriately, we can ensure that after relatively few steps we have for all of the sets . We then declare . It turns out that, because of m being reasonably small, is also fairly small.
Fig. 1.

An illustration of the beginning of the construction of the set of containers , which can be viewed as a tree. For each , we check the size of , and if it is still too large we apply the container theorem again. If the set contains few enough collinear triples, we stop the process. This is what happens with the set in the diagram. The final set of containers consists of the leaves of the tree
Now we give more precise details of how to run this argument. Let , with , and write . If then we add A to the final set of containers . Otherwise, we apply Theorem 2.1 to . We choose the parameters and to be
| 16 |
In order for this application of Theorem 2.1 to be legitimate, we need to make some calculations involving d(H[A]) and , and check that the conditions of Theorem 2.1 are satisfied. The main challenge is to verify that (5) holds.
Since we have , it follows from (13) that
| 17 |
Applying (17) and (14) and then using the two components of the definition of , it follows that
We have therefore verified that (5) holds. Since , the condition that is guaranteed to hold as long as we choose q to be sufficiently large. The condition is immediate.
Theorem 2.1 therefore gives the bound
| 18 |
for some absolute constant c (which may be different from the absolute constant c in the statement of Theorem 2.1). Applying (12) and the fact that , we see that
| 19 |
It then follows from (19), (18), the definition of , and the upper bound , that
for some absolute constant .
Since each step of this process reduces the number of edges in the containers by a factor of 3, it follows that the process will terminate after at most steps. The final set therefore contains at most
elements, for some absolute constant K.
A Second Container Lemma for Arcs
We use Lemma 3.1 as a basis for the following improved result, in which we reduce the number of edges further without paying too serious a price. The main quantitative cost of this reduction is an increase in the size of the final set of containers, and this increase depends on the new error-term parameter . The proof of the following lemma again uses the supersaturation results established in Sect. 2, and this time we use both Corollaries 2.3 and 2.4.
Lemma 3.2
Suppose that q is a sufficiently large prime power. There exist positive absolute constants c and such that, for any in the range , following statement is true. There exists a family of subsets of such that
,
For all , ,
For every arc , there exists such that .
Proof
Let be the set of containers given by Lemma 3.1, and fix . Similar to the proof of Lemma 3.1, we will iteratively apply Theorem 2.1 to obtain a family of containers whose components contain fewer collinear triples.
Let A be a container obtained during this iterative process and write . If then we add A to the final set of containers . Otherwise, we will apply Theorem 2.1 to to obtain a family of containers for .
The main difference between this proof and that of Lemma 3.1 is that we bound using (15). Since , this bound is better than the bound (14) that we used in the proof of Lemma 3.1. In particular, we have
| 20 |
Using (12), along with the fact that and , yields
| 21 |
It therefore follows that
| 22 |
We choose the parameters and to be
| 23 |
We again need to check that (5) holds. Indeed, by (20), (22), and the definition of ,
We have therefore verified that (5) holds. We also need to ensure that the condition holds. This follows from the assumption that , along with the fact that , provided that the absolute constant c is chosen to be sufficiently large. Indeed,
Theorem 2.1 can therefore be legitimately applied, and it gives the bound
| 24 |
for some absolute constant . Applying (21) and the definition of , one obtains the bound
where C is an absolute constant. Combining this with (24) gives
with a new absolute constant .
Since each step of this process reduces the number of collinear triples in the containers by a factor of 3, it follows that the process will comfortably terminate after at most steps. We also need to take into account that we already started with containers, and that we apply this process for each of the elements of . Therefore, the final set contains at most
elements, where is an absolute constant.
It remains to check that for all . This follows immediately from Corollary 2.3.
Counting Arcs
We are now ready to prove the main results of the paper. We start with Theorem 1.1, which is restated below for convenience.
Theorem 4.1
There exists an absolute constant C such that, for all q sufficiently large,
Proof
Let denote the set of containers given by Lemma 3.2, applied with . All of the sets in are subsets of some . Since , it follows that
The result holds, with .
Now we prove Theorem 1.2, which we will derive from the following more general statement.
Theorem 4.2
Let q be a sufficiently large prime power and let C is a sufficiently large absolute constant. Let satisfy and define . Then
Proof
Apply Lemma 3.2 with this choice of . The assumption that is sufficient to ensure that the condition on in Lemma 3.2 is satisfied. All of the sets in are subsets of some . Since , it follows that
The final inequality follows from the lower bound on k in the statement of the theorem and the assumption that C is sufficiently large (one may take ).
Proof of Theorem 1.2
If we now apply Theorem 4.2 with the slightly stronger lower bound , we see that , thus proving Theorem 1.2
Finally, we consider the size of the largest arc contained in a random point set, improving a result from [12]. Given , let a(P) denote the size of the largest arc such that . Let be a random subset of with the events being independent with probability . We say that is a p-random set.
Theorem 4.3
Suppose that , where is the constant from Lemma 3.2, and let be a p-random set. Then
By contrast, it was established in [12] that, with high probability, . Combining this observation with Theorem 4.3, we see that a random set with p relatively large is very likely to have .
Other ranges of p for this problem were also considered in [12], and a near-optimal bound for was proven. The problem of determining the behaviour of in the range was very recently settled by Chen, Liu, Nie and Zeng [7].
Proof of Theorem 4.3
Fix p in the given range and let . Apply Lemma 3.2 with to obtain a family of containers . For q sufficiently large, the probability that contains an arc of size m is at most
This is because an arc of size m must be contained in some , and each subset of size m belongs to the random subset with probability . Applying the bounds from Lemma 3.2, as well as the bound , it follows that
The second from last of these inequalities is an application of the inequality , which follows from the lower bound on p assumed in the statement of the theorem. It follows that
Acknowledgements
The authors were supported by the Austrian Science Fund FWF Project P 34180. Part of this work was carried out during the Focused Research Workshop “Testing Additive Structure", which was supported by the Heilbronn Institute for Mathematical Research. We are grateful to Sam Mattheus, Cosmin Pohoata, Audie Warren and Adam Zsolt Wagner for helpful discussions. Finally, we are particular thankful to the anonymous referees, who gave many comments, corrections and suggestions which have improved this paper.
Funding
Open access funding provided by Johannes Kepler University Linz.
Footnotes
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
-
1.Balogh, J., Liu, H., Sharifzadeh, M.: The number of subsets of integers with no
-term arithmetic progression. Int. Math. Res. Not. IMRN, no. 20, 6168–6186
- 2.Balogh, J., Morris, R., Samotij, W.: Independent sets in hypergraphs. J. Am. Math. Soc. 28(3), 669–709 (2015) [Google Scholar]
- 3.Balogh, J., Morris, R., Samotij, W.: The method of hypergraph containers. In: Proceedings of the International Congress of Mathematicians-Rio de Janeiro 2018. Vol. IV. Invited lectures, pp. 3059–3092 (2018)
-
4.Balogh, J., Samotij, W.: The number of
-free graphs. J. Lond. Math. Soc. 83, 368–388 (2011) [Google Scholar] - 5.Balogh, J., Solymosi, J.: On the number of points in general position in the plane. Discrete Anal., Paper No. 16, 20 pp
- 6.Bose, R.C.: Mathematical theory of the symmetrical factorial design. Sankhya 8, 107–166 (1947) [Google Scholar]
- 7.Chen, Y., Liu, X., Nie, J., Zeng, J.: Random Turán and counting results for general position sets over finite fields. arXiv:2309.07744
- 8.Dudeney, H.E.: 317. A puzzle with pawns. Amusements Math. 94, 222 (1917) [Google Scholar]
- 9.Hall, R.R., Jackson, T.H., Sudbery, A., Wild, K.: Some advances in the no-three-in-line problem. J. Comb. Theory Ser. A 18, 336–341 (1975) [Google Scholar]
-
10.Morris, R., Saxton, D.: The number of
-free graphs. Adv. Math. 298, 534–580 (2016) [Google Scholar] - 11.Pohoata, C., Roche-Newton, O.: Four-term progression free sets with three-term progressions in all large subsets. Random Struct. Algorithms 60(4), 749–770 (2022) [Google Scholar]
-
12.Roche-Newton, O., Warren, A.: Arcs in
. Eur. J. Comb. 103, Paper No. 103512, 15 pp (2022)
- 13.Roth, K.F.: On a problem of Heilbronn. J. Lond. Math. Soc. 26(2), 198–204 (1951) [Google Scholar]
- 14.Saxton, D., Thomason, A.: Hypergraph containers. Invent. Math. 201(3), 925–992 (2015) [Google Scholar]
