Hostname: page-component-cd9895bd7-7cvxr Total loading time: 0 Render date: 2024-12-26T18:36:00.209Z Has data issue: false hasContentIssue false

Random growth via gradient flow aggregation

Published online by Cambridge University Press:  25 November 2024

Stefan Steinerberger*
Affiliation:
University of Washington
*
*Postal address: Department of Mathematics, University of Washington, Seattle, WA 98195, USA. Email: steinerb@uw.edu
Rights & Permissions [Opens in a new window]

Abstract

We introduce gradient flow aggregation, a random growth model. Given existing particles $\{x_1,\ldots,x_n\} \subset \mathbb{R}^2$, a new particle arrives from a random direction at $\infty$ and flows in direction of the vector field $\nabla E$ where $ E(x) = \sum_{i=1}^{n}{1}/{\|x-x_i\|^{\alpha}}$, $0 < \alpha < \infty$. The case $\alpha = 0$ refers to the logarithmic energy ${-}\sum\log\|x-x_i\|$. Particles stop once they are at distance 1 from one of the existing particles, at which point they are added to the set and remain fixed for all time. We prove, under a non-degeneracy assumption, a Beurling-type estimate which, via Kesten’s method, can be used to deduce sub-ballistic growth for $0 \leq \alpha < 1$, $\text{diam}(\{x_1,\ldots,x_n\}) \leq c_{\alpha} \cdot n^{({3 \alpha +1})/({2\alpha + 2})}$. This is optimal when $\alpha = 0$. The case $\alpha = 0$ leads to a ‘round’ full-dimensional tree. The larger the value of $\alpha$, the sparser the tree. Some instances of the higher-dimensional setting are also discussed.

Type
Original Article
Copyright
© The Author(s), 2024. Published by Cambridge University Press on behalf of Applied Probability Trust

1. Introduction and results

1.1. Aggregation models

The purpose of this paper is to introduce a model describing random growth of particles aggregating in the plane. We will almost exclusively work in $\mathbb{R}^2$ and we will consider particles to be disks of radius $\frac{1}{2}$ (which are then often identified with their center).

Perhaps the most celebrated random growth model is diffusion limited aggregation (DLA) introduced in [Reference Witten and Sander24] in 1981. DLA models random growth by assuming that a particle coming from ‘infinity’ performs a random walk until it touches an already existing particle for the first time, at which point it gets stuck for all time. The model is frequently considered on the lattice $\mathbb{Z}^d$ where random walks are particularly easy to define. The main results [Reference Kesten8, Reference Kesten9, Reference Kesten10] proved that the diameter of a cluster of n particles under DLA on the lattice $\mathbb{Z}^2$ is $\lesssim n^{2/3}$ (along with generalizations to $\mathbb{Z}^d$ ). No non-trivial lower bound (better than $\gtrsim n^{1/2}$ ) is known. Many other models have been proposed; we specifically mention the Eden model [Reference Eden4], the Vold–Sutherland model [Reference Sutherland21, Reference Vold23], the dielectric breakdown model [Reference Losev and Smirnov13, Reference Niemeyer, Pietronero and Wiesmann15], and the Hastings–Levitov model [Reference Berger, Procaccia and Turner3, Reference Hastings and Levitov7, Reference Liddle and Turner12, Reference Norris, Silvestri and Turner16, Reference Norris, Silvestri and Turner17, Reference Norris and Turner18, Reference Silvestri20]. They have received a lot of attention because of the intricate emerging complexity and the simplicity of the setup; nonetheless, there are relatively few rigorous results. In particular, DLA remains ‘notoriously resilient to rigorous analysis’ [Reference Benjamini and Yadin2]. We were motivated by the question of whether the underlying randomness (a Brownian walker coming from infinity) could be replaced by more elementary ingredients and whether this would lead to a model where the corresponding theory would also simplify (and thus, hopefully, allow for further insight). This motivated GFA, which seems to exhibit emergent complexity similar to that of many of the previous models. Additionally, it admits an analogous theory (a Beurling-style estimate leading to a growth bound via Kesten’s method) where each proof requires only ‘elementary’ real analysis.

Figure 1. A tree grown with $\alpha = 2$ and the trajectory of 100 incoming gradient flows (equispaced in angle). There is a tendency to avoid existing valleys.

1.2. Gradient flow aggregation

We start with a single particle $x_1 \in \mathbb{R}^2$ . If we already have $\{x_1,\ldots,x_n\} \subset \mathbb{R}^2$ , then a new particle $x_{n+1}$ is created randomly at ‘infinity’ (uniformly over all directions, made precise below) and flows in the direction of $\nabla E$ (gradient ascent), where

\begin{align*}E(x) = \sum_{i=1}^{n}\frac{1}{\|x - x_i\|^{\alpha}}, \quad 0 < \alpha < \infty.\end{align*}

The particle flows until it reaches distance 1 from one of the existing points for the first time; at that point, the flow stops and the particle then remains at that location for all time (see Figure 1). There is a natural interpretation of the model where new particles are drawn to the existing particles via a ‘field’ generated by the existing particles. The only randomness in the model is the starting angle at infinity of the new particle (a single uniformly distributed random variable on $[0,2\pi]$ ). In particular, the model requires less ‘random input’ (a single random variable) than DLA (where an entire Brownian path is required). There are two obvious modifications of the energy E at the endpoints $\alpha\in\{0,\infty\}$ . We define

\begin{equation*} E(x) = \begin{cases} \displaystyle\sum_{i=1}^{n}\log\bigg(\frac{1}{\|x-x_i\|}\bigg) \quad & \text{when}\,\alpha = 0, \\ \max_{1 \leq i \leq n}\dfrac{1}{\|x-x_i\|} & \text{when}\,\alpha =\infty. \end{cases}\end{equation*}

It remains to consider how to proceed with critical points $\nabla E(x^*) = 0$ which may have the effect of trapping a gradient flow. In practice, this is not actually an issue. We work, for the remainder of the paper, under a non-degeneracy assumption.

Definition 1. We say that $\{x_1,\ldots,x_n\} \subset \mathbb{R}^2$ satisfies property (P) if $E(x) = \sum_{i=1}^{n} \|x-x_i\|^{-\alpha}$ has a finite number of critical points.

A result of Gauss implies that when $\alpha = 0$ , then every set has property (P). It is widely assumed to be true in general (a version of this question is known as ‘Maxwell’s problem’). Since it stands to reason that it is either true for all points or, at very least, typically true (in the sense of a hypothetical exceptional set being rare), it is a very weak assumption; we comment more on this in Section 2.1.

It remains to give a precise definition of what it means for a new particle to appear ‘randomly at infinity’. Fix an existing particle $x_i$ and an interval $J \subset \big\{y\,:\,\|x_i-y\| = \frac{1}{2}\big\}$ . For any finite $r \gg 1$ , we can consider points at distance $\|x\| = r$ from the origin (with uniform probability) and ask for the likelihood $p_J(r)$ that gradient flow started in the random point reaches distance 1 from $x_i$ , this is the first time the gradient flow is distance 1 from any of the points in the existing set, and the disks touch in J. The limit as $r \rightarrow \infty$ exists; the simplest statement one can make in this direction is as follows.

Theorem 1. (GFA exists.) Assuming property (P), the limit $\lim_{r \rightarrow \infty} p_J(r)$ exists.

The role of $\alpha$ (illustrated in Figures 2, 3, and 4) is interesting. Larger values of $\alpha$ lead to a stronger pull by nearby particles. Philosophically, it is similar to the parameter $\eta$ in the dielectric breakdown model (DBM). Newly incoming particles are being pulled by the existing particles with a force determined by distance: particles that are closer exert a stronger pull. Larger values of $\alpha$ lead to sparser trees.

Figure 2. Simulation of gradient flow aggregation for $n=1000$ particles with various parameters of $\alpha$ .

Figure 3. A fixed cluster of 40 particles. Gradient descent lines are colored by the color of the particle they eventually attach to (particles colored black were not hit by any gradient descent in the simulation). Likelihoods are determined by gradient flow lines at infinity. As $\alpha$ increases, the exposed endpoints gain more mass.

1.3. The Beurling estimate

The crucial estimate in the theory of DLA is Beurling’s estimate for harmonic measure which provides a uniform upper bound for the likelihood of a new incoming particle attaching itself to any fixed particle. We prove an analogous result in the setting of GFA for $0 \leq \alpha \leq 1$ .

Theorem 2. (Beurling estimate.) Let $0 \leq \alpha \leq 1$ and suppose $\{x_1,\ldots,x_n\} \subset \mathbb{R}^2$ satisfies property (P). Then, for some $c_{\alpha}>0$ depending only on $\alpha$ ,

\begin{align*}\max_{1 \leq i \leq n}\mathbb{P}(\textit{new particle hits}\,x_i) \leq c_{\alpha} \cdot n^{({\alpha-1})/({2\alpha+2})}.\end{align*}

The result is optimal for $\alpha = 0$ where it gives rate $\lesssim n^{-1/2}$ . The estimate gives non-trivial results all the way up to $\alpha =1$ . There is no reason to assume the exponent to be optimal; we comment on this more extensively after the proof.

1.4. The growth estimate

Beurling-type estimates can be used to derive growth bounds on the diameter. Originally due to [Reference Kesten8, Reference Kesten9, Reference Kesten10] (‘Kesten’s method’), this is now well understood: we adapt a robust and very general framework developed for DLA on graphs [Reference Benjamini and Yadin2].

Figure 4. GFA for $n=5000$ particles with $\alpha=0$ (left) and $n=2500$ particles with $\alpha =2$ . Theorem 3 implies that the structure on the left grows as slowly as possible, $\text{diam}(x_1, \ldots, x_n) \leq c \sqrt{n}$ .

Theorem 3. (Growth bound) Let $0 \leq \alpha \leq 1$ , and let $(x_k)_{k=1}^{\infty}$ be obtained via GFA and satisfy property (P) throughout. Then, with high probability,

\begin{align*}\mathrm{diam}\{x_1,\ldots,x_n\} \leq c_{\alpha} \cdot n^{({3 \alpha +1})/({2\alpha + 2})}.\end{align*}

This is optimal for $\alpha = 0$ : any set of n points in $\mathbb{R}^2$ satisfying $\|x_i - x_j\| \geq 1$ has diameter at least $c \cdot n^{1/2}$ . Thus, GFA with $\alpha = 0$ grows at the slowest possible rate (see Figure 4) and GFA with $0 < \alpha \ll 1$ small behaves similarly. Kesten’s method allows us to translate Beurling estimates into growth estimates, but it’s clear that the argument is inherently lossy; this is the case for DLA and it is equally the case for GFA. It would be very desirable if the simplified framework of GFA were to simplify the investigation of new arguments that might possibly break this barrier.

1.5. The case $\alpha \rightarrow \infty$

One nice aspect of GFA is that the case of $\alpha = \infty$ is particularly easy to describe and simulate (see Figure 5). It should be related, in spirit, to the behavior of DBM for a suitable $\eta$ (maybe; see [Reference Hastings6], with a value of $\eta$ close to 4?). We first describe the model. There is a natural limit in that

\begin{align*}\lim_{\alpha\rightarrow\infty}\Bigg(\sum_{i=1}^{n}\frac{1}{\|x - x_i\|^{\alpha}}\Bigg)^{{1}/{\alpha}} =\max_{1 \leq i \leq n}\frac{1}{\|x-x_i\|}.\end{align*}

This suggests a particularly natural limiting process: given $\{x_1,\ldots,x_n\} \subset \mathbb{R}^2$ , create the next point $x_{n+1}$ by adding a ‘random point at infinity’ (in the same way as above) and then moving it along $\nabla E$ . In this case, the gradient $\nabla E$ (which is defined almost everywhere) is simply going to point to the nearest particle in the set. It is not difficult to see that this never changes along the entire flow. This means the particle follows a straight line until the corresponding disks touch and the process stops. The geometry in this case becomes a lot simpler to analyze; in particular, new particles only attach themselves to the convex hull of the existing particles (and the likelihoods can be computed in terms of opening angles).

Figure 5. $n=100$ (left), $n=500$ (middle), and $n=2000$ (right) steps of the evolution of GFA with an $\alpha = \infty$ tree.

Proposition 1. $x_{n+1}$ can only attach to particles in the convex hull of $\{x_1,\ldots,x_{n}\}$ . If $x_k$ is a particle in the complex hull with (interior) opening angle $\alpha$ then

\begin{align*}\mathbb{P}(x_{n+1}\,\textit{attaches itself to}\,x_k) = ({\pi - \alpha})/{2\pi}.\end{align*}

See Figure 6 for an illustration. We note that this also means that the simulation of GFA for $\alpha = \infty$ is numerically simple: it suffices to keep track of the convex hull of the existing set of points, add $x_{n+1}$ , and then recompute the new convex hull. Since the cardinality of the convex hull seems to be, asymptotically, bounded, computation of $x_{n+1}$ can be as cheap as $\mathcal{O}(1)$ . This allows for the construction of trees with a large number of points.

Figure 6. The likelihood of a new particle attaching itself to an existing particle $x_k$ on the convex hull is $\beta/(2\pi) = (\pi - \alpha)/(2\pi)$ .

This appears to be an interesting dynamical system. The growth of its diameter is approximately inversely proportional to the number of points that constitute its convex hull. It is intuitively clear that the dynamical system does not favor a large number of points in the convex hull, which then suggests ballistic growth. The reason for the small number of points in the convex hull is, heuristically, the following: one of the existing points in the convex hull is bound to have a smallest angle and will then attract the largest growth. If all opening angles are the same, then random fluctuations will naturally lead to some of them being a little bit larger than others and then the natural drift sets in. In examples, see Figure 7, we tend to observe a natural tendency to shapes that appear something like three or four long branches that meet roughly at an equal angle. However, this does not seem to be a stable configuration and we sometimes observe shorter segments trying to branch off (and sometimes succeeding). It could be an interesting avenue for further research to understand the case $\alpha = \infty$ better.

Figure 7. Starting with the 10 vertices of a regular polygon at distance 100 from the origin, the evolution after 1000, 5000, 15 000, and 25 000 steps, respectively.

1.6. What to expect in higher dimensions

Some of our arguments employ the particularly simple geometry/topology in two dimensions but it seems conceivable that many of the arguments extend to higher dimensions. We note that there is one particularly simple case, the case of

\begin{align*}E(x) = \sum_{i=1}^{n}\frac{1}{\|x-x_i\|^{d-2}}\end{align*}

in $\mathbb{R}^d$ with $d \geq 3$ . The main reason is, unsurprisingly, that E is now a harmonic function and $\Delta E = 0$ . This allows us to use Lemma 2 almost verbatim while avoiding the use of Lemma 3 (whose generalization to higher dimensions seems non-trivial).

Theorem 4. Consider GFA in $\mathbb{R}^d$ , $d \geq 3$ , and $\alpha = d-2$ . Given $\{x_1,\ldots,x_n\} \subset \mathbb{R}^d$ ,

\begin{align*}\max_{1 \leq i \leq n} \mathbb{P}(\textit{new particle hits}\,x_i) \leq c_{d} \cdot n^{({1}/{d}) - 1}\end{align*}

and $\mathrm{diam}\{x_1,\ldots,x_n\} \leq c_{d} \cdot n^{({d-1})/{d}}$ .

We note that both estimates are clearly optimal. For any aggregation scheme, no Beurling estimate can be better than $n^{({1}/{d}) - 1}$ and this can be seen as follows: consider n points arranged in some lattice structure so that only ${\sim}n^{(d-1)/d}$ are exposed as boundary points; by the pigeonhole principle at least one of them is hit with likelihood $n^{-(d-1)/d}$ . Likewise, any set of n points that are 1-separated has diameter at least ${\sim}n^{(d-1)/d}$ . Theorem 4 suggests a phase transition when $\alpha = d-2$ . We could therefore be led to expect GFA to produce ball-like shapes when $\alpha \leq d-2$ and to do something else entirely when $\alpha > d-2$ .

1.7. Problems

These results suggest a large number of questions. We list some and note that there are many others.

  1. (i) We know the growth rate for the logarithmic case $\alpha =0$ which grows at the smallest possible rate. Is the limiting shape a disk?

  2. (ii) What is the sharpest possible Beurling estimate for GFA with $0 \leq \alpha < \infty$ ? Is the extremal set a set of n points on a line?

  3. (iii) How does the growth rate of GFA depend on $\alpha$ ? This is likely very difficult, and already heuristic/numerical guesses might be of interest. We know that, for $\alpha$ close to 0, the rate is $\leq n^{\beta}$ with $\frac{1}{2} \leq \beta \leq \frac{1}{2} + \alpha$ .

  4. (iv) Related to the previous question, are there fast numerical methods to simulate GFA? The fast multipole method appears to be a natural candidate.

  5. (v) Is the growth rate sub-ballistic for all $1 \leq \alpha < \infty$ ? Or is there a threshold parameter $0 < \alpha_0 < \infty$ such that GFA becomes ballistic for $\alpha \geq \alpha_0$ ?

  6. (vi) Is it possible to prove a ballistic growth rate when $\alpha = \infty$ ? Numerically, this seems to be obviously the case. Do initial conditions matter? More generally, what can be proven about GFA when $\alpha = \infty$ ?

  7. (vii) There is a notion that DBM and Hastings–Levitov might belong to the same universality class. Does GFA fit into the picture or is it something else entirely? Is there a way in which GFA with parameter $\alpha$ behaves like DBM with parameter $\eta = \eta(\alpha)$ ? DBM seems to undergo a phase transition [Reference Hastings6] for $\eta = 4$ ; does a similar phase transition happen for GFA?

  8. (viii) Natural aspects of interest when studying GFA might be the evolution of E(x) and $\nabla E(x)$ , and the distribution of the critical points. Is there anything rigorous that can be said about these objects?

  9. (ix) What happens if the potential $\phi(\|x-x_i\|)=\|x-x_i\|^{-\alpha}$ is replaced with some other monotonically decreasing and convex radial function? As long as there is control on the sign of the Laplacian of the radial function, some of our arguments might generalize. Is there a universality phenomenon in the sense that the evolution depends only weakly on $\phi$ ?

  10. (x) We could naturally consider all these questions in dimension $d \geq 3$ , and many of our arguments will generalize (see also Section 1.6). As a tendency, things do not become easier in higher dimensions. However, it appears as if the simulation of the $\alpha = \infty$ tree remains equally simple in higher dimensions (see Figure 8) and it would be of interest to see what can be said.

Figure 8. Four examples of GFA with $\alpha=\infty$ in $\mathbb{R}^3$ : the first 10 000 points. We see a tendency towards five tentacles.

2. Proof of Theorem 1

This section is structured as follows. Section 2.1 details what is known about the set of critical points of E(x) and discusses the role of property (P). Section 2.2 presents a quick argument that a finite number of critical points cannot have a big impact on the gradient flows (a more general version of the statement follows as a by-product from our main argument and is later given as Lemma 4 in Section 3.4). Section 2.3 shows that the set of gradient flows that end up touching a particular particle has to be topologically simple. Section 2.4 summarizes an important trick, and Section 2.5 gives a proof of Theorem 1.

2.1. The critical set

We discuss the set of critical points of $E(x) = \sum_{i=1}^{n}{1}/{\|x-x_i\|^{\alpha}}$ , which is the set of points x for which $\nabla E(x) = 0$ . It can also be written as

\begin{align*}\text{set of critical points} = \Bigg\{x\in\mathbb{R}^2\,:\,\sum_{i=1}^{n}\frac{x-x_i}{\|x-x_i\|^{\alpha+2}} = 0\Bigg\}.\end{align*}

The case $\alpha = 0$ , corresponding to logarithmic energy, is completely understood: when $\alpha = 0$ , then critical points can be written as the derivative of a complex-valued polynomial and there are at most $n-1$ distinct critical points because a complex polynomial of degree $n-1$ has exactly $n-1$ roots (this observation is sometimes attributed to Gauss). Note that this argument is restricted to $\mathbb{R}^2$ .

Let us now consider the case $\alpha > 1$ . As was only recently rediscovered [Reference Gabrielov, Novikov and Shapiro5], a particular special case dates back to James Clerk Maxwell [Reference Maxwell14, Section 113] who claimed that, for $\{x_1,\ldots,x_n\} \subset \mathbb{R}^3$ ,

\begin{align*}\#\,\text{critical points of} \ E(x) = \sum_{i=1}^{n} \frac{1}{\|x-x_i\|} \leq (n-1)^2.\end{align*}

J. J. Thomson, when proofreading the book in 1891, added the remark ‘I have not been able to find any place where this result is proved.’ The problem is already interesting when $n=3$ : the problem is then really two-dimensional since critical points can only occur in the convex hull of the three points. The question is thus whether any set of $n=3$ points in $\mathbb{R}^2$ leads to E(x) with $\alpha = 1$ having at most four critical points. This is true, and a rather non-trivial 2015 result [Reference Tsai22]. For general n, it is not even known whether the critical set is always finite, not even in the plane (a situation accurately described in [Reference Silvestri20] as ‘very irritating’). Under a non-degeneracy assumption, [Reference Killian11] (improving on [Reference Gabrielov, Novikov and Shapiro5]) showed that n points in $\mathbb{R}^2$ generate at most $2^{2n-2} (3n-2)^2$ critical points. We refer to [Reference Gabrielov, Novikov and Shapiro5], a recent result [Reference Zolotov25], and references therein for more details.

We note that our setting suggests further relaxations of property (P). For example, suppose the set $\{x_1,\ldots,x_n\}$ fails to have property (P). Then it appears to be quite conceivable that the set of points x with the property that $\{x_1,\ldots,x_{n-1},x\}$ fails to have property (P) is necessarily small (say, measure 0). Any failure of property (P) should be very delicate. A result like this might not be out of reach and would suffice to show that property (P) is automatically satisfied along GFA with probability 1. It is also conceivable that a more measure-geometric analysis of the set of critical points may lead to another relaxation. We have not pursued these possibilities at this time; they may be interesting future directions.

2.2. Gradient flows and critical points

We argue that a fixed critical point does not capture a positive mass of gradient flows. We give two different arguments, one based on stable/unstable manifolds that works for $\alpha > 0$ , and a second argument in the case of $\alpha = 0$ . The second argument is more robust and can also be made to work in the case $\alpha > 0$ where it can be used to produce a result of independent interest which we postpone to Lemma 4 in Section 3.4 (since it relies on some other ideas that have yet to be introduced). Recall that

\begin{align*}\nabla E(x) = \alpha\sum_{j=1}^{n}\frac{x_j - x}{\|x_j - x\|^{\alpha+2}}, \qquad \Delta E(x) = \alpha^2\sum_{j=1}^{n}\frac{1}{\|x - x_j\|^{\alpha+2}} \geq 0.\end{align*}

Assume that $\nabla E(x^*) = 0$ . Since $\alpha > 0$ , we have $\Delta E(x^*) > 0$ and can perform a local Taylor expansion of the energy around $x^*$ , $E(x^* + y) = E(x^*) + \frac{1}{2}\langle y, ((D^2 E)(x^*))(y)\rangle + \mathcal{O}(\|y\|^3)$ . Since $\text{tr}((D^2 E)(x^*)) = \Delta E(x^*) > 0$ , we know that the symmetric $2 \times 2$ matrix $(D^2 E)(x^*)$ has at least one positive eigenvalue (since the sum of the two eigenvalues is positive). If it has two positive eigenvalues, then E has a strict local minimum in $x^*$ . This means that the gradient flow started at $\infty$ can never reach $x^*$ ; here, it is important that we are always considering gradient ascent which necessarily avoids local minima (gradient descent very well might end up in a local minimum). This means that the only relevant remaining case is when $((D^2 E)(x^*))$ has one positive and one non-positive eigenvalue. After a change of variables, we may assume without loss of generality (after a translation) that $x^* = 0$ and (after a rotation) that the eigenvectors of $((D^2 E)(x^*))$ are given by (1,0) and (0, 1). Then

\begin{align*}E(y) = E(0) + \frac{\lambda_1}{2} y_1^2 + \frac{\lambda_2}{2} y_2^2 + \mathcal{O}(\|y\|^3),\end{align*}

where $\lambda_1 > 0$ and $\lambda_2 \leq 0$ . Then $\nabla E(y) = ( \lambda_1 y_1, \lambda_2 y_2) + \mathcal{O}(\|y\|^2)$ . There is one stable and one unstable direction, which means that there is exactly one gradient flow line that can end up in the critical point. In particular, if there are finitely many critical points, then this set of trajectories has measure 0. The case $\alpha = 0$ is different (since the Laplacian is zero) and a different argument is needed; we will provide a more general statement (that also works for $\alpha \geq 0$ ) as a consequence of the main argument in the paper as Lemma 4 in Section 3.4.

2.3. A topological lemma

We establish a basic topological fact showing that the set of points at distance $\|x\| = r \gg n $ from the origin whose gradient flow transports them to a fixed existing particle cannot be too complicated.

Lemma 1. Let $\{x_1,\ldots,x_n\} \subset \mathbb{R}^2$ be a set of n particles satisfying property (P). Then, for $1 \leq i \leq n$ arbitrary and $r \geq 10n$ , the set $A_r = \{\|x\| = r\,:\, \text{gradient flow ends up touching}\,x_i\}$ is, up to a finite number of points, the union of at most six open intervals.

Proof. We start by considering the circle of radius $\frac{1}{2}$ centered at $x_i$ . By the construction of the set, it is touched by at least one other circle of radius $\frac{1}{2}$ centered at some other point. It may be touched by more than one circle, but it cannot be touched by more than six circles. Therefore the set

\begin{align*}\bigg\{\|x - x_i\| = \frac{1}{2} \bigg\} \setminus \bigcup_{\substack{k=1\\ {k \neq i}}}^n \bigg\{\|x-x_k\|=\frac{1}{2}\bigg\}\end{align*}

is almost all of $\big\{\|x - x_i\| = \frac{1}{2}\big\}$ with at most six points removed. In particular, it can be written as the union of at most six connected sets, which we label $C_{i,1}, C_{i,2}, \ldots$

Let us now assume that $A_r$ contains at least seven open intervals (separated by more than just a finite number of points). Then, using the pigeonhole principle, there are two points $x,y\in A_r$ that belong to two different open intervals in $A_r$ which, under the gradient flow, get sent to the same connected set $C_{i,k}$ . The main idea of the argument is now shown in Figure 9. Starting the gradient flow for any point between x and y now leads to a flow that is trapped between the two existing gradient flows in the region $\Omega$ . This can be seen as follows: first, because $r \geq 10n$ and thus all the forces $\nabla E$ point inside the disk, it is clear that the gradient flow can never escape through $A_r$ . It can never touch the gradient lines induced by x and y by uniqueness. It can also not get within distance 1 of any of the existing points because the existing particles are connected. It may get stuck in a critical point but, by assumption, there are only finitely many of those, which then shows that, in the set $A_r$ , the open interval containing x and the open interval containing y may actually be joined into a single open interval (up to finitely many points).

Figure 9. Creating a trapping region $\Omega$ .

2.4. Gradient flow asymptotics

Both in the proof of Theorem 1 (in Section 2.5) and in the proof of Lemma 3 (Section 3.2), we will use a specific trick that is worth pointing out in detail. Our goal is to understand the behavior of gradient ascent flows with respect to $\nabla E$ coming from infinity. It is equivalent, and sometimes easier, to analyze the behavior of gradient descent flows going off to infinity. As soon as we are far away from the set of n existing particles, there is an important simplification: we have, for all $1 \leq j \leq n$ ,

\begin{align*}\nabla E(x) = \alpha\sum_{j=1}^{n}\frac{x_j - x}{\|x_j - x\|^{\alpha+2}} =(\alpha \cdot n+o(1)) \cdot \frac{x_j-x}{\|x_j-x\|^{\alpha+2}} \quad \text{as}\,\|x\| \rightarrow \infty,\end{align*}

where the error o(1) naturally depends on which $x_j$ has been chosen. Alternatively, we could also pick the center of mass of $x_1, \ldots, x_n$ or any other point in their convex hull. This principle is well understood in physics (‘far-field expansion’): the effect of N electrons, when measured far away, is not too different from the effect of N electrons all placed in the exact same position. The most extreme instance is Newton’s shell theorem, stating that a spherically symmetric body has a gravitational effect that is indistinguishable from the gravitational effect it would have if all its mass were concentrated at a point in the center.

The relevance for our problem is as follows. Suppose now that $I \subset \{x \in \mathbb{R}^2\,:\, \|x\| = r\}$ is some connected subset with the property that gradient ascent started in that set flows to some specific subset of the boundary of the disks. Then, once $r \gg n$ is sufficiently large, the gradient descent of that set I off to infinity should be very regular: I should evolve very nearly as if under radial projection from the fixed point $(x_1 + \cdots + x_n)/n$ . This is then suggestive of the existence of a limit. One way to proceed is to make this precise by making the asymptotic expansion above precise. We opt for a slightly more elementary route that uses the following basic fact: if $C \subset \mathbb{R}^2$ is a cone and if $x_j - x \in C$ for all $1 \leq j \leq n$ , then $\nabla E \in C$ . This argument holds for a great number of radially decreasing non-negative functions (and, in particular, is true uniformly in $\alpha$ ). It can be used to narrow down the directions in which the gradient flow may flow. We refer to Figure 10 for a sketch of the argument.

Figure 10. A set of n particles contained in a ball of radius $\sim n$ , a far away point x, and the cone C induced by the convex hull of the particles. Gradient descent flows along the cone.

This narrows down the possible options of where the gradient flow can be in the future. Note that the opening angle of the cone C is bound to be $\lesssim n/\|x-x_j\|$ independently of how the particles are arranged. As $\|x\|$ becomes large, the size of the angle goes to 0. Applying this argument twice (once for each endpoint of an interval) can then be used to force convergence of the underlying probabilities as $r \rightarrow \infty$ .

Figure 11. Sketch of the proof of Theorem 1.

2.5. Proof of Theorem 1

Proof. We will show that, for any point $x_i$ and any interval $J \subset \big\{y\,:\, \|x_i -y\| = \frac{1}{2}\big\}$ , there exists an asymptotic likelihood of ending up in that interval (in the sense of the limit of the measure existing when $r \rightarrow \infty$ ). Thus, by considering random points at distance $\|x\| = r$ , there is an emerging limit as $r \rightarrow \infty$ of where these points impact on the boundary. We can, without loss of generality, assume that J is an interval on the disk with the property that no element in the interval is distance $\frac{1}{2}$ from any other point (if not, we can always split the interval into two intervals and argue for each one separately).

Repeating the argument from Lemma 1, we see that the set $A_r$ , which is now the set of points on $\{x\,:\, \|x\| = r\}$ that end up flowing into J, are a connected set (up to at most finitely many points that are transported to critical points; here we use property (P)). There are two cases: either $\lim_{r \rightarrow \infty} {|A_r|}/{2\pi r} = 0$ or not. If the limit is 0, we have the desired convergence (the likelihood of hitting the interval J is then 0). Suppose now that the limit is not 0. Then there exists $\varepsilon_0$ such that, for a sequence $(r_k)_{k=1}^{\infty}$ tending to infinity, $|A_{r_k}| \geq \varepsilon_0 r_k$ . We pick such a very large radius $r_k$ , say one that is so large that it ensures that $|A_{r_k}| \geq {\mathrm{e}}^n$ , and then argue as illustrated in Figure 11. We don’t exactly know how the existing n particles are distributed inside the original disk of radius $\leq n$ , but they all exert a radial force. This shows that, at the endpoints of $A_{r_k}$ , we can actually predict up to an angle of size $\sim n/r_k$ how the flow lines will look like (by assuming, simultaneously, that all the existing points are concentrated in the northern and the southern tips of the disk). This means that the asymptotic probability is actually determined up to a factor of ${\sim} n/r_k$ . This can be made arbitrarily small by letting $r \rightarrow \infty$ , which forces convergence of $|A_r|/r$ .

3. Proof of Theorem 2

The main ingredient is Lemma 2 in Section 3.1. Section 3.2 proves a geometric result, Lemma 3, which then, together with Lemma 2, proves Theorem 2 in Section 3.3. A different look at Lemma 2 leads to Lemma 4 in Section 3.4.

3.1. The main step of the argument

We start by proving the key estimate. The constant 10 in the condition $r \geq 10n$ is somewhat arbitrary and chosen for the sake of concreteness of exposition.

Lemma 2. For any $x_i$ , $ 1\leq i \leq n$ , we define, for $r \geq 10n$ , the set

\begin{align*}A_r = \{x \in \mathbb{R}^2\,:\, \|x\| = r \, {and\ gradient\ flow\ leads\ to}\,x_i \}.\end{align*}

Then, for $c_{\alpha}>0$ depending only on $\alpha > 0$ ,

\begin{align*}\frac{|A_{r}|}{2 \pi r} \leq c_{\alpha} r^{\alpha} n^{-{\alpha}/{2} - {1}/{2}}.\end{align*}

Figure 12. Sketch of the proof of Lemma 2.

Proof. We define the set $\Omega_r \subset \mathbb{R}^2$ as all the points in $\mathbb{R}^2$ that are elements of some gradient flow started by some point in $A_i$ (and ending up within distance 1 of $x_i$ ); see Figure 12. Green’s theorem implies that $\int_{\Omega_r}\Delta E\,{\mathrm{d}} x = \int_{\partial\Omega_r}\nabla E\cdot{\mathrm{d}} n$ , where n is the normal vector pointing outside.

The gradient and the Laplacian of the energy are given by

\begin{align*}\nabla E(x) = \alpha\sum_{j=1}^{n}\frac{x_j - x}{\|x_j - x\|^{\alpha+2}}, \qquad \Delta E(x) = \alpha^2 \sum_{j=1}^{n} \frac{1}{\|x - x_j\|^{\alpha+2}} \geq 0.\end{align*}

This implies that $\int_{\partial\Omega_r}\nabla E\cdot{\mathrm{d}} n \geq 0$ . The remainder of the argument is dedicated to deducing information from this inequality. We decompose the boundary of $\Omega_r$ into three sets,

\begin{align*}\partial\Omega_r = (\partial\Omega_r\cap\{z\,:\,\|z-x_i\|=1\}) \cup A_r\cup \text{remainder},\end{align*}

where $\partial\Omega_r\cap\{z\,:\,\|z-x_i\| = 1\}$ is the part of the boundary of $\Omega_r$ that is within distance 1 of $x_i$ , the set $A_r$ is the initial starting points of the gradient flow that are far away, and the remainder is everything else (comprised of a union of gradient flows). We note that

\begin{align*}\int_{\mathrm{remainder}} \nabla E \cdot \textrm{d}n = 0\end{align*}

because the remainder is given by gradient flows flowing in direction $\nabla E$ : the normal derivative vanishes everywhere. We arrive at

\begin{align*}\int_{\partial\Omega_r\cap\{z\,:\,\|z-x_i\| = 1\}}\nabla E\cdot{\mathrm{d}} n\geq-\int_{A_r}\nabla E\cdot{\mathrm{d}} n.\end{align*}

We now bound first the left-hand side from above and then the right-hand side from below. On the part of the boundary that is distance 1 from $x_i$ , the normal vector can be written as $(x_i - x)$ , and thus

\begin{align*} \int_{\partial\Omega_r\cap\{z\,:\,\|z-x_i\|=1\}}\nabla E\cdot{\mathrm{d}} n = \int_{\partial\Omega_r\cap\{z\,:\,\|z-x_i\|=1\}}\Bigg\langle\alpha\sum_{j=1}^{n}\frac{x_j - x}{\|x_j - x\|^{\alpha+2}}, x_i - x\Bigg\rangle\,{\mathrm{d}}\sigma, \end{align*}

where $\sigma$ is the arclength measure. We conclude that

\begin{align*}\Bigg\langle\sum_{j=1}^{n}\frac{x_j - x}{\|x_j - x\|^{\alpha+2}},x_i - x\Bigg\rangle = \sum_{j=1}^{n}\frac{\langle x_j - x, x_i - x\rangle}{\|x_j - x\|^{\alpha+2}} \leq \sum_{j=1}^{n}\frac{1}{\|x_j - x\|^{\alpha+1}},\end{align*}

where the last step used Cauchy–Schwarz and the fact that $\|x_i - x\|=1$ . Thus,

\begin{align*} \bigg|\int_{\partial\Omega_r\cap\{z\,:\,\|z-x_i\|=1\}}\nabla E\cdot{\mathrm{d}} n\bigg| & \leq \alpha\int_{\partial\Omega_r\cap\{z\,:\,\|z-x_i\|=1\}}\sum_{j=1}^{n}\frac{1}{\|x_j - x\|^{\alpha+1}}\,{\mathrm{d}}\sigma \\ & \leq 2\pi\alpha\max_{\partial\Omega_r\cap\{z\,:\,\|z-x_i\|=1\}}\sum_{j=1}^{n}\frac{1}{\|x_j - x\|^{\alpha+1}}. \end{align*}

At this point, we use a packing argument. The existing points $\{x_1,\ldots,x_n\}$ are 1-separated, meaning $\|x_i - x_j\| \geq 1$ when $i \neq j$ . Likewise, the new point x is also a distance $\geq 1$ from all the existing points. The sum is therefore maximized if all the existing points $x_i$ are packed as tightly as possible around x. Switching to polar coordinates around x, we have that, for some absolute constants,

\begin{align*} \max_{\partial\Omega_r\cap\{z\,:\,\|z-x_i\|=1\}}\sum_{j=1}^{n}\frac{1}{\|x_j - x\|^{\alpha+1}} \leq c^*_{\alpha}\sum_{\ell=1}^{\sqrt{n}}\ell\frac{1}{\ell^{\alpha+1}} \leq c_{\alpha} \cdot n^{{1}/{2}-{\alpha}/{2}}. \end{align*}

We could make the constant $c_{\alpha}$ explicit; indeed, since the sphere-packing problem is solved in two dimensions, it stands to reason that the optimal constant should be attained for the hexagonal lattice (a result in a similar spirit has been shown in [Reference Beltran, Betermin, Grabner and Steinerberger1]). We have not tracked explicit constants anywhere and will not start here; however, this may be a useful avenue to pursue in regards to whether the asymptotic shape is a disk when $\alpha = 0$ .

It remains to bound the second integral from below. It can be written as

\begin{align*} -\int_{A_r}\nabla E\cdot{\mathrm{d}} n = \alpha \int_{A_r}\Bigg\langle\sum_{j=1}^{n}\frac{x-x_j}{\|x_j - x\|^{\alpha+2}}, \frac{x}{\|x\|}\Bigg\rangle\,{\mathrm{d}}\sigma. \end{align*}

Since $\|x\| \geq 10n$ and $\|x_i\| \leq n$ , and thus $\|x-x_j\| \geq 9n$ , we have

\begin{align*}\langle x-x_j, x\rangle = \|x-x_j\|^2 + \langle x-x_j, x_j\rangle \geq \frac{1}{2} \|x-x_j\|^2\end{align*}

and thus

\begin{align*}-\int_{A_r}\nabla E\cdot{\mathrm{d}} n \geq \frac{\alpha}{100}\int_{A_r}\sum_{j=1}^{n}\frac{1}{\|x_j - x\|^{\alpha+1}}\,{\mathrm{d}}\sigma.\end{align*}

Again using $r \geq 10n$ together with $\|x- x_i\| \geq r/2$ , we have

\begin{align*}\frac{\alpha}{100}\int_{A_r}\sum_{j=1}^{n}\frac{1}{\|x_j - x\|^{\alpha+1}}\,{\mathrm{d}}\sigma \geq c_{\alpha}^* \frac{n}{r^{\alpha+1}}|A_r|.\end{align*}

Combining the upper bound and the lower bound, we obtain ${|A_{r}|}/{2 \pi r} \leq c_{\alpha} r^{\alpha} n^{-{\alpha}/{2} - {1}/{2}}$ .

3.2. Bounding the likelihood at infinity

In order to conclude the result, we need to show that the size of $A_{r}$ for finite r is related to the probability of selecting the particle (which is governed by the asymptotic behavior of $|A_r|/r$ as $r \rightarrow \infty$ ).

Lemma 3. Suppose $\{x_1,\ldots,x_n\} \subset \mathbb{R}^2$ satisfies property (P). Then the asymptotic likelihood of hitting particle $x_i$ satisfies

\begin{align*}p = \lim_{r\rightarrow\infty}\frac{|A_r|}{2\pi r} \leq 100\cdot\frac{|A_{{100n}/{p}}|}{{100n}/{p}}.\end{align*}

Figure 13. Sketch of the proof of Lemma 3. Connected components flowing into $A_r$ have to be contained inside the cone.

Proof. We may think of each individual existing particle as exerting a force, and that these forces, once we are far away from the set, are contained in a convex cone. We refer to Figure 13 for a sketch of the argument. It remains to compute the opening angle of the cone.

We assume, for convenience, the disk of radius n to be centered in the origin and, again for convenience, that one (of possibly several but at most six, see Lemma 1) interval is centered symmetrically around the x-axis. We choose r to be large. The length of the set $A_r$ is then, for r very large, asymptotically given by $2 \pi p r$ , where $0 \leq p \leq 1$ is the asymptotic probability. We choose r so large that $2 \pi p r \gg {\mathrm{e}}^n \geq n$ . The two lines in Figure 13 can be approximated by $y= px - n$ and $y= -px + n$ . These lines meet at $x = n/p$ . Plugging in $x = 100n/p$ shows that $|A_{100{n}/{p}}| \geq 10n = 100({n}/{p})({p}/{10})$ , and therefore

\begin{equation*} \frac{|A_{100{n}/{p}}|}{100{n}/{p}} \geq \frac{p}{10}. \end{equation*}

3.3. Proof of Theorem 2

Proof. With all these ingredients in place, the proof of Theorem 2 is now straightforward. Using Lemmas 2 and 3, we have

\begin{align*} p = \lim_{r\rightarrow\infty}\frac{|A_r|}{2\pi r} \leq 100\cdot\frac{|A_{{100n}/{p}}|}{{100n}/{p}} \leq c_{\alpha}\bigg(\frac{n}{p}\bigg)^{\alpha}n^{-{\alpha}/{2} - {1}/{2}}, \end{align*}

from which we deduce that $p^{1+\alpha} \leq c\cdot n^{({\alpha-1})/{2}}$ and thus $p \leq c\cdot n^{({\alpha-1})/({2\alpha+2})}$ .

It seems likely that there is at least some room for improvement here. Lemma 3 uses only basic convexity properties of the forces that are being exerted by the existing particles. It is not inconceivable that something like

\begin{equation*} \qquad\qquad\qquad\qquad\quad\qquad c_{\alpha,1}\frac{|A_{10n}|}{10n} \leq \lim_{r\rightarrow\infty}\frac{|A_r|}{r} \leq c_{\alpha,2}\frac{|A_{10n}|}{10n} \quad\quad\qquad\qquad\qquad\qquad (\diamond)\end{equation*}

might be true. One motivation is that if $\mu$ is a probability measure in the unit disk D(0, 1), then

\begin{align*}\nabla_x\int_{D(0, 1)}\frac{{\mathrm{d}}\mu(y)}{\|x-y\|^{\alpha}}\,{\mathrm{d}} y\end{align*}

for $\|x\| \geq 10$ should have a greater degree of regularity than is exploited by the simple geometric argument in Lemma 3. It appears that multipole expansions might be a natural avenue to pursue in this direction. If $(\diamond)$ were the case, we would obtain the Beurling estimate $\mathbb{P} \leq c_{\alpha} n^{\alpha/2 - 1/2}$ , which is a slight improvement over Theorem 2. However, there is still no reason to assume that this would be optimal (except when $\alpha = 0$ , where Theorem 2 is already optimal). Theorem 3 would then lead to a slight improvement on the growth rate and show that $\text{diam} \leq c_{\alpha} n^{\alpha/2 + 1/2}$ . There is also no reason to assume that this improved rate would be optimal. None of this works for $\alpha \geq 1$ ; it seems that truly new ideas are required to deal with this regime.

3.4. A useful by-product

We use this section to point out an interesting by-product of the previous arguments (which can also be used to give an alternative approach to the argument from Section 2.2).

Lemma 4. Let $\alpha \geq 0$ and suppose $\{x_1,\ldots,x_n\} \subset \mathbb{R}^2$ satisfies property (P). Let y be a point that is at least distance 1 from all the existing points, $\|y - x_i\| \geq 1$ . Then, for $\varepsilon \leq \frac{1}{10}$ , the likelihood of hitting the disk $B_{\varepsilon}(y)$ satisfies

\begin{align*} \mathbb{P}({new\ incoming\ particle\ hits}\ B_{\varepsilon}(y)) \leq c_{\alpha}\cdot\varepsilon^{{1}/({1 + \alpha})} \cdot n^{({\alpha-1})/({2\alpha+2})}. \end{align*}

Proof. Lemma 1 implies that the set $A_r \subset \{x\,:\, \|x\| = r\}$ with the property that the gradient flow ends up hitting $B_{\varepsilon}(y)$ before getting within distance 1 of any the $x_i$ must be connected. We use the argument from Lemma 2, and argue that

\begin{align*}\int_{\partial\Omega_r\cap\{z\,:\,\|z-y\|=\varepsilon\}}\nabla E\cdot{\mathrm{d}} n \geq -\int_{A_r}\nabla E\cdot{\mathrm{d}} n.\end{align*}

Arguing as in the proof of Lemma 2, we have

\begin{align*}\bigg|\int_{\partial\Omega_r\cap\{z\,:\,\|z-y\|=\varepsilon\}}\nabla E\cdot\,{\mathrm{d}} n\bigg| \leq c_{\alpha}\cdot\varepsilon\cdot n^{{1}/{2} - {\alpha}/{2}},\end{align*}

as well as $-\int_{A_r}\nabla E\cdot{\mathrm{d}} n \geq c_{\alpha}n{|A_r|}/{r^{\alpha+1}}$ . Thus, as in the proof of Lemma 2, we have ${|A_r|}/{2\pi r} \leq c_{\alpha}\varepsilon r^{\alpha}n^{-{\alpha}/{2} - {1}/{2}}$ . Applying Lemma 3 once more,

\begin{align*} p = \lim_{r\rightarrow\infty}\frac{|A_r|}{2\pi r} \leq 10\cdot\frac{|A_{{100n}/{p}}|}{{100n}/{p}} \leq c_{\alpha}\varepsilon\bigg(\frac{n}{p}\bigg)^{\alpha}n^{-{\alpha}/{2} - {1}/{2}}, \end{align*}

and thus $p \leq c_{\alpha}\cdot\varepsilon^{{1}/({1 + \alpha})} \cdot n^{({\alpha-1})/({2\alpha+2})}$ .

We may think of Lemma 4 as a measure of the maximum amount of distortion that the gradient flows can experience. In particular, letting $\varepsilon \rightarrow 0$ gives us quantitative control on the statement that critical points are hit with probability 0. The case $\alpha = 0$ is also instructive because only very little distortion can take place.

4. Proof of Theorem 3

The purpose of this section is to deduce the growth estimate from a Beurling estimate. We first present a simple argument showing that we can consider the problem in a suitable discrete setting on graphs. After that, we present an existing argument [Reference Benjamini and Yadin2] (suitably simplified and adapted to our setting).

4.1. Going discrete

We first note a basic fact. GFA leads, with probability 1, to particles whose disks touch and which form a tree structure. This from the fact that a new incoming particle will only attach itself to one existing particle with likelihood 1. There are points where a new circle would touch two circles, but this is a finite set of points and gradient descent shows that they lead to a set of measure 0. Moreover, since every disk in the Euclidean plane can only touch at most six other circles, we can furthermore deduce that every vertex in the degree has at most six neighbors. We deduce the following fact.

Proposition 2. The adjacency structure of GFA can be embedded into an infinite 6-regular tree (taken to be rooted in the first particle $x_1$ ) with likelihood 1.

Each particle in GFA has $\leq 5$ neighbors with probability 1 (having six neighbors would require a perfect alignment of disks and happens with probability 0). Since the Benjamini–Yadin argument [Reference Benjamini and Yadin2] is somewhat insensitive to the choice of constant, we choose 6. One way of studying the diameter of GFA is to study the diameter of the evolving induced subgraph on the infinite 6-regular tree. Note that the likelihoods of attaching a new particle/vertex to an existing vertex in the tree is still induced by the underlying continuous setting in $\mathbb{R}^2$ (we may think of GFA as running in the background and the evolution on the tree as being a cartoon picture). The argument then simply uses that the diameter of the induced subgraph on the infinite tree immediately bounds the diameter of GFA by the triangle inequality.

4.2. A concentration inequality

This section is almost verbatim from [Reference Benjamini and Yadin2] and is included for the convenience of the reader.

Lemma 5. Let $B = \sum_{k=1}^{n} Z_k$ be a sum of independent Bernoulli random variables taking values in $\{0, 1\}$ . Then, for any $C > 1$ , we have $\mathbb{P}(B \geq C\cdot\mathbb{E}B) \leq \exp[-C(\mathbb{E}B)\log({C}/{{\mathrm{e}}})]$ .

Proof. This follows from the classical Bernstein method. For any $\alpha > 0$ , we have

\begin{align*}\mathbb{E}{\mathrm{e}}^{\alpha B} = \prod_{k=1}^{n}\mathbb{E}{\mathrm{e}}^{\alpha Z_k} = \prod_{k=1}^{n}(({\mathrm{e}}^{\alpha}-1)\cdot\mathbb{E}Z_k+1).\end{align*}

Now, using $\log(1+x) \leq x$ , we have

\begin{align*} \prod_{k=1}^{n}(({\mathrm{e}}^{\alpha}-1)\cdot\mathbb{E}Z_k+1) & = \exp\Bigg(\sum_{k=1}^{n}\log(1 + ({\mathrm{e}}^{\alpha}-1)\cdot\mathbb{E}Z_k)\Bigg) \\ & \leq \exp\Bigg(\sum_{k=1}^{n}({\mathrm{e}}^{\alpha}-1)\cdot\mathbb{E}Z_k\Bigg) = \exp\{({\mathrm{e}}^{\alpha}-1)\cdot\mathbb{E}B\}. \end{align*}

Markov’s inequality leads to

\begin{align*} \mathbb{P}(B \geq C\cdot\mathbb{E}B) & = \mathbb{P}({\mathrm{e}}^{\alpha B}\geq{\mathrm{e}}^{\alpha C\mathbb{E}B}) \\ & = \mathbb{P}({\mathrm{e}}^{\alpha B - \alpha C\mathbb{E}B} \geq 1) \\ & \leq \mathbb{E}{\mathrm{e}}^{\alpha B - \alpha C\mathbb{E}B} = {\mathrm{e}}^{-\alpha C\mathbb{E}B}\cdot\mathbb{E}{\mathrm{e}}^{\alpha B} \leq {\mathrm{e}}^{-\alpha C\mathbb{E}B}\exp\{(\textrm{e}^{\alpha}-1)\cdot\mathbb{E}B\}. \end{align*}

It remains to optimize the function $-\alpha C + {\mathrm{e}}^{\alpha} - 1$ over $\alpha$ . Differentiating suggests choosing $\alpha$ so that ${\mathrm{e}}^{\alpha} = C$ (which is possible since $C>1$ ). Plugging in, we get

\begin{align*}\mathbb{P}(B \geq C\cdot\mathbb{E}B) \leq \exp[(\mathbb{E}B)(-(\log{C})C + C - 1)].\end{align*}

Since the random variables are non-negative, we have $\mathbb{E}B \geq 0$ and can thus use

\begin{equation*} -(\log{C})C + C - 1 \leq -(\log{C})C + C = -C\log\bigg(\frac{C}{{\mathrm{e}}}\bigg). \end{equation*}

4.3. Growth on the tree

The argument in this subsection is also fully contained in [Reference Benjamini and Yadin2]. We have merely specialized the general argument to our setting. We will now suppose that we have n GFA particles $\{x_1,\ldots,x_n\} \subset \mathbb{R}^2$ . We now consider the process of adding the next n particles $x_{n+1},\ldots,x_{2n}$ . We first consider the subgraph $G_n \subset H_6$ of the infinite 6-regular tree, and then the second induced subgraph $G_{2n} \subset H_6$ . We use d(v, w) to denote the usual graph distance and, for a set $A \subset V$ , the distance $d(v,A) = \inf_{a \in A}d(v,a)$ . The statement we are going to prove is as follows.

Lemma 6. Assuming GFA satisfies a Beurling estimate

\begin{align*}\max_{1 \leq i \leq n}\mathbb{P}({new\ particle\ hits}\ x_i) \leq c_{\alpha} \cdot n^{-\alpha},\end{align*}

there exists a constant $c>0$ (depending only on $c_{\alpha}$ ) such that for all $v \in G_{2n}$ , $d(v, G_n) \leq c \cdot n^{1-\alpha}$ with likelihood at least $1 - n \cdot {\mathrm{e}}^{-c n^{1-\alpha}}$ .

Proof. The argument is based on analyzing an individual path and then taking a union bound over all paths. Let $v \in G_n$ be a vertex that is adjacent to a vertex in $H_{6} \setminus G_n$ . Consider a fixed path of length $c \cdot n^{1-\alpha}$ emanating from v and never touching either itself or $G_n$ . We ask ourselves: what is the likelihood that this path is being added when we add in the next n points? Using $Z_i$ to denote the indicator variable of the event of the next vertex in a (fixed) path being added when adding particle $v_{n+i}$ , we have that $Z_i\in\{0, 1\}$ is Bernoulli with $\mathbb{E}Z_i \leq c_{\alpha}n^{-\alpha}$ and $\mathbb{E}\sum_{i=1}^{n}Z_i \leq c_{\alpha}n^{1-\alpha}$ . The likelihood of the entire path of length $c n^{1-\alpha}$ being added is thus bounded from above by the likelihood of having $\sum_{i=1}^{n}Z_i$ exceed $ c n^{1-\alpha}$ ,

\begin{align*}\mathbb{P}\Bigg(\sum_{i=1}^{n}Z_i \geq cn^{1-\alpha}\Bigg) \leq \mathbb{P}\bigg(B \geq \frac{cn^{1-\alpha}}{\mathbb{E}B}\mathbb{E}B\bigg).\end{align*}

Appealing to Lemma 5, we have

\begin{equation*} \mathbb{P}\bigg(B \geq \frac{cn^{1-\alpha}}{\mathbb{E}B}\mathbb{E}B\bigg) \leq \exp\bigg({-}cn^{1-\alpha}\log\bigg(\frac{cn^{1-\alpha}}{\mathbb{E}B}\bigg)\bigg) \leq \exp\bigg({-}c\cdot n^{1-\alpha}\log\bigg(\frac{c}{c_{\alpha}}\bigg)\bigg). \end{equation*}

This is the likelihood of any particular path venturing far. It remains to count the number of such paths. Since we have n original vertices to start from and are on a 6-regular tree, the number of such paths is trivially bounded from above by $n6^{c n^{1-\alpha}}$ and thus, by the union bound,

\begin{align*}\mathbb{P}(\text{long path exists}) \leq n\cdot\exp\bigg\{cn^{1-\alpha}\bigg(\log{(6)} - \log\bigg(\frac{c}{c_{\alpha}}\bigg)\bigg)\bigg\}.\end{align*}

Choosing $c = 1000 \cdot c_{\alpha}$ then implies the result.

The growth bound now follows from the Borel–Cantelli lemma. We refer to [Reference Benjamini and Yadin2] for details and implementations of a similar idea in more general settings.

5. Proof of Theorem 4

Proof. The argument is a subset of the existing arguments. We briefly summarize them in the correct order. Suppose $\{x_1,\ldots,x_n\} \subset \mathbb{R}^d$ is given, $d \geq 3$ , and we consider the harmonic energy $E(x) = \sum_{i=1}^{n}{1}/{\|x- x_i\|^{d-2}}$ . Let us pick a large radius $R \gg n \gg 1$ ; let $A_R$ be all points on $\{x\,:\,\|x\|=R\}$ for which the gradient flow attaches itself to particle $x_i$ , and denote the union of all these flows by $\Omega$ . Then $\int_{\partial\Omega}\nabla E\cdot{\mathrm{d}} n = \int_{\Omega}\Delta E\,{\mathrm{d}} x = 0$ . The boundary integral can be decomposed into two parts: the parts close to $x_i$ and the parts on the sphere $\{x\,:\,\|x\|=R\}$ : $\partial\Omega = \partial\Omega_{x_i} \cup \partial\Omega_{R}$ . We note that, as before, when $R \rightarrow \infty$ ,

\begin{align*} \int_{\partial\Omega_R}\nabla E\cdot{\mathrm{d}} n & = (d-2+o(1))\int_{\Omega_R}\frac{x_i - x}{\|x_i - x\|^d}\,{\mathrm{d}} n \\ & = (1+o(1))\frac{n}{R^{d-1}}\int_{\Omega_R}{\mathrm{d}}\sigma = (1+o(1))\cdot n\cdot\frac{|\Omega_R|}{R^{d-1}}. \end{align*}

Meanwhile, on the other side we argue with the triangle inequality, 1-separation of points, and a constant $c_d$ (which may change its value every time it appears) that

\begin{align*} \bigg|\int_{\partial\Omega_{x_i}}\nabla E\cdot{\mathrm{d}} n\bigg| & \leq \int_{\partial\Omega_{x_i}}|\nabla E|\cdot{\mathrm{d}}\sigma \\ & \leq c_d\int_{\partial\Omega_{x_i}}\sum_{i=1}^{n}\frac{1}{\|x-x_i\|^{d-1}}\,{\mathrm{d}} \sigma \leq c_d\sum_{\ell=1}^{n^{1/d}}\ell^{d-1}\frac{1}{\ell^{d-1}} \leq c_d n^{1/d}. \end{align*}

This proves the Beurling estimate. As for the growth estimate, we can simply modify the existing argument as follows. A GFA tree in $\mathbb{R}^d$ can, with likelihood 1, be embedded in an infinite regular tree $H_k$ where k is the kissing number of the space; that is, the maximal number a sphere of radius 1 can be simultaneously touched by other disjoint spheres with the same radius. The remainder of the argument is unchanged (though all the implicit constants do of course change).

6. Proof of Proposition 1

Proof. We first note, when picking a point y at distance $R \gg n$ from the existing set of points, that the set $\|y-x\| = \text{const}$ is a circle with curvature $1/\text{const}$ . These circles become flat lines as $\|y\| \rightarrow \infty$ and therefore only points in the convex hull are considered at each step. We can now consider three consecutive points on the convex hull and assume there is an (interior) angle $\alpha$ in the middle point. We can then draw the half-lines separating closest distances between any of the points. These half-lines do not meet but, asymptotically, the only relevant questions is the opening angle $\beta$ that they create. Elementary trigonometry shows that $\beta = \pi - \alpha$ . The sum of the opening angles in a convex polygon with m sides is $\sum_{i=1}^{m}\alpha_i = (m-2)\pi$ . Therefore, the likelihoods in our case sum to $\sum_{i=1}^{m}(\pi - \alpha_i) = m\pi - (m-2)\pi = 2\pi$ . Thus, the likelihood of attaching itself to a new point is given by $(\pi - \alpha)/(2\pi)$ .

Funding information

The author was partially supported by the NSF (DMS-2123224).

Competing interests

There were no competing interests to declare which arose during the preparation or publication process of this article.

References

Beltran, C., Betermin, L., Grabner, P. and Steinerberger, S. (2022). How well-conditioned can the eigenvector problem be? Math Comput, 91, 12371245.Google Scholar
Benjamini, I, and Yadin, A. (2017). Upper bounds on the growth rate of diffusion-limited aggregation. Preprint, arXiv:1705.06095.Google Scholar
Berger, N., Procaccia, E. and Turner, A. (2022). Growth of stationary Hastings–Levitov. Ann. Appl. Prob. 32, 33313360.CrossRefGoogle Scholar
Eden, M. (1961). A two-dimensional growth process. In Proc. 4th Berkeley Sympos. Math. Statist. Prob., Vol. IV, pp. 223–239. University of California Press, Berkeley, CA.Google Scholar
Gabrielov, A., Novikov, D. and Shapiro, B. (2007). Mystery of point charges. Proc. London Math. Soc. 95, 443472.CrossRefGoogle Scholar
Hastings, M. (2001). Fractal to nonfractal phase transition in the dielectric breakdown model. Phys. Rev. Lett. 87, 175502.CrossRefGoogle ScholarPubMed
Hastings, M. and Levitov, L. (1998). Laplacian growth as one-dimensional turbulence. Physica D 116, 244252.CrossRefGoogle Scholar
Kesten, H. (1987). Hitting probabilities of random walks on $\mathbb{Z}^d$ . Stoch. Process. Appl. 25, 165184.CrossRefGoogle Scholar
Kesten, H. (1987). How long are the arms in DLA? J. Phys. A 20, L29.CrossRefGoogle Scholar
Kesten, H. (1990). Upper bounds for the growth rate of DLA. Physica A 168, 529535.CrossRefGoogle Scholar
Killian, K. (2009). A remark on Maxwell’s conjecture for planar charges. Complex Var. Elliptic Equ. 54, 10731078.CrossRefGoogle Scholar
Liddle, G. and Turner, A. (2023). Scaling limits of anisotropic growth on logarithmic time-scales. Electron. J. Prob. 28, 132.CrossRefGoogle Scholar
Losev, I. and Smirnov, S. (2023). How long are the arms in DBM? Preprint, arXiv:2307.14931.Google Scholar
Maxwell, J. C. (1954). A Treatise on Electricity and Magnetism, Vol. 1. Dover Publications, Mineola, NY.Google Scholar
Niemeyer, L., Pietronero, L. and Wiesmann, H. J. (1984). Fractal dimension of dielectric breakdown. Phys. Rev. Lett. 52, 10331036.CrossRefGoogle Scholar
Norris, J., Silvestri, V. and Turner, A. (2023). Scaling limits for planar aggregation with subcritical fluctuations. Prob. Theory Relat. Fields 185, 185250.CrossRefGoogle Scholar
Norris, J., Silvestri, V. and Turner, A. (2024). Stability of regularized Hastings–Levitov aggregation in the subcritical regime. Commun. Math. Phys. 405, 74.CrossRefGoogle Scholar
Norris, J. and Turner, A. (2012). Hastings–Levitov aggregation in the small-particle limit. Commun. Math. Phys. 316, 809841.CrossRefGoogle Scholar
Shapiro, B. (2015). Problems around polynomials: The good, the bad and the ugly… Arnold Math. J. 1, 9199.CrossRefGoogle Scholar
Silvestri, V. (2017). Fluctuation results for Hastings–Levitov planar growth. Prob. Theory Relat. Fields 167, 417460.CrossRefGoogle Scholar
Sutherland, D. N. (1967). A theoretical model of floc structure. J. Colloid Interface Sci. 25, 373380.CrossRefGoogle Scholar
Tsai, Y. L. (2015). Maxwell’s conjecture on three point charges with equal magnitudes. Physica D 309, 8698.CrossRefGoogle Scholar
Vold, M. J. (1963). Computer simulation of floc formation in a colloidal suspension. J. Colloid Sci. 18, 684695.CrossRefGoogle Scholar
Witten, T. A. and Sander, L. M. (1981). Diffusion-limited aggregation, a kinetic critical phenomenon. Phys. Rev. Lett. 47, 14001403.CrossRefGoogle Scholar
Zolotov, V. (2023). Upper bounds for the number of isolated critical points via the Thom–Milnor theorem. Anal. Math. Phys. 13, 81.CrossRefGoogle Scholar
Figure 0

Figure 1. A tree grown with $\alpha = 2$ and the trajectory of 100 incoming gradient flows (equispaced in angle). There is a tendency to avoid existing valleys.

Figure 1

Figure 2. Simulation of gradient flow aggregation for $n=1000$ particles with various parameters of $\alpha$.

Figure 2

Figure 3. A fixed cluster of 40 particles. Gradient descent lines are colored by the color of the particle they eventually attach to (particles colored black were not hit by any gradient descent in the simulation). Likelihoods are determined by gradient flow lines at infinity. As $\alpha$ increases, the exposed endpoints gain more mass.

Figure 3

Figure 4. GFA for $n=5000$ particles with $\alpha=0$ (left) and $n=2500$ particles with $\alpha =2$. Theorem 3 implies that the structure on the left grows as slowly as possible, $\text{diam}(x_1, \ldots, x_n) \leq c \sqrt{n}$.

Figure 4

Figure 5. $n=100$ (left), $n=500$ (middle), and $n=2000$ (right) steps of the evolution of GFA with an $\alpha = \infty$ tree.

Figure 5

Figure 6. The likelihood of a new particle attaching itself to an existing particle $x_k$ on the convex hull is $\beta/(2\pi) = (\pi - \alpha)/(2\pi)$.

Figure 6

Figure 7. Starting with the 10 vertices of a regular polygon at distance 100 from the origin, the evolution after 1000, 5000, 15 000, and 25 000 steps, respectively.

Figure 7

Figure 8. Four examples of GFA with $\alpha=\infty$ in $\mathbb{R}^3$: the first 10 000 points. We see a tendency towards five tentacles.

Figure 8

Figure 9. Creating a trapping region $\Omega$.

Figure 9

Figure 10. A set of n particles contained in a ball of radius $\sim n$, a far away point x, and the cone C induced by the convex hull of the particles. Gradient descent flows along the cone.

Figure 10

Figure 11. Sketch of the proof of Theorem 1.

Figure 11

Figure 12. Sketch of the proof of Lemma 2.

Figure 12

Figure 13. Sketch of the proof of Lemma 3. Connected components flowing into $A_r$ have to be contained inside the cone.