Hostname: page-component-745bb68f8f-cphqk Total loading time: 0 Render date: 2025-01-28T01:51:10.742Z Has data issue: false hasContentIssue false

On the asymptotic normality of persistent Betti numbers

Published online by Cambridge University Press:  03 December 2024

Johannes Krebs*
Affiliation:
KU Eichstätt-Ingolstadt
Wolfgang Polonik*
Affiliation:
University of California at Davis
*
*Postal address: KU Eichstätt-Ingolstadt, Ostenstraße 28, 85072 Eichstätt, Germany. Email address: johannes.krebs@ku.de
**Postal address: Department of Statistics, University of California, Davis, CA 95616, USA. Email address: wpolonik@ucdavis.edu
Rights & Permissions [Opens in a new window]

Abstract

Persistent Betti numbers are a major tool in persistent homology, a subfield of topological data analysis. Many tools in persistent homology rely on the properties of persistent Betti numbers considered as a two-dimensional stochastic process $ (r,s) \mapsto n^{-1/2} (\beta^{r,s}_q ( \mathcal{K}(n^{1/d} \mathcal{X}_n))-\mathbb{E}[\beta^{r,s}_q ( \mathcal{K}( n^{1/d} \mathcal{X}_n))])$. So far, pointwise limit theorems have been established in various settings. In particular, the pointwise asymptotic normality of (persistent) Betti numbers has been established for stationary Poisson processes and binomial processes with constant intensity function in the so-called critical (or thermodynamic) regime; see Yogeshwaran et al. (Prob. Theory Relat. Fields 167, 2017) and Hiraoka et al. (Ann. Appl. Prob. 28, 2018).

In this contribution, we derive a strong stabilization property (in the spirit of Penrose and Yukich, Ann. Appl. Prob. 11, 2001) of persistent Betti numbers, and we generalize the existing results on their asymptotic normality to the multivariate case and to a broader class of underlying Poisson and binomial processes. Most importantly, we show that multivariate asymptotic normality holds for all pairs (r, s), $0\le r\le s<\infty$, and that it is not affected by percolation effects in the underlying random geometric graph.

Type
Original Article
Copyright
© The Author(s), 2024. Published by Cambridge University Press on behalf of Applied Probability Trust

1. Introduction

In this manuscript we address an important question in topological data analysis (TDA), namely, the study of the weak convergence of persistent Betti numbers

(1) \begin{align} \Big( n^{-1/2}\Big( \beta^{r_i,s_i}_q(\mathcal{K}(n^{1/d} \mathcal{X}_n) ) - \mathbb{E}\Big[ \beta^{r_i,s_i}_q(\mathcal{K}(n^{1/d} \mathcal{X}_n) ) \Big]\Big)\;:\; 1\le i \le \ell \Big),\end{align}

where $0\le q \le d-1$ and $0\le r_i\le s_i <\infty$ for $1\le i \le \ell$ ( $\ell\in\mathbb{N}$ ) and where $\mathcal{X}_n$ is either an n-binomial process with a bounded density $\kappa$ defined on the unit cube $[0,1]^d$ or the corresponding Poisson process with intensity function $n \kappa$ for $n\in\mathbb{N}$ .

So far, there exist results on the pointwise asymptotic normality of Betti numbers (i.e., $\ell = 1$ ) in the case of a homogeneous Poisson process or a binomial process with a constant density; see Yogeshwaran et al. [Reference Yogeshwaran, Subag and Adler42]. In the case of a homogeneous Poisson process this result was extended to persistent Betti numbers by Hiraoka et al. [Reference Hiraoka, Shirai and Trinh16].

Based on the pioneering central limit theorem of Penrose and Yukich [Reference Penrose and Yukich32] for stabilizing functionals on the homogeneous Poisson process, Trinh [Reference Trinh37] extends the central limit theorem to strongly stabilizing functionals in the case of an underlying inhomogeneous Poisson process. We will apply the abstract result of Trinh to persistent Betti functions (see below). For this we establish the strong stabilization property of the persistent Betti function, and this is one of our main contributions.

The theory of random geometric complexes is growing rapidly. For pioneering contributions see Kahle [Reference Kahle17], Yogeshwaran and Adler [Reference Yogeshwaran and Adler41], or Owada and Adler [Reference Owada and Adler27]. We refer the reader to Bobrowski and Kahle [Reference Bobrowski and Kahle5] for a survey.

Recent contributions in the context of Betti numbers are those of Kahle and Meckes [Reference Kahle and Meckes18], Owada [Reference Owada26], Goel et al. [Reference Goel, Trinh and Tsunoda15], Divol and Polonik [Reference Divol and Polonik10] and Owada and Thomas [Reference Owada and Thomas28].

TDA is a comparatively young field that has emerged from several contributions in algebraic topology and computational geometry. Milestone contributions which helped to popularize TDA in its early days were made by Edelsbrunner et al. [Reference Edelsbrunner, Letscher and Zomorodian11], Zomorodian and Carlsson [Reference Zomorodian and Carlsson44], and Carlsson [Reference Carlsson7]. Given a point cloud sampled from a distribution with density f on a d-dimensional manifold, TDA encompasses various techniques aimed at understanding the topology of the manifold and of the density f. TDA methods have been successfully implemented in applied sciences such as biology ([Reference Yao40]), materials science ([Reference Lee21]), and chemistry ([Reference Nakamura24]). From the mathematical statistician’s point of view, a topic of particular interest is the application of TDA to time series; see, e.g., the pioneering works of Seversky et al. [Reference Seversky, Davis and Berger35] and Umeda [Reference Umeda38], as well as the contributions of Gidea et al. to the analysis of financial time series ([Reference Gidea12Reference Gidea and Katz14]).

Our contribution falls within the area of persistent homology, which is one of the major tools in TDA. We can give only a short introduction to this topic here; a more detailed introduction offering insights into the basic concepts, ideas, and applications of persistent homology can be found in Chazal and Michel [Reference Chazal and Michel9], Oudot [Reference Oudot25], and Wasserman [Reference Wasserman39].

Given a point cloud in $\mathbb{R}^d$ (a sample of a point process), one first builds simplicial complexes over this point cloud according to a rule that describes the neighborhood relation between points. The two most frequently used simplicial complex models are the Vietoris–Rips complex and the Čech complex, defined below. The topological properties of simplicial complexes considered as geometric structures are characterized by the number of q-dimensional holes they contain—most notably connected components, loops, and cavities (zero-, one- and two-dimensional features). These holes are theoretically defined via a tool from algebraic topology, known as homology. The qth homology of a simplicial complex is determined by a quotient space. Its dimension is the so-called qth Betti number. Intuitively, the qth Betti number counts the number of q-dimensional holes in the simplicial complex.

For a given simplicial complex model, a filtration is an increasing collection of simplicial complexes indexed by a one-dimensional parameter, the so-called filtration parameter, which can be understood as time. Given a filtration on a finite time interval, we can consider the evolution of the qth homology groups, i.e., the dynamic behavior of the Betti numbers. As the underlying simple point process (e.g., a Poisson process on a Euclidean space) is random, these Betti numbers are also random; we are thus considering a stochastic process.

From the applied point of view, the mere knowledge of the evolution of the Betti numbers is often not enough, especially when considering objects obtained from persistence diagrams, such as persistent landscapes. In this context the more general concept of persistent Betti numbers is the appropriate tool, and this is the object studied here.

The remainder of this manuscript is organized as follows. In Section 2 we introduce our notation. The main results are presented in Section 3, where we state the property of strong stabilization and present two central limit theorems for persistent Betti numbers. Section 4 offers a short review of important related results in the literature that are also used in our study. The framework of stabilization is treated in detail in Section 5, which also contains further results on the stabilizing properties of persistent Betti numbers. The technical details are given in Section 6 and in Appendix A.

2. Notation

Given a finite subset P of the Euclidean space $\mathbb{R}^d,$ the Čech filtration $\mathcal{C}(P)=(\mathcal{C}_r(P)\;:\;r\ge 0)$ and the Vietoris–Rips filtration $\mathcal{R}(P)=(\mathcal{R}_r(P)\;:\;r\ge 0)$ are defined by

\begin{align*} \mathcal{C}_r(P) &= \bigg\{ \sigma \subseteq P, \bigcap_{x\in \sigma} B(x,r)\neq \emptyset \bigg\}, \\[5pt] \mathcal{R}_r(P) &= \{ \sigma \subseteq P, \operatorname{diam}(\sigma)\le r \},\end{align*}

respectively, where $B(x,r) = \{y\in\mathbb{R}^d \;:\; \|x-y\|\le r\}$ , $\|\cdot\|$ is the Euclidean norm, and $\operatorname{diam}$ is the diameter of a subset of $\mathbb{R}^d$ . Each $\sigma \subseteq P$ of size $q+1$ is a q-simplex, and for each $r\ge 0$ , the collections of simplices $\mathcal{C}_r(P)$ and $\mathcal{R}_r(P)$ , respectively, form simplicial complexes. Throughout this article, $\mathcal{K}_r(P)$ denotes either the Čech or the Vietoris–Rips complex built from P for some $r\ge 0$ . The Čech or the Vietoris–Rips filtration $\mathcal{K}(P)$ is the nested sequence of complexes $\mathcal{K}_r(P)$ as r goes from 0 to $+\infty$ .

Consider a filtration $\mathcal{K}(P)$ and a time $r\ge 0$ . The chain group generated by the q-dimensional simplices at time r is $C_q( \mathcal{K}_r(P))$ . Write $Z_q(\mathcal{K}_r(P))$ for the qth cycle group of the simplicial complex $\mathcal{K}_r(P)$ and $B_q(\mathcal{K}_r(P))$ for the qth boundary group, and let $H_q(\mathcal{K}_r(P))$ be the homology of the simplicial complex $\mathcal{K}_r(P)$ with respect to to the base field $\mathbb{F}_2=\{0,1\}$ . A q-simplex $\sigma$ is positive in the filtration $\mathcal{K}(P)$ if, at its filtration time $r(\sigma)$ (the time it enters the complex), its inclusion in the simplicial complex $\mathcal{K}_{r(\sigma)-}$ creates a q-dimensional cycle. Here $\mathcal{K}_{r(\sigma)-}$ is the simplicial complex $\mathcal{K}_{r(\sigma)}$ without the complexes containing $\sigma$ (as a simplex or as a face). If the q-simplex $\sigma$ is not positive, then we call it negative.

Let $0\le q \le d-1$ . Then the (r, s)-persistent Betti number of a simplicial complex $\mathcal{K}(P)$ (see [Reference Edelsbrunner, Letscher and Zomorodian11]) is defined by

\begin{align*} \beta^{r,s}_q(\mathcal{K}(P)) &= \dim \frac{Z_q (\mathcal{K}_r(P))}{Z_q (\mathcal{K}_r(P)) \cap B_q (\mathcal{K}_s(P))} \\[5pt] &= \dim Z_q (\mathcal{K}_r(P)) - \dim Z_q (\mathcal{K}_r(P)) \cap B_q (\mathcal{K}_s(P)).\end{align*}

The Betti number $\beta^r_q(\mathcal{K}(P))$ is defined as $\beta^{r,r}_q(\mathcal{K}(P))$ , $r\ge 0$ ; in particular, $\beta^r_q(\mathcal{K}(P)) = \dim Z_q (\mathcal{K}_r(P)) - \dim B_q (\mathcal{K}_r(P))$ .

The persistent Betti number $\beta^{r,s}_q(\mathcal{K}(P))$ is closely related to the persistence diagram of the underlying point cloud P; see [Reference Hiraoka, Shirai and Trinh16] for further details. It equals the number of q-dimensional topological features (points in the qth persistence diagram) born before time r and still alive at time s (see Figure 1). Persistent Betti numbers are translation-invariant, i.e., $\beta^{r,s}_q(\mathcal{K}(P+v)) = \beta^{r,s}_q(\mathcal{K}(P))$ for each $v\in\mathbb{R}^d$ . The add-one cost function

\begin{align*} \mathfrak{D}_0 \beta^{r,s}_q(\mathcal{K}(P)) = \beta^{r,s}_q(\mathcal{K}(P \cup \{0\} )) - \beta^{r,s}_q(\mathcal{K}(P)) \end{align*}

is an important tool in our analysis.

Figure 1. The persistent Betti number $\beta^{r,s}_q(\mathcal{K}(P))$ equals the number of points in the gray-shaded rectangle; the point on the dashed red line is not counted, whereas the point on the solid red line is.

We let $\mathcal{P}$ and $\mathcal{P}'$ be two independent and homogeneous Poisson processes on $\mathbb{R}^d$ with unit intensity, observed on increasing observation windows $B_n=[-2^{-1} n^{1/d},2^{-1} n^{1/d}]^d$ . Given a function $w \ge 0$ , we denote by $\mathcal{P}(w)$ the (inhomogeneous) Poisson process with intensity function w.

We also use the following notation: $\Delta=\{(r,s)\;:\; 0\le r\le s<\infty\}$ denotes the domain of the persistent Betti function. We let $Q(x,r) = \{y\in\mathbb{R}^d\;:\; |y_i - x_i| \le r \text{ for } 1\le i \le d \}$ and $Q(x) = (\!-\!1/2,1/2]^d + x$ for $x\in\mathbb{R}^d$ and $r>0$ . For $y,z\in\mathbb{Z}^d$ , we write $y \prec z$ if y precedes z in the lexicographic ordering on $\mathbb{Z}^d$ , and we write $y \preceq z$ if either $y\prec z$ or $y=z$ . If $f\colon\mathbb{R}\rightarrow\mathbb{R}$ , we write $\|f\|_{\infty}$ for the sup-norm of f. We let $\Rightarrow$ denote convergence in distribution of a sequence of random variables. Throughout the article, we let $(\Omega,\mathcal{F},\mathbb{P})$ be a sufficiently rich probability space, on which all random variables are defined.

3. Main results

We now present the first main result, discussed in detail later in Section 5.

Theorem 1. Let $\lambda>0$ , $(r,s)\in\Delta$ , and $q \in \{0,\ldots,d-1\}$ . There is an $\mathcal{F}$ -measurable random variable $S^{(r,s)}_q := S^{(r,s)}_q(\mathcal{P}(\lambda)) $ which is almost surely (a.s.) finite, such that for all finite sets $A \subseteq \mathbb{R}^d\setminus B(0, S^{(r,s)}_q)$ , the add-one cost function satisfies

\begin{align*} &\mathfrak{D}_0 \beta^{r,s}_q\Big( \mathcal{K}\big( \big(\,\mathcal{P}(\lambda)\cap B(0,S^{(r,s)}_q)\,\big) \cup A \big) \Big) \equiv \mathfrak{D}_0 \beta^{r,s}_q\Big( \mathcal{K}\big( \mathcal{P}(\lambda)\cap B(0,S^{(r,s)}_q)\big)\Big) \quad \textit{a.s.}\end{align*}

Thus, the persistent Betti function is strongly stabilizing on the homogeneous Poisson process $\mathcal{P}(\lambda)$ , in the spirit of Penrose and Yukich [Reference Penrose and Yukich32], for each intensity $\lambda\in\mathbb{R}_+$ , each pair $(r,s)\in\Delta$ , and each dimension q. The proof of Theorem 1 relies on an abstract stabilization result stated in Theorem 4; see Section 5. The proofs of both theorems are given in Subsection 6.1.

By the property of strong stabilization, there are random variables $\Delta^{r,s}_0(\infty)$ taking values in $\mathbb{Z}$ and $N_0$ taking values in $\mathbb{N}$ such that, for all $n \ge N_0$ ,

\begin{align*} &\beta_q^{r,s} (\mathcal{K}(\mathcal{P}\cap B_n)) - \beta_q^{r,s} (\mathcal{K}(( [\mathcal{P}\setminus Q(0)]\cup[\mathcal{P}'\cap Q(0)])\cap B_n) ) \equiv \Delta^{r,s}_0(\infty);\end{align*}

see Lemma 3. Let $\mathcal{F}_0$ be the $\sigma$ -field generated by $\mathcal{P}$ restricted to $\bigcup_{y\in\mathbb{Z}^d: y \preceq 0} Q(y)$ . Define $\gamma( (u,v), (r,s)) = \mathbb{E}\left [ \mathbb{E}\left [ \Delta^{u,v}_0(\infty)|\mathcal{F}_0 \right ] \mathbb{E}\left [ \Delta^{r,s}_0(\infty)|\mathcal{F}_0 \right ] \right ]$ . The asymptotic normality in the Poisson sampling scheme can be derived directly from the the strong stabilization stated in Theorem 3 and the abstract result of Trinh [Reference Trinh37] via the Cramér–Wold device, as follows.

Theorem 2. Let $\mathcal{P}_n=\mathcal{P}(n\kappa)$ be a Poisson process with intensity $n\kappa$ on $[0,1]^d$ , where $\kappa$ is a bounded and measurable density function. Let $X \sim \kappa$ and let $(r_i,s_i)\in\Delta$ for $1\le i\le \ell$ and $\ell\in\mathbb{N}$ . Then

\[ \begin{pmatrix} &n^{-1/2}\left( \beta^{r_1,s_1}_q(\mathcal{K}(n^{1/d} \mathcal{P}_n) ) - \mathbb{E}\left [ \beta^{r_1,s_1}_q(\mathcal{K}(n^{1/d} \mathcal{P}_n) ) \right ] \right) \\[5pt] &\vdots\\[5pt] &n^{-1/2}\left( \beta^{r_\ell,s_\ell}_q(\mathcal{K}(n^{1/d} \mathcal{P}_n) ) - \mathbb{E}\left [ \beta^{r_\ell,s_\ell}_q(\mathcal{K}(n^{1/d} \mathcal{P}_n) ) \right ] \right) \end{pmatrix} \Rightarrow \Psi,\]

where $\Psi\sim \mathcal{N}(0,\Sigma)$ has a multivariate normal distribution with mean zero and covariance matrix $\Sigma\, \ge 0$ given by

\[ \Sigma(i,j) = \mathbb{E}\Big [ \gamma( \kappa(X)^{1/d} ( (r_i,s_i),(r_j,s_j)) ) \Big ] \quad (1\le i,j\le \ell).\]

Furthermore,

\begin{align*} \lim_{n\to\infty} n^{-1} \operatorname{Cov}\! \Big( \beta^{r_i,s_i}_q(\mathcal{K}(n^{1/d} \mathcal{P}_n) ) , \beta^{r_j,s_j}_q(\mathcal{K}(n^{1/d} \mathcal{P}_n) ) \Big) = \Sigma(i,j) \qquad (1\le i,j\le \ell).\end{align*}

Moreover, for $0\le r\le s< \infty$ , define

(2) \begin{align}\begin{split} \alpha(r,s) := \mathbb{E}\Big [ \mathfrak{D}_0 \beta^{r,s}_q\Big(\mathcal{K}\big( \mathcal{P} \cap B(0,S^{(r,s)}_q ) \big) \Big) \Big],\end{split}\end{align}

where $S^{(r,s)}_q = S^{(r,s)}_q(\mathcal{P})$ is as in Theorem 1. Then for the binomial sampling scheme the result is as follows.

Theorem 3. Let $\mathbb{X}_n$ be an n-binomial process with density function $\kappa$ on $[0,1]^d$ , which is bounded and measurable. Let $X \sim \kappa$ and let $(r_i,s_i)\in\Delta$ for $1\le i \le \ell$ and $\ell\in\mathbb{N}$ . Then

\[ \begin{pmatrix} &n^{-1/2}\left( \beta^{r_1,s_1}_q(\mathcal{K}(n^{1/d} \mathbb{X}_n) ) - \mathbb{E}\left [ \beta^{r_1,s_1}_q(\mathcal{K}(n^{1/d} \mathbb{X}_n) ) \right ] \right) \\[5pt] &\vdots\\[5pt] &n^{-1/2}\left( \beta^{r_\ell,s_\ell}_q(\mathcal{K}(n^{1/d} \mathbb{X}_n) ) - \mathbb{E}\left [ \beta^{r_\ell,s_\ell}_q(\mathcal{K}(n^{1/d} \mathbb{X}_n) ) \right ] \right) \end{pmatrix} \Rightarrow \widetilde\Psi,\]

where $\widetilde\Psi\sim \mathcal{N}(0,\widetilde\Sigma)$ has a multivariate normal distribution with mean zero and covariance matrix $\widetilde\Sigma\,\ge 0$ given by

\begin{align*} \widetilde\Sigma(i,j) &= \mathbb{E}\Big [ \gamma( \kappa(X)^{1/d} ( (r_i,s_i),(r_j,s_j)) ) \Big ] \\[5pt] &\qquad - \mathbb{E}\Big[ \alpha( \kappa(X)^{1/d} (r_i,s_i)) \Big] \, \mathbb{E}\Big[ \alpha( \kappa(X)^{1/d} (r_j,s_j)) \Big] \quad (1\le i,j\le \ell).\end{align*}

Furthermore,

\begin{align*} \lim_{n\to\infty} n^{-1} \operatorname{Cov} \Big( \beta^{r_i,s_i}_q(\mathcal{K}(n^{1/d} \mathbb{X}_n) ) , \beta^{r_j,s_j}_q(\mathcal{K}(n^{1/d} \mathbb{X}_n) ) \Big) = \widetilde\Sigma(i,j) \qquad (1\le i,j\le \ell).\end{align*}

We conclude this section with some discussion of the results and the techniques used in the proofs of these theorems, which are given in Subsections 6.2 and 6.3. The univariate central limit theorems for Betti numbers ( $r=s$ ) have already been formulated by Trinh [Reference Trinh37] (see also Proposition 2 in this manuscript) under the condition that the parameter r is chosen to be such that no percolation occurs. Here, as we can rely on the strong stabilization property, we can omit this restriction.

For the derivation of the multivariate results in the Poisson sampling scheme, we can rely on the abstract result of Trinh [Reference Trinh37, Theorem 3.3] and derive the covariance structure with the help of results from Penrose and Yukich [Reference Penrose and Yukich32]. Multivariate central limit theorems in the spatial context are also studied by Penrose [Reference Penrose30]. The results in the binomial setting are established using Trinh [Reference Trinh37, Theorem 3.9], which itself relies on a de-Poissonization argument.

Finally, we mention that it is currently unknown whether or not the limiting covariance matrices are strictly positive definite.

4. Related results

Below we discuss some literature closely related to our study. The techniques employed to obtain these results are tools from geometric probability, which studies geometric quantities deduced from simple point processes. A classical result of Steele [Reference Steele36] shows the convergence of the total length of the minimum spanning tree built from an independent and identically distributed (i.i.d.) sample of n points in the unit cube. There are several generalizations of this work; for notable contributions see McGivney and Yukich [Reference McGivney and Yukich22], Yukich [Reference Yukich43], Penrose and Yukich [Reference Penrose and Yukich33], and the monograph of Penrose [Reference Penrose29].

A different type of contribution, equally important, is that of Penrose and Yukich [Reference Penrose and Yukich32], which considers asymptotic normality of functionals built on Poisson and binomial processes. For completeness, we mention that the study of Gaussian limits (as in [Reference Penrose and Yukich32, Reference Penrose and Yukich33]) is not limited to the total mass functional. It can be extended to random point measures obtained from the points of a marked point process; see, e.g., [Reference Baryshnikov and Yukich3, 4, Reference Penrose31].

Goel et al. [Reference Goel, Trinh and Tsunoda15] prove a convergence result for the expectation of Betti numbers in the critical regime. Their result generalizes directly to persistent Betti numbers, giving us the following well-known result.

Proposition 1. Let $0<r\le s < \infty$ . Let $\mathcal{X}_n$ be either a Poisson process with intensity $n\kappa$ on $[0,1]^d$ or an n-binomial process with density $ \kappa$ . Then

\begin{align*} \lim_{n\rightarrow \infty} n^{-1} \mathbb{E}\Big [ \beta^{r,s}_q( \mathcal{K}( n^{1/d} \mathcal{X}_n) ) \Big] = \mathbb{E}\Big[ \hat{b}_q(r \kappa(X')^{1/d},s \kappa(X')^{1/d}) \Big],\end{align*}

where X has density $\kappa$ and where $\hat{b}_q(r,s)$ is the limit of $n^{-1} \mathbb{E}\big[ \beta^{r,s}_q(\mathcal{K}((n^{1/d} \mathbb{X}^*_n)) \big]$ for a homogeneous Poisson process $\mathbb{X}^*_n$ on $[0,1]^d$ with intensity n.

So far, normality results for (persistent) Betti numbers exist only in a pointwise sense and are rather direct consequences of Theorems 2.1 and 3.1 from [Reference Penrose and Yukich32]. We quote them here in a way that is more in line with our framework. For this we need the notion of the interval of co-existence $I_d(\mathcal{P})$ of a Poisson process $\mathcal{P}$ with unit intensity on $\mathbb{R}^d$ . This is determined by the critical radii for percolation of the occupied and the vacant component, respectively, which are defined as follows:

\[ r_c(\mathcal{P}) \;:\!= \inf\{ r\;:\; \mathbb{P}(\mathcal{C}_r(\mathcal{P}) \text{ percolates})>0 \}\]

and

\[ r^*_c(\mathcal{P}) \;:\!=\; \sup\{ r\;:\; \mathbb{P}( \mathbb{R}^d\setminus \mathcal{C}_r(\mathcal{P}) \text{ percolates})>0 \} .\]

Both probabilities inside the infimum and supremum are either 0 or 1 by Kolmogorov’s 0–1 law. The quantity $r_c(\mathcal{P})$ is called the critical radius for percolation of the occupied component, and $r_c^*(\mathcal{P})$ is called the critical radius for percolation of the vacant component. The interval of co-existence, on which unbounded components of both the (Boolean) model $\mathcal{C}_r(\mathcal{P})$ and its complement co-exist, is defined as follows:

\[ I_d(\mathcal{P}) \;:\!=\; \begin{cases} (r_c, r^*_c] & \text{ if } \mathbb{P}( \mathcal{C}_{r_c}(\mathcal{P}) \text{ percolates}) = 0, \\[5pt] [r_c,r^*_c] & \text{otherwise.} \end{cases}\]

We know that in two dimensions, $I_2(\mathcal{P}) = \emptyset$ from [Reference Meester and Roy23, Theorem 4.4 and 4.5]. Moreover, from [Reference Sarkar34, Theorem 1], we know that $I_d(\mathcal{P})\neq\emptyset$ for each $d\ge 3$ . Thus we have the following results.

Proposition 2. (Pointwise normality of (persistent) Betti numbers.)

  1. (i) (Hiraoka et al. [Reference Hiraoka, Shirai and Trinh16, Theorem 5.2].) Let $\mathcal{P}|_{[0,n^{1/d}]^d}$ be the restriction of $\mathcal{P}$ to $[0,n^{1/d}]^d$ , and let $0\le r \le s<\infty$ . Then there is a $\sigma^2(r,s)\in\mathbb{R}_+$ such that

    \[ n^{-1/2} \Big(\beta^{r,s}_q (\mathcal{K}(\mathcal{P}|_{[0,n^{1/d}]^d})) - \mathbb{E}\Big [ \beta^{r,s}_q (\mathcal{K}(\mathcal{P}|_{[0,n^{1/d}]^d})) \Big] \Big) \Rightarrow \Phi_1, \]
    where $\Phi_1$ has a normal distribution with mean zero and variance $\sigma^2(r,s)\, \ge 0$ .
  2. (ii) (Yogeshwaran et al. [Reference Yogeshwaran, Subag and Adler42, Theorem 4.7].) Let $\mathcal{K}$ be the Čech filtration, and let $0\le r < \infty$ be such that $r \notin I_d(\mathcal{P})$ . For each $n\in\mathbb{N}$ , let $\mathbb{X}_n$ be an n-binomial process with a uniform density on $[0,1]^d$ . Then there is a $0<\tau^2(r) \le \sigma^2(r,r)$ (with $\sigma^2$ from (i)) such that

    \[ n^{-1/2} \Big(\beta^{r}_q (\mathcal{K}( n^{1/d} \mathbb{X}_n)) - \mathbb{E}\Big[ \beta^{r}_q (\mathcal{K}(n^{1/d} \mathbb{X}_n)) \Big] \Big) \Rightarrow \Phi_2, \]
    where $\Phi_2$ has a normal distribution with mean zero and variance $\tau^2(r)\,> 0$ .
  3. (iii) (Trinh [Reference Trinh37, Theorem 4.1].) Let $\kappa$ be a bounded density function with compact support, and let $\mathcal{K}$ be the Čech filtration. Let $0\le q\le d-1$ . Let $r\in (0, (\sup \kappa)^{-1/d} \ r_c)$ . Then

    \begin{align*} n^{-1/2}\Big( \beta^{r}_q(\mathcal{K}(n^{1/d} \mathcal{P}(n\kappa)) ) - \mathbb{E}\Big[ \beta^{r}_q(\mathcal{K}(n^{1/d} \mathcal{P}(n\kappa) )) \Big ] \Big) \Rightarrow \Phi_3, \end{align*}
    where $\Phi_3$ has a normal distribution with mean zero and variance $\widetilde\sigma^2 > 0$ ; here $\widetilde\sigma^2 = \int \sigma^2(\kappa(x)^{1/d}(r,r)) \kappa(x) \textrm{d}x$ with $\sigma^2$ from (i). A similar statement is true for the binomial process.

First, we remark that the above statements in their original versions are also valid for more general domains $\tilde B_n\subseteq\mathbb{R}^d$ which are not necessarily rectangular. Furthermore, we remark that Hiraoka et al. [Reference Hiraoka, Shirai and Trinh16] prove their theorem for a general class of filtrations which contains, among others, the Čech and the Vietoris–Rips filtration. Moreover, Theorem 4.7 of Yogeshwaran et al. [Reference Yogeshwaran, Subag and Adler42] also contains a version of (ii) for Betti numbers of the homogeneous Poisson process, which, in the above list, is contained in the result (i). The result of Trinh [Reference Trinh37] is already for Betti numbers from a general density function $\kappa$ , but the parameter choice for r depends on $r_c$ . Moreover, Trinh [Reference Trinh37] points out that in the case $d=2$ there are no restrictions on the choice of r as $I_2(\mathcal{P})$ is empty; this can be shown with a duality property. Finally, regarding (iii), Yogeshwaran et al. [Reference Yogeshwaran, Subag and Adler42] remark that the condition $r\notin I_d(\mathcal{P})$ is likely to be superfluous; as already mentioned, we show that this condition can indeed be removed.

5. Background on strong stabilization

In our analysis of the multivariate asymptotic normality of persistent Betti numbers, stabilization properties are crucial. Kesten and Lee [Reference Kesten and Lee19] introduced stabilization to prove asymptotic normality for the weight of the Euclidean minimal spanning tree. The concept was extended by Penrose and Yukich [Reference Penrose and Yukich32, Reference Penrose and Yukich33] to treat general functionals defined on Poisson and binomial point processes; it goes as follows. Consider a functional H which is defined on finite subsets of $\mathbb{R}^d$ , and define its add-one cost function as

\[ \mathfrak{D}_0 H(\mathcal{H}) := H(\mathcal{H} \cup \{0\}) - H(\mathcal{H}) \]

for $\mathcal{H} \subseteq\mathbb{R}^d$ finite. The functional H is strongly stabilizing on the homogeneous Poisson process with intensity $\lambda\in (0,\infty)$ on $\mathbb{R}^d$ , denoted by $\mathcal{P}(\lambda)$ , if there exist a.s. finite random variables S and $\mathfrak{D}_{\infty}H$ such that for all finite $A\subseteq \mathbb{R}^d\setminus B(0,S)$ ,

\[ \mathfrak{D}_0 H( (\mathcal{P}(\lambda)\cap B(0,S) ) \cup A ) = \mathfrak{D}_{\infty} H \quad \text{a.s.} \]

Recall that, for $n\in\mathbb{N}$ , the sets $B_n = [-n^{1/d}/2, n^{1/d}/2]^d$ denote observation windows, and let $\mathcal{A}$ be the collection $\{B_n+x\;:\; x\in\mathbb{R}^d, n\in\mathbb{N} \}$ . The functional H is weakly stabilizing on $\mathcal{A}$ (for $\mathcal{P}(\lambda)$ ) if there is an a.s. finite random variable $\mathfrak{D}'_{\infty} H$ such that, for any such sequence $(A_n\;:\; n\in\mathbb{N})$ from the collection $\mathcal{A}$ with $\lim_{n\to \infty}A_n = \mathbb{R}^d$ ,

\[ \mathfrak{D}_0 H( \mathcal{P}(\lambda) \cap A_n ) \to \mathfrak{D}'_{\infty} H \quad \text{a.s.\ as $n \to \infty$.} \]

Recall that the set-theoretic limit $\lim_{n\to \infty}A_n = \mathbb{R}^d$ is equivalent to $\lim_{n \to \infty}\textbf{1}_{A_n}(x) =1 $ for all $x \in \mathbb{R}^d$ . The stabilization of a functional defined on subsets of a point process roughly means that a local change in the point process (e.g., the addition or subtraction of finitely many points) affects the value of the functional only locally. This latter phenomenon can be described in terms of various notions. We consider two radii of stabilization for the persistent Betti function $\beta^{r.s}_q$ . Their functionality is related to the classical weak and strong stabilization properties given above. Properties of these radii are addressed in detail below.

Consider a point process P on $\mathbb{R}^d$ without accumulation points, and let Q be finite with circumcenter $z_Q\in\mathbb{R}^d$ and circumradius $L_Q$ . Thus, $Q \subseteq B(z_Q,L_Q)$ for $L_Q\ge 0$ minimal. For short, write $\mathcal{K}_{r,a} = \mathcal{K}_r( P \cap B(z_Q,a) )$ and $\mathcal{K}'_{r,a}= \mathcal{K}_r( (P\cup Q) \cap B(z_Q,a) )$ for $a,r \ge 0$ . In the following, the reference case is that $Q\subset Q(0)$ , so that $z_Q\in Q(0)$ and $L_Q \le \sqrt{d}/2$ . (Recall that Q(0) is defined in Section 2.)

Radius of weak stabilization: Define the radius of weak stabilization of (r, s) by

(3) \begin{align} \rho^{(r,s)}_q (P,Q) &:= \inf\{ R>0\;:\; \dim Z_q(\mathcal{K}'_{r,a}) - \dim Z_q(\mathcal{K}_{r,a}) = \text{const. } \forall a\ge R \text{ and } \nonumber \\[5pt] &\quad \dim Z_q(\mathcal{K}'_{r,a})\cap B_q(\mathcal{K}'_{s,a}) - \dim Z_q(\mathcal{K}_{r,a})\cap B_q(\mathcal{K}_{s,a}) = \text{const. }\; \forall a\ge R\}, \nonumber \\[5pt] \rho^{(r,s)}(P,Q) &:= \max_{0\le q\le d-1} \rho_q^{(r,s)}(P,Q). \end{align}

One can use similar ideas as in the proof of Lemma 5.3 in Hiraoka et al. [Reference Hiraoka, Shirai and Trinh16] to show that if P a.s. has no accumulation points and if Q is finite, then $\rho^{(r,s)}(P,Q)$ is a.s. finite; we do this in Lemma 5 in the appendix. A similar result was also obtained by Hiraoka et al. [Reference Hiraoka, Shirai and Trinh16] for the add-one cost function of persistent Betti numbers. Our definition of the radius of weak stabilization implies that for all $0\le r \le s$ and for all $ q\in \{0,\ldots,d-1\},$

\[ \beta^{r,s}_q\Big( \mathcal{K}\big( (P\cup Q)\cap B(z_Q,R) \big) \Big) - \beta^{r,s}_q\Big( \mathcal{K}\big( P\cap B(z_Q,R) \big) \Big) = \text{const.}\]

as a function in R, for $R\ge \rho_q^{(r,s)}(P,Q).$

Radius of strong stabilization: Let $r>0$ be an arbitrary but fixed filtration parameter. Let $\mu(r)$ be an upper bound on the diameter of simplices in the filtration at time r. For the Vietoris–Rips filtration, $\mu(r)$ equals r. For the Čech filtration $\mu(r)=2r$ is a sharp bound. We choose $a\ge a^*(r) = L_Q + \mu(r)$ sufficiently large so that all simplices containing at least one point of Q have a filtration time of at most $a^*(r)$ ; recall that $L_Q$ denotes the circumradius of Q.

Given P and Q, let $\sigma^r_{q,i}$ , $i=1,\ldots,m_q$ , be the q-simplices in $\mathcal{K}'_{r,a}\setminus \mathcal{K}_{r,a}$ contained in the ball $B(z_Q,a)$ that are created up to filtration time r by the addition of the points in Q to the point process P. Without loss of generality, we may assume the simplices are already ordered according to their filtration times; simplices with the same filtration time are ordered at random.

The number R that limits the knowledge of a point process P to the ball B(z, R) is referred to as the information horizon (with respect to z); i.e., we observe only the process $P'\cap B(z,R) = P'|_{B(z,R)}$ and the corresponding simplicial complexes restricted to $P'|_{B(z,R)}$ , i.e., the complexes $\mathcal{K}_r( P'|_{B(z,R)} )$ , $r\ge 0$ .

Let $\partial_q$ denote the qth boundary map. For $i=1,\ldots,m_q$ , define the following quantities depending on the filtration parameter $r\ge 0:$

\[ C^{\,r}_{q,i}(a) = C_q ( \mathcal{K}_{r,a}) \oplus \langle \sigma^r_{q,1},\ldots,\sigma^r_{q,i} \rangle \quad \text{ and } \quad Z^r_{q,i}(a) = \operatorname{ker}(\partial_q\colon C^{\,r}_{q,i}(a) \to B^r_{q-1,i}(a) ),\]

where $B^r_{q-1,i}(a) = \partial_q(C^{\,r}_{q,i}(a) ) \subseteq C^{\,r}_{q-1,i}(a)$ is the image of $\partial_q$ . First, we define

(4) \begin{align} \widetilde\rho_q^{\,r}(P,Q) := \inf\Big\{& R \ge a^*(r) \ \Big| \ \text{for each } \sigma^r_{q,i}, i \in \{1,\ldots,m_q \}\;:\; \nonumber\\[5pt] & \text{ either } \Big[ \exists c\in Z^r_{q,i}(R) \;:\; \sigma^r_{q,i}\ \text{ is contained in } c \Big] \nonumber \\[5pt] & \text{ or } \Big[ \text{ conditional on $(P\cup Q) |_{B(z_Q,R)}$ } \nonumber \\[5pt] & \big[ \forall a\ge R, \forall c\in Z^r_{q,i}(a)\;:\; \sigma^r_{q,i} \text{ is not contained in \textit{c} } \big] \text{ is true } \Big] \Big \}. \end{align}

This definition means the following. Consider an information horizon $R > \widetilde\rho_q^{\,r}(P,Q)$ ; i.e., we observe all points from the point process $(P\cup Q)\cap B(z_Q,R)$ . Then, if we include the q-simplex $\sigma^r_{q,i}$ in the simplicial complex, we already have the information either that $\sigma^r_{q,i}$ creates a new q-cycle or that it does not, even when we are additionally including points from an infinite information horizon. In other words, given $(P\cup Q)\cap B(z_Q,R)$ , the event of which simplices $\sigma^r_{q,i}$ are ultimately positive is decidable (i.e. computable or recursive). By a simplex being ‘ultimately positive’ we mean that if the information horizon is large enough, then we see that the simplex is part of a cycle. Similarly, a simplex staying negative means that it never becomes part of a cycle, even if the information horizon is infinite.

Thus, the event of which simplices $\sigma^r_{q,i}$ are ultimately positive being decidable means that having observed $P \cup Q$ up to the information horizon R, i.e., $(P \cup Q)\cap B(z_Q,R)$ , where R is ‘large enough’ ( $ R > \widetilde\rho_q^{\,r}(P,Q)$ ), we know that each potential cycle in $C_q( \mathcal{K}_r( \mathcal{P}(\lambda)|_{B(z_Q,R)} ) ) \oplus \langle \sigma^r_{q,1},\ldots,\sigma^r_{q,i} \rangle$ containing $\sigma^r_{q,i}$ has already terminated, meaning that $\sigma^r_{q,i}$ is positive, or that $\sigma^r_{q,i}$ will stay negative.

In order to state whether the persistent Betti number for a given pair (r, s) and a given dimension q has stabilized, we need to know whether a new cycle in $Z_q (\mathcal{K}'_{r,\widetilde\rho^{\,r}_q})$ is also a q-dimensional feature, i.e., is not eventually a boundary in $B_q (\mathcal{K}'_{s,a})\cap Z_q (\mathcal{K}'_{r,a})$ for some $a>\widetilde\rho^{\,r}_q$ .

For this purpose, apart from the additional q-simplices $\sigma^r_{q,1},\ldots,\sigma^r_{q,m_q}$ , we write $\widetilde\sigma^s_{q+1,1},\ldots,\widetilde\sigma^s_{q+1,\widetilde m_{q+1}}$ for the additional $(q+1)$ -simplices in $\mathcal{K}'_{s,a}\setminus \mathcal{K}_{s,a}$ for $a \ge a^*(s)$ .

We can repeat the considerations from above for the $(q+1)$ -dimensional simplices and a filtration parameter equal to s. Then by definition, after time $\widetilde\rho^{\,s}_{q+1}(P,Q)$ , we know for each $\widetilde\sigma^s_{q+1,i}$ whether it is positive or it is negative for all $a>\widetilde\rho^{\,s}_{q+1}(P,Q)$ .

Consequently, the radius of strong stabilization for the pair $(r,s)\in\Delta$ is as follows:

(5) \begin{align} S_q^{(r,s)}(P,Q) = \max\{ \widetilde\rho^{\,r}_q(P,Q), \widetilde\rho^{\,s}_{q+1}(P,Q) \}.\end{align}

At this stage there is a major difference between the Čech complex and the Vietoris–Rips complex if $q=d-1$ . For the Čech filtration, there are no q-dimensional cycles in d-dimensional Euclidean space for $q\ge d$ . Thus, for the Čech filtration, $S^{(r,s)}_{d-1}(P,Q) = \max\{\widetilde\rho^{\,r}_{d-1}(P,Q), a^*(s)\}$ . For the Vietoris–Rips filtration, however, there can be q-dimensional cycles for every possible dimension q; see Bobrowski and Kahle [Reference Bobrowski and Kahle5]. In particular, $S^{(r,s)}_{d-1}(P,Q) > \max\{\widetilde\rho^{\,r}_{d-1}(P,Q), a^*(s)\}$ can occur.

In the following, if $Q=\{0\}$ , we simply write $\rho^{(r,s)}(P)$ , $\widetilde\rho^{\,r}_q(P)$ or $S^{(r,s)}_q(P)$ for convenience. Next, we show in Theorem 4 that the radius $\widetilde\rho_q^{\,r}(P,Q)$ is a.s. finite for each $q\ge 0$ and $r\in\mathbb{R}_+$ if P equals a homogeneous Poisson process modulo a finite set of points and $Q\subseteq\mathbb{R}^d$ is finite. In particular, this implies the strong stabilization property of the persistent Betti number $\beta^{r,s}_q$ , in the sense that $S^{(r,s)}_q$ from (5) is finite, which in turn leads to Theorem 1.

Theorem 4. For a Poisson process with constant intensity $\lambda\in\mathbb{R}_+$ and two finite (disjoint) sets $Q_1,Q_2\subseteq Q(0)$ , the radius $ \widetilde\rho_q^{\,r}(\mathcal{P}(\lambda)\cup Q_1,Q_2)$ is a.s. finite for each q and each $r>0$ . In particular, the radius of strong stabilization $S_q^{(r,s)}$ is finite for each q and each $(r,s) \in \Delta$ .

We apply arguments from continuum percolation theory to prove this theorem. These arguments are also used to prove uniqueness of the occupied and vacant component in the Boolean model; see, e.g., Aizenman et al. [Reference Aizenman, Kesten and Newman1, Reference Aizenman, Kesten and Newman2] and Burton and Keane [Reference Burton and Keane6], as well as the monograph of Meester and Roy [Reference Meester and Roy23].

Furthermore, we have the following relation between the two radii.

Lemma 1. We have $\rho^{(r,s)}_q(P,Q) \le S_q^{(r,s)}(P,Q)$ for each pair $(r,s)\in \Delta$ , $q\in\{0,\ldots,d-1\}$ .

Thus, weak stabilization measured in terms of $\rho^{(r,s)}_q$ is always implied by strong stabilization measured in terms of $S_q^{(r,s)}$ . The proof of this lemma is given in Subsection 6.1.

Moreover, our results are not limited to this static case, in which we consider only one Poisson process and the persistent Betti number for one pair (r, s): we show in Theorem 5 that Borel probability measures induced by the radius of strong (and of weak) stabilization are tight over a variety of parameter ranges.

The theorem is divided into three parts. Part (1) concerns uniform stabilization over a variety of homogeneous Poisson processes. These stabilization properties then enable us to derive the results in Parts (2) and (3), where we consider stabilization for the binomial and the Poisson sampling schemes, respectively. The proof of Theorem 5 is deferred to Appendix 7.

Theorem 5. (Uniform stabilization) For $m \in \mathbb N$ , let $\mathfrak{Q}_m = \{ \{y_1,\ldots, y_k\}\;:\; y_i\in Q(0),$ $i=1,\ldots,k, k\le m\}$ be the class of sets with at most m points in Q(0). Let $+\infty> \overline{r}\ge \underline{r}>0$ and $m\in\mathbb{N}$ be arbitrary but fixed. Then we have the following:

  1. (1) Stabilization for the homogeneous Poisson case: The laws of

    \[\{ \widetilde\rho_q^{\,r} (\mathcal{P}(\lambda)\cup Q_1,Q_2))\;:\; \underline{r}\le r \le \overline{r},\lambda\in \mathbb{R}_+, Q_1,Q_2\in \mathfrak{Q}_{m}, q=0,\ldots,d-1 \}\]
    are tight for each $m\in\mathbb{N}$ .
  2. (2) Stabilization in the Poisson sampling scheme: Let $\nu$ be a probability density on $[0,1]^d$ . For $n \in \mathbb{N}$ and $L>0$ , set $B''_{\!\!\!n,L} = \{z\in\mathbb{R}^d\;:\; B(z,L) \subseteq [0,n^{1/d}]^d \}$ . Consider a specific continuous density $\kappa$ on $[0,1]^d$ .

    Let $\varepsilon>0$ . Then there are $b>0$ , $n_0\in\mathbb{N}$ , and $L\in\mathbb{R}_+$ such that, uniformly in $q=0,\ldots,d-1$ and $r\in[\underline{r},\overline{r}],$

    \[ \sup_{n\ge n_0} \quad \sup_{z \in B^{\prime\prime}_{n,L}} \quad \sup_{Q_1,Q_2 \in z + \mathfrak{Q}_{m} } \quad \mathbb{P}( \widetilde\rho_q^{\,r} (n^{1/d} \mathcal{P}(n \nu) \cup Q_1, Q_2 ) \ge L ) \le \varepsilon\]
    for all densities $\nu$ on $[0,1]^d$ satisfying $ \| \nu - \kappa\|_\infty \le b$ .

    For each $n\in\mathbb{N}$ , let $\mathcal{V}_n, \mathcal{W}_n$ be Poisson processes on $[0,n^{1/d}]^d$ that are independent of $n^{1/d}\mathcal{P}(n\nu)$ , and whose intensity functions on $\mathbb{R}^d$ are uniformly bounded in n. Then for each $\varepsilon>0$ , there are $b>0$ , $n_0\in\mathbb{N}$ , and $L>0$ such that, uniformly in $q=0,\ldots,d-1$ and $r\in[\underline{r},\overline{r}],$

    \[ \sup_{n\ge n_0} \quad \sup_{z \in B^{\prime\prime}_{n,L}} \quad \mathbb{P}( \widetilde\rho_q^{\,r}(n^{1/d} \mathcal{P}(n \nu) \cup ( \mathcal{V}_n \cap Q(z) ), \mathcal{W}_n \cap Q(z) ) \ge L ) \le \varepsilon\]
    for all densities $\nu$ on $[0,1]^d$ satisfying $\| \nu - \kappa \|_{\infty} \le b$ .
  3. (3) Stabilization in the binomial sampling scheme: Let $\mathbb{X}_n$ be an n-binomial process on $[0,1]^d$ obtained from an i.i.d. sequence $(X_k\;:\;k\in\mathbb{N})$ with common density $\kappa$ . Let X be a random variable independent of $(X_k\;:\;k\in\mathbb{N})$ , and with continuous density $\kappa$ on $[0,1]^d$ . Write $Q_{m,n}$ for the point process $n^{1/d}(\mathbb{X}_m - X')$ for $m\in J_n=[n-h(n),n+h(n)]$ , where the function h satisfies $h(n)\rightarrow\infty$ and $h(n)/n\rightarrow 0$ as $n\rightarrow\infty$ . Then the family $\{ \widetilde\rho_q^{\,r}(Q_{m,n},\{0\})\;:\; n\in\mathbb{N},m\in J_n, \underline{r}\le r\le \overline{r}, q=0,\ldots,d-1\}$ is tight.

Furthermore, with $\widetilde\Delta = \{(r,s)\in\Delta\;:\; w_1\le r\le s\le w_2\}$ , where $w_2\ge w_1 >0$ are arbitrary, all these results remain valid if $\widetilde\rho_q^{\,r}$ , $\underline r \le r\le \overline{r}$ , is replaced by $\rho^{(r,s)}_q, (r,s) \in \widetilde\Delta$ .

6. Technical results

This section consists of three parts. In the first part we derive the stabilization results. In the second part we prove the asymptotic normality of the finite-dimensional distributions in the case of underlying Poisson processes. In the third part we prove the same limit result in the case of an underlying sequence of binomial processes.

The next result is crucial for the upcoming proofs. It is a direct consequence of Lemma 2.11 in Hiraoka et al. [Reference Hiraoka, Shirai and Trinh16]. This so-called geometric lemma enables us to obtain upper bounds on moments.

Lemma 2. (Corollary of [Reference Hiraoka, Shirai and Trinh16, Lemma 2.11]) Let $\mathbb{X} \subseteq \mathbb{Y}$ be two finite point sets of $\mathbb{R}^d$ . Then

\[ \left|\beta^{r,s}_q (\mathcal{K}(\mathbb{Y})) - \beta^{r,s}_q (\mathcal{K}(\mathbb{X})) \right| \le \sum_{j=q}^{q+1} |\mathcal{K}_j(\mathbb{Y},s) \setminus \mathcal{K}_j(\mathbb{X},s)|.\]

6.1. Stabilization

We start with the proof of the fundamental Theorem 4, the last step of which uses Proposition 3. While this proposition is stated immediately after the proof of Theorem 4, it is of course helpful to first read the proposition before passing to the last step of the proof.

Subsequently, we derive Theorem 1 and Lemma 1. The proof of Theorem 5 is deferred to Appendix A.

Proof of theorem 4. If $q=0$ , $\widetilde\rho_0^{\,r}$ is clearly finite. So we can assume that $q>0$ , and we have to study chains that prevent $\widetilde\rho_q^{\,r}(\mathcal{P}(\lambda)\cup Q_1,Q_2)$ from being finite. We can consider $\mathcal{P} = \mathcal{P}(1)$ because we consider a general positive r. Moreover, we can assume that the circumcenter $z_{Q_2}$ of $Q_2$ coincides with the origin. We write $a^*(r) = L_{Q_2} + \mu(r)$ with $L_{Q_2}$ being the circumradius of $Q_2$ . (Here, $\mu(r) = r$ for the Vietoris–Rips filtration and $\mu(r)=2r$ for the Čech filtration.) Clearly, as $Q_1,Q_2$ are finite, $\widetilde\rho_q^{\,r}$ is finite if the random geometric graph $G(\mathcal{P},\mu(r))$ does not percolate. If $G(\mathcal{P},\mu(r))$ percolates, $\widetilde\rho_q^{\,r}$ is infinite if and only if for each finite information horizon $R\in\mathbb{R}_+$ , $R \ge a^*(r)+\mu(r)$ , there is a simplex $\sigma_{q,i}\in \mathcal{K}_r( (\mathcal{P}\cup Q_1 \cup Q_2)|_{B(0,R)})$ which intersects the additional points $Q_2$ and which is negative until R, but we cannot exclude the possibility that it might ultimately become positive. Formally, this means there is a chain

(6) \begin{align} \tau = \sum_{i} \sigma_i,\end{align}

where $\sigma_i \in \bigcup_{n\in\mathbb{N}} \mathcal{K}_r( (\mathcal{P}\cup Q_1)|_{B(0,n)})$ are q-simplices, such that the boundary of the restriction of $\tau$ to $\mathcal{K}_r( (\mathcal{P}\cup Q_1)|_{B(0,R)})$ consists of two disjoint $(q-1)$ -cycles which are not boundaries. More precisely, set

(7) \begin{align} \tau_R \;:\!=\; \tau|_{B(0,R)} \;:\!=\; \sum_{i} \sigma_i\mathbb{1}\! \left\{\sigma_i \in \mathcal{K}_r((\mathcal{P}\cup Q_1)|_{B(0,R)}) \right\}. \end{align}

Then, for each $R\ge a^*(r)+\mu(r)$ , we have $\partial \tau_R = e_1 + e_{2,R}$ , where

(8) \begin{align} \begin{split} & e_1,e_{2,R} \in Z_{q-1}( \mathcal{K}_r( (\mathcal{P}\cup Q_1)|_{B(0,R)})) \setminus B_{q-1}( \mathcal{K}_r( (\mathcal{P}\cup Q_1)|_{B(0,R)})) \\[5pt] &\qquad\text{such that } e_{2,R} \subseteq B(0,R)\setminus B(0,R-2\mu(r)) \\[5pt] &\qquad\text{ and $e_1 \subseteq B(0,R_0)$ for a certain $R_0\in \mathbb{R}_+$}; \end{split}\end{align}

here the set inclusions for $e_{2,R}$ (resp. $e_1$ ) are to be understood as the usual inclusions between subsets of Euclidean space. The cycle $e_1$ becomes a boundary of $\tau|_{B(0,R)}$ when we include the additional points of $Q_2$ , i.e., $e_1 = \sum_j \nu_j$ , where the $\nu_j$ are $(q-1)$ -simplices in $\mathcal{K}_r((\mathcal{P}\cup Q_1\cup Q_2)|_{B(0,a^*(r)+\mu(r))})$ and $e_1 \in B_{q-1}( \mathcal{K}_r((\mathcal{P}\cup Q_1\cup Q_2)|_{B(0,a^*(r)+\mu(r))}) )$ . Consequently, $e_{2,R}$ becomes a boundary in this case as well, i.e., $e_{2,R} \in B_{q-1}(\mathcal{K}_r((\mathcal{P}\cup Q_1\cup Q_2)|_{B(0,R)}))$ . See also Figure 2 for an illustration in the special case of such a one-dimensional chain.

Figure 2. Illustration of a chain $\tau$ consisting of one-dimensional simplices (red, green) from Poisson points (black, blue, and green dots) and an additional point (black diamond) which is located inside Q(0). The 1-simplices between Poisson points are red; the 1-simplices between a Poisson point and the additional point are green. The layers depict two spheres of B(z, R) and $B(z,R-2\mu(r))$ ; $e_1$ corresponds to the two blue dots (to which the two green 1-simplices are attached), $e_{2,R}$ to the green dots shown between the two layers.

The existence of a $\tau$ as above is equivalent to $\widetilde\rho_q^{\,r}$ being infinite. We show in the remainder of the proof that for a homogeneous Poisson process modulo a finite point process such chains cannot occur. More precisely, we show that the cycle $e_{2,R}$ cannot exist for all $R\ge a^*(r)+\mu(r)$ . For this purpose, we can assume that $Q_1=\emptyset$ , because the question of whether such cycles $e_{2,R}$ exist for all $R\ge a^*(r)+\mu(r)$ is an asymptotic property of the Poisson process (this also follows from details given below). Moreover, it suffices to study the case where $Q_2$ consists of a single point, and by translation-invariance we can assume that this point is the origin, viz., $Q_2=\{0\}$ .

In the remainder of the proof, we study chains which generalize the chain $\tau$ given in (6) and show using a Burton–Keane argument that the existence of such chains contradicts the properties of the stationary Poisson process. To this end, we proceed in four steps. We introduce general maximal chains in the first step. In the second step, we show that the number of these maximal chains is a.s. constant. In the third and fourth steps, we prove that the number of these maximal chains is a.s. zero.

Step 1—maximal chains. We begin with q-chains $\tau$ of the form $\tau=\sum_{i}\sigma_i$ , where the simplices $\sigma_i$ are in $\bigcup_{n\in\mathbb{N}} \mathcal{K}_r( \mathcal{P}|_{B(0,n)})$ for all i, and there are $y\in\mathbb{R}^d$ (‘a shift of the origin’) and $R_0\in\mathbb{R}_+$ such that the following property (P) holds:

  1. (P) For all $R\ge R_0$ and with

    \begin{align*} \tau_{y,R} \;:\!=\; \sum_{i} \sigma_i \mathbb{1}\! \left\{ \sigma_i\in \mathcal{K}_r( \mathcal{P}|_{B(y,R)} )\right\},\end{align*}
    we have that $\partial \tau_{y,R} = e_1 + e_{2,R}$ , where
    (9) \begin{align} e_1 \in & Z_{q-1}( \mathcal{K}_r( \mathcal{P}|_{B(y,R)})) \setminus B_{q-1}( \mathcal{K}_r( \mathcal{P}|_{B(y,R)})) \nonumber\\[5pt] &\text{ but } e_{1} \in B_{q-1}( \mathcal{K}_r( (\mathcal{P}\cup \{y\} )|_{B(y,a^*(r)+\mu(r))}))\end{align}
    and
    (10) \begin{align}e_{2,R} &\subseteq B(y,R)\setminus B(y,R-2\mu(r)) \nonumber\\[5pt] &\text{ and } e_{2,R} \in Z_{q-1}( \mathcal{K}_r( \mathcal{P}|_{B(y,R)})) \setminus B_{q-1}( \mathcal{K}_r( \mathcal{P}|_{B(y,R)}). \end{align}

So, loosely speaking, $\tau_{y,R}$ is a shifted analogue of $\tau_R$ from (7). We call a chain $\tau$ satisfying (P) a chain of type (P).

Now, we might have two chains $\tau$ and $\tau'$ of type (P) that have the same local cycle, i.e., their cycles $e_1, e'_1$ (as characterized by (9)) are homologous (meaning they differ by a boundary only). In this case we say that $\tau$ and $\tau'$ are equivalent and write $\tau\sim\tau'$ ; we can then consider the chain $\tau^*$ which consists of the union of the simplices of $\tau$ and $\tau'$ .

This leads to the following notion of maximality: consider an arbitrary but fixed chain $\tau$ of type (P) with local cycle $e_1$ . Then take the union over all chains $\tau'$ of type (P) equivalent to $\tau$ and call the resulting chain the maximal chain $\tau_{\max}$ . Formally, given $\tau$ with corresponding $e_1,$ the maximal chain is

\begin{align*} \tau_{max} = \sum_{\sigma \in I} \sigma, \quad \text{ where } \quad I = \bigcup_{\substack{\tau': \tau' \sim \tau} } \, \bigcup_{\sigma\in\tau'} \{\sigma\}.\end{align*}

Plainly, if $\tau$ and $\tau'$ are equivalent, then $\tau_{\max} = \tau'_{\max}$ .

Step 2—ergodicity and invariance of maximal chains. We show that the number of maximal chains, which we denote by $M(\mathcal{P})$ , is a.s. constant. More precisely, we show that

\begin{align*} M(\mathcal{P}) = m \ \qquad\text{ a.s., for some } m \in \mathbb{N} \cup \{0, +\infty\}.\end{align*}

To this end, consider the standard decomposition of the stationary Poisson process $\mathcal{P}$ into a countable sum of independent Poisson processes restricted to the cubes $Q_z$ , viz.,

(11) \begin{align} \mathcal{P} = \sum_{z\in\mathbb{Z}^d} \mathcal{P}|_{Q_z} = \sum_{n\in\mathbb{Z}} \bigg\{ \sum_{z^{\prime}\in\mathbb{Z}^{d-1}} \mathcal{P}|_{Q_{(n,z^{\prime})}} \bigg\}.\end{align}

Next, for $m\in\mathbb{N}\cup\{0,+\infty\},$ define the events

\begin{align*} A_{m} := \{ \omega\in\Omega \ | \ M( \mathcal{P}(\omega)) = m \}.\end{align*}

Consider the shift operator T acting on the first coordinate only. Applying T to a set $P\subseteq\mathbb{R}^d$ gives $T(P) = \{ y+ (1,0,\ldots,0)^t | y\in P\}$ . Define the ‘kth translation of $A_{m}$ ’ by

\begin{align*} T^k(A_{m} ) := \{ \omega\in\Omega \ | \ M( T^k(\mathcal{P}(\omega))) = m \}, \quad k\in\mathbb{Z}.\end{align*}

Then $M(\mathcal{P}) = M( T(\mathcal{P}))$ , and consequently $T(A_{m}) = A_{m}$ for each m, i.e., $A_m$ is an invariant event. A standard argument relying on the decomposition (11) into i.i.d. random variables now yields that $\mathbb{P}(A_{m}) \in \{0,1\}$ for each m. We refer to the book of Klenke [Reference Klenke20, Example 20.26] for a formal proof.

Step 3—insertion-tolerance, $M(\mathcal{P}) \notin\mathbb{N}$ a.s. Assume that $M(\mathcal{P}) = m$ with probability 1 for some $m\in\mathbb{N}$ . Define the generic ‘annulus’ (with respect to the maximum norm) $A_{t,s} = [-t,t ]^d \setminus (-s,s)^d$ for $s<t$ .

We rely on the following decomposition of $\mathcal{P}$ : we fix $n\in\mathbb{N}$ large enough—see the paragraph below (12) for details. Denote the restriction of $\mathcal{P}$ to $Q(0,n)=[-n,n]^d$ by $\mathcal{P}^\circ_n$ and its restriction to $\mathbb{R}^d\setminus [-n,n]^d$ by $\mathcal{P}^\dagger_n$ . Then $\mathcal{P}^\circ_n$ and $\mathcal{P}^\dagger_n$ are independent. Assume $\mathcal{P}^\circ_n$ is defined on the generic probability space $(\Omega^\circ,\mathcal{A}^\circ,\mathbb{P}^\circ)$ and $\mathcal{P}^\dagger_n$ is defined on $(\Omega^\dagger,\mathcal{A}^\dagger,\mathbb{P}^\dagger)$ . Next, for $n\in\mathbb{N}$ and $\varepsilon \in (0,\mu(r))$ , consider the event

(12) \begin{align} D_{n,\varepsilon} & = \Big\{\text{for each $\tau_{max}$: $\partial (\tau_{max}|_{A_{n+\mu(r),n} })$ contains a cycle \textit{c} inside $A_{n+\mu(r)-\varepsilon,n}$ with } \nonumber\\[5pt] &\qquad \text{$c\in Z_{q-1}( \mathcal{K}_r( \mathcal{P}|_{Q(0,n+\mu(r))})) \setminus B_{q-1}( \mathcal{K}_r( \mathcal{P}|_{Q(0,n+\mu(r))})$}\Big\}. \end{align}

(Here $\tau_{max}|_{A_{n+\mu(r),n} }$ is the restriction of $\tau_{max}$ to $A_{n+\mu(r),n}$ .) We choose $n\in\mathbb{N}$ sufficiently large so that $D_{n,0}$ occurs with positive probability $\eta>0$ . Next, define the event

\begin{align*} E_{n,\varepsilon} = \{ \omega^\dagger \in \Omega^\dagger \ | \ \exists\, \widetilde\omega^\circ \in \Omega^\circ \;:\; ( \widetilde\omega^\circ,\omega^\dagger) \in D_{n,\varepsilon} \}. \end{align*}

Note that $D_{n,\varepsilon}$ , and thus also $E_{n,\varepsilon}$ , is decreasing in $\varepsilon$ , and clearly $\mathbb{P}^\dagger(E_{n,0})>0$ as well. Moreover, $E_{n,0} \setminus E_{n,\varepsilon}\downarrow N$ for a $\mathbb{P}^\dagger$ -null set N as $\varepsilon\downarrow 0$ ; hence, we can fix some $\varepsilon^*>0$ sufficiently small so that $\mathbb{P}^\dagger(E_{n,\varepsilon^*})>0$ .

For the technical details underlying the remaining arguments in Step 3, which follow, we refer to the proof of Theorem 5(1). First, partition $A_{n,n-\delta}$ with subcubes $(C_i)_{i\in I}$ of edge length $\delta$ for $\delta \le \mu(r)(1-1/\sqrt{2})/\sqrt{d}$ . We can assume that $n/\delta\in\mathbb{N}$ . Let $G_n$ denote the event that each $C_i$ contains at least d points of $\mathcal{P}^\circ_n$ . Then $\mathbb{P}^\circ(G_n)>0$ , and by construction, $E_{n,\varepsilon^*}$ and $G_n$ are independent. Consequently, $\mathbb{P}^\circ\otimes\mathbb{P}^\dagger(E_{n,\varepsilon^*} \cap G_n ) = \mathbb{P}^\circ(E_{n,\varepsilon^*})\mathbb{P}^\dagger(G_n) > 0$ . However, if both $E_{n,\varepsilon^*}$ and $G_n$ occur, there are no maximal chains because each potential feature associated to some $\tau_{max}$ already terminates in $A_{n,n-\delta}.$ This is because the points of the Poisson process are sufficiently dense inside $A_{n,n-\delta}.$ Hence we arrive at a contradiction, and thus $M(\mathcal{P}) \notin\mathbb{N}$ a.s.

Step 4—the Burton–Keane argument, $M(\mathcal{P})\neq +\infty$ a.s. We begin with general considerations for the volume-boundary argument applied to cycles. The d-dimensional Lebesgue measure of $A_{s+\Delta,s}$ is $ 2^d \big( (s+\Delta )^d - s ^d \big) = 2^d \Delta s^{d-1} + o\big( s^{d-1} \big)$ if $s\to\infty$ and if $\Delta$ is bounded above by a constant.

Moreover, from the definition of the Čech and Vietoris–Rips filtrations over a point cloud, we have the following: for each $(q-1)$ -cycle $\widetilde e$ that is contained in the complex corresponding to the filtration parameter r and that is not a boundary (i.e., a non-trivial cycle), there exists a convex set $\mathcal{J}_r$ intersecting the convex hull of $\widetilde e$ such that on the one hand $\mathcal{J}_r \cap \mathcal{P} = \emptyset$ and on the other hand the d-dimensional volume of $\mathcal{J}_r$ is positive and bounded away from zero, i.e., $V(\mathcal{J}_r)\ge \delta_r$ for some $\delta_r>0$ depending on the filtration type but not on the cycle $\widetilde e$ . Hence, the total number of such cycles that are not boundaries but are located in $A_{s+\Delta,s}$ is of order

(13) \begin{align} \frac{2^d \Delta s^{d-1} + o\big( s^{d-1} \big)}{V(\mathcal{J}_r)} = O(s^{d-1})\end{align}

for $s\to\infty$ and $\Delta$ bounded above.

The remainder of this step follows ideas very similar to those in the proof of Theorem 4.6 in Meester and Roy [Reference Meester and Roy23], where it is shown that the number of vacant components in the standard Boolean model is a.s. not equal to $\infty$ . To facilitate the comparison, we adopt similar notation. Also, the remainder of this step crucially relies on Proposition 3 below, which considers the existence of what we call encounter chains (see Proposition 3 for their definition).

It follows from Proposition 3 that (under the assumption of infinitely many maximal chains) the event $E_m$ (detailed in the proposition) has positive probability of at least $\eta>0$ , say, for all $m\in\mathbb{N}$ sufficiently large. We assume that $\mathbb{P}(E_m) \ge \eta$ for all m sufficiently large and show that this leads to a contradiction. We can translate $E_m$ by a vector 2mz for $z\in\mathbb{Z}^d$ and call this event $E_m^{2mz}$ . Then for each $L\in\mathbb{N}$

(14) \begin{align}\begin{split} &\mathbb{E}\left [ \sum_{z\in\mathbb{Z}^d} \mathbb{1}\! \left\{E^{2mz}_m \text{ occurs and } Q(2mz,m)\subseteq Q(0,Lm) \right\} \right ] \\[5pt] &\ge \eta \cdot \#\{ z\in\mathbb{Z}^d \;:\; Q(2mz,m)\subseteq Q(0,Lm) \} \ge \eta (L-1)^d.\end{split}\end{align}

Let $\mathfrak{R}$ be the set of the following encounter configurations in Q(0,Lm): an encounter configuration r lies in $\mathfrak{R}$ if and only if $ Q(2mz,m)\subseteq Q(0,Lm)$ such that $r = \tau^* |_{Q(2mz,m)}$ is an encounter configuration for some encounter chain $\tau^*$ . Then $\mathbb{E}\left [ \#\mathfrak{R} \right ]\ge \eta (L-1)^d$ .

We consider the branch $b=b_{r}^{(i)}$ for each $r\in \mathfrak{R}$ and $i\in\{1,2,3\}$ . The boundary $\partial b$ intersected with $A_{Lm+\mu(r),Lm}$ consists of $(q-1)$ -cycles $e_i$ that are not boundaries themselves. We can assume that each $e_i$ is minimal in the sense that we cannot decompose the chain $e_i$ into two or more disjoint $(q-1)$ -cycles. We define $\mathfrak{V}_b$ as the set that contains all these minimal $(q-1)$ -cycles. Furthermore, set $\mathfrak{V} = \bigcup_{r\in \mathfrak{R}} \bigcup_{i=1}^3 \mathfrak{V}_{b_r^{(i)}}$ ; this is the union of all minimal $(q-1)$ -cycles that are not boundaries and that are contained in $A_{Lm+\mu(r),Lm}$ . Hence, for a suitable $c\in\mathbb{R}_+$ , we have $c L^{d-1} \ge \# \mathfrak{V}$ by (13). Furthermore, for $r\in \mathfrak{R}$ and $i\in\{1,2,3\},$ define

\begin{align*} \mathfrak{C}_r^{(i)} = \{r'\in \mathfrak{R}\;:\; r' \subseteq b_r^{(i)} \} \cup \mathfrak{V}_{b_r^{(i)} }\end{align*}

as a set of chains. Then clearly $\# \mathfrak{C}_r^{(i)} \ge \# \mathfrak{V}_{b_r^{(i)}} \ge 1 \;=\!:\; \mathfrak{K}$ , and the following relation holds for each $r,r'\in \mathfrak{R}$ : either $\big[{r} \cup \bigcup_i \mathfrak{C}_r^{(i)} \big] \cap \big[{r'} \cup \bigcup_j \mathfrak{C}_{r'}^{(j)} \big]=\emptyset$ , or there are i, j which satisfy

\begin{align*} \mathfrak{C}_r^{(i)}\supseteq \{r'\} \cup \bigcup_{\ell\neq j} \mathfrak{C}_{r'}^{(\ell)} \text{ and } \mathfrak{C}_{r'}^{(j)}\supseteq \{r\} \cup \bigcup_{\ell\neq i} \mathfrak{C}_{r}^{(\ell)}.\end{align*}

Define $\mathfrak{S} \;:\!=\; \mathfrak{R} \cup \bigcup_{r\in R} \bigcup_{i=1}^3 \mathfrak{C}_r^{(i)}$ . Then the assumptions of Lemma 3.2 in Meester and Roy [Reference Meester and Roy23] are satisfied. Consequently, $\# \mathfrak{S} \ge \mathfrak{K} ( \#\mathfrak{R}+2)+\#\mathfrak{R}$ . Since $\# \mathfrak{S} = \# \mathfrak{R} + \# \mathfrak{V}$ , it follows that $\# \mathfrak{V} \ge \# \mathfrak{R} +2$ . We therefore arrive at the following inequalities:

\begin{align*} c L^{d-1} \ge \mathbb{E}\left [ \# \mathfrak{V} \right ] &\ge \mathbb{E}\left [ \# \mathfrak{V} \mathbb{1}\! \left\{\mathfrak{R} \neq \emptyset\right\} \right ]\ge \mathbb{E}\left [ (\# \mathfrak{R} + 2) \mathbb{1}\! \left\{\mathfrak{R} \neq \emptyset\right\} \right ] \\[5pt] &= \mathbb{E}\left [ \# \mathfrak{R} + 2 \mathbb{1}\! \left\{\mathfrak{R} \neq \emptyset\right\} \right ] \ge (L-1)^d \eta\end{align*}

for all $L\in\mathbb{N}$ . This leads to a contradiction if L is sufficiently large. Therefore, $\eta=0$ and $M(\mathcal{P})$ equals $\infty$ with probability 0. This completes the fourth step.

All in all, this contradicts the initial assumption that with positive probability the chain $\tau$ from (6) exists and satisfies (P). Consequently, $\widetilde \rho^{\,r}_q$ is a.s. finite.

To state and prove the upcoming proposition, we rely once more on a suitable decomposition of the homogeneous Poisson process $\mathcal{P}$ with unit intensity. Fix $n\in\mathbb{N}$ . We choose a Poisson process $\mathcal{P}^\dagger_n$ on $\mathbb{R}^d\setminus [\!-\!n,n]^d$ which is defined on the probability space $(\Omega^\dagger,\mathcal{A}^\dagger,\mathbb{P}^\dagger)$ . We also choose a complementary Poisson process $\mathcal{P}^\circ_n$ on $Q(0,n)=[\!-\!n,n]^d$ which is defined on $(\Omega^\circ,\mathcal{A}^\circ,\mathbb{P}^\circ)$ . Obviously, $\mathcal{P}^\dagger_n$ and $\mathcal{P}^\circ_n$ are independent.

Proposition 3. (Encounters of maximal chains) For $m \in \mathbb{N}$ , let $E_m$ denote the event given in the next paragraph. If the numbers of maximal chains is infinite (as is assumed in Step 4 of the proof of Theorem 4), we have that $\liminf_{m\to\infty} \mathbb{P}(E_m)>0$ .

Define $n = m - 2 \lceil{\mu(r)}\rceil$ and decompose $\mathcal{P}$ into $\mathcal{P}^\circ_n$ and $\mathcal{P}^\dagger_n$ as above. Set

$E_m \;:\!=\; \Big\{ (\omega^\circ,\omega^\dagger)\in \Omega^\circ\times \Omega^\dagger \mid \mathcal{P}^\dagger_n(\omega^\dagger) $ admits disjoint chains $b^{(1)}, b^{(2)},b^{(3)}$ which fulfil the following conditions:

  1. (a) there is an $\widetilde\omega^\circ\in\Omega^\circ$ such that each $b^{(i)}$ can be completed to a maximal chain with elements of $\mathcal{P}^\circ_n(\widetilde\omega^\circ)$ and $\mathcal{P}^\dagger_n(\omega^\dagger)$ ;

  2. (b) each $\partial(b^{(i)}|_{A_{m,n}})$ decomposes into a disjoint union of $e^{(i)}_1$ and $e^{(i)}_2$ that are $(q-1)$ -cycles but not boundaries in the complex generated by $\mathcal{P}^\dagger_n(\omega^\dagger)$ , and $e_1^{(i)} \subseteq A_{n+\mu(r)-\varepsilon,n}$ ;

  3. (c) there exist a chain r in the complex generated by the elements of $\mathcal{P}^\circ_n(\omega^\circ)$ and a chain c in $A_{n+\mu(r),n-\mu(r)}$ generated by the elements of $\mathcal{P}^\circ_n(\omega^\circ) \cup \mathcal{P}^\dagger_n(\omega^\dagger)$ such that $\tau \;:\!=\; r + c + \displaystyle{\sum_{i=1}^3 b^{(i)}}$ satisfies $\partial (\tau|_{Q(0,m)} ) = \displaystyle{\sum_{i=1}^3 e_2^{(i)} \Big\}.}$

If $E_m$ occurs, we call Q(0,m) an encounter box, Q(0,n) a central box, r an encounter configuration, c an intermediate configuration, $b^{(1)}, b^{(2)}, b^{(3)}$ branches, and $\tau$ an encounter chain.

Moreover, we can translate $E_m$ over the vector $y=2mz$ ( $y\in\mathbb{Z}^d$ ) and denote this event by $E_m^{y}$ . If $E_m^y$ occurs, then Q(y, m) is an encounter box and Q(y, n) a central box.

Proof. We assume the existence of infinitely many maximal chains with probability one, and then we build an event $H_m$ which is contained in $E_m$ . First let $F_m$ be the event that there are (at least) three disjoint maximal chains $\tau^{(1)},\tau^{(2)},\tau^{(3)}$ with corresponding $e^{(1)}_1,e_1^{(2)},e^{(3)}_1$ (as in (9)) which are all located inside Q(0,n), $n=m-2\lceil{\mu(r)}\rceil$ . (If there are more than three such maximal chains, we choose three of them at random.) Then $\liminf_{m\to\infty} \mathbb{P}(F_m)=1>0$ by monotone convergence.

Figure 3. Illustration of encounters of maximal chains in two dimensions (in a reduced set-up and not true to scale). The left panel depicts several encounters inside boxes Q(y, m) (for certain $y\in\mathbb{Z}^d$ ). The (blue) central boxes are located inside the (black) encounter boxes, which are part of the (black) lattice which partitions the plane. Each (green) encounter configuration merges three branches (red and partly in green and orange) through the corresponding (orange) intermediate configuration. The right panel shows a specific central box Q(y, n) (blue). Here a suitable configuration (violet) inside the central box converts the corresponding branches to maximal chains.

Now, fix $m\in\mathbb{N}$ such that $F_m$ has positive probability. If $(\omega^\circ,\omega^\dagger)\in F_m$ , we define the three disjoint branches $b^{(i)} = \tau^{(i)}|_{\mathbb{R}^d\setminus Q(0,n)}$ , $1\le i\le 3$ . Then, for all $\varepsilon>0$ sufficiently small, the event

\begin{align*} G_m &=\{ (\omega^\circ,\omega^\dagger) \in F_m \ | \ \forall i: \ \partial( b^{(i)}|_{A_{m,n}} ) = f_1 + f_2 \text{ such that $f_1$ and $f_2$ are cycles }\\[5pt] &\qquad\qquad \text{ but not boundaries with respect to $\mathcal{K}_r(\mathcal{P}|_{ Q(0,m)\setminus Q(0,n)})$ and $f_1\subseteq A_{n+\mu(r)-\varepsilon,n}$ } \}\end{align*}

has positive probability, too.

Now let $H_m$ consist of all $(\omega^\circ,\omega^\dagger)\in \Omega^\circ\times \Omega^\dagger$ such that on the one hand, for this very $\omega^\dagger$ , there is an $\widetilde\omega^\circ\in\Omega^\circ$ with $(\widetilde\omega^\circ,\omega^\dagger)\in G_m$ (for short, ‘ $\omega^\dagger$ enjoys the property $P_1$ ’) and on the other hand there are the following chains: a chain r, constructed from the elements of $\mathcal{P}^\circ_n(\omega^\circ)$ , and a chain c, constructed from the elements of $[\mathcal{P}^\circ_n(\omega^\circ)\cup \mathcal{P}^\dagger_n(\omega^\dagger)] \cap A_{m,n}$ with the property that the chain $\tau = r + c + \sum_{i=1}^3 b^{(i)}$ satisfies $\partial( \tau|_{Q(0,R)} ) \subseteq A_{R,R-\mu(r)}$ for all $R\ge m$ (for short, ‘the pair $(\omega^\circ,\omega^\dagger)$ enjoys the property $P_2$ ’).

Then $H_m$ can be formulated in an abstract way as

\begin{align*} H_m = \big\{\big(\omega^\circ,\omega^\dagger\big)\in \Omega^\circ\times \Omega^\dagger \;:\; \text{ $\omega^\dagger$ has $P_1$ and $(\omega^\circ,\omega^\dagger)$ has $P_2$}\big\}.\end{align*}

We show that $H_m$ has positive probability. Since $G_m$ has positive probability, we have on the one hand

\begin{align*}\mathbb{P}^{\dagger}\big(\big\{\omega^\dagger \;:\; \omega^\dagger \text{ has } P_1 \big\}\big) \ge \mathbb{P}^\circ\otimes\mathbb{P}^\dagger(G_m) > 0\end{align*}

and on the other hand

\begin{align*} \mathbb{P}(H_m) &= \int_{\Omega^\dagger} \mathbb{1}\! \big\{\omega^\dagger \text{ has } P_1 \big\} \mathbb{P}^{\circ}(\{ \omega^\circ \;:\; (\omega^\circ,\omega^\dagger) \text{ has } P_2 \} ) \ \mathbb{P}^{\dagger}(\textrm{d} \omega^\dagger).\end{align*}

Consequently, it remains to show that $\mathbb{P}^{\circ}( \{ \omega^\circ \;:\; (\omega^\circ,\omega^\dagger) \text{ has } P_2 \} ) > 0$ for almost all $\omega^\dagger$ which have $P_1$ .

To this end, we choose an $\omega^\dagger$ which has $P_1$ . Conditionally on $\omega^\dagger$ and relying on classical results for triangulations, there is a finite set P inside Q(0,n) (with all elements in general position) such that r, c, and $\tau$ exist as laid out, and all simplices involved in the chain r have a filtration time of at most $\mu(r)/2$ . This implies that we can move the vertices of a specific q-simplex $\sigma$ in $\mathcal{K}_r(P)$ by at most $\mu(r)/(4(q+1))$ and still have a filtration time of $\sigma$ of at most $3\mu(r)/4$ .

Moreover, as $P\cup \mathcal{P}^\dagger_n(\omega^\dagger)|_{Q(0,n+\mu(r))}$ is finite, there is a $\delta>0$ such that all vertices of P can be moved by at least $\delta>0$ without adding (resp. removing) another simplex to (resp. in) the complex $\mathcal{K}_r(P \cup \mathcal{P}^\dagger_n(\omega^\dagger)|_{Q(0,n+\mu(r))} )$ . Note that the probability that the elements of $\mathcal{P}^\dagger_n|_{Q(0,n+\mu(r))}$ do not entail a sharp filtration time of exactly $\mu(r)$ is 1, which is what we implicitly assume in the choice of $\omega^\dagger$ . (This effect of local constancy is also studied by Chazal and Divol [Reference Chazal and Divol8, Lemma 13] for filtration functions generated by (random) point clouds.)

Thus, for this specific $\delta>0$ , the probability of the following event is positive: there is a realization of $\mathcal{P}^\circ_n$ which has $\# P$ elements and for each $p\in P$ there is an element of $\mathcal{P}^\circ_n$ in the $\delta$ -neighborhood of p.

This proves that $\mathbb{P}^{\circ}( \{ \omega^\circ \;:\; (\omega^\circ,\omega^\dagger) \text{ has } P_2 \} ) > 0$ for almost all $\omega^\dagger$ which have $P_1$ . Thus, $H_m$ occurs with positive probability and we arrive at $\mathbb{P}(E_m) \ge \mathbb{P}(H_m) > 0$ . This completes the proof.

Proof of theorem 1. Let $q\in\{0,\ldots,d-1\}$ be fixed. For simplicity, we write $\mathcal{P}=\mathcal{P}(\lambda)$ and $S=S^{(r,s)}_q$ . (Recall that S is defined in (5).) S is a.s. finite by Theorem 4. By the definition of $\widetilde\rho^{\,r}_q$ and $\widetilde\rho^{\,s}_{q+1}$ , the following two functionals do not change when changing the configuration outside B(0,S):

(15) \begin{align} A \mapsto \dim \frac{Z_q (\mathcal{K}_r(\mathcal{P}\cup\{0\}\cup A))}{Z_q (\mathcal{K}_r(\mathcal{P}\cup A))} \text{ and } A \mapsto \dim \frac{B_q (\mathcal{K}_s(\mathcal{P}\cup\{0\}\cup A))}{B_q (\mathcal{K}_s(\mathcal{P}\cup A))},\end{align}

where $A\subseteq\mathbb{R}^d\setminus B(0,S)$ is finite.

In order to conclude the case for the persistent Betti function $\beta^{r,s}_q$ , we show that

(16) \begin{align} A \mapsto \dim \frac{Z_q (\mathcal{K}_r(\mathcal{P}\cup\{0\}\cup A)) + B_q (\mathcal{K}_s(\mathcal{P}\cup\{0\}\cup A)) }{Z_q (\mathcal{K}_r(\mathcal{P}\cup A)) + B_q (\mathcal{K}_s(\mathcal{P}\cup A))},\end{align}

$A\subseteq\mathbb{R}^d\setminus B(0,S)$ finite, is constant, too. This implies the case for the persistent Betti function by making use of the dimension formula ( $\dim (U+V) + \dim (U\cap V) = \dim U + \dim V$ for two finite-dimensional linear spaces U, V) in conjunction with (15).

On the one hand, assume that the map in (16) increases when going from a set $A_1$ to another set $A_2$ outside B(0,S). Then there is a q-chain $c^*$ which is not 0 modulo $Z_q (\mathcal{K}_r(\mathcal{P}\cup\{0\}\cup A_2))$ and not 0 modulo $B_q (\mathcal{K}_s(\mathcal{P}\cup\{0\}\cup A_2))$ , but is 0 modulo $Z_q (\mathcal{K}_r(\mathcal{P}\cup A_2))$ or 0 modulo $B_q (\mathcal{K}_s(\mathcal{P}\cup A_2))$ . This contradicts the conclusion for the mappings in (15).

On the other hand, assume that the map decreases, when going from a set $A_1$ to $A_2$ . This time there is a q-chain $c^*$ which is 0 modulo $Z_q (\mathcal{K}_r(\mathcal{P}\cup\{0\}\cup A_2))$ or 0 modulo $B_q (\mathcal{K}_s(\mathcal{P}\cup\{0\}\cup A_2))$ , but not 0 modulo $Z_q (\mathcal{K}_r(\mathcal{P}\cup A_2))$ and not 0 modulo $B_q (\mathcal{K}_s(\mathcal{P}\cup A_2))$ . This is again a contradiction.

Consequently, the persistent Betti number $\beta^{r,s}_q$ does not change with the configuration outside B(0,S).

Proof of Lemma 1. Clearly, if the right-hand side is infinite, there is nothing to prove. So assume that it is finite. Define $\mathcal{K}_{s,a} = \mathcal{K}_s( P\cap B(z_Q,a))$ and $\mathcal{K}'_{s,a} = \mathcal{K}_s((P\cup Q)\cap B(z_Q,a))$ . We show

\[ \dim Z_q( \mathcal{K}'_{r,a}) - \dim Z_q( \mathcal{K}_{r,a}) = \text{const. } \text{ and } \dim B_q(\mathcal{K}'_{s,a}) - \dim B_q(\mathcal{K}_{s,a}) = \text{const. }\]

for all $a \ge S_q^{(r,s)}(P,Q)$ . By definition of $\widetilde\rho_q^{\,r}(P,Q)$ , this is true for differences involving the cycle groups $Z_q( \mathcal{K}'_{r,a})$ and $Z_q( \mathcal{K}_{r,a})$ and $a\ge \widetilde\rho_q^{\,r}(P,Q)$ . Moreover, again by the definition of $S_q^{(r,s)}(P,Q)$ , the difference $ \dim B_q \mathcal{K}'_{s,a} - \dim B_q \mathcal{K}_{s,a}$ is constant for all $a\ge S_q^{(r,s)}(P,Q)$ .

Using the dimension formula $\dim (U+V) + \dim (U\cap V) = \dim U + \dim V$ , it only remains to consider the difference $\dim ( Z_q (\mathcal{K}'_{r,a}) + B_q (\mathcal{K}'_{s,a}) ) - \dim( Z_q (\mathcal{K}_{r,a}) + B_q (\mathcal{K}_{s,a}) )$ .

First, assume that this difference increases at an $a \ge S_q^{(r,s)}(P,Q)$ . So, there is a q-chain $c^*$ such that $c^*\neq 0$ mod $B_q (\mathcal{K}'_{s,a-})$ and $c^*\neq 0$ mod $Z_q (\mathcal{K}'_{r,a-})$ but $c^*=0$ mod $B_q (\mathcal{K}_{s,a-})$ or $c^*= 0$ mod $Z_q (\mathcal{K}_{r,a-})$ . This is a contradiction. Second, if the difference decreases at an $a \ge S_q^{(r,s)}(P,Q)$ , there is again a q-chain $c^*$ such that $c^*\neq 0$ mod $B_q (\mathcal{K}_{s,a-})$ and $c^*\neq 0$ mod $Z_q (\mathcal{K}_{r,a-})$ but $c^*=0$ mod $B_q (\mathcal{K}'_{s,a-})$ or $c^*= 0$ mod $Z_q (\mathcal{K}'_{r,a-})$ . Again, this is a contradiction.

6.2. Asymptotic normality for the Poisson process

Recall that $\mathcal{P}$ and $\mathcal{P}'$ denote independent Poisson processes with unit intensity on $\mathbb{R}^d$ and $B_n = [-2^{-1} n^{1/d},2^{-1} n^{1/d}]^d$ . For $z \in \mathbb{Z}^d$ , set $\mathcal{P}''(z) = (\mathcal{P}\setminus Q(z) )\cup (\mathcal{P}'\cap Q(z))$ , and let

\begin{align*}\Delta^{r,s}_z (B_n) = \beta^{r,s}_q(\mathcal{K}( \mathcal{P} \cap B_n)) - \beta^{r,s}_q(\mathcal{K}( \mathcal{P}''(z) \cap B_n)).\end{align*}

Lemma 3. For each $(r,s)\in\Delta$ and each $z\in \mathbb{Z}^d$ , there are random variables $\Delta^{r,s}_z(\infty)$ and $N_0=N_0(z,(r,s))\in\mathbb{N}$ such that $\Delta^{r,s}_z(B_n) \equiv \Delta^{r,s}_z(\infty)$ a.s. for all $n\ge N_0$ .

Proof. Let $z\in\mathbb{Z}^d$ and $(r,s)\in\Delta$ be arbitrary but fixed. We utilize Lemma 3.1 in [Reference Penrose and Yukich32] to obtain a random variable $\Delta^{r,s}_z(\infty)$ such that

\begin{align*} \lim_{n\to\infty} \Delta^{r,s}_z(B_n) = \Delta^{r,s}_z(\infty) \text{ with probability 1}.\end{align*}

The existence of $N_0$ follows from the fact that Betti numbers and consequently the $\Delta^{r,s}_z(B_n)$ are integer-valued.

Proposition 4. For any two pairs $(r,s),(u,v)\in\Delta$ ,

\begin{align*} \gamma((u,v),(r,s)) &:= \mathbb{E}\left [ \mathbb{E}\left [ \Delta^{r,s}_0(\infty) | \mathcal{F}_0 \right ] \ \mathbb{E}\left [ \Delta^{u,v}_0(\infty) | \mathcal{F}_0 \right ] \right ] \\[5pt] &= \lim_{n \rightarrow \infty } n^{-1} \operatorname{Cov} \left( \beta^{u,v}_q(\mathcal{K}( \mathcal{P} \cap B_n)) , \beta^{r,s}_q(\mathcal{K}( \mathcal{P} \cap B_n)) \right) .\end{align*}

Proof. Let (r, s) and (u, v) be arbitrary but fixed. Define the two functionals

\begin{align*} h_1 := \beta^{r,s}_q(\mathcal{K}( \cdot )) \text{ and } h_2 := \beta^{u,v}_q(\mathcal{K}( \cdot )).\end{align*}

First, observe that

(17) \begin{align}\begin{split} &2 n^{-1} \operatorname{Cov} \left( h_1( \mathcal{P} \cap B_n) , h_2( \mathcal{P} \cap B_n) \right) \\[5pt] &= n^{-1} \ \operatorname{Var}\left [ h_1( \mathcal{P} \cap B_n) + h_2( \mathcal{P} \cap B_n) \right ] \\[5pt] &\quad - n^{-1} \operatorname{Var}\left [ h_1( \mathcal{P} \cap B_n) \right ] - n^{-1} \operatorname{Var}\left [ h_2( \mathcal{P} \cap B_n) \right ] . \end{split}\end{align}

In the following we prove the convergence of each of the three terms on the right-hand side to an appropriate limit as $n\to\infty$ .

Using the established strong stabilization (Theorem 1), we obtain the convergence of each term in question from [Reference Penrose and Yukich32, Theorem 3.1] once the Poisson bounded moments condition is satisfied, i.e., for $i\in\{1,2\}$

\begin{align*} \sup_{A \in \mathcal{B}: 0\in A} \mathbb{E}\left [ | h_i( [\mathcal{P}\cap A] \cup \{0\} ) - h_i(\mathcal{P}\cap A) |^4 \right ] < \infty,\end{align*}

where $\mathcal{B} = \{ B_n + x\;:\; x\in \mathbb{R}^d, n\ge 1\}$ . The validity of the Poisson bounded moment condition follows immediately from the geometric lemma. We skip the details and refer to [Reference Yogeshwaran, Subag and Adler42] (proof of Lemma 4.1) instead.

Thus, using Theorem 3.1 of [Reference Penrose and Yukich32],

\begin{align*} \lim_{n\to\infty} n^{-1} \operatorname{Var}\left [ h_1( \mathcal{P} \cap B_n) \right ] = \mathbb{E}\left [ \mathbb{E}\left [ \Delta^{r,s}_0(\infty) | \mathcal{F}_0 \right ]^2 \right ], \\[5pt] \lim_{n\to\infty} n^{-1} \operatorname{Var}\left [ h_2( \mathcal{P} \cap B_n) \right ] = \mathbb{E}\left [ \mathbb{E}\left [ \Delta^{u,v}_0(\infty) | \mathcal{F}_0 \right ]^2 \right ].\end{align*}

Moreover, using the additivity of the add-one cost, the functional $h_1 + h_2$ enjoys the strong stabilization just as $h_1$ and $h_2$ . Hence, applying Theorem 3.1 of [Reference Penrose and Yukich32] once more, we get

\begin{align*} \lim_{n\to\infty} n^{-1} \operatorname{Var}\left [ h_1( \mathcal{P} \cap B_n) + h_2( \mathcal{P} \cap B_n) \right ] = \mathbb{E}\left [ \mathbb{E}\left [ \Delta^{r,s}_0(\infty) + \Delta^{u,v}_0(\infty) | \mathcal{F}_0 \right ]^2 \right ].\end{align*}

Consequently, by (17),

\begin{align*} \lim_{n\to\infty} 2 n^{-1} \operatorname{Cov} \left( h_1( \mathcal{P} \cap B_n) , h_2( \mathcal{P} \cap B_n) \right) = 2 \mathbb{E}\left [ \mathbb{E}\left [ \Delta^{r,s}_0(\infty) | \mathcal{F}_0 \right ] \mathbb{E}\left [ \Delta^{u,v}_0(\infty) | \mathcal{F}_0 \right ] \right ]\end{align*}

and the claim follows.

Proof of theorem 2. We show the multivariate asymptotic normality by considering finite linear combinations of persistent Betti numbers. For $a_1,\ldots,a_\ell\in\mathbb{R}$ and $n\in\mathbb{N},$ let $H( n^{1/d} \mathcal{P}_n) = \sum_{i=1}^\ell a_i \beta^{r_i,s_i}_q(\mathcal{K}( n^{1/d} \mathcal{P}_n))$ . We will verify that

(18) \begin{align}\begin{split} &n^{-1} \ \operatorname{Var}\left [ H( n^{1/d} \mathcal{P}_n) \right ] \to \sigma^2 \\[5pt] &\text{ and } n^{-1/2} \big( H( n^{1/d} \mathcal{P}_n) - \mathbb{E}[H( n^{1/d} \mathcal{P}_n)] \big) \Rightarrow \mathcal{N}(0,\sigma^2) \text{ as } n\to\infty,\end{split}\end{align}

where $\sigma^2 = \int_{[0,1]^d} \overline \sigma^2( \kappa(x)) \ \textrm{d}x$ ; here $\overline \sigma^2(\lambda)$ is the corresponding limiting variance if the Poisson point process $n^{1/d} \mathcal{P}_n$ in the functional H is replaced by a homogeneous Poisson process $\mathcal{P}(\lambda)$ (with intensity $\lambda$ ) restricted to $B_n$ . Now, we have

\begin{align*} \overline \sigma^2(\lambda) &= \lim_{n\to\infty} n^{-1} \operatorname{Var}\left [ H(\mathcal{P}(\lambda)|_{B_n}) \right ] \\[5pt] &= \sum_{i,j=1}^\ell a_i a_j \ \lim_{n\to\infty} n^{-1} \textrm{Cov}[ \beta^{r_i,s_i}_q( \mathcal{K}( \mathcal{P}(\lambda)|_{B_n}) ), \beta^{r_j,s_j}_q( \mathcal{K}( \mathcal{P}(\lambda)|_{B_n}) ) ] \\[5pt] &= \lambda \sum_{i,j=1}^\ell a_i a_j \lim_{n\to\infty} (\lambda n)^{-1} \textrm{Cov}[ \beta^{\lambda^{1/d} r_i,\lambda^{1/d} s_i}_q( \mathcal{K}( \mathcal{P}(1)|_{\lambda^{1/d}B_n}) ), \\[5pt] & \, \,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\, \beta^{\lambda^{1/d} r_j, \lambda^{1/d}s_j}_q( \mathcal{K}( \mathcal{P}(1)|_{\lambda^{1/d}B_n}) ) ] \\[5pt] &= \lambda \sum_{i,j=1}^\ell a_i a_j \gamma( \lambda^{1/d} (r_i, s_i), \lambda^{1/d} (r_j, s_j) ),\end{align*}

where the last equality follows from from Proposition 4, and where $\gamma$ is the limiting covariance function of an underlying homogeneous Poisson process with unit intensity on $\mathbb{R}^d$ . Hence,

(19) \begin{align}\begin{split} \sigma^2 &= \sum_{i,j=1}^\ell a_i a_j \ \int_{[0,1]^d} \gamma( \kappa(x)^{1/d} (r_i,s_i),\kappa(x)^{1/d} (r_j,s_j) ) \ \kappa(x) \ \textrm{d}x.\end{split}\end{align}

Consequently, once the statements in (18) are verified, the proof is complete.

By Theorem 3.3 of [Reference Trinh37], the statements in (18) hold if, apart from the strong stabilization (Theorem 1), the following two moment conditions are satisfied for cubes of the type $W = z + [0,a)^d \subseteq \mathbb{R}^d$ with $z\in\mathbb{R}^d,a\in\mathbb{R}_+$ , and for each pair (r, s) with $r\le s$ :

  1. (1) the Poisson bounded moments condition: for some $p>2$

    \[ \sup_n \sup_{y\in\mathbb{R}^d} \sup_{y\in W: \text{ cube}} \mathbb{E}\left [ | \mathfrak{D}_0 \beta^{r,s}_q( \mathcal{K}( n^{1/d} \mathcal{P}_n |_W - y ) ) |^p \right ] <\infty; \]
  2. (2) the locally bounded moments condition: for each cube W as above, there is a $p>2$ such that

    \[ \sup_n \sup_{y\in\mathbb{R}^d} \mathbb{E}\left [ | \beta^{r,s}_q( \mathcal{K}( n^{1/d} \mathcal{P}_n |_{y+W} ) ) |^{p} \right ] <\infty. \]

Both conditions (1) and (2) are immediate consequences of the geometric lemma (Lemma 2). Indeed, to see (1), observe that the geometric lemma gives

\begin{align*} | \mathfrak{D}_0 \beta^{r,s}_q( \mathcal{K}( n^{1/d} \mathcal{P}_n |_W - y ) ) | &\le \sum_{j=q}^{q+1} \mathcal{K}_j( ( n^{1/d} \mathcal{P}_n |_W ) \cup \{y\} , s) \setminus \mathcal{K}_j( ( n^{1/d} \mathcal{P}_n |_W ) ,s ) \\[5pt] &\le 2 | n^{1/d} \mathcal{P}_n \cap B(y,\mu(s) ) |^{q+1}.\end{align*}

This last upper bound is stochastically dominated by $| \mathcal{P}( \| \kappa \|_\infty )\cap B(0,\mu(s)) |^{q+1}$ , which does not depend on n, y, or W, and the moment $\mathbb{E}\left [ | \mathcal{P}( \|\kappa\|_\infty )\cap B(0,\mu(s)) |^{p(q+1)} \right ]$ is finite for each $p,q\ge 0$ .

Regarding (2), we have again by Lemma 2 and the translation-invariance that $\beta^{r,s}_q( \mathcal{K}(n^{1/d} \mathcal{P}_n |_{y+W} ))$ is stochastically dominated by $2 |\mathcal{P}(\| \kappa \|_\infty)|_W|^{q+2}$ , which does not depend on y or n. Moreover, for each cube W the moment $\mathbb{E}\left [ |\mathcal{P}(\| \kappa\|_\infty)|_W|^{p(q+2)} \right ]$ is finite for all $p,q\ge 0$ . This completes the proof.

6.3. Asymptotic normality for the binomial process

For each $n\in\mathbb{N}$ , let $(U_{m,n}\colon m\in\mathbb{N})$ be a sequence of binomial processes such that $U_{m,n} = ( Y_{1,n},\ldots,Y_{m,n} )$ for i.i.d. sequences $(Y_{i,n}\colon i\in\mathbb{N})$ with common marginal density $\kappa$ .

Proof of theorem 3. Let $\ell\in\mathbb{N}$ , $a_1,\ldots,a_\ell\in\mathbb{R}$ . We apply the abstract Theorem 3.9 in Trinh [Reference Trinh37] to the functional $H( n^{1/d} \mathbb{X}_n) = \sum_{i=1}^\ell a_i \beta^{r_i,s_i}_q(\mathcal{K}( n^{1/d} \mathbb{X}_n))$ . Since we established the Poisson bounded moments condition and the locally bounded moments condition in the proof of Theorem 2, and since the qth persistent Betti number obtained from a finite set is polynomially bounded in the cardinality of this set, it is enough to verify for the add-one cost function that

\begin{align*} &\sup_{ n\in\mathbb{N}} \sup_{ m\in [n(1-\eta),n(1+\eta)]} \mathbb{E}\left [ | \beta^{r,s}_q(\mathcal{K}( n^{1/d}U_{m+1,n} )) - \beta^{r,s}_q(\mathcal{K}( n^{1/d}U_{m,n} ))|^4 \right ] < \infty\end{align*}

for some $\eta>0$ . This can be verified using the geometric lemma (Lemma 2), similarly to Lemma 4.1 in Yogeshwaran et al. [Reference Yogeshwaran, Subag and Adler42]; we omit the details.

Hence the conditions of Theorem 3.9 in Trinh [Reference Trinh37] are satisfied, and

\begin{align*} &\frac{ \operatorname{Var}\left [ H(n^{1/d} \mathbb{X}_n ) \right ] }{n} \to \tau^2, \qquad \frac{H(n^{1/d} \mathbb{X}_n) - \mathbb{E}\left [ H(n^{1/d} \mathbb{X}_n) \right ]}{\sqrt{n}} \Rightarrow N(0,\tau^2),\end{align*}

where $\tau^2$ is given by the following relation: $ \tau^2 = \sigma^2 - \Big(\int_{[0,1]^d} \mathbb{E}\left [ \overline\Delta( \kappa(x)) \right ] \kappa(x) \ \textrm{d}x \Big)^2$ for $\sigma^2$ given in (19) and

\begin{align*} \mathbb{E}\left [ \overline\Delta( \lambda ) \right ] &= \sum_{i=1}^\ell a_i \ \mathbb{E}\left [ \lim_{n\to\infty} \mathfrak{D}_0 \beta^{r_i,s_i}_q( \mathcal{K}( \mathcal{P}(\lambda)|_{B_n} )) \right ] \\[5pt] &= \sum_{i=1}^\ell a_i \ \mathbb{E}\left [ \mathfrak{D}_0 \beta^{r_i,s_i}_q( \mathcal{K}( \mathcal{P}(\lambda)\cap B(0,S^{(r_i,s_i)}_q(\lambda) ))) \right ] \\[5pt] &= \sum_{i=1}^\ell a_i \ \mathbb{E}\left [ \mathfrak{D}_0 \beta^{\lambda^{1/d}r_i, \lambda^{1/d} s_i}_q( \mathcal{K}( \mathcal{P}(1)\cap B(0,S^{(\lambda^{1/d} r_i,\lambda^{1/d} s_i)}_q(1) ))) \right ] \\[5pt] &= \sum_{i=1}^\ell a_i \ \alpha(\lambda^{1/d} (r_i,s_i) ). \end{align*}

Thus, for a random variable X distributed with density $\kappa$ , we have

\begin{align*} \tau^2 &= \sum_{i,j=1}^\ell a_i a_j \ \bigg\{ \mathbb{E}\left [ \gamma( \kappa(X)^{1/d} (r_i,s_i), \kappa(X)^{1/d} (r_j,s_j) ) \right ] \\[5pt] &\qquad\qquad\qquad\quad - \mathbb{E}\left [ \alpha(\kappa(X) ^{1/d} (r_i,s_i) ) \right ] \mathbb{E}\left [ \alpha(\kappa(X) ^{1/d} (r_j,s_j) ) \right ] \bigg\}.\end{align*}

This completes the proof.

Appendix A

First we give the proof of Theorem 5.

Proof of theorem 5. Recall that $\mu(r)$ is an upper bound on the diameter of a simplex with filtration time at most $r \ge 0$ . Note that the statements for the radius of strong stabilization $\widetilde\rho_q^{\,r}$ , together with the relation

\[ \rho^{(r,s)}(P,Q) \le S^{(r,s)}_q (P,Q) \qquad\text{for each }(r,s)\in \Delta, q\in\{0,\ldots,d-1\} \]

from Lemma 1, allow us to conclude the results for the radius of weak stabilization. So it remains to prove the statement for the strong stabilization property. We proceed for each q separately; clearly this is no restriction.

In the following, if $Q_1$ , $Q_2$ , r, and q are fixed, we just write $\widetilde\rho(\kappa)$ for $ \widetilde\rho_q^{\,r}(\mathcal{P}(\kappa)\cup Q_1,Q_2) $ if the Poisson process $\mathcal{P}(\kappa)$ has intensity $\kappa$ .

In the remainder of the proof, we assume without loss of generality that the Poisson process $\mathcal{P}(\kappa)$ on $\mathbb{R}^d$ is coupled to a homogeneous Poisson process $\mathcal{P}$ on $\mathbb{R}^{d+1}$ with intensity 1 via a space–time coupling as follows:

(20) \begin{align} \mathcal{P}(\kappa) = \{x|\, \exists\; t\;:\; 0 \le t \le \kappa(x), (x,t) \in \mathcal{P}\}.\end{align}

Proof of (1): Let $r,\varepsilon>0$ and $q\in\{0,\ldots, d-1\}$ be arbitrary but fixed. We first consider the case of adding exactly one additional point to the Poisson process, so $Q_1 = \emptyset$ and $Q_2=\{0\}$ . We show that there is an $L>0$ such that $\mathbb{P}( \widetilde\rho(\lambda) > L)\le 2\varepsilon$ for each $\lambda\in\mathbb{R}_+$ . The generalization then works along the same lines.

We rely on the following ideas: if the intensity $\lambda$ is sufficiently small, there are no points of the Poisson process in the neighborhood of the additional point 0 with high probability; hence, adding 0 does not create a new cycle. Moreover, if the intensity $\lambda$ is sufficiently large, then an annulus around the origin is covered by the points of the Poisson process with high probability; hence the impact of adding 0 is only local. Finally, we rely on the space–time coupling for the intermediate intensities.

First, there is a $\underline\kappa\in\mathbb{R}_+$ such that

\[ \mathbb{P}( |\mathcal{P}(\lambda) \cap B(0,\mu(r)) | = 0 ) \ge 1-\varepsilon \text{ for all } \lambda\le \underline\kappa.\]

This means that with high probability and for all $q \ge 1$ , we have, for all $\lambda$ below this threshold, that including $\{0\}$ does not create an additional q-simplex. Hence, if $\lambda\le \underline\kappa$ , then $\mathbb{P}( \widetilde\rho(\lambda) > L)\le \varepsilon$ for $L\ge \mu(r)$ .

Also, there is an intensity $\overline\kappa$ such that with high probability all changes in the add-one cost function which are caused by including the origin are limited to a deterministic neighborhood; we carry out the argument simultaneously for the Čech and Vietoris–Rips complexes. Indeed, let $\delta \le \mu(r)(1-1/\sqrt{2})/\sqrt{d}$ be sufficiently small so that $4\mu(r) /\delta \in\mathbb{N}$ , and consider a partition of $A\;:\!=\;A_{2\mu(r)+\delta,2\mu(r)} = [- 2\mu(r)-\delta, 2\mu(r)+\delta]^d \setminus (- 2\mu(r), 2\mu(r))^d$ with subcubes $(C_i)_{i\in I}$ of edge length $\delta$ . We study $\mathcal{U} = \bigcap_{i\in I} \{\#(\mathcal{P}(\lambda) \cap C_i) \ge d\}$ (‘each subcube contains at least d Poisson points’). Then, there is a $\overline\kappa\in\mathbb{R}_+$ depending on r and $\delta$ such that

\[ \mathbb{P}(\mathcal{U}) = \prod_{i \in I} \mathbb{P}(\#(\mathcal{P}(\lambda) \cap C_i) \ge d) \ge 1-\varepsilon \text{ for all } \lambda\ge \overline\kappa.\]

Define $L_0 := (2\mu(r)+\delta)\sqrt{d}$ . We show $\{ \widetilde\rho(\lambda) \le L_0 \} \supseteq \mathcal{U}$ , which in turn implies $\mathbb{P}( \widetilde\rho(\lambda) > L) \le \varepsilon$ for $L \ge L_0$ if $\lambda \ge \overline{\kappa}$ .

Assume $\mathcal{U}$ . Let $\sigma^r_{q,1},\ldots,\sigma^r_{q,m_q}$ be the q-simplices which contain the origin. Clearly these simplices are all contained in $[- \mu(r), \mu(r)]^d$ . In particular, they do not intersect A, which itself is homeomorphic to a $(d-1)$ -cycle.

Plainly, we can triangulate A with $(d-1)$ -simplices with filtration time at most $ 2\delta \sqrt{d} < \mu(r)$ . Indeed, consider two adjacent cubes $C_1,C_2$ each containing a $(d-1)$ -simplex, say $\sigma_1=\{x_0,\ldots,x_{d-1}\}$ , $\sigma_2=\{y_0,\ldots,y_{d-1}\}$ . (Here adjacent means that the closures of the cubes have nonempty intersection.) Then we can connect $\sigma_1$ and $\sigma_2$ with the $(d-1)$ -simplices $\{x_0,\ldots,x_{i-1},y_{i},\ldots,y_{d-1}\}$ ( $1\le i\le d-1$ ), and each of these simplices has a filtration time of at most $2\delta\sqrt{d}$ .

Moreover, let $\sigma=\{x_0,\ldots,x_{k-1}\}$ be a $(k-1)$ -simplex in $A_{2\mu(r) + \mu(r)/2, 2\mu(r)-\mu(r)/2}$ for a generic $k\in\mathbb{N}$ . Then there is a $(k-1)$ -simplex $\sigma^*=\{y_0,\ldots,y_{k-1}\}$ in A such that the Euclidean distance between any two elements of $\sigma$ and $\sigma^*$ is at most $\sqrt{ (\mu(r)/2)^2 + (\mu(r)/2)^2 } + \sqrt{d} \delta$ , which in turn is at most $\mu(r)$ . Consequently, all k-simplices of the type $\{x_0,\ldots,x_i,y_i,\ldots,y_{k-1}\}$ have a filtration time at most $\mu(r)$ .

This shows that, conditional on $\mathcal{U}$ , any cycle in $A_{2\mu(r) + \mu(r)/2, 2\mu(r)-\mu(r)/2}$ is equivalent to a cycle in $A_{2\mu(r)+\delta,2\mu(r)}$ and thus is a boundary as well. Hence, $\widetilde\rho(\lambda) \le L_0$ in this case. In particular, we have $\mathbb{P}( \widetilde\rho(\lambda)> L)\le\varepsilon$ for all $\lambda \ge \overline\kappa$ and all $L\ge L_0 = \sqrt{d}(2r+\delta)$ .

It remains to check intensities $\lambda\in[\underline\kappa,\overline\kappa]$ . Assume there is an $\varepsilon>0$ such that

\[ \limsup_{L\rightarrow \infty} \sup_{\lambda\in[\underline\kappa,\overline\kappa]} \mathbb{P}( \widetilde\rho(\lambda) > L ) > 2\varepsilon.\]

Then we can find sequences $(L_n)_n$ and $(\lambda_n)_n$ such that $L_n\rightarrow\infty$ and $\lambda_n\rightarrow\lambda^*\in[\underline\kappa,\overline\kappa]$ with the property that $\mathbb{P}( \widetilde\rho(\lambda_n)>L_n) > \varepsilon$ for all $n\in\mathbb{N}$ . However, there is an $L^*\in\mathbb{R}_+$ such that $\mathbb{P}(\widetilde\rho(\lambda^*)> L^*) < \varepsilon/4$ as $\widetilde\rho$ is a.s. finite by Theorem 4. Also, from the coupling of the Poisson processes (20), and because $\mathcal{P}(\lambda)$ is a simple point process, there are random $\overline{\delta}, \underline{\delta} > 0$ (depending on the choice of $L^*$ ) such that $\mathcal{P}(\lambda)$ does not contain any points in $B(0,L^* + \mu(r))\times [\lambda^* - \underline\delta, \lambda^* + \overline\delta]$ , viz.,

\[ \underline\delta = \big(\lambda^* - \inf\{ \lambda<\lambda^*: \mathcal{P}(\lambda)( B(0,L^* + \mu(r))\times [\lambda,\lambda^*] ) = 0 \}\big)/2\]

and $\overline\delta = \big( \sup\{ \lambda>\lambda^*: \mathcal{P}(\lambda)( B(0,L^* + \mu(r))\times [\lambda^*,\lambda] ) = 0 \} - \lambda^* \big)/2$ . This means, for all $\lambda\in [\lambda^* -\underline\delta,\lambda^* +\overline\delta]$ ,

\[ \mathcal{P}(\underline\lambda)|_{B(0,L^* + \mu(r)) } \equiv \mathcal{P}(\lambda)|_{B(0,L^* + \mu(r)) } \equiv \mathcal{P}(\overline\lambda)|_{B(0,L^* + \mu(r)) }.\]

Note that $\{ \lambda^*-\underline\lambda \ge \delta' \} \cap \{ \overline\lambda - \lambda^* \ge \delta' \} \supseteq \{ \mathcal{P}( B(0,L^* + \mu(r))\times [\lambda^*-2\delta',\lambda^*+2\delta'] ) = 0 \}$ for $\delta'>0$ . Consequently, there is an $\delta'\in\mathbb{R}_+$ such that the event $\{ \lambda^*-\underline\lambda>\delta'\} \cap \{ \overline\lambda - \lambda^* > \delta' \}$ has probability at least $1-\varepsilon/4$ . Consequently, for all n large enough so that $L_n\ge L^*$ and $|\lambda_n - \lambda^*|\le \delta'$ , we have

\begin{align*} \mathbb{P}( \widetilde\rho(\lambda_n)>L_n) &\le \mathbb{P}\big( \widetilde\rho(\lambda_n)>L_n, \widetilde\rho(\lambda^*) \le L^*, \lambda^*-\underline\lambda>\delta', \overline\lambda - \lambda^* > \delta' \big) \\[5pt] &\quad + \mathbb{P}\big(\widetilde\rho(\lambda^*) > L^* \big) + \mathbb{P}\big( \{\lambda^*-\underline\lambda\le \delta'\} \cup \{\overline\lambda - \lambda^*\le \delta'\} \big) \le \varepsilon/2,\end{align*}

because $ \{ \widetilde\rho(\lambda_n)>L_n, \widetilde\rho(\lambda^*) \le L^*, \lambda^*-\underline\lambda>\delta', \overline\lambda - \lambda^* > \delta' \} = \emptyset$ . This contradicts the fact that $\mathbb{P}( \widetilde\rho(\lambda_n)>L_n) > \varepsilon$ for all $n\in\mathbb{N}$ . Thus, the laws of $\{\widetilde\rho(\mathcal{P}(\lambda),\{0\})):\;\lambda\in\mathbb{R}_+\}$ are tight.

A similar argument shows also that the laws of $\{\widetilde\rho(\mathcal{P}(\lambda)\cup Q_1,Q_2), \lambda \in \mathbb{R}_+, Q_1,Q_2\in \mathfrak{Q}_m\}$ are tight. At this point, it is essential that we have a uniform upper bound on the parameter $a^*(r)$ , as follows:

(21) \begin{align} a^*(r) = L_{Q_2} + \mu(r) \le \sqrt{d} + \mu(\overline r)\end{align}

for each $r \le \overline r$ and each $Q_2\subseteq Q(0)$ . So, instead of computing the radius of strong stabilization by taking the infimum in (4) over all $\{ R: R \ge a^*(r)\}$ , we can first take the infimum over the smaller set $\{R: R \ge \sqrt{d} + \mu(\overline r) \}$ , which does not depend on $Q_2$ and r, in order to obtain a modified radius of strong stabilization, which is not smaller than the original one in (4). Verifying the claim for this modified radius then implies the claim for the original radius. For the rest of the proof of (1) we argue with this modified radius, which we also denote by $\widetilde\rho_q^{\,r}$ , abusing the notation slightly.

We now sketch the remaining steps. Using the same techniques, we easily see that there are upper and lower bounds $\underline\kappa$ and $\overline\kappa$ such that intensities $\lambda\notin[\underline\kappa,\overline\kappa]$ have only a local effect, in the same sense as in the special case for $\{0\}$ ; i.e., for each $\varepsilon>0 $ there are $\underline\kappa,\overline\kappa$ and an $L>0$ such that

\begin{align*} \sup_{Q_1,Q_2\in \mathfrak{Q}_m} \quad \sup_{\lambda\notin [\underline\kappa,\overline\kappa] } \mathbb{P}( \widetilde\rho(\mathcal{P}(\lambda)\cup Q_1,Q_2) > L ) \le \varepsilon.\end{align*}

For intensities $\lambda\in [\underline\kappa,\overline\kappa]$ , we can repeat the argument from the special case treated above. Indeed, assume the contrary, namely,

\begin{align*} \limsup_{L\rightarrow \infty} \quad \sup_{Q_1,Q_2\in\mathfrak{Q}_m} \quad \sup_{\lambda\in[\underline\kappa,\overline\kappa]} \mathbb{P}( \widetilde\rho(\mathcal{P}(\lambda)\cup Q_1,Q_2) > L ) > 2\varepsilon,\end{align*}

for some $\varepsilon>0$ . Then there are sequences $(Q_{n,1})_n, (Q_{n,2})_n, (\lambda_n)_n, (L_n)_n$ with the properties $Q_{n,1}\rightarrow Q^*_1, Q_{n,2}\rightarrow Q^*_2$ (considered as vectors whose entries are elements in $[\!-\!2^{-1},2^{-1}]^d$ ) for two admissible elements $Q^*_1,Q^*_2 \in \mathfrak{Q}_m$ , as well as $\lambda_n\rightarrow \lambda^*\in [\underline\kappa,\overline\kappa]$ and $L_n\rightarrow\infty$ . These sequences satisfy

\[ \mathbb{P}( \widetilde\rho(\mathcal{P}(\lambda_n)\cup Q_{n,1},Q_{n,2}) > L_n ) > \varepsilon\qquad \text{for all \textit{n}}.\]

Now we can argue as before in the special case to obtain a contradiction. We arrive at the following result: for all $\varepsilon>0$ , $m\in\mathbb{N}$ , and $r\in \mathbb{R}_+$ , there is an $L>0$ such that

\[ \max_{q\in\{0,\ldots,d-1\}} \quad \sup_{\lambda\in\mathbb{R}_+} \quad \sup_{Q_1,Q_2\in\mathfrak{Q}_m} \mathbb{P}( \widetilde\rho_q^{\,r}(\mathcal{P}(\lambda)\cup Q_1,Q_2) > L ) \le \varepsilon.\]

So far, we have been considering a fixed $r\le \overline r$ . We now prove the general statement given in (1). To this end, we rely once more on (21) and the induced modification of $\widetilde\rho_q^{\,r}$ . Then, for $r\le \overline r$ and $\alpha\in\mathbb{R}_+$ arbitrary but fixed,

(22) \begin{align}\begin{split} \mathbb{P}( \widetilde\rho_q^{\,\alpha r}( \mathcal{P}(\lambda)\cup Q_1, Q_2) > L ) &= \mathbb{P}( \widetilde\rho_q^{\,r}( \alpha^{-1} \mathcal{P}( \lambda)\cup \alpha^{-1}Q_1, \alpha^{-1}Q_2) > \alpha^{-1} L ) \\[5pt] &= \mathbb{P}( \widetilde\rho_q^{\,r}( \mathcal{P}(\alpha^d \lambda)\cup \alpha^{-1}Q_1, \alpha^{-1}Q_2) > \alpha^{-1} L ),\end{split}\end{align}

using the scale-invariance for the first equation and $\mathcal{L}( \alpha^{-1} \mathcal{P}(\lambda)) = \mathcal{L}( \mathcal{P}(\alpha^d \lambda))$ for the second equation. Consequently, for all $\overline\alpha,\underline\alpha,\varepsilon\in\mathbb{R}_+$ arbitrary but fixed, with $\underline\alpha\le \overline\alpha$ , there is an $L\in\mathbb{R}_+$ such that

(23) \begin{align} \sup_{\alpha\in [\underline\alpha,\overline\alpha]} \quad \sup_{\lambda\in\mathbb{R}_+} \quad \sup_{Q_1,Q_2\in\mathfrak{Q}_m} \mathbb{P}( \widetilde\rho_q^{\,\alpha r}( \mathcal{P}(\lambda)\cup Q_1, Q_2) > L ) \le \varepsilon.\end{align}

This completes the considerations of Part (1).

Proof of (2): We use a suitable space–time coupling of Poisson processes. Let $r\in\mathbb{R}_+$ and $q\in\{0,\ldots,d-1\}$ be arbitrary but fixed. Write $\widetilde\rho$ for $\widetilde\rho_q^{\,r}$ . Note that the law of $n^{1/d}\mathcal{P}(n \nu)$ equals the law of $\mathcal{P}( \nu(\cdot/n^{1/d}))$ .

First we show that for all $\varepsilon>0$ , there exist a $b>0$ and an $L>0$ such that

\[ \sup_{n\in\mathbb{N}} \sup_{z \in B''_{\!\!\!n,L}} \mathbb{P}( \widetilde\rho( \mathcal{P}( \nu(\cdot/n^{1/d})\cup Q_1, Q_2 ) > L) \le \varepsilon\]

for all densities $\nu$ satisfying $\|\nu - \kappa\|_\infty \le b,$ and for all $Q_1,Q_2\in z+\mathfrak{Q}_m$ , where $m\in\mathbb{N}$ is arbitrary but fixed.

From Part (1) of the theorem, for each $\varepsilon>0$ there is an $L>0$ such that, for the homogeneous Poisson process $\mathcal{P}( \nu(z/n^{1/d})),$ we have that

\[ \mathbb{P}( \widetilde\rho( \mathcal{P}( \nu(z/n^{1/d}))\cup Q_1, Q_2 ) > L ) \le \varepsilon\]

uniformly in $z\in [0,n^{1/d}]^d$ , $Q_1,Q_2\in z+\mathfrak{Q}_m$ , and $n\in\mathbb{N}$ , and for all densities $\nu$ . Moreover, with $K > 0,$ define the set

\begin{align*} A_{\nu,n}(K,z) = \Big\{x\in B(z,K) \ \Big| & \ \exists t\in \Big[\nu\Big( \frac{x}{n^{1/d}}\Big)\wedge \nu\Big( \frac{z}{n^{1/d}}\Big), \nu\Big( \frac{x}{n^{1/d}}\Big)\vee \nu\Big( \frac{z}{n^{1/d}}\Big) \Big] \\[5pt] &\qquad \text{ and } (x,t)\in\mathcal{P} \Big\}.\end{align*}

By assumption, the function $\kappa$ is uniformly continuous with a certain modulus of continuity $\omega(\cdot)$ . Hence, $|\kappa(x_1) - \kappa(x_2)|\le \omega(\delta)$ whenever $\|x_1-x_2\| \le \delta$ and $\omega(\delta)\to 0$ as $\delta\to 0$ . Let $\nu$ be a density satisfying $\| \nu - \kappa\|_\infty \le b$ . Then

(24) \begin{align} &\mathbb{P}( A_{\nu,n}(K,z) \neq \emptyset ) \nonumber \\[5pt] &\le \mathbb{P}\Big( \exists x\in B(z,K) \ \Big| \ \exists t \in \Big[\kappa\Big( \frac{x}{n^{1/d}}\Big)\wedge \kappa\Big( \frac{z}{n^{1/d}}\Big) - b, \kappa\Big( \frac{x}{n^{1/d}}\Big)\vee \kappa\Big( \frac{z}{n^{1/d}}\Big) + b \Big] \nonumber\\[5pt] &\qquad \qquad \qquad \qquad\qquad \text{ and } (x,t)\in\mathcal{P} \Big) \nonumber \\[5pt] &\le \mathbb{P}\Big( \exists x\in B(z,K):\;\exists t \in \Big[0, \omega(K/n^{1/d}) + 2b\Big] \text{ and } (x,t)\in\mathcal{P} \Big), \end{align}

where the last inequality uses the stationarity of $\mathcal{P}$ . Clearly, given a value for K, there are $b>0$ and $n_0\in\mathbb{N}$ such that (24) is small uniformly in $z\in B''_{\!\!\!n,K}$ , $\nu$ in a b-neighborhood of $\kappa,$ and $n\ge n_0$ .

We now come to the conclusion. Let $\varepsilon>0$ be arbitrary but fixed. We apply the result from Part (1) and choose $L^*\in\mathbb{R}_+$ such that $\mathbb{P}( \widetilde\rho( \mathcal{P}(\lambda)\cup Q_1, Q_2) > L^* ) \le \varepsilon/2$ is satisfied uniformly in $\lambda\in\mathbb{R}_+$ , $Q_1,Q_2\in z+\mathfrak{Q}_m$ , and $z\in\mathbb{R}^d$ .

Next, let $K^* = L^* + 2\mu(r)$ . Choose $b>0$ and $n_0\in\mathbb{N}$ such that $\mathbb{P}( A_{\nu,n}(K^*,z) \neq \emptyset ) \le \varepsilon/2$ for all $n\ge n_0$ , for all $z\in B''_{n,K^*}$ , and for all $\nu$ such that $\|\nu - \kappa \|_\infty \le b$ . Since by assumption $z\in B''_{n,K^*}$ , this implies

(25) \begin{align}\begin{split} &\Big\{ \widetilde\rho( \mathcal{P}( \nu( \cdot / n^{1/d} ) )\cup Q_1, Q_2) > K^*, \\[5pt] &\qquad \widetilde\rho( \mathcal{P}(\nu(z/n^{1/d}))\cup Q_1, Q_2) \le L^*, A_{\nu,n}(K^*,z) = \emptyset \Big\} = \emptyset.\end{split}\end{align}

Consequently,

(26) \begin{align}\begin{split} &\mathbb{P}( \widetilde\rho( \mathcal{P}( \nu( \cdot / n^{1/d} ) ) \cup Q_1, Q_2) > K^* )\\[5pt] &\le \mathbb{P}\big( \widetilde\rho( \mathcal{P}( \nu( \cdot / n^{1/d} ) ) \cup Q_1, Q_2) > K^*, \\[5pt] &\qquad \widetilde\rho( \mathcal{P}(\nu(z/n^{1/d}))\cup Q_1, Q_2) \le L^*, A_{\nu,n}(K^*,z) = \emptyset \big) + \varepsilon = \varepsilon,\end{split}\end{align}

for all $z\in B''_{\!\!\!n,K^*}$ , for $n\ge n_0$ , and for all $\nu$ such that $\sup |\nu - \kappa| \le b$ .

The generalization to an entire parameter range for the filtration parameter now follows from the result in (23) (note that $\overline\lambda = \sup \kappa$ is an admissible choice in this equation) and by using a similar ansatz as in the derivation of (26). So, for each $\varepsilon>0$ , $\overline\alpha\ge \underline\alpha>0$ , there are $L\in \mathbb{R}_+$ , $b>0$ , and $n_0\in\mathbb{N}$ such that for each $0\le q \le d-1$

(27) \begin{align} \sup_{\underline\alpha\le \alpha \le \overline\alpha} \quad \sup_{n \ge n_0} \quad \sup_{Q_1,Q_2\in\mathfrak{Q}_{m}} \mathbb{P}( \widetilde\rho_q^{\,\alpha r} ( \mathcal{P}( \nu( \cdot / n^{1/d} ) ) \cup Q_1, Q_2) > L ) \le \varepsilon,\end{align}

for all densities $\nu$ in a b-neighborhood of $\kappa$ with respect to the sup-norm. This yields the first result given in Part (2). The second result is now immediate: there is an $m\in\mathbb{N}$ such that with high probability, the number of Poisson points inside Q(z) is at most m. This means we can apply the previous result.

Proof of (3): This time we couple the binomial process to a suitable Poisson process. Again let $0\le q\le d-1$ and r be arbitrary but fixed. We first study the radius $\widetilde\rho_q^{\,r}(n^{1/d}\mathbb{X}_m,n^{1/d}X')$ . First, note that for all $\varepsilon>0$ and for all $L>0$ there is an $n_0\in\mathbb{N}$ such that $\mathbb{P}(n^{1/d} X'\notin B''_{\!\!\!n,L}) \le \overline\kappa n^{(d-1)/d} L n^{-1} \le \varepsilon$ for all $n\ge n_0$ . So, by independence, for each $s>0,$

\[ \mathbb{P}( \widetilde\rho_q^{\,s}( n^{1/d} \mathbb{X}_m, n^{1/d} X') >L ) \le \sup_{z\in B''_{\!\!\!n,L}} \mathbb{P}\big( \widetilde\rho_q^{\,s}( n^{1/d} \mathbb{X}_m, \{z\} ) >L \big) + \varepsilon, \quad \forall n\ge n_0.\]

For each $n\in\mathbb{N}$ , let $V_{1,n},V_{2,n},\ldots$ be i.i.d. with density $\kappa(\cdot/n^{1/d})$ . Let $\mathcal{P}( \kappa(\cdot/n^{1/d})) = \{Z_{1,n},\ldots, Z_{N_n,n}\}$ be the Poisson process from (20) for the intensity function $\kappa(\cdot/n^{1/d})$ and $N_n\sim \operatorname{Poi}(n)$ . Then $n^{1/d}\mathbb{X}_m$ has the same distribution as the process

\[ U_{m,n} = \Big[\mathcal{P}( \kappa(\cdot/n^{1/d})) \setminus \{Z_{m+1,n},\ldots, Z_{N_n,n}\} \Big]\cup \{ V_{1,n},\ldots,V_{m-N_n,n} \},\]

where, by convention, $\{Z_{m+1,n},\ldots, Z_{N_n,n}\}$ is empty if $N_n\le m,$ and $\{ V_{1,n},\ldots,V_{m-N_n,n} \}$ is empty if $N_n\ge m$ .

By (27), for each $\overline\alpha\ge\underline\alpha>0$ and for each $\varepsilon>0$ , there is an $L>0$ such that

\[ \sup_{\underline\alpha\le \alpha\le \overline\alpha} \sup_{n\in\mathbb{N}} \sup_{z\in B''_{\!\!\!n,L} } \mathbb{P}( \widetilde\rho_q^{\,\alpha r} ( \mathcal{P}( \kappa(\cdot/n^{1/d})) , \{z\} ) > L) \le \varepsilon.\]

Note that here we use the fact that we are studying only one density function, namely, $\kappa$ , so we choose $L>0$ individually for values of n, $n\le n_0$ , and take the maximum at the end.

Also, for all $\varepsilon>0$ , for all $z\in B_n,$ and for all $K>0,$ there is an $n_0\in\mathbb{N}$ such that for all $n\ge n_0$ , $\mathbb{P}( A'_n(K,z) ) \ge 1 - \varepsilon$ , where

\begin{align*} A'_n(K,z) = \{ [\{Z_{m+1,n},\ldots, Z_{N_n,n}\} \cup \{ V_{1,n},\ldots,V_{m-N_n,n} \} ] \cap B(z,K) = \emptyset \}.\end{align*}

Indeed, this result follows from standard calculations, as

\begin{align*}\mathbb{E}\left [ |m-N_n| \right ] \le |m-n| + \mathbb{E}\left [ |N_n-n|^2 \right ]^{1/2} \le h(n) + n^{1/2}\end{align*}

and as the probability that a single point falls in B(z, K) is bounded above by a constant times $n^{-1}$ .

In the last step, we combine these observations as follows. First, let $\varepsilon>0$ be arbitrary but fixed. Then there is an $L^*>0$ such that

\[ \mathbb{P}(n^{1/d} X'\notin B''_{\!\!\!n,L^*}) \le \frac{\varepsilon}{3} \]

and

\[ \sup_{\underline\alpha\le \alpha\le \overline\alpha} \quad \sup_{n\in\mathbb{N}} \quad \sup_{z\in B''_{\!\!\!n,L^*} } \mathbb{P}( \widetilde\rho_q^{\,\alpha r}( \mathcal{P}( \kappa(\cdot/n^{1/d})) , \{z\} ) > L^*) \le \frac{\varepsilon}{3}.\]

Moreover, with $K^* = L^* + 2\mu(r)$ , there is an $n_0\in\mathbb{N}$ such that for $z\in B_n$ and $n\ge n_0$ , $\mathbb{P}( A'_n(K^*,z)^c ) \le \varepsilon/3$ . Consequently, similarly to (25), if $n\ge n_0$ and if $L\ge L^*$ , then

\begin{align*} &\mathbb{P}(\widetilde\rho_q^{\,\alpha r} (n^{1/d} \mathbb{X}_m, n^{1/d} X') >L ) \\[5pt] &\le \sup_{z\in B''_{\!\!\!n,L^*}} \mathbb{P}( \widetilde\rho_q^{\,\alpha r}( U_{m,n}, \{z\} ) > L) + \frac{\varepsilon}{3} \\[5pt] &\le \sup_{z\in B''_{\!\!\!n,L^*}} \mathbb{P}( \widetilde\rho_q^{\,\alpha r}( U_{m,n}, \{z\})>L, A'_n(K^*,z), \widetilde\rho_q^{\,\alpha r}( \mathcal{P}( \kappa(\cdot/n^{1/d})) , \{z\} ) \le L^* ) + \varepsilon = \varepsilon,\end{align*}

uniformly in $m\in J_n$ and $\underline\alpha\le \alpha \le \overline\alpha$ . This shows (3) and completes the proof.

Throughout the remainder of the appendix we consider a more general filtration as in [Reference Hiraoka, Shirai and Trinh16]. Examples of these filtrations are the Čech and the Vietoris–Rips filtrations. The following principle will be important.

Lemma 4. Let $\mathcal{K}$ and $\mathcal{K}'$ be two simplicial complexes. Then $C_q(\mathcal{K})\cap C_q(\mathcal{K}') = C_q(\mathcal{K}\cap \mathcal{K}')$ . Moreover, $Z_q(\mathcal{K})\cap Z_q(\mathcal{K}') = Z_q(\mathcal{K}\cap\mathcal{K}')$ and $B_q(\mathcal{K})\cap B_q(\mathcal{K}') \supseteq B_q(\mathcal{K}\cap\mathcal{K}')$ .

Proof. First, we consider the claim concerning the spaces $C_q$ . The inclusion $\supseteq$ is clear, so we prove only $\subseteq $ . This inclusion can be deduced from the fact that $C_q$ is a free module over $\mathbb{F}_2$ generated by the corresponding q-simplices in the filtration. We can write $c\in C_q(\mathcal{K})\cap C_q(\mathcal{K}')$ as $\sum_{i} a_i \sigma_i$ , where $\sigma_i$ are q-simplices in $\mathcal{K}$ , $a_i\in\mathbb{F}_2$ , and also as $\sum_{j} b_j \widetilde\sigma_j,$ where $\widetilde\sigma_j$ are q-simplices in $\mathcal{K}'$ , $b_j\in\mathbb{F}_2$ . Hence, $\sum_{i} a_i \sigma_i - \sum_{j} b_j \widetilde\sigma_j =0$ . If $\sigma_i\in \mathcal{K}\setminus\mathcal{K}'$ , the coefficient $a_i$ is zero, as this basis element cannot occur in the filtration $\mathcal{K}'$ . The same holds in the other direction, if $\widetilde\sigma_j\in\mathcal{K}'\setminus\mathcal{K}$ , $b_j$ is zero.

The statement $Z_q(\mathcal{K})\cap Z_q(\mathcal{K}') = Z_q(\mathcal{K}\cap\mathcal{K}')$ follows immediately. Again the inclusion $\supseteq$ is clear and we prove only $\subseteq$ . If $c\in Z_q(\mathcal{K})\cap Z_q(\mathcal{K}')$ , then by the above $c\in C_q(\mathcal{K}\cap\mathcal{K}')$ and by assumption $\partial c = 0$ . Thus, $c\in Z_q(\mathcal{K}\cap\mathcal{K}') $ as desired. The inclusion concerning the boundary groups is immediate.

We remark that $B_q(\mathcal{K})\cap B_q(\mathcal{K}') \not\subseteq B_q(\mathcal{K}\cap\mathcal{K}')$ is possible. For instance, consider a situation where $\beta_0(\mathcal{K})=\beta_0(\mathcal{K}')=1$ and where $\mathcal{K} \cap \mathcal{K}' = \{a,b\}$ for two zero-dimensional simplices a, b such that $\mathcal{K}\cap \mathcal{K}'$ is a strict subset of $\mathcal{K}$ and $\mathcal{K}'$ . (For example, we can take two ‘arc-like’ connected components represented by $\mathcal{K}$ and $\mathcal{K}'$ which intersect only at their endpoints.) Then we have $ B_0(\mathcal{K}\cap\mathcal{K}') = \{0\}$ , but $B_0(\mathcal{K})\cap B_0(\mathcal{K}')$ contains $a+b$ as a nontrivial element.

In the following, assume that P is a simple point cloud on $\mathbb{R}^d$ without accumulation points and Q is a finite subset of $\mathbb{R}^d$ such that $P\cap Q = \emptyset$ and $Q\subseteq Q(z,L)$ for some $z\in\mathbb{R}^d$ and $L\in\mathbb{R}_+$ . Define

\begin{align*} \mathcal{K}_{s,a} &= \mathcal{K}_s( P\cap B(z,a)), \qquad \mathcal{K}'_{\!\!s,a} = \mathcal{K}_s((P\cup Q)\cap B(z,a)).\end{align*}

Set $a^*= a^*(s) = \mu(s) + L$ , where $\mu(s)$ is the upper bound on the diameter of a simplex in the filtration at time s, which is guaranteed by the assumptions of [Reference Hiraoka, Shirai and Trinh16] on the filtration. Choose $a_1,a_2 \in \mathbb{R}$ with $a^*\le a_1 \le a_2$ such that $C_0(\mathcal{K}_{s,a_2}\setminus\mathcal{K}_{s,a_1})$ contains exactly one additional basis element (a point) from P, and write

\[ C_{q+1}( \mathcal{K}'_{s,a_2} \setminus \mathcal{K}'_{\!\!s,a_1}) = \langle \sigma_1,\ldots,\sigma_n \rangle.\]

We can assume without loss of generality that the simplices are already in the right order, i.e.,

(28) \begin{align} B_q (\mathcal{K}'_{s,a_2}) = B_q( \mathcal{K}'_{s,a_1}) \oplus \langle \partial\sigma_1,\ldots,\partial\sigma_i \rangle,\end{align}

so that $\partial\sigma_j \neq 0$ mod $ B_q( \mathcal{K}'_{s,a_1}) \oplus \langle \partial\sigma_1,\ldots,\partial\sigma_{j-1} \rangle$ for $j=1,\ldots,i$ and $\partial\sigma_j = 0$ mod $ B_q( \mathcal{K}'_{s,a_1}) \oplus \langle \partial\sigma_1,\ldots,\partial\sigma_{i} \rangle$ for $j=i+1,\ldots, n$ . As $a_1$ is sufficiently large, we have that each of the simplices $\sigma_j$ is also contained in $\mathcal{K}_{s,a_2}$ . Hence, as $B_q (\mathcal{K}_{s,a})$ is a subspace of $B_q (\mathcal{K}'_{s,a})$ , we have that

(29) \begin{align}\begin{split} &B_q (\mathcal{K}_{s,a_2})= B_q( \mathcal{K}_{s,a_1}) \oplus \langle \partial\sigma_1,\ldots,\partial\sigma_i \rangle \oplus \langle \partial\sigma_{j}: \text{ for } j\in J \rangle,\end{split}\end{align}

where $J \subseteq \{i+1,\ldots,n\}$ can be empty. In particular,

\[ \dim B_q (\mathcal{K}'_{s,a_2}) - \dim B_q (\mathcal{K}_{s,a_2}) = \dim B_q (\mathcal{K}'_{s,a_1}) - \dim B_q (\mathcal{K}_{s,a_1}) + (i - i - \# J).\]

That is, the map $a\mapsto \dim B_q( \mathcal{K}'_{s,a}) / B_q (\mathcal{K}_{s,a})$ is non-increasing if $a\ge a^*$ .

Finiteness of the radius of weak stabilization $\rho^{(r,s)}(P,Q)$ follows directly from Lemma 1 and Theorem 4. This argument relies on the finiteness of the radius of strong stabilization (Theorem 4), which, however, is not necessary for the finiteness of $\rho^{(r,s)}(P,Q)$ . The following lemma gives a direct proof of the finiteness of $\rho^{(r,s)}(P,Q)$ , using ideas similar to those in the proof of Lemma 5.3 of Hiraoka et al. [Reference Hiraoka, Shirai and Trinh16].

Lemma 5. The radius $\rho^{(r,s)}(P,Q)$ from (3) is well-defined.

Proof. It is sufficient to consider $\rho^{(r,s)}_q(P,Q)$ for each $0\le q\le d-1$ . One can use the geometric lemma to show that the nonnegative, integer-valued mappings

(30) \begin{align} a\mapsto \dim \frac{ Z_q (\mathcal{K}'_{r,a})}{ Z_q (\mathcal{K}_{r,a})} \text{ and } a\mapsto \dim \frac{ Z_q (\mathcal{K}'_{r,a}) \cap B_q (\mathcal{K}'_{s,a})}{ Z_q (\mathcal{K}_{r,a})\cap B_q (\mathcal{K}_{s,a})}\end{align}

are bounded above. Moreover, the map $ Z_q (\mathcal{K}'_{r,a_1}) / Z_q (\mathcal{K}_{r,a_1}) \hookrightarrow Z_q (\mathcal{K}'_{r,a_2}) / Z_q (\mathcal{K}_{r,a_2})$ is injective for all $0\le a_1\le a_2$ ; this follows from Lemma 4. Thus, there is an $a^*_1\in\mathbb{R}_+$ such that the first mapping in (30), which is integer-valued, is constant for all $a\ge a^*_1$ .

Next we show that the second mapping in (30) also becomes constant as $a\rightarrow \infty$ . To this end, we first show that $a\mapsto \dim B_q (\mathcal{K}'_{s,a}) / B_q (\mathcal{K}_{s,a})$ is constant for all $a\ge a^*_2$ for a certain $a^*_2\in\mathbb{R}_+$ . This follows from the non-increasing property (see the paragraph right after (29)) and the boundedness from below of this mapping. Now, returning to the second mapping in (30), one can argue as in the proof of Lemma 1 to show that the difference $\dim Z_q( \mathcal{K}'_{r,a}) \cap B_q(\mathcal{K}'_{s,a}) - \dim Z_q( \mathcal{K}_{r,a}) \cap B_q(\mathcal{K}_{s,a})$ is constant for all $a \ge a^*_1 \vee a^*_2$ .

Acknowledgements

The authors are very grateful to the associate editor and two referees for their careful reading and their suggestions, which greatly improved the manuscript. Moreover, the authors thank Khanh Duy Trinh for his interest in this paper and for pointing out a mistake in a preliminary version.

Funding information

J. Krebs was supported by the German Research Foundation (DFG), grant numbers KR-4977/1-1 and KR-4977/2-1. W. Polonik was partially supported by the National Science Foundation (NSF), grant number DMS-2015575.

Competing interests

There were no competing interests to declare which arose during the preparation or publication process of this article.

References

Aizenman, M., Kesten, H. and Newman, C. (1987). Uniqueness of the infinite cluster and related results in percolation. In Percolation Theory and Ergodic Theory of Infinite Particle Systems, Springer, New York, pp. 1320.CrossRefGoogle Scholar
Aizenman, M., Kesten, H. and Newman, C. M. (1987). Uniqueness of the infinite cluster and continuity of connectivity functions for short and long range percolation. Commun. Math. Phys. 111, 505531.CrossRefGoogle Scholar
Baryshnikov, Y. and Yukich, J. E. (2005). Gaussian limits for random measures in geometric probability. Ann. Appl. Prob. 15, 213253.CrossRefGoogle Scholar
Baszczyszyn, B., Yogeshwaran, D. and Yukich, J. E. (2019). Limit theory for geometric statistics of point processes having fast decay of correlations. Ann. Prob. 47, 835–895.CrossRefGoogle Scholar
Bobrowski, O. and Kahle, M. (2018). Topology of random geometric complexes: a survey. J. Appl. Comput. Topology 1, 331364.CrossRefGoogle Scholar
Burton, R. M. and Keane, M. (1989). Density and uniqueness in percolation. Commun. Math. Phys. 121, 501505.CrossRefGoogle Scholar
Carlsson, G. (2009). Topology and data. Bull. Amer. Math. Soc. 46, 255308.CrossRefGoogle Scholar
Chazal, F. and Divol, V. (2019). The density of expected persistence diagrams and its kernel based estimation. J. Comput. Geom. 10, 127153.Google Scholar
Chazal, F. and Michel, B. (2021). An introduction to topological data analysis: fundamental and practical aspects for data scientists. Frontiers Artificial Intellig. 4, 128.CrossRefGoogle ScholarPubMed
Divol, V. and Polonik, W. (2019). On the choice of weight functions for linear representations of persistence diagrams. J. Appl. Comput. Topology 3, 249283.CrossRefGoogle Scholar
Edelsbrunner, H., Letscher, D. and Zomorodian, A. (2000). Topological persistence and simplification. In Proc. 41st Annual Symposium on Foundations of Computer Science, Institute of Electrical and Electronics Engineers, Piscataway, NJ, pp. 454463.CrossRefGoogle Scholar
Gidea, M. (2017). Topological data analysis of critical transitions in financial networks. In 3rd International Winter School and Conference on Network Science (NetSci-X 2017), Springer, Cham, pp. 4759.CrossRefGoogle Scholar
Gidea, M. et al. (2020). Topological recognition of critical transitions in time series of cryptocurrencies. Physica A 548, article no. 123843.CrossRefGoogle Scholar
Gidea, M. and Katz, Y. (2018). Topological data analysis of financial time series: landscapes of crashes. Physica A 491, 820834.CrossRefGoogle Scholar
Goel, A., Trinh, K. D. and Tsunoda, K. (2019). Strong law of large numbers for Betti numbers in the thermodynamic regime. J. Statist. Phys. 174, 865892.CrossRefGoogle Scholar
Hiraoka, Y., Shirai, T. and Trinh, K. D. (2018). Limit theorems for persistence diagrams. Ann. Appl. Prob. 28, 27402780.CrossRefGoogle Scholar
Kahle, M. (2011). Random geometric complexes. Discrete Comput. Geom. 45, 553573.CrossRefGoogle Scholar
Kahle, M. and Meckes, E. (2013). Limit theorems for Betti numbers of random simplicial complexes. Homology Homotopy Appl. 15, 343374.CrossRefGoogle Scholar
Kesten, H. and Lee, S. (1996). The central limit theorem for weighted minimal spanning trees on random points. Ann. Appl. Prob. 6, 495527.CrossRefGoogle Scholar
Klenke, A. (2013). Probability Theory: A Comprehensive Course. Springer, Cham.Google Scholar
Lee, Y. et al. (2017). Quantifying similarity of pore-geometry in nanoporous materials. Nature Commun. 8, article no. 15396.Google ScholarPubMed
McGivney, K. and Yukich, J. (1999). Asymptotics for Voronoi tessellations on random samples. Stoch. Process. Appl. 83, 273288.CrossRefGoogle Scholar
Meester, R. and Roy, R. (1996). Continuum Percolation. Cambridge University Press.CrossRefGoogle Scholar
Nakamura, T. et al. (2015). Persistent homology and many-body atomic structure for medium-range order in the glass. Nanotechnology 26, article no. 304001.CrossRefGoogle ScholarPubMed
Oudot, S. Y. (2015). Persistence Theory: From Quiver Representations to Data Analysis. American Mathematical Society, Providence, RI.CrossRefGoogle Scholar
Owada, T. (2018). Limit theorems for Betti numbers of extreme sample clouds with application to persistence barcodes. Ann. Appl. Prob. 28, 28142854.CrossRefGoogle Scholar
Owada, T. and Adler, R. J. (2017). Limit theorems for point processes under geometric constraints (and topological crackle). Ann. Prob. 45, 20042055.CrossRefGoogle Scholar
Owada, T. and Thomas, A. (2020). Limit theorems for process-level Betti numbers for sparse and critical regimes. Adv. Appl. Prob. 52, 131.CrossRefGoogle Scholar
Penrose, M. (2003). Random Geometric Graphs. Oxford University Press.CrossRefGoogle Scholar
Penrose, M. D. (2005). Multivariate spatial central limit theorems with applications to percolation and spatial graphs. Ann. Prob. 33, 19451991.CrossRefGoogle Scholar
Penrose, M. D. (2007). Gaussian limits for random geometric measures. Electron. J. Prob. 12, 9891035.CrossRefGoogle Scholar
Penrose, M. D. and Yukich, J. E. (2001). Central limit theorems for some graphs in computational geometry. Ann. Appl. Prob. 11, 10051041.CrossRefGoogle Scholar
Penrose, M. D. and Yukich, J. E. (2003). Weak laws of large numbers in geometric probability. Ann. Appl. Prob. 13, 277303.CrossRefGoogle Scholar
Sarkar, A. (1997). Co-existence of the occupied and vacant phase in boolean models in three or more dimensions. Adv. Appl. Prob. 29, 878889.CrossRefGoogle Scholar
Seversky, L. M., Davis, S. and Berger, M. (2016). On time-series topological data analysis: new data and opportunities. In 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops, Institute of Electrical and Electronics Engineers, Piscataway, NJ, pp. 59–67.CrossRefGoogle Scholar
Steele, J. M. (1988). Growth rates of Euclidean minimal spanning trees with power weighted edges. Ann. Prob. 16, 17671787.CrossRefGoogle Scholar
Trinh, K. D. (2019). On central limit theorems in stochastic geometry for add-one cost stabilizing functionals. Electron. Commun. Prob. 24, article no. 76.CrossRefGoogle Scholar
Umeda, Y. (2017). Time series classification via topological data analysis. Inf. Media Technol. 12, 228239.Google Scholar
Wasserman, L. (2018). Topological data analysis. Ann. Rev. Statist. Appl. 5, 501532.CrossRefGoogle Scholar
Yao, Y. et al. (2009). Topological methods for exploring low-density states in biomolecular folding pathways. J. Chem. Phys. 130, 04B614.CrossRefGoogle Scholar
Yogeshwaran, D. and Adler, R. J. (2015). On the topology of random complexes built over stationary point processes. Ann. Appl. Prob. 25, 33383380.CrossRefGoogle Scholar
Yogeshwaran, D., Subag, E. and Adler, R. J. (2017). Random geometric complexes in the thermodynamic regime. Prob. Theory Relat. Fields 167, 107142.CrossRefGoogle Scholar
Yukich, J. (2000). Asymptotics for weighted minimal spanning trees on random points. Stoch. Process. Appl. 85, 123138.CrossRefGoogle Scholar
Zomorodian, A. and Carlsson, G. (2005). Computing persistent homology. Discrete Comput. Geom. 33, 249274.CrossRefGoogle Scholar
Figure 0

Figure 1. The persistent Betti number $\beta^{r,s}_q(\mathcal{K}(P))$ equals the number of points in the gray-shaded rectangle; the point on the dashed red line is not counted, whereas the point on the solid red line is.

Figure 1

Figure 2. Illustration of a chain $\tau$ consisting of one-dimensional simplices (red, green) from Poisson points (black, blue, and green dots) and an additional point (black diamond) which is located inside Q(0). The 1-simplices between Poisson points are red; the 1-simplices between a Poisson point and the additional point are green. The layers depict two spheres of B(z, R) and $B(z,R-2\mu(r))$; $e_1$ corresponds to the two blue dots (to which the two green 1-simplices are attached), $e_{2,R}$ to the green dots shown between the two layers.

Figure 2

Figure 3. Illustration of encounters of maximal chains in two dimensions (in a reduced set-up and not true to scale). The left panel depicts several encounters inside boxes Q(y, m) (for certain $y\in\mathbb{Z}^d$). The (blue) central boxes are located inside the (black) encounter boxes, which are part of the (black) lattice which partitions the plane. Each (green) encounter configuration merges three branches (red and partly in green and orange) through the corresponding (orange) intermediate configuration. The right panel shows a specific central box Q(y, n) (blue). Here a suitable configuration (violet) inside the central box converts the corresponding branches to maximal chains.