1. Introduction
Let (X, d) be a separable metric space and $(T,\mu)$ a transformation preserving a Borel, probability measure. The classical Poincaré lemma in such a setting gives that
The historically first attempt at strengthening this result came in a paper by M. Boshernitzan [Reference Boshernitzan1], who proved that $d(x,T^n(x))\approx n^{-1/\alpha}$, where α is the Hausdorff dimension of X. Precisely speaking, he gave two results, which we state now.
For a dynamical system (X, T) preserving a probability Borel measure µ:
The second result from that paper states, that if the preserved probability measure $\mu=H_\alpha$, then
There has been a lot of development in the area, for an introduction into quantitative recurrence, see e.g. [Reference Barreira and Saussol3].
In this paper, we will be interested in showing some new bounds on the recurrence speed. We will prove a generalisation of Boshernitzan’s result, but the main new idea is to show how to use this improved result to get an estimate from below of the Hausdorff measure of a fractal set. We discuss this on an easy example. An upcoming paper with M. Urbański [Reference Pawelec and Urbański4] shows a more interesting application, namely for Cantor sets defined by the so-called Denjoy maps (i.e. we show a bound from below on the Hausdorff measure of the minimal set occurring for a $\mathcal{C}^{1+\alpha}$ diffeomorphism on the circle which is only semi-conjugate to a rotation).
The idea of the method comes from the author’s PhD Thesis.
The paper is organised as follows. In the next section, we give the needed definitions, state the relevant theorems and sketch the new technique. In $\S$ 3, we show the method of estimating the Hausdorff measure on an example. $\S$ 4 is filled with additional comments, improvements and limitations of the method. Finally, $\S$ 5 is devoted to the proof of Theorem 3.
2. Definitions and theorems
Throughout this paper, we will assume that (X, d) is a metric space and $T\colon~X\to~X$ a Borel measurable map; µ is a T-invariant, ergodic, probability, Borel measure on X.
As we are working with subtle measure estimates it seems prudent to put here the precise definitions in use in this paper.
We will use the (most common) version of the definition of the Hausdorff measure.
Definition 1. The outer Hausdorff measure is the following
where the infimum is take over all countable covers of Y satisfying the conditions as stated. By Carathéodory’s extension this gives the (typical) Hausdorff measure.
The next definition is also standard.
Definition 2. The Hausdorff dimension of the set Y is given by the formula
We will now state a new version of Boshernitzan’s estimate (1.3). In contrast to his result we do not assume that the preserved measure $\mu=H_\alpha$.
Theorem 3. With the assumptions on the dynamical system as above, for any α > 0 for which Hα is σ-finite on X and for µ-almost every $x\in X$ we have
Remark. Note that g(x) may be equal to 0 or $+\infty$. The statement still holds. Also, due to the way this theorem is used below, the σ-finiteness assumption is not restrictive at all. (We want to give a bound from below on the $H_\alpha(X)$, so infinite measure only helps us.)
The rather simple proof utilises the idea by M. Boshernitzan and some techniques from ergodic theory. We postpone it till the last section.
This result shows that the behaviour of the recurrence is (may be) governed by the Hausdorff measure of the space. We will try to apply this in a reverse manner: if we could compute/estimate the lower limit of the speed of recurrence, then this would give us some information on the Hausdorff measure.
More precisely, if we can show that the lower limit on the LHS of (2.1) is positive for some α > 0, then either $H_\alpha(X)=+\infty$ or we will get the lower bound on the density (and so on the α-Hausdorff measure of the space). Also, both cases trivially give $\dim_H(X)\geq \alpha$.
Regarding the dimension, note that there is a unique value $\alpha^*\in [0,+\infty]$ such that
Theorem 3 now gives that $\dim_H(X)\geq \alpha^*$.
Note that a priori we may take any map on the space, as long as it preserves some Borel, probability, ergodic measure µ. However, we ought to take a map with poor mixing properties because of a result that requires another well-known definition.
Definition 4. We say that a dynamical system has an exponential decay of correlations in Lipschitz–continuous functions (denoted by $\mathcal{L}$), if there exist $\gamma \in (0,1)$ and $C \lt +\infty$, such that for all $g\in \mathcal{L}$, all $f \in L_1(\mu)$ and every $n\in {\mathbb N}$, we have
where $||\cdot||_\mathcal{L}$ denotes the typical norm of the space of Lipschitz functions.
The simplified version (stronger assumptions) of Theorem 3.1. from [Reference Pawelec6] states that
Theorem 5. With the assumptions on the dynamical system as above, if $\mu \approx H_\alpha$ and the system has an exponential decay of correlation in Lipschitz–continuous functions, then
which is the opposite of what we want (namely a positive lower limit). Thus, for the map to be useful to our method it needs to be slowly mixing. Typical examples of such maps include the irrational rotations on $\mathcal{S}^1$, Feigenbaum maps or the adding machine map, which we utilise below.
3. Example
Our example will be arguably the simplest of fractal sets – the one-third Cantor set. We will estimate from below the dimensional density g(x) for all values of α. As it turns out, we will get a meaningful result for α equal to the Hausdorff dimension of the Cantor set, leading to a bound on the Hausdorff dimension and the Hausdorff measure, both from below.
As mentioned, we will utilise a so-called adding machine map. We recall the definition now.
Every point x in the Cantor set C has a unique coding $(x_n)_{n=1}^{\infty}$ using symbols 0 and 1. The first symbol is 0 if the point is to the left of $1/2$ and 1 if it is to the right. The second symbol decides if the point is on the left or on the right of the second level segments, etc. The relation between coding and the point on the real line is $\displaystyle x=\sum\limits_{n=1}^{+\infty} \frac{2x_n}{3^n}$. It follows that the (Euclidean) distance between points x and y is given by a formula $\displaystyle |x-y|= 2\left|\sum\limits_{n=1}^{+\infty} \frac{x_n-y_n}{3^n}\right|$.
The map T on the coding space, is defined by an inductive scheme:
A) Start with the first symbol: n = 1.
B) If the symbol $x_n= 0$, then add 1 to it (new $(Tx)_n=1$) and finish.
C) If the symbol $x_n=1$, then make it equal to 0 (new $(Tx)_n=0$), increase n by 1, and return to (B).
In other words – we scan the code for the first digit of $(x_n)$ equal to 0, set it to 1 and set all the previous digits (i.e. $(x_k)$ for k < n) to 0.
Note that this ‘program’ will run indefinitely, if our point x has code $[111\ldots]$ (i.e. if x = 1), but mathematically this is not an issue as we may set $T(1)=0$.
This map is called an adding machine, because it is equivalent to adding 1 to the first digit of a binary number, where the digits are written in reverse order (Figure 1). This transformation is a piecewise isometry and it preserves the Cantor measure µ (defined to be equally distributed on the cylinders of the same level/size).
Let us start computing the recurrence rate by taking the point $z^0=0=[0000\ldots]$ and denote the forward iterates as $T^n(z^0)=z^n$.
To calculate the lower limit (LHS) of (2.1), we only need to look at the subsequent closest returns, i.e. we can ignore all n for which there exists k < n such that $|T^k(z)-z|\leq|T^n(z)-z|.$ For our point z 0 (and in fact any starting point) it is obvious that those returns will occur for the iterates being powers of 2. More precisely,
Taking any α > 0, we get the following
Obviously, z 0 is not a typical point in this system. However, the general calculation is not that different. Take any point $x\in C$ and look at its code – $[x_1x_2x_3\ldots]$. As before, we only need to look at iterates that are of form 2n. The point $T^{2^n}(x)$ will have the first n symbols identical and the $(n+1)$st symbol will be different. What we do not control/know are the later symbols, which can lower the distance slightly, e.g. the distance between $[100\ldots]$ and $[010\ldots]$ is equal to $4/9$. However, it is easy to write down all the possibilities.
To sum up – the worst case is when we add 1 at the place where there is a symbol 1 followed by a 0.
Repeating (3.1) for a general point we get a slightly worse estimate
So if we take any $\alpha \lt \log_32$, we see that the lower limit is infinite so by using Boshernitzan’s result (1.1) we know that the Hausdorff measure $H_\alpha(C)$ is infinite, so the Hausdorff dimension $\mathrm{HD}(C)\geq \log_32$.
Take $\alpha=\log_32$ and the Cantor measure µ. Now Thm. 3 gives that either $H_{\log_32}$ is not σ-finite on C (thus $H_{\log_32}(C)=+\infty$) or $g(x)\geq \big(\frac{4}{9}\big)^\alpha$ for all x (where $g(x)=\frac{dH_\alpha}{d\mu}$).So
This is not a very strong result – in reality $H_{\log_32}(C)=1$, but on the other hand, the estimate has been acquired with little effort. The next section is dedicated to comments on improving this lower bound.
Note that, it is easy to apply this technique to other self-similar sets, which allow symbolic coding, e.g. the Sierpiński triangle. Unfortunately, the unoptimality of the lower bound may (and typically will) remain.
On the other hand, the coding is not strictly necessary. If a system has slow recurrence properties, then one could get meaningful results as well. An example of this is in a paper with M. Urbański [Reference Pawelec and Urbański4], where the underlying dynamics is that of an irrational rotation on a circle (which for numbers with bad Diophantine properties is in fact slowly recurrent).
4. Improvements and comments
4.1. Changing the metric
In the calculation above we used the Euclidean metric on the real line. However, on the Cantor set, there is another natural metric, coming from the symbolic representation. Define
Then, the diameter of C stays equal to 1. Also, the diameters of all the cylinder sets in this metric is equal to the diameters in the Euclidean one. And the Hausdorff measure (and dimension) are exactly as it was in the Euclidean case.
Let us check what happens to our recurrence estimates if we take this metric.
For any $z\in C$ we trivially get
Inserting this into the liminf estimates yields
Now, setting $\alpha=\log_32$ we get the estimate on the density $g(x)\geq1$, which in turns gives $H_\alpha(C)\geq1$.
We see that using this metric we get the optimal estimate.
4.2. Irremovable obstacle
Let us return to the Euclidean metric. One could ask a very natural question – would some different map yield a better estimate?
And while it is possible that there exists a map with even slower recurrence, there does not seem to be any chance of improving up to the optimal lower bound. This is shown by a result of Boshernitzan and Delecroix, [Reference Boshernitzan and Delecroix2], which we will utilise below.
To see the problem, let us try to apply our method to a circle S of length 1. To get the best bound we need to find a map T on the circle (preserving some probability measure µ) for which:
for µ-a.e. $x\in S$. This would prove that $H_1(S)\geq 1$.
First, let us see what should we assume on the measure. Its support needs to be the entire circle (we get nonsense otherwise). Also, the dimension of the measure needs to be 1 (reason as before). Finally, as the circle is geometrically identical at any point so should be the measure – leaving as only with the Lebesgue measure. The last argument is not precise at all, but we are not actually proving anything here, so for the sake of clarity let us leave it like that.
There is still plenty of maps preserving the Lebesgue measure. The simplest of those are the rotations by angle γ, denoted by Rγ. Then the recurrence speed does not depend on the starting point, but on the continued fraction expansion of γ. It is very well studied subject. By the classic result of Khinchin we know that the slowest return speed happens for the rotation by the golden mean (minus one) $\varphi = \frac{\sqrt{5}-1}{2}$. Khinchin’s Theorem also states that
with the equality for φ.
This shows that taking only the rotations, we have no chance of realising (4.2). And Boshernitzan and Delecroix generalise this to prove (in [Reference Boshernitzan and Delecroix2]) that inequality (4.3) is true for all maps preserving the Lebesgue measure. This shows that the method shown here has an irremovable obstacle in achieving the best bound. At least on the circle, but their proof suggest this would happen on every space.
Actually, their proof indicates that there exists a constant correction term (depending on the dimension of the space, and perhaps slightly on the geometry of the space) which one could apply to get the correct measure (for the circle this would be $\frac{1}{\sqrt{5}}$). Unfortunately, making this argument precise would require very general results on the optimal packing of points in rather arbitrary sets.
4.3. Dependence on the dimension
Their proof also suggests that the scale of the unoptimality of the lower bound (i.e. the difference between the obtained result and the true Hausdorff measure) depends on the dimension. In fact, it should shrink to zero as the dimension goes to zero.
We cannot prove this general result here. What we can do, however, is show the phenomenon for basic Cantor sets.
Let us compute the lower bound on the measure of the Cantor sets of varying dimensions. Fix $0 \lt a \lt \frac{1}{2}$. The Cantor set Ca in question is given by the maps:
The Hausdorff dimension of this set is trivially computed $\dim_H(C_a)=\frac{\log(2)}{-\log(a)}$.
We define the coding as before, take the same adding machine map and repeat the calculation as in the example. We get for any $x\in C_a$ (where $1-2a$ is the size of the gap between intervals of the same order, to which we may add the length of the next-level cylinder a 2):
where the inequality becomes equality for those n’s when $x_n=1$ and $x_{n+1}=0$ (exactly as before). Putting this into the lower limit yields
Put $\alpha=\frac{\log(2)}{-\log(a)}$, so that the sequence is constant. Then, we see
This expression goes to one as a goes to zero (note that then so does α). Thus the difference between the true Hausdorff measure (equal to one in this case) and our estimate does disappear in the limit.
5. Proofs
The proof of Theorem 3 is divided into a few steps. First we prove
Proposition 6. With the assumptions on the dynamical system as in Theorem 3, in addition suppose that $H_\alpha \ll \mu$ for some α > 0, and denote the corresponding density by $g:=\frac{dH_\alpha}{d\mu}$. Then for µ-almost every $x\in X$ we have
Remark. Note that g is the inverse of the usually taken density.
Proof. First, if g is unbounded, then the inequality is trivial. Denote $\beta:=\frac{1}{\alpha}$ and $s:=\operatorname{esssup}\, g \lt +\infty$. In this notation we need to show that $\mu(D)=0$, where
Take any ɛ > 0 and define
It suffices to show that for any ɛ > 0, this set has µ-measure zero.
Assume the opposite, i.e. $\mu(D_\varepsilon) \gt 0$ for some fixed ɛ. We will prove that this implies $H_\alpha(D_\varepsilon) \gt 0$.
Define $\tau(x)$ as the first return map of a point x into Dɛ. This map preserves the conditional measure ν, defined as
Now, if $H_\alpha(D_\varepsilon) =0$, then by a result of Boshernitzan cited in the introduction (1.2), we have
Denote by $n_k(x)$ the time of k-th return of x to Dɛ. Then $\tau^k(x) = T^{n_k(y)}(x)$ and also
for ν-a.e. y because of the ergodic theorem. Combining the two limits above we get
which is contradicts the definition of Dɛ. Thus we know that $H_\alpha(D_\varepsilon) \gt 0$.
As the Hausdorff measure is positive, there must exist a measurable, non-empty subset $U\subset D_\varepsilon$ with $\operatorname{diam} U \lt \varepsilon$ satisfying $ (1-\varepsilon)(\operatorname{diam} U)^\alpha \leq H_\alpha(U)$. (If all subsets of Dɛ satisfy the opposite inequality, then this trivially violates the definition of the Hausdorff measure). Put $u=\mu(U)$ and $r=\operatorname{diam} U$. From the definition of density g
Combining the two inequalities on U gives
Since T preserves µ, we can show that
Indeed, if all those intersections were empty, then U, $T^{-1}(U)$ … $T^{-[1/u]}(U)$ would be pairwise disjoint, and so
a contradiction.
Take n for which the intersection is non-empty and any $x\in T^{-n}U \cap U$. Then
so this n satisfies the condition in the definition of $D(\varepsilon)$. Using (5.9) and (5.10), then (5.8), we get
which contradicts the definition of Dɛ and ends the proof.
As the next step we will ‘localize’ the result above obtaining:
Proposition 7. With the assumptions as in Proposition 6, we have for µ-almost every $x\in X$
Remark 8. The density g is defined only almost everywhere, so g(x) really means
Remark 9. This result for $X=[0,1]$ (α = 1), has been proved in [Reference Ho Choe5]. That proof, however, works only in a 1-dimensional space.
Proof of Proposition 7
We will use basic ergodic properties as in the part of the previous proof. Fix x and r > 0 and consider S(y) – the first return function to the ball $B(x,r)$. S preserves the conditional measure ν, defined as
The density of this new measure is related to the old density:
Using Proposition 6 for a system $\left(B(x,r), \mathcal{F}|_{B(x,r)}, \nu, d|_{B(x,r)}, S\right)$ we get
Denote by $n_k(y)$ the time of k-th return of y to $B(x,r)$. Then $S^k(y) = T^{n_k(y)}(y)$ and from ergodic theorem
Note that the closest returns (to itself) of a point $y\in B(x,r)$ for the original system have to occur within the sequence $n_k(y)$. Thus, the limit in (5.14) transforms to
It remains to compile (5.13), (5.14) and (5.16) obtaining
Letting r → 0 we finish the proof.
We may finally conclude the proof of the theorem.
Proof of Theorem 3
As the result is proved µ-almost everywhere we may assume that all the points $x\in\operatorname{supp} \mu$. Take any α > 0 and the Hausdorff measure Hα. Recall that we assume that Hα is σ-finite. There exists a decomposition $H_\alpha = H^s + H^c$, where $H^s\bot \mu$ and $H^c \ll \mu$.
If $H^c=0$, then by the definition of singular measures there exists a set A such that $\mu(A)=1$ and $H_\alpha(A)=H^s(A)=0$. Define a new measure preserving system only on A, that is consisting only of points whose entire orbit stays in A. As A has full measure this new system has all the properties of the original system. Then by the result of Boshernitzan cited in the introduction (1.2)
Thus the limit is smaller than g(x) whatever the latter would be.
If $H^s=0$, then the result follows from Proposition 7.
Finally, if both the singular part and the absolutely continuous part exist, then – as above – take a set A of full µ-measure for which $H^s(A)=0$ and define the system restricted to A. Within this new system $H_\alpha\ll \mu$ and this case has already been solved above.
Acknowledgements
The research of Łukasz Pawelec was supported in part by National Science Centre, Poland, Grant OPUS21 ‘Holomorphic dynamics, fractals, thermodynamic formalism’, 2021/41/B/ST1/00461. The author would like to thank the referee for helpful comments and spotting the omission in one of the proofs.