1. Introduction
A $C^1$ map $f:S^1\rightarrow S^1$ is called expanding if $\min _{x\in S^1}|f'(x)|\geq \unicode{x3bb} _f>1$ . We call $\unicode{x3bb} _f$ the minimum expansion rate. Let $\operatorname {\mathrm {Exp}}^r(S^1)$ ( $r\geq 1$ ) be the subspace of all $C^r$ uniformly expanding maps, and for $\gamma>1$ , let $\operatorname {\mathrm {Exp}}^r_\gamma (S^1)$ be the space of all $C^r$ expanding maps whose minimum expansion rate is greater than or equal to $\gamma $ . Given any continuous map $f:S^1\rightarrow S^1$ , recall that the degree of f is defined to be value $F(x+1)-F(x)$ , where $F:\mathbb {R}\rightarrow \mathbb {R}$ is any lift of f. In addition to being well-defined independent of the choice of lift and the point $x\in S^1$ , it was proved by Shub in [Reference ShubShu69] that the degree is a complete topological conjugacy invariant for expanding maps on the circle.
Theorem 1.1. Let $f,g:S^1\rightarrow S^1$ be continuous expanding maps. Then there exists a homeomorphism $h:S^1\rightarrow S^1$ such that $h\circ f=g\circ h$ if and only if $\deg (f)=\deg (g)$ .
It is easy to check that h is Hölder continuous, but there is an obstruction to h having higher regularity at the periodic orbits of f. By formally differentiating the conjugacy equation, it is clear that h will not be differentiable if $(f^n)'(x)\neq (g^n)'(h(x))$ for at least one periodic point $x\in \operatorname {\mathrm {Fix}}(f^n)$ . As the next theorem shows, satisfying this obstruction at all periodic points is sufficient to conclude differentiability of h.
Theorem 1.2. Suppose $f,g:S^1\rightarrow S^1$ are $C^{1+\alpha }$ ( $\alpha>0$ ) expanding maps of the same degree, and fix a conjugacy h such that $h\circ f=g\circ h$ . Then, the map h is $C^{1+\alpha }$ if and only if for every point $p\in \operatorname {\mathrm {Fix}}(f^n)$ , $n\in \mathbb {N}$ , we have $(f^n)'(p)=(g^n)'(h(p))$ .
While not directly stated in this form, a proof of Theorem 1.2 can be found from de la Llave [Reference de la LlaveLla92]. A closely related result is the theorem of Shub and Sullivan [Reference Shub and SullivanSS85], which proves that $C^r$ ( $r\geq 2$ ) expanding maps of the circle which are conjugated by an absolutely continuous homeomorphism $h_1$ are in fact conjugated by a $C^r$ diffeomorphism $h_2,$ but it may be that $h_1\neq h_2$ . Martens and de Melo prove a more general version of Theorem 1.2 [Reference Martens and de MeloMM99, Corollary 2.9] applying to all $C^r$ Markov maps in one dimension. In particular, the main theorem of [Reference Martens and de MeloMM99] applies to unimodal maps (with critical points) without wild and solenoidal attractors and establishes periodic data rigidity for these systems. The picture for general unimodal maps is more complicated: Moreira and Smania proved [Reference Moreira, Smania, Griffiths, Mather and SteinMS14] that unimodal maps with Cantor set attractor, such as Feigenbaum maps and Fibonacci maps with high order at the critical point, are always absolutely continuously conjugated, but not necessarily smoothly. Thus, Shub and Sullivan’s theorem does not carry over to these maps. It would be interesting to study finite data rigidity (Theorem 1.3 below) for unimodal and Markov maps as well.
The goal of the present paper is to relax the conditions of Theorem 1.2 to hold at only finitely many periodic orbits, and then see how close f and g are to being smoothly conjugated.
Theorem 1.3. Let $\gamma>1$ be fixed and let $\mathcal {W}\subset \operatorname {\mathrm {Exp}}^2_\gamma (S^1)$ be a bounded subset. Then, there exist constants $K>0$ and $0<\unicode{x3bb} <1$ depending only on $\gamma>1$ and $\mathcal {W}$ such that the following holds: if $f,g\in \mathcal {W}$ are conjugated by a homeomorphism h ( $h\circ f=g\circ h$ ) and if there exists $N\in \mathbb {N}$ such that $(f^N)'(p)=(g^N)'(h(p))$ for every $p\in \operatorname {\mathrm {Fix}}(f^N)$ , then there exists a diffeomorphism ${h}_N\in C^2(S^1)$ such that $d_{C^0}(h,{h}_N)\leq K\unicode{x3bb} ^N$ . Moreover, for every $0<\unicode{x3bb} ^{1/2}<\unicode{x3bb} _0<1$ , there exists a constant $K'>0$ , such that if we let $f_N={h}_N^{-1}\circ g\circ {h}_N$ , then $d_{C^1}(f,{f}_N)\leq K'\unicode{x3bb} _0^N$ .
Remark 1. The conjugacy $h_N$ we construct will not explicitly depend on the parameter N. Instead, the notation is meant to emphasize how much of the periodic data we can use for our estimates. A more descriptive notation would be $h_{f,g,N}$ to emphasize the dependence on the expanding maps f and g. However, for the sake of brevity (as well as to avoid confusing notation for $f_N$ ), we will use the simplified notation $h_N$ .
A key step in the proof of Theorem 1.3 is to prove an effective version of Bowen’s equidistribution theorem (see Theorem 2.2 below), which allows us to estimate the difference between h and ${h}_N$ in terms of the periodic orbits of order N. The convergence rate $\unicode{x3bb} $ comes from the effective equidistribution rate and depends on the degree of the expanding maps and $\gamma $ . The proof of effective equidistribution is postponed until §3 and relies on the technique of Birkhoff cones for subshifts of finite type, which we recall in Appendix A. This technique is well known in the case of expanding maps; see for instance Baladi [Reference BaladiBal00].
2. Finite data rigidity
The goal of this section is to generalize Theorem 1.2 to allow for the derivatives of f and g to agree only at finitely many periodic points. Of course, f and g will not be $C^1$ conjugate, but we can find a map $C^1$ close to f which is $C^1$ conjugate to g. Moreover, this new map converges exponentially to f as the number of periodic points that the derivatives agree on increases.
Definition 2.1. For a function $f\in C^k(S^1)$ , $k\in \mathbb {N}$ , let $|f|_{C^k}=\sup |D^kf|$ denote the $C^k$ seminorm. Let $\|f\|_{\infty }$ denote the supremum norm of f and let $\|f\|_{C^k}=\|f\|_{\infty }+\sum _{i=1}^k|f|_{C^i}$ denote the $C^k$ norm. If $f:S^1\rightarrow S^1$ is $\alpha $ -Hölder continuous ( $0<\alpha \leq 1$ ), define
to be the Hölder seminorm and define $\|f\|_{\alpha }=\|f\|_{\infty }+|f|_{\mathrm {Lip}}$ to be the Hölder norm. For $\alpha =1$ , we get the Lipschitz seminorm and Lipschitz norm $|\cdot |_{\mathrm {Lip}}$ and $\|\cdot \|_{\mathrm {Lip}}$ , respectively.
It well known that for a $C^{1+\alpha }$ expanding map f on a compact manifold M, there exists a unique invariant probability measure $\mu _f$ which is absolutely continuous with respect to Lebesgue measure, whose density $\rho _f(x)$ is $C^\alpha $ and strictly positive. See Baladi [Reference BaladiBal00] for full details. Moreover, $\mu _f$ is the unique equilibrium state corresponding to the geometric potential $\psi _f=-\log (|f'(x)|)$ . For the uniformity claims of Theorem 1.3, it will be important to have uniform bounds on the densities $\rho _f$ .
Lemma 2.1. Let $\mathcal {W}$ be a bounded set in $\operatorname {\mathrm {Exp}}^2_\gamma (S^1)$ , $\gamma>1$ . Then, for every $0<\alpha <1$ , there exists $C>1$ such that for all $f\in \mathcal {W}$ and every $x\in S^1$ , $C^{-1}\leq \rho _f(x)\leq C$ and $|\rho _f|_{C^\alpha }<C$ .
Proof. We prove this using a Birkhoff cone argument. See Baladi [Reference BaladiBal00, §2.2] for details and Appendix A of the present paper for a similar argument for subshifts of finite type. For $L>0$ , consider the cone
Then, Baladi shows in [Reference BaladiBal00] that there exists $L>0$ depending only on $\gamma>1$ and $0<\alpha <1$ such that $\rho _f\in \Lambda _L$ . Since $\rho _f$ is the smooth density of a probability measure, there exists a point $y\in S^1$ such that $\rho _f(y)=1$ . Since $\rho \in \Lambda _L$ , for any $x\in S^1$ , we have
and we likewise get the lower bound for $\rho (x)$ by swapping the roles of x and y. Finally, notice that $\rho _f\in \Lambda _L$ is equivalent to $\log (\rho _f)\in C^\alpha (S^1)$ with $|\log (\rho _f)|_\alpha \leq L$ . Then, $\rho _f=\exp (\log (\rho _f))$ and
Remark 2. One can prove uniform bounds on the densities without a Birkhoff cone argument by instead carefully going through the arguments of Sacksteder’s proof [Reference Sacksteder and VidalSac74] of the existence of invariant densities.
We are now ready to introduce the main technical tools needed to prove Theorem 1.3. We will use the notation $S_{f,n}\psi (x)$ to denote the nth ergodic sum of $\psi $ at the point x with respect to the dynamics of f:
Definition 2.2. Let $(X,f)$ be a dynamical system and $\psi :X\rightarrow [0,\infty )$ a non-negative potential function. Whenever $\operatorname {\mathrm {Fix}}(f^n)\not =\emptyset $ , we can define a measure $\mu ^n_{f,\psi }$ by
where
is a normalization constant so that $\mu ^n_{f,\psi }$ is a probability measure. We call $\mu ^n_{f,\psi }$ the nth weighted discrete measure associated to the dynamics of f and the potential $\psi $ .
The measure $\mu ^n_{f,\psi }$ depends on three ingredients: the length n of periodic orbits under consideration, the dynamical system $f:X\rightarrow X$ , and the potential function $\phi :X\rightarrow [0,\infty )$ . In our setting, we will be considering a smooth expanding map $f:S^1\rightarrow S^1$ together with the associated geometric potential $\psi _f(x)=-\log (|f'(x)|)$ . When this is the case, since the potential depends on the dynamics, we will use the more compact notation $\mu ^n_f$ in place of $\mu ^n_{f,\psi _f}$ .
Let us recall the following theorem of Bowen [Reference BowenBow74].
Theorem 2.2. (Bowen’s equidistribution theorem)
Let $(X,d)$ be a compact metric space, $f:X\rightarrow X$ an expansive homeomorphism with the specification property, and $\psi \in C^f(X)$ . Then, there exists a unique equilibrium state $\mu _{\psi }\in \mathcal {M}(f)$ given by
where the limit converges in the weak $^*$ -topology.
Remark 3. Theorem 2.2 is sufficient for proving Theorem 1.2 (though Bowen’s theorem is stated for invertible systems, the same result is true for uniformly expanding maps on the circle with essentially the same proof; see also Theorem 2.3 below). Indeed, from the assumption on periodic data, it follows that $\mu _f^n=h^*\mu ^n_g$ for all $n\in \mathbb {N}$ , where $h^*\mu _g$ denotes the pullback measure by h. Then, taking weak $^*$ -limits and using Theorem 2.2, we conclude that $\mu _f=h^*\mu _g$ . Defining the functions
and integrating $\mu _f=h^*\mu _g$ from $0$ to x, we find that $I_f(x)=I_g(h(x))$ or
Equation (2.1) provides us with insight as to how we should define $h_N$ in the proof of Theorem 1.3.
To estimate $d_{C^0}(h,h_N)$ in Theorem 1.3, we will need the following effective version of Theorem 2.2.
Theorem 2.3. (Effective equidistribution)
Let $\mathcal {W}$ be as bounded set in $\operatorname {\mathrm {Exp}}^2_\gamma (S^1)$ , $\gamma>1$ . Let $f\in \mathcal {W}$ , $\psi _f:S^1\rightarrow \mathbb {C}$ be the geometric potential of f with unique equilibrium state $\mu _f$ , and let $\mu ^N_{f}$ be the Nth weighted discrete measure associated to f and $\psi _f$ . Then, there exist constants $C'>0$ and $0<\tau <1$ , depending only on $\mathcal {W}$ , such that for every Lipschitz function $\phi :S^1\rightarrow \mathbb {C}$ , we have
We will defer the proof of Theorem 2.3 to §3. See Kadyrov [Reference KadyrovKad16, Theorem 1.5] for a more general result for the measure of maximal entropy of a subshift of finite type, and Rühr [Reference RührRüh21] for a version applying to equilibrium states of countable state shifts.
To prove Theorem 1.3, we will need to apply a version of Theorem 2.3 to the characteristic functions $\chi _{[0,x]}$ for every $x\in S^1$ .
Lemma 2.4. There exist constants $K>0$ and $0<\unicode{x3bb} <1$ such that for every $f\in \mathcal {W}$ , every $x\in S^1$ , and every $N\in \mathbb {N}$ ,
Here and throughout the paper, we write $\int _0^x\,d\mu _f^N$ as shorthand notation for $\int \chi _{[0,x]}\,d\mu _f^N=\mu _f^N([0,x])$ .
Proof. We would like to apply Theorem 2.3 but cannot do so directly since the characteristic function $\chi _{[0,x]}$ is not Lipschitz continuous. Instead, we will approximate $\chi _{[0,x]}$ by a Lipschitz function $\phi _x$ and show that the effective equidistribution in equation (2.2) of $\phi _x$ can be passed on to $\chi _{[0,x]}$ , albeit with a slower rate of convergence. To find an appropriate choice of $\phi _x$ , we will construct a one-parameter family of Lipschitz functions $\phi _x^s$ for $s\in [0,\tau ^{N/2}]$ , where $\tau $ is as in Theorem 2.3, satisfying the following properties:
-
(1) the family $\phi ^s_{x}$ varies continuously with s in the $C^0$ -topology;
-
(2) $\phi ^0_{x}\leq \chi _{[0,x]}$ and $\phi ^{\tau ^{N/2}}_{x}\geq \chi _{[0,x]}$ ;
-
(3) for every $s\in [0,\tau ^{N/2}]$ , $|\phi ^s_{x}|_{\mathrm {Lip}}\leq \tau ^{-N/2}$ ;
-
(4) for every $s\in [0,\tau ^{N/2}]$ , $|\phi ^s_{x}-\chi _{[0,x]}|=0$ except on a set $\Omega _N$ of Lebesgue measure $m(\Omega _N)\leq 2\tau ^{N/2}$ .
The definition of the families $\phi ^s_x$ depends on the point $x\in S^1$ , though the typical graph of the functions are all similar and illustrated in Figure 1. We split the construction of the $\phi _x^s$ into three general cases.
Case 1. Assume that $2\tau ^{N/2}\leq x\leq 1-2\tau ^{N/2}$ : Fix $2\tau ^{N/2}\leq x\leq 1-2\tau ^{N/2}$ and define a family of continuous functions as follows:
It can be easily shown that the above family has the stated properties. Next, we estimate
Let us consider each term on the right side of equation (2.4) separately. The first term can be rewritten as
By property (4) of the family $\phi _x^s$ , $|\chi _{[0,x]}(y)-\phi _x^s(y)|=0$ except on a set of Lebesgue measure less than $2\tau ^{N/2}$ (depending on s), and is otherwise bounded above by $1$ . Furthermore, by Lemma 2.1, there exists $C>1$ such that the density $\rho _f$ of $\mu _f$ is bounded above by C. Hence, $|\chi _{[0,x]}(y)-\phi _x^s(y)|=0$ except on a set of $\mu _f$ -measure less than $2C\tau ^{N/2}$ , and is otherwise bounded above by $1$ . Therefore,
For the second term on the right side of equation (2.4), we observe that by property (3), we have $\|\phi _x^s\|_{\mathrm {Lip}}=1+\tau ^{-N/2}$ . We can therefore apply Theorem 2.3 to find
We claim that the final term on the right side of equation (2.4) is zero for some choice of s. Define
Then by property (2), since $\Phi (0)$ is the integral of a strictly negative function with respect to a positive measure, we have $\Phi (0)\leq 0$ , and likewise $\Phi (\tau ^{N/2})\geq 0$ . We claim that $\Phi (s)$ is a continuous function of s. Indeed, by property (1), fixing $\varepsilon>0$ , we can find a $\delta>0$ such that if $|s_1-s_2|<\delta $ , then $\|\phi _x^{s_1}-\phi _x^{s_2}\|_{\infty }<\varepsilon $ . Then,
By the intermediate value theorem, we can therefore choose some $0<s<\tau ^{N/2}$ so that $\Phi (s)=0$ . Therefore, combining these three bounds, we have the following bound on equation (2.4):
which is exactly equation (2.3), with $\unicode{x3bb} =\tau ^{1/2}$ .
Case 2. Assume that $x>1-2\tau ^{N/2}$ : The proof is nearly identical as in Case 1, but we use a slightly different family of Lipschitz functions:
for $0\leq s\leq ({1+x})/{2}$ , and
for $({1+x})/{2}\leq s\leq \tau ^{N/2}$ . The remainder of the proof is identical to Case 1.
Case 3. Assume that $x\leq 2\tau ^{N/2}$ : The proof is again identical to Case 1 but with the following family:
for $0\leq s\leq \tau ^{N/2}-{x}/{2}$ , and
for $\tau ^{N/2}-{x}/{2}\leq s\leq \tau ^{N/2}$ . Observe that for every $0\leq s\leq \tau ^{N/2}$ , $\|\phi _x^s\|_{\infty }\leq 1$ , so that $\|\phi _x^s\|_{\mathrm {Lip}}\leq 1+ \tau ^{-N/2}$ , so the remainder of the argument in Case 1 carries over verbatim.
Proof of Theorem 1.3
We begin by recalling that when we have matching of all periodic data, the conjugacy h could be expressed as the smooth composition in equation (2.1). Observe that the latter expression $I_g^{-1}\circ I_f$ is a well-defined $C^{1+\alpha }$ function without any hypotheses on the periodic data. For this reason, we define $h_N:=I_g^{-1}\circ I_f$ .
Observe that since $(f^N)'(p)=(g^N)'(h(p))$ for every $p\in \operatorname {\mathrm {Fix}}(f^N)$ , we have that $\mu _f^N=h^*\mu _g^N$ :
We now calculate
where $\sup |(I_g^{-1})'|=\sup |\rho _g^{-1}|\leq C$ . By symmetry, it suffices to show that
uniformly in $x\in S^1$ . This is precisely the content of Lemma 2.4. Therefore, we have shown that for every $x\in S^1$ , $|{h}_N(x)-h(x)|\leq K\unicode{x3bb} ^{N}$ , where $\unicode{x3bb} =\tau ^{N/2}$ . This proves the first statement of Theorem 1.3.
As a first consequence, we obtain a bound on the $C^0$ distance between f and $f_N$ :
Note that $Lip({h}_N^{-1}\circ g)=\sup |D({h}_N^{-1}\circ g)|=\sup (|D(I_f^{-1}\circ I_g\circ g)|{\kern-1pt}\leq{\kern-1pt} ({\max \rho _g}/{\min \rho _f}) \max |g'|$ , which is uniformly bounded in $\mathcal {W}$ . Absorbing these uniform constants into K, we get
To finish the proof, it remains to establish the stated $C^1$ exponential closeness of f and $f_N$ . We will do so by interpolating between the $C^0$ exponential bound in equation (2.5) and a uniform $C^{1+\alpha }$ bound we will establish below. The following interpolation lemma we will use is elementary, but we include the proof for completeness.
Lemma 2.5. Fix $M>0$ and $0<\alpha \leq 1$ . Let $\phi :S^1\rightarrow S^1$ be a $C^{1+\alpha }$ function with $|\phi '|_{C^\alpha }$ $<M$ , and let $\varepsilon ,\delta>0$ be such that
then $|\phi |_{C^1}<({M}/({\alpha +1}))\delta ^\alpha +\varepsilon $ .
Proof of Lemma 2.5
Suppose first that $\phi (0)=0$ and $\phi '(0)=\sup |\phi '|=:\varepsilon '$ , and take $|x|>\delta $ . Then, since the $\alpha $ -Hölder seminorm of $\phi '$ is bounded by M, we have
Putting this together with
we find
It follows that
where in the last line, we let $|x|\rightarrow \delta $ . Finally, if $|\phi |_{C^0}$ is attained at some other point $x_0\in S^1$ , then simply apply the preceding argument to $\tilde {\phi }(x)=\phi (x+x_0)-\phi (0)$ .
We now return to finish the proof of Theorem 1.3. We will apply Lemma 2.5 to the function $F={f}_N-f$ . Let $\varepsilon =2K'\unicode{x3bb} ^{N/2}$ and $\delta =\unicode{x3bb} ^{N/2}$ . Then,
It remains to prove that $|F'|_{C^\alpha }$ is uniformly bounded for $f\in \mathcal {W}$ for every $\alpha <1$ . We have
and since f is uniformly bounded in the $C^2$ seminorm, it will be uniformly bounded in the $C^{1+\alpha }$ seminorm. So it remains to uniformly bound $|{f}_N'|_{C^\alpha }=|(h_N^{-1}\circ g\circ h_N)'|_{C^\alpha }=|((h_N^{-1})'\circ g\circ h_N)(g'\circ h_N)h_N'|_{C^\alpha }$ . By the product rule for the $\alpha -$ Hölder seminorm and symmetry between $h_N=I_g^{-1}\circ I_f$ and $h_N^{-1}=I_f^{-1}\circ I_g$ , it suffices to uniformly bound $|h_N'|_{C^\alpha }=|{\rho _f}/({\rho _g\circ h_N})|_{C^\alpha }$ . By Lemma 2.1, we can uniformly bound $|\rho _f|_{C^\alpha }$ for $f\in \mathcal {W}$ and hence, by properties of the $\alpha -$ Hölder seminorm, we can uniformly bound $|{\rho _f}/({\rho _g\circ h_N})|_{C^\alpha }.$ Therefore, we may apply Lemma 2.5 and we find that $|F|_{C^1}\leq ({M}/({\alpha +1}))\delta ^\alpha +\varepsilon = ({M}/({\alpha +1}))(2K'\unicode{x3bb} ^{\alpha N/2})+\unicode{x3bb} ^{N/2}=K"\unicode{x3bb} ^{\alpha N/2}$ . For $\alpha <1$ such that $\unicode{x3bb} ^{\alpha /2}=\unicode{x3bb} _0$ , we get the desired conclusion.
Corollary 2.6. Let $f,g\in \operatorname {\mathrm {Exp}}^{r+1+\alpha }_\gamma (S^1)$ for $r\in \mathbb {N}$ , $r\geq 2$ , and suppose that $d_{C^{r+1+\alpha }}(f,g)<C_0$ . Under the same hypotheses of Theorem 1.3, we have that there exists a constant $K_r>0$ independent of f such that $d_{C^r}(f,{f}_N)\leq K_r\unicode{x3bb} ^{2^{-r}N}$ .
Proof. We proceed by induction. The base case $r=2$ follows exactly as in Theorem 1.3, except the added assumption that our systems are $C^{2+\alpha }$ allow us to get uniform bounds on $|F'|_{C^1}$ (using the argument of Lemma 2.1) and apply Lemma 2.5 with $\alpha =1$ . Letting $F={f}_N-f$ , we assume by induction that we have proven $|F|_{C^r}\leq K_r\unicode{x3bb} ^{2^{-r}N}$ . To obtain a similar estimate for $|F|_{C^{r+1}}$ , we apply Lemma 2.5 to the function $F^{(r)}$ , with $\varepsilon =2K_r\unicode{x3bb} ^{2^{-r-1}N}$ and $\delta =\unicode{x3bb} ^{2^{-r-1}N}.$ For these choices, we find
The assumption that maps are $C^{r+2+\alpha }$ is so that we can compactly embed the set $\mathcal {W}$ in $C^{r+2}$ , thereby getting uniform bounds on $|F^{(r+1)}|_{C^1}$ . The conclusion now follows Lemma 2.5 exactly as in the proof of Theorem 1.3.
The next corollary establishes a similar estimate on the exponential decay of $d_{C^r}(f,{f}_N)$ without such a loss of exponent under the stronger assumption that f and g are close in the $C^k$ -topology for all k.
Corollary 2.7. Let $f,g\in \operatorname {\mathrm {Exp}}^{\infty }_\gamma (S^1)$ and suppose that $\sup _{k\geq 0}d_{C^k}(f,g)<C_0$ . Then, for any $r\in \mathbb {N}$ and any $0<\unicode{x3bb} ^{1/2}<\unicode{x3bb} _0<1$ , there exists a constant $K_r'>0$ such that under the hypotheses of Theorem 1.3, we have $d_{C^r}(f,{f}_N)\leq K_r'\unicode{x3bb} _0^N$ .
Proof. The proof follows from interpolation theory on the spaces $C^r(S^1)$ (see Lunardi [Reference LunardiLun18, Remark 1.22]). For $k_1<m<k_2\in \mathbb {N}$ , we have that $\|\phi \|_{C^m}\leq C_{k_1,k_2,m}\|\phi \|_{C^{k_1}}^{1-t}\|\phi \|_{C^{k_2}}^t$ for any $\phi \in C^{k_2}(S^1)$ , where $t=({m-k_1})/({k_2-k_1})$ . We will apply this with $\phi =F={f}_N-f, k_1=1$ , and by taking $k_2$ sufficiently large, we have that t can be made arbitrarily close to $0$ . We choose $k_2$ so that $\unicode{x3bb} ^{({1-t})/{2}}\leq \unicode{x3bb} _0$ . We want to bound the term $\|F\|_{C^{k_2}}$ using the bound $d_{C^{k_2+2}}(f,g)<C_0$ . Applying Corollary 2.1 gives us $\|F\|_{C^{k_2}}\leq K_{k_2}\unicode{x3bb} ^{{tN}/{2^{k_2}}}\leq K_{k_2}$ . Hence, by our interpolation inequality (collecting all constants into C),
3. Effective equidistribution
We begin by reviewing the basic definitions of subshifts of finite type and transfer operators. For a more detailed treatment, see Baladi [Reference BaladiBal00], and Parry and Pollicott [Reference Parry and PollicottPP90]. Let A be an irreducible and aperiodic $0,1$ -matrix, which we will refer to as a transition matrix, and consider the set
We interpret $\Sigma _A^+$ as the set of all sequences in m-symbols that are allowed by the transition matrix A. Consider the left shift map $\sigma _A^+:\Sigma _A^+\rightarrow \Sigma _A^+$ defined by $(\sigma _A^+(x))_n=x_{n+1}$ . We refer to the dynamical system $(\Sigma _A^+,\sigma _A^+)$ as a one-sided subshift of finite type. We can analogously define two-sided subshifts of finite types. When our transition matrix is clear, we shall denote the left shift map simply as $\sigma $ .
We topologize $\Sigma _A^+$ with the metric $d_{\theta }(x,y)=\theta ^{\max \{n\geq 0|x_i=y_i, 0\leq i<n\}}$ , where $0<\theta <1$ is a fixed constant. Notice that with respect to this metric, $\sigma $ is a $\theta $ -expansion. Let $\mathcal {F}_\theta ^+$ denote the Banach space of all functions $\phi :\Sigma _A^+\rightarrow \mathbb {C}$ which are Lipschitz continuous with respect to this metric. Denote by $|\cdot |_\theta $ and $\|\cdot \|_\theta $ the Lipschitz seminorm and Lipschitz norm with respect to this metric, respectively. For $1\leq i\leq m,$ let $[i]=\{x\in \Sigma _A^+|x_0=i\}$ , which we call the cylinder set at i. We will use the following important but simple estimate.
Lemma 3.1. If $\phi \in \mathcal {F}_\theta $ and $[i]$ is a cylinder set, then $\|\chi _{[i]}\phi \|_\theta <2\|\phi \|_\theta $ .
Proof. Note that we always have $\|\chi _{[i]}\phi \|_{\infty }\leq \|\phi \|_\infty \leq \|\phi \|_\theta $ , so it remains to prove that $|\chi _{[i]}\phi |_\theta \leq \|\phi \|_\theta $ . Suppose that $x\neq y$ . If $x,y\in [i]$ , we have
If $x\not \in [i]$ and $y\not \in [i]$ , then
Finally, if $x\in [i]$ and $y\not \in [i]$ , then since $d_\theta (x,y)=1$ , we have
Taking the supremum over $x\neq y$ yields $|\chi _{[i]}\phi |_\theta \leq \|\phi \|_\theta ,$ as desired.
Our primary tool in this section will be the Ruelle transfer operator.
Definition 3.1. Fix a weight function $\psi \in \mathcal {F}_\theta ^+$ and define the Ruelle transfer operator $\mathcal {L}_\psi :\mathcal {F}_\theta ^+\rightarrow \mathcal {F}_\theta ^+$ by the formula
By the Ruelle–Perron–Frobenius theorem (see [Reference BaladiBal00, Theorem 1.5]), the operator $\mathcal {L}_\psi $ is quasi-compact, with a unique maximal positive simple eigenvalue $\unicode{x3bb} =e^{P(\psi )}$ corresponding to a strictly positive eigenfunction $\rho $ , and all other points of the spectrum lie in a strictly smaller disc. Let us further assume that the transfer operator is normalized so that $\mathcal {L}_\psi (1)=e^{P(\psi )}$ (which can always be accomplished by replacing the weight $\psi $ with $\overline {\psi }=\psi +\log (\rho )-\log (\rho )\circ \sigma $ , and observing that $P(\psi )=P(\overline {\psi }))$ . Then the eigenmeasure $\mu _\psi $ corresponding to the eigenvalue $e^{P(\psi )}$ of the dual operator $\mathcal {L}_\psi ^*$ is the unique equilibrium state of the potential $\psi $ . Observe that
What follows is the analog of Theorem 3.2 for subshifts of finite type.
Theorem 3.2. (Effective equidistribution for equilibrium states)
Let $(\Sigma _A^+,\sigma _A^+)$ be a subshift of finite type, where the $m\times m$ transition matrix A is irreducible and aperiodic, and let $\psi \in \mathcal {F}_\theta ^+$ be a Lipschitz continuous potential. Then, there exists constants $C>0$ and $0<\tau <1$ such that for any $\phi \in \mathcal {F}_\theta ^+$ and all $n\in \mathbb {N}$ ,
where $\mu _\psi $ is the unique equilibrium state of $\psi $ .
Remark 4. To be consistent with our previous notation in §2, we should write $\mu ^n_{\sigma ,\psi }$ for the weighted discrete measure. However, since the dynamics on the shift space will always be the left shift map $\sigma $ , we do not need to specify in our notation and will instead simply write $\mu ^n_\psi $ .
Proof. By replacing $\phi $ by $\phi -\int \phi \,d\mu _\psi $ , we may assume that $\int \phi \,d\mu _\psi =0$ . We need to show that
where $Z_n$ is the normalization constant. By Katok and Hasselblatt [Reference Katok and HasselblattKH95, Proposition 20.3.3], there exists a constant $D>0$ such that $({1}/{D})e^{nP(\psi )}\leq Z_n\leq De^{nP(\psi )}$ (an inspection of the proof reveals that this constant D can be made uniform). Let $[i]=\{x\in \Sigma _A^+|x_0=i\}$ , and for a string $\underline {i}=(i_0,\ldots ,i_{n-1})$ , let us denote its length by $|\underline {i}|=n$ and its cylinder set by $[\underline {i}]=\{x\in \Sigma _A^+|x_0=i_0,\ldots ,x_{n-1}\}$ . For each $1\leq i\leq m$ , fix any point $x_i\in [i]$ , and for each string $\underline {i}$ , fix a point of period n, $x_{\underline {i}}\in [\underline {i}]$ , if one exists, and let $x_{\underline {i}}\in [\underline {i}]$ be arbitrary otherwise. We first claim that
To see this, we expand out each term in the right sum:
where $\underline {i}x=(i_0,\ldots ,i_{n-1},x_0,x_1,\ldots )$ denotes the only inverse branch of $\sigma ^n$ that contributes to the sum due to the characteristic function. Since each point of $\operatorname {\mathrm {Fix}}(\sigma ^n)$ lies in a unique cylinder set $[\underline {i}]$ , and each such cylinder set contains at most one period-n point, we see that all periodic points are accounted for in the sum over $|\underline {i}|=n$ . Consider the estimate
To estimate both terms on the right, we will first decompose the transfer operator as a sum of its projection onto the eigenspace of $e^{P(\psi )}$ , and the orthogonal projection: $\mathcal {L}_\psi =\mathcal {P}+\mathcal {N}$ , where $\mathcal {P}(\phi )=e^{P(\psi )}\int \phi \,d\mu _\psi $ , and $\mathcal {N}$ has spectral radius $re^{P(\psi )}$ with $0<r<1$ . The second term can thus be estimated as
using Lemma 3.1 and the spectral radius formula (see also Lemma 3.3 for the uniform version we will need later in the case of the symbolic coding of expanding maps), where $\varepsilon>0$ is arbitrary and $C>0$ depends on $\varepsilon $ (recall that m is the size of our alphabet and is independent of n).
It remains to estimate
If $\underline {i}=(i_0,\ldots ,i_{n-1})$ , we let $\underline {j}(\underline {i})=(i_0,\ldots ,i_{n-2})$ . We now telescope our above series:
(The preceding expansion was not novel, and can be found for instance in [Reference Pollicott and SharpPS01, Lemma 3].) We now take the absolute value of both sides and observe that $d_\theta (x_{\underline {i}},x_{\underline {j}(\underline {i})}) =\theta ^{m-1}$ :
In this last inequality, we made use of the fact that $\mathcal {N}\mathcal {P}=0$ to get the bound
We next estimate the term $\|\mathcal {L}_\psi ^{m}(\chi _{[\underline {i}]}\phi ))\|_\theta =\|e^{(S_m\psi )\circ \sigma _{\underline {i}}^{-1}}(\phi \circ \sigma _{\underline {i}}^{-1})\|_\theta $ , where $\sigma _{\underline {i}}^{-1}(x)=\underline {i}x$ is an inverse branch of $\sigma ^n$ :
To calculate the seminorms $|\phi \circ \sigma _{\underline {i}}^{-1}|_\theta $ and $|S_m\psi \circ \sigma _{\underline {i}}^{-1}|_\theta $ , we use the fact that $\sigma _{\underline {i}}^{-1}$ is a $\theta ^m$ -contraction:
A similar (and simpler) calculation shows that $|\phi \circ \sigma _{\underline {i}}^{-1}|_\theta \leq \theta ^m\|\phi \|_\theta $ . Putting this all together, we find that
where $C'=\max \{1,{|\psi |_\theta }/({1-\theta })\}$ . Thus, using the spectral radius bound for $\mathcal {N}$ , we find that
where $\kappa =\max \{\theta ,r+\varepsilon \}<1$ . We have seen previously that $|S_m\psi \circ \sigma _{\underline {i}}^{-1}(x)-S_m\psi \circ \sigma _{\underline {i}}^{-1}(y)|\leq ({|\psi |_\theta }/({1-\theta }))\,d_\theta (x,y)\leq K<\infty $ (since the diameter of $\Sigma _A^+$ is finite). Taking the exponential of both sides, we obtain the following bounded distortion estimate:
for any $x,y$ in the domain of $\sigma _{\underline {i}}^{-1}$ . Notice that the domain of $\sigma _{\underline {i}}^{-1}$ is completely determined by the last symbol in the string $\underline {i}$ . For each $\underline {i}$ , let $y_{\underline {i}}$ be such that $e^{(S_m\psi )\circ \sigma _{\underline {i}}^{-1}(y_{\underline {i}})}=|e^{(S_m\psi )\circ \sigma _{\underline {i}}^{-1}}|_{\infty }$ , and let $z_{\underline {i}}$ be any point in the domain of $\sigma _{\underline {i}}^{-1}$ that only depends on the last symbol $i_{m}$ . Then,
Therefore,
Combining the estimates in equations (3.2), (3.3), and (3.4), we have shown
with $\kappa +\varepsilon <1$ . We therefore have
Finally, letting $\tau =(\kappa +\varepsilon )e^\varepsilon <1$ for sufficiently small $\varepsilon>0$ , this gives equation (3.1).
Remark 5. By standard arguments, it is easy to deduce the same equidistribution result for two-sided shifts.
To pass to the proof of Theorem 2.3, we will use Markov partitions. It is well known that repellers and Axiom A diffeomorphisms admit finite Markov partitions; see Przytycki and Urbanski [Reference Przytycki and UrbanskiPU10, Theorem 3.5.2] in the case of expanding maps and Bowen [Reference BowenBow70] for the case of Axiom A diffeomorphisms. More precisely, if $(J,T)$ is a repeller or $(\Omega (T),T)$ is an Axiom A diffeomorphism, then there exists a subshift of finite type $(\Sigma _A^+,\sigma )$ and a semiconjugacy $\pi :\Sigma _A^+\rightarrow J$ (respectively $(\Sigma _A,\sigma )$ for $(\Omega (T),T)$ ). If the transformation T is mixing, then the transition matrix A is irreducible and aperiodic. For an appropriately chosen $\theta $ , Lipschitz functions f defined on J or $\Omega (T)$ can be lifted to a Lipschitz function $f\circ \pi $ . We proceed to prove Theorem 2.3 in the case of one-dimensional expanding maps, where the passage to a Markov partition is simplest. However, to obtain a uniform effective equidistribution for expanding maps, we need uniform effective equidistribution of corresponding lifted systems in Theorem 3.2. Uniformity is lost in Theorem 3.2 when we apply the spectral radius formula to get bounds on $\|\mathcal {N}^n\|_\theta $ . However, for the particular systems we are considering, we can obtain uniform bounds on $\|\mathcal {N}^n\|_\theta $ by using the Birkhoff cone technique, adapted to subshifts of finite type by Naud in [Reference NaudNau04].
Lemma 3.3. There exists $C_{\mathcal {W}}>0$ and $0<\tau _{\mathcal {W}}<1$ such that the following is true. Given $f\in \mathcal {W}$ , let $\pi _f$ be the associated semiconjugacy to the full shift on $\deg (f)$ -symbols, and let $\psi _f=-\log (f'\circ \pi _f)$ . If $\mathcal {L}_{\psi _f}h_f= e^{P(\psi _f)}h_f$ , we consider the normalized potential $\overline {\psi }_f=\psi _f+\log (h_f)-\log (h_f\circ \sigma )$ . Then, for all $n\in \mathbb {N}$ , $\|\mathcal {N}_{\overline {\psi }_f}^n\|_\theta < C_{\mathcal {W}}\tau _{\mathcal {W}}^ne^{nP(\overline {\psi }_f)}$ .
We review the Birkhoff cone construction and prove Lemma 3.3 in Appendix A.
Proof of Theorem 2.3
Consider a degree k expanding map $f:S^1\rightarrow S^1$ . Then, the Markov partition of $(f,S^1)$ consists of k closed intervals which are each mapped under f to the entirety of $S^1$ . Consequentially, $(f, S^1)$ is semiconjugated to the full shift on k symbols, $(\sigma , \Sigma ^+)$ . Let $\phi :S^1\rightarrow \mathbb {C}$ be a Lipschitz function. We can then lift $\phi $ to a Lipschitz function $\phi \circ \pi :\Sigma ^+\rightarrow \mathbb {C}$ . Likewise, let $\psi _f:S^1\rightarrow \mathbb {C}$ be the geometric potential with equilibrium state $\mu _f$ , and let $\psi _f\circ \pi :\Sigma ^+\rightarrow \mathbb {C}$ be the lifted potential with equilibrium state $\mu _{\psi _f\circ \pi }$ . We claim that the pushforward of $\mu _{\psi _f\circ \pi }$ under $\pi $ is $\mu _{f}$ , that is, $\pi _*\mu _{\psi _f\circ \pi }=\mu _{f}$ . Unfortunately, it is not true that $\pi _*\mu _{\psi _f\circ \pi }^n=\mu _{f}^n$ . Since the semiconjugacy is not injective, we may have multiple distinct periodic orbits for $(\Sigma ^+,\sigma )$ get mapped under $\pi $ to the same periodic orbit of $(S^1,f)$ .
We know that $|\operatorname {\mathrm {Fix}}(\sigma ^n)|=k^n$ and $|\operatorname {\mathrm {Fix}}(f^n)|=k^n-1$ . Moreover, distinct periodic orbits of f can be lifted to distinct periodic orbits of $\sigma $ with the same period, so we have that for each n, only two distinct points of $\Sigma ^+$ of period n get mapped under $\pi $ to the same point. Let $\nu _{\psi _f\circ \pi }^n$ be any measure on $\Sigma ^+$ such that $\pi ^*\nu _{\psi _f\circ \pi }^n=\mu _{f}^n$ . Then,
The first term in equation (3.5) can be bounded by $C\|\phi \circ \pi \|_\theta \tau ^n$ using Theorem 3.2. To estimate the last term of equation (3.5), we write
where $A_n$ is any subset of $\operatorname {\mathrm {Fix}}(\sigma ^n)$ that is mapped bijectively to $\operatorname {\mathrm {Fix}}(f^n)$ , and $Z_n({\psi })$ is a normalization constant. Then, if we let $y\in \operatorname {\mathrm {Fix}}(\sigma ^n)/A_n$ , we have
In the case of expanding maps, we have that $P(\psi _f)=0$ and $S_n(\psi _f)(\pi (y))\leq -n\log \unicode{x3bb} _f$ , where $\unicode{x3bb} _f>1$ is the expansion constant for f, that is, $|f'(x)|\geq \unicode{x3bb} _f$ for all $x\in S^1$ . This proves that
for some $0<\tau <1$ (not necessarily the same $\tau $ as in Theorem 3.2). Clearly, $\|\phi \circ \pi \|_{\infty }\leq \|\phi \|_{\infty }$ . Moreover,
so that $\|\phi \circ \pi \|_\theta \leq \max \{1,|\pi |_{\theta }\}\|\phi \|_{\mathrm {Lip}}.$ Thus, absorbing all constants into C, we have
as desired.
Remark 6. One can prove effective equidistribution for general expanding repellers and Axiom A diffeomorphism using Markov partitions in the same way, but more care is needed to handle the difference between the measure $\nu _{\psi _f\circ \pi }^n$ and $\mu _{\psi _f\circ \pi }^n$ .
Acknowledgments
The author would like to express his sincerest thanks to Andrey Gogolev for suggesting the problem, and for his patience and invaluable guidance. The author would also like to thank James Marshall Reber for many useful discussions and for his help in proving Theorem 3.2. Thanks to Daniel Smania for providing valuable comments on the draft of this paper. Finally, the author would like to express his gratitude to the referee for their comments which greatly helped with the readability of the present paper.
A. Appendix. Birkhoff cones for subshifts of finite type
Rather than deducing our desired bounds on $\|\mathcal {N}_{\psi _f}^n\|_\theta $ as a consequence of quasi-compactness of the transfer operator (the standard approach of the Ruelle–Perron–Frobenius theorem), we use the technique of Birkhoff cones. The idea is to show that the transfer operator contracts a certain cone of Lipschitz functions with respect to a ‘pseudo-metric’ and to then establish the leading eigenfunction as a fixed point with respect to this pseudo-metric. The benefit to this approach is that we can establish explicit bounds on $\|\mathcal {N}_{\psi _f}^n\|_\theta $ which will be uniform in our set $\mathcal {W}$ . Then, one can actually deduce quasi-compactness as a consequence of this bound. This approach is standard for uniformly expanding maps; see Baladi [Reference BaladiBal00, §2.2]. For subshifts of finite type, we follow Naud [Reference NaudNau04] closely, applied to the specific case of the full shift that we need, and for which certain technical difficulties vanish.
Definition A.1. A subset $\Lambda \subset \mathcal {B}/\{0\}$ of a Banach space $\mathcal {B}$ is called a cone if $\unicode{x3bb} \phi \in \Lambda $ for all $\phi \in \Lambda $ and all $\unicode{x3bb}>0$ . The cone is called closed if $\Lambda \cup \{0\}$ is closed, and $\Lambda $ is called convex if $\psi _1+\psi _2\in \Lambda $ for every $\psi _1,\psi _2\in \Lambda $ . A cone $\Lambda $ induces a partial order $\leq _\Lambda $ on $\mathcal {B}$ by defining $\psi \leq _\Lambda \phi \iff \phi -\psi \in \Lambda \cup \{0\}$ .
Definition A.2. For $\psi $ and $\phi $ in a cone $\Lambda $ , define
Then, we define the Hilbert pseudo-metric $\Theta _\Lambda $ on $\unicode{x3bb} $ by
Theorem A.1. (Birkhoff’s inequality)
Let $\Lambda $ be a convex cone in a Banach space $\mathcal {B}$ . If $T:\mathcal {B}\rightarrow \mathcal {B}$ is a linear operator such that $T(\Lambda )\subset \Lambda $ , then for each $\phi ,\psi \in \Lambda $ , we have
Lemma A.2. Let $\Lambda $ be a closed convex cone in a Banach space $\mathcal {B}$ endowed with two (not necessarily equivalent norms $\|\cdot \|_i$ , $i=1,2,$ and assume that for all $\phi ,\psi \in \mathcal {B}$ ,
Then, for any $\phi ,\psi \in \Lambda $ with $\|\phi \|_1=\|\psi \|_1$ , we have
If $\Sigma ^+$ is the one-sided full shift on k-symbols, and if $\mathcal {F}_\theta ^+$ is the Banach space of Lipschitz continuous functions on $\Sigma ^+$ with respect to the $d_\theta $ -metric, then given any $L>0$ , we have a cone in $\mathcal {F}_\theta ^+$ given by
To apply Birkhoff’s inequality, we will need the following lemmas.
Lemma A.3. Fix $0<\xi <1$ . Then, for every $\phi ,\psi \in \mathcal {C}_{\xi L}$ with $\phi ,\psi>0$ , we have
See Naud [Reference NaudNau04, Proposition 5.3] for the proof, which is unchanged in our setting.
Lemma A.4. Fix $\theta <\xi <1$ . Then, for every $L\geq {\theta |\psi |_\theta }/({\xi -\theta })$ , we have $\mathcal {L}_\psi (\mathcal {C}_L)\subset \mathcal {C}_{\xi L}$ and we have
Proof. Let $\phi \in \mathcal {C}_L$ and let $x,y\in \Sigma ^+$ be such that $d_\theta (x,y)\leq \theta $ . We obtain
The condition $\theta |\psi |_\theta +L\theta \leq \xi L$ holds if and only if $L\geq {\theta |\psi |_\theta }/({\xi -\theta })$ . Notice that $\phi \in \mathcal {C}_L$ implies that there exists at least one cylinder set $C_i=\{x\in \Sigma ^+ | x_0=i\}$ such that $\phi |_{C_i}>0$ . Thus, $\mathcal {L}_\psi \phi (x)\geq e^{\psi (ix)}\phi (ix)>0$ . We may therefore apply Lemma A.3 to functions $\mathcal {L}_\psi \phi _1$ and $\mathcal {L}_\psi \phi _2$ :
For $\phi \in \mathcal {F}^+$ , define the seminorm
and set $\|\phi \|_L:=\max (\|\phi \|_\infty , ({1}/{2L})V(\phi ))$ . The next lemma gives the essential properties of the norm $\|\cdot \|_L$ .
Lemma A.5. The norm $\|\cdot \|_L$ is equivalent to $\|\cdot \|_\theta $ , and for all $\phi ,\psi \in \mathcal {F}^+$ , we have that $-\phi _2\leq _{\mathcal {C}_L}\phi _1\leq _{\mathcal {C}_L} \phi _2$ implies that $\|\phi _1\|_L\leq \|\phi _2\|_L$ .
Proof. Given $\phi \in \mathcal {C}_L$ , $\varepsilon>0,$ and $x,y\in \Sigma ^+$ such that $d_\theta (x,y)\leq \theta $ , we have
Letting $\varepsilon \rightarrow 0$ and taking the supremum, we get the estimate $V(\phi )\leq \|\phi \|_\infty $ . This gives us that $\|\phi \|_L\leq C\|\phi \|\theta $ for some $C>0$ . Likewise, it is easy to see that $|\phi |_\theta \leq 2\|\phi \|_L$ , so we have that the norms are equivalent.
Now suppose that $-\phi _2\leq _L\phi _1\leq _L\phi _2$ . Then, $\phi _2-\phi _1\geq 0$ and $\phi _2+\phi _1\geq 0.$ In other words, for every $x\in \Sigma ^+$ , $-\phi _2(x)\leq \phi _1(x)\leq \phi _2(x)$ , which implies that $\|\phi _1\|_\infty \leq \|\phi _2\|_\infty $ . To prove that $\|\phi _1\|_L\leq \|\phi _2\|_L$ , it suffices to prove that $V(\phi _1)\leq 2L\|\phi _2\|_\infty $ . We have
Observe that for $\phi \in \mathcal {F}^+, \phi \geq 0$ , and $\alpha=|\phi|_\theta/L>0$ . we have for all $x,y\in \Sigma ^+$ ,
Hence, $\phi +\alpha \in \mathcal {C}_L$ .
Proof of Lemma 3.1
Note that for every $f\in \mathcal {W}$ , if we take $\theta>{1}/{\gamma }$ , we have $|\pi _f|_\theta \leq 1$ . Thus, for uniformity in Lemma 3.3, we need to take
where $M:=\max \{|\log (f')|_{C^1}\mid f\in \mathcal {W}\}$ . Let $h_f$ be such that $\mathcal {L}_{\psi _f}h_f=e^{P(\psi _f)}h_f$ , and let $\nu _f$ be the corresponding measure such that $\mathcal {L}_{\psi _f}^*\nu _f= e^{P(\psi _f)}\nu _f.$ Note that for all $f\in \mathcal {W}$ , $P(\psi _f)=0$ . It can be shown that $h_f\in \mathcal {C}_L$ for L taken as above. Hence, for any $x,y\in \Sigma ^+$ , we have
and hence $|\log (h_f)|_\theta \leq \theta (L+M)$ , and so $|\overline {\psi }_f|_\theta \leq M+\theta (L+M)+L+M= (2+\theta )M+(1+\theta )L$ . Thus, for the normalized operators $\mathcal {L}_{\overline {\psi }_f}$ , we take
Moreover, one can show that $-\phi \leq _{\mathcal {C}_L}\psi \leq _{\mathcal {C}_L} \phi $ implies that
where $\mu _f$ is the equilibrium state corresponding to $\overline {\psi }_f$ .
Observe that for every $n\in \mathbb {N}$ ,
Therefore, we may apply Lemma A.2 with $\|\cdot \|_1=\|\cdot \|_{L^1}$ , $\|\cdot \|_2=\|\cdot \|_L$ , $\phi =\mathcal {L}^n_{\overline {\psi }_f}\phi $ , and $\psi =\int \phi \,d\mu _f=\mathcal {L}^n_{\overline {\psi }_f}(\int \phi \,d\mu _f)$ :
Let $\Delta =\operatorname {\mathrm {diam}}_{\Theta _L}(\mathcal {L}_{\overline {\psi }_f}(\mathcal {C}_L))$ , and observe that Birkhoff’s inequality implies that for $\phi \in \mathcal {C}_L$ ,
for uniform $\tau _{\mathcal {W}}$ . Therefore, for $\phi \in \mathcal {C}_L$ , we have
where $C_{\mathcal {W}}$ is uniform. Since the norms are equivalent, we may replace $\|\cdot \|_L$ by $\|\cdot \|_\theta $ . It remains to extend this bound to all $\phi \in \mathcal {F}_\theta ^+$ . If $\phi \geq 0$ , then $\phi +{|\phi |_\theta }/{L}\in \mathcal {C}_L$ , so
for a different (but still uniform) $C_{\mathcal {W}}$ . For general real-valued $\phi \in \mathcal {F}_\theta ^+$ , we decompose $\phi $ as $\phi =\phi ^+-\phi ^-$ , where $\phi ^+,\phi ^-\geq 0$ . Then,
The case of complex-valued $\phi $ is handled similarly.