1 Introduction
Let X be a metrizable space and $T\colon X\to X$ be a Borel measurable map. We denote by $\mathcal {M}(X)$ the set of all Borel probability measures on X endowed with weak- ${}^{\ast }$ topology, by $\mathcal {M}_T(X)\subset \mathcal {M}(X)$ the set of all T-invariant ones, and by $\mathcal {M}^e_T(X)\subset \mathcal {M}_T(X)$ the set of ergodic ones. We say that $(X,T)$ satisfies the (level-2) large deviation principle with a reference measure $m\in \mathcal {M}(X)$ if there exists a lower semi-continuous function $\mathcal {J}\colon \mathcal {M}(X)\to [0,\infty ]$ , called a rate function, such that
holds for any closed set $\mathcal {K}\subset \mathcal {M}(X)$ and
holds for any open set $\mathcal {U}\subset \mathcal {M}(X)$ . Here, $\delta _n^T\colon X\to \mathcal {M}(X)$ is defined by $\delta _n^T(x):={(1/n)}\sum _{j=0}^{n-1}\delta _{T^j(x)}$ , where $\delta _y$ signifies the Dirac mass at a point $y\in X$ . We say that $\mu \in \mathcal {M}(X)$ is a periodic measure if there exist $x\in X$ and $n>0$ such that $T^n(x)=x$ and $\mu =\delta _n^T(x)$ hold. Then it is clear that $\mu \in \mathcal {M}_T^e(X)$ . We denote by $\mathcal {M}_T^p(X)\subset \mathcal {M}_T^e(X)$ the set of all periodic measures on X.
Henceforth, let $X=[0,1]$ be the unit interval, and $T\colon X\to X$ be a piecewise monotonic map; that is, there exist an integer $k>1$ , and $0=a_0<a_1<\cdots <a_k=1$ , which we call critical points, such that $T|_{(a_{j-1},a_j)}$ is continuous and strictly monotone for each $1\le j\le k$ . If $T|_{(a_{j-1},a_j)}$ is increasing (respectively decreasing) for $1\le j\le k$ , then we call T a piecewise increasing (respectively decreasing) map. Throughout this paper, we further assume the following conditions for a piecewise monotonic map T.
-
• $\bigcup _{n\ge 0}T^{-n}\{a_0,a_1,\ldots ,a_k\}$ is dense in X. In other words, the partition $\{(a_{j-1},a_j):1\le j\le k\}$ by the monotone intervals is a generator for the dynamical system $(X,T)$ .
-
• The topological entropy ${h_{\mathrm {top}}(X,T)}$ of T is positive (see [Reference Brucks and Bruin2, Ch. 9] for the definition and basic properties of topological entropy for piecewise monotonic maps).
It is proved in [Reference Hof bauer11] that there exists a measure $m=m_T\in \mathcal {M}_T^e(X)$ of maximal entropy for T; that is, the metric entropy of m coincides with $h_{\mathrm {top}} (X, T)$ . In this study, we investigate whether the large deviation principle holds for a piecewise monotonic map with a measure of maximal entropy as reference. In such a situation, it was shown in [Reference Takahashi24, Reference Young27] that the large deviation principle holds if the map has the specification property (see [Reference Buzzi3, §1] for the definition of the specification property). Moreover, Pfister and Sullivan have proved in [Reference Pfister and Sullivan20] that all $\beta $ -transformations satisfy the large deviation principle, while the specification property holds only for a set of $\beta>1$ of Lebesgue measure zero.
The purpose of this paper is to provide a criterion for satisfying the large deviation principle using common concepts for piecewise monotonic maps. More precisely, we assume the following natural conditions for piecewise monotonic maps $T:X\to X$ .
-
• Irreducibility (IR). The Markov diagram $(\mathcal {D}_T,\rightarrow )$ of T is irreducible (see §2 for the definitions).
-
• Density of periodic measures (DP). The set $\mathcal {M}_T^p(X)$ of periodic measures is dense in $\mathcal {M}_T^e(X)$ .
Condition (IR) implies transitivity of the map, and that the measure of maximal entropy is unique (see [Reference Hof bauer11]). Moreover, condition (DP) holds for any map with the specification property (see [Reference Sigmund22]). Our main result is the following theorem.
Theorem A. Let $T\colon X\to X$ be a piecewise monotonic map satisfying (IR) and (DP), and let m be the unique measure of maximal entropy. Then $(X,T)$ satisfies the large deviation principle with m as reference, and the rate function $\mathcal {J}\colon \mathcal {M}(X)\to [0,\infty ]$ is expressed by (3.5). Moreover, if we assume that T is piecewise increasing and either left or right continuous, then
Here, $h_T(\mu )$ denotes the metric entropy of $\mu \in \mathcal {M}_T(X)$ .
We remark that Theorem A can be generalized for the case where the reference measure m is the equilibrium state of a potential discussed in [Reference Hof bauer and Keller14]. It will be treated in a forthcoming paper.
The contraction principle [Reference Dembo and Zeitouni7] gives the following formula for fluctuations of time averages of continuous observables.
Corollary B. Let $T: X\to X$ be as in Theorem A. Then, for any continuous function $\varphi : X \to \mathbb R$ and a closed interval $J\subset \mathbb {R}$ , we have
where $\mathcal {J}_{\varphi }\colon \mathbb {R}\to [0,\infty ]$ denotes the (level-1) rate function for $\varphi $ given by $\mathcal {J}_{\varphi } (\alpha )= \inf \{\mathcal {J}(\mu ): \int \varphi \, d\mu =\alpha \} $ .
Remark 1.1. If T is continuous, then condition (IR) implies that (some iteration of) the map has the specification property (see, for example, [Reference Buzzi3, Theorem 1.1]), and then the large deviation principle holds (see [Reference Takahasi25, Reference Young27]). By contrast, in the case of discontinuity, we emphasize that there are many examples of piecewise monotonic maps that satisfy (IR) and (DP) but do not have the specification property (see applications below).
Remark 1.2. Condition (IR) is slightly stronger than transitivity. It is highly likely that Theorem A remains true if we assume transitivity instead of (IR). However, (IR) simplifies the argument used in this study for lifting measures from a subshift with a finite alphabet to a Markov shift with a countable alphabet (see the proof of Proposition 3.1). Hence, we avoided this generalization in this study. We provide a sufficient condition for (IR) in §4.
Remark 1.3. The density of periodic measures (DP) assumed in Theorem A has been studied by many researchers independently of the theory of large deviations. The condition holds for a broad class of piecewise monotonic maps (see [Reference Hof bauer13, Reference Hof bauer and Raith15, Reference Sigmund23], for example). However, as of this writing, there is no known transitive piecewise monotonic map without the density of periodic measures (this is an open problem posed by Hof bauer and Raith in [Reference Hof bauer and Raith15]). Therefore, we hope that the result of our study will contribute to a complete description of the large deviation principle for transitive piecewise monotonic maps.
Theorem A can be applied to demonstrate the large deviation principle for the following classes of piecewise monotonic maps.
-
• $\beta $ -transformations (piecewise increasing). The $\beta $ -transformation $T_{\beta }\colon X\to X$ with $\beta>1$ was introduced by Rényi [Reference Rényi21] and defined by
$$ \begin{align*}T_{\beta}(x)= \begin{cases} \beta x\ (\text{mod}\ 1), \quad& x\not=1,\\ \displaystyle\lim_{y\rightarrow 1-0}(\beta y\ (\text{mod}\ 1)), \quad& x=1. \end{cases} \end{align*} $$Then $T_{\beta }$ satisfies (IR) and (DP) for any $\beta>1$ (see [Reference Góra9, Reference Sigmund23]). -
• Linear mod 1 transformations (piecewise increasing). The linear mod one transformation $T_{\alpha ,\beta }\colon X\to X$ with $\beta>1$ and $0\le \alpha <1$ was introduced by Parry [Reference Parry19] and defined by
(1.2) $$ \begin{align} T_{\alpha,\beta}(x)= \begin{cases} \beta x+\alpha\ (\text{mod}\ 1), \quad & x\not=1,\\ \displaystyle\lim_{y\rightarrow 1-0}(\beta y+\alpha\ (\text{mod}\ 1)), \quad& x=1. \end{cases} \end{align} $$In Example 4.1, we prove that $T_{\alpha ,\beta }$ satisfies (IR) and (DP) for any $0\le \alpha <1$ and $\beta>2$ . -
• $(-\beta )$ -transformations (piecewise decreasing). The $(-\beta )$ -transformation $T_{-\beta }\colon X\to X$ with $\beta>1$ was introduced by Ito and Sadahiro [Reference Keller16] and defined by
(1.3) $$ \begin{align} T_{-\beta}(x)=-\beta x+ \lfloor \beta x \rfloor +1, \end{align} $$where $\lfloor \xi \rfloor $ denotes the largest integer that is no more than $\xi $ . We show that $T_{-\beta }$ satisfies (IR) and (DP) for any $({1+\sqrt {5}})/{2} < \beta <2$ in Example 4.2. -
• Maps with two monotonic pieces (may be neither piecewise increasing nor decreasing). A piecewise monotonic map $T\colon X\to X$ has two monotonic pieces if there is $0<a<1$ such that both $T|_{(0,a)}$ and $T|_{(a,1)}$ are strictly monotonic and continuous. In this case, condition (DP) was demonstrated in [Reference Hof bauer and Raith15, Theorem 2] for when the map satisfies condition (IR). A typical example which has two monotonic pieces is the class of one-dimensional Lorenz maps. For condition (IR) for Lorenz maps, we refer to [Reference Araujo and Pacifico1, Ch. 3] and [Reference Oprocha, Potorski and Raith18].
The graphs of linear mod one transformations $T_{\alpha ,\beta }$ and $(-\beta )$ -transformations $T_{-\beta }$ are plotted in Figure 1. These transformations do not satisfy the specification property for Lebesgue almost every parameter (see [Reference Buzzi3]), and it was not known whether the large deviation principle holds. As mentioned before, we apply Theorem A to these transformations.
The remainder of this paper is organized as follows. In §2 we establish our definitions and prepare several lemmas. Subsequently, we present a proof of Theorem A in §3 and apply it to concrete examples in §4.
2 Preliminaries
2.1 Symbolic dynamics
For a finite or countable set A, we denote by $A^{\mathbb {N}}$ the one-sided infinite product of A equipped with the product topology of the discrete topology of A. Let $\sigma $ be the shift map on $A^{\mathbb N}$ (that is, $(\sigma (\omega ))_i= \omega _{i+1}$ for each $i\in \mathbb N$ and $\omega = ( \omega _i)_{i\in \mathbb N} \in A^{\mathbb N}$ ). When a subset $\Sigma $ of $A^{\mathbb N}$ is $\sigma $ -invariant and closed, we call it a subshift and call A the alphabet of $\Sigma $ . When $\Sigma $ is of the form
with a matrix $M= (M_{ij})_{(i,j)\in A^2}$ , each entry of which is $0$ or $1$ , we call $\Sigma $ a Markov shift. To emphasize the dependence of $\Sigma $ on M, it is denoted by $\Sigma _M$ , and M is called the adjacency matrix of $\Sigma _M$ . In a similar manner, we define the shift map, subshift, and Markov shift for the two-sided infinite product $A^{\mathbb Z}$ of A.
For a subshift $\Sigma $ on an alphabet A, let $[u] = \{ (\omega _i)\in \Sigma : u=\omega _1\cdots \omega _n\}$ for each $u\in A^n$ , $n\geq 1$ , and set $ \mathcal L(\Sigma ) =\{ u\in \bigcup _{n\geq 1} A^n : [u] \neq \emptyset \}. $ We also denote $\mathcal {L}_n(\Sigma ):=\{u\in \mathcal {L}(\Sigma ):|u|=n\}$ for $n\ge 1$ , where $|u|$ denotes the length of u that is, $|u|=n$ if $u =u_1\cdots u_n \in A^n$ . For $u=u_1\cdots u_m$ and $v=v_1\cdots v_n$ in $\mathcal {L}(\Sigma )$ , we denote $uv=u_1\cdots u_mv_1\cdots v_n$ . Finally, we say that $\Sigma $ is transitive if, for any $u,v\in \mathcal {L}(\Sigma )$ , we can find $w\in \mathcal {L}(\Sigma )$ such that $uwv\in \mathcal {L}(\Sigma )$ holds. For the rest of this paper, we denote by $h_{\mathrm {top}}(\Sigma ,\sigma )$ the topological entropy of the restriction $\sigma : \Sigma \to \Sigma $ of the shift map $\sigma $ to a subshift $\Sigma $ , and by $h_{\sigma }(\mu )$ the metric entropy of $\mu \in \mathcal {M}_{\sigma }(\Sigma )$ .
2.2 Markov diagram
Let $X =[0,1]$ and $T\colon X\to X$ be a piecewise monotonic map with critical points $0\kern-1pt =a_0<a_1<\cdots <a_k=1$ . Let $X_T\kern-1.5pt :=\bigcap _{n=0}^{\infty }\kern-1ptT^{-n}(\bigcup _{j=1}^k (a_{j-1},a_j))$ , and define the coding map $\mathcal {I}\colon X_T\to \{1,\ldots ,k\}^{\mathbb {N}}$ by
which is injective since the partition $\{(a_{j-1},a_j):1\le j\le k\}$ is a generator. We denote the closure of $\mathcal {I}(X_T)$ in $\{1,\ldots ,k\}^{\mathbb {N}}$ by $\Sigma _T^+$ . Then $\Sigma _T^+$ is a subshift, and $(\Sigma _T^+,\sigma )$ is called the coding space of $(X,T)$ .
In what follows, we define the Markov diagram, introduced by Hof bauer [Reference Hof bauer11], which is a countable oriented graph with subsets of $\Sigma _T^+$ as vertices. Let $C\subset \Sigma _T^+$ be a closed subset with $C\subset [j]$ for some $j\in \{ 1,\ldots ,k\}$ . We say that a non-empty closed subset $D\subset \Sigma _T^+$ is a successor of C if $D=[l]\cap \sigma (C)$ for some $l\in \{ 1,\ldots ,k\}$ . The expression $C\rightarrow D$ denotes that D is a successor of C. We now define a set $\mathcal {D}_T$ of vertices by induction. First, we set $\mathcal {D}_0:=\{[1],\ldots ,[k]\}$ . If $\mathcal {D}_n$ is defined for $n\ge 0$ , then we set
We note that $\mathcal {D}_n$ is a finite set for each $n\ge 0$ since the number of successors of any closed subset of $\Sigma _T^+$ is at most k by definition. Finally, we set
We call the oriented graph $(\mathcal {D}_T,\rightarrow )$ the Markov diagram of T. The Markov diagram $(\mathcal {D}_T,\rightarrow )$ is irreducible if, for any $C,D\in \mathcal {D}_T$ , there exist $C_1,\ldots ,C_n\in \mathcal {D}_T$ such that $C=C_1\rightarrow \cdots \rightarrow C_n=D$ . For notational simplicity, we use the expression $\mathcal {D}$ instead of $\mathcal {D}_T$ if no confusion arises. We remark that
holds, and that $\sigma ^{|u|}[uj]$ is a successor of $\sigma ^{|u|-1}[u]$ for each pair $u\in \mathcal {L}(\Sigma _T^+)$ and $j \in \{ 1,\ldots , k\}$ with $uj\in \mathcal {L}(\Sigma _T^+)$ .
For a subset $\mathcal {C}\subset \mathcal {D}$ , we define a matrix $M(\mathcal {C})=(M(\mathcal {C})_{C,D})_{(C,D)\in \mathcal {C}^2}$ by
Then $\Sigma _{M(\mathcal {C})}=\{(C_i)_{i\in \mathbb {Z}}\in \mathcal {C}^{\mathbb {Z}}:C_i\rightarrow C_{i+1},i\in \mathbb {Z}\}$ is a two-sided Markov shift with a countable alphabet $\mathcal {C}$ and an adjacency matrix $M(\mathcal {C})$ . For notational simplicity, we write $\Sigma _{\mathcal {C}}$ instead of $\Sigma _{M(\mathcal {C})}$ for the remainder of this paper. It is clear that $\Sigma _{\mathcal {D}}$ is transitive if and only if $(\mathcal {D}_T,\rightarrow )$ is irreducible, and it is known that the equality
holds (see [Reference Faller8]). Moreover, there is a relationship between the countable Markov shift $\Sigma _{\mathcal {D}}$ and a natural extension of the coding shift $\Sigma _T^+$ . To be more precise, let us define a natural extension $\Sigma _T$ of $\Sigma _T^+$ by
and a map $\Psi \colon \Sigma _{\mathcal {D}}\to \{1,\ldots ,k\}^{\mathbb {Z}}$ by
where $\omega _i \in \{ 1, \ldots , k\}$ is a unique integer such that $C_i\subset [\omega _i]$ holds for each $i\in \mathbb {Z}$ . The following lemma, proved in [Reference Hof bauer11], states that $(\Sigma _{\mathcal {D}},\sigma )$ is topologically conjugate to $(\Sigma _T,\sigma ) ,$ except for ‘small’ sets (see also [Reference Buzzi4, Appendix]).
Lemma 2.1. [Reference Hof bauer11, Lemmas 2 and 3]
The map $\Psi \colon \Sigma _{\mathcal {D}}\to \{1,\ldots ,k\}^{\mathbb {Z}}$ is continuous and satisfies $\sigma \circ \Psi =\Psi \circ \sigma $ . Moreover, there exist two shift-invariant subsets $\overline {N}\subset \Sigma _{\mathcal {D}}$ and $N\subset \Sigma _T$ satisfying the following properties.
-
• We have $\Psi (\Sigma _{\mathcal {D}}\setminus \overline {N})=\Sigma _T\setminus N$ .
-
• The restriction map $\Psi \colon \Sigma _{\mathcal {D}}\setminus \overline {N}\to \Sigma _T\setminus N$ is bijective and bi-measurable.
-
• There is no periodic point in N.
-
• For any invariant measure $\mu \in \mathcal {M}_{\sigma }(\Sigma _T)$ with $\mu (N)=1$ , we have ${h_{\sigma }(\mu )}=0$ .
-
• For any invariant measure $\overline {\mu }\in \mathcal {M}_{\sigma }(\Sigma _{\mathcal {D}})$ with $\overline {\mu }(\overline {N})=1$ , we have ${h_{\sigma }(\overline {\mu })}=0$ .
For the rest of this section, we assume that $(\mathcal {D}_T,\rightarrow )$ is irreducible. Then the following lemma holds.
Lemma 2.2. ([Reference Hof bauer11, Theorem 2(iii)] and [Reference Hof bauer12, p. 377, Corollary 1(ii)])
There is a unique pair of $\overline {m} \in \mathcal {M}_{\sigma }^e (\Sigma _{\mathcal D})$ and $m^+ \in \mathcal {M}_{\sigma }^e (\Sigma _T^+)$ of maximal entropy; that is,
Moreover, the following properties hold for $\overline {m}$ and $m^+$ .
-
(1) $m^+=\overline {m}\circ (\Psi ^+)^{-1}$ , where $\Psi ^+:=\pi \circ \Psi $ and $\pi \colon \Sigma _T\to \Sigma _T^+$ is a natural projection; $(\omega _i)_{i\in \mathbb {Z}}\mapsto (\omega _i)_{i\in \mathbb {N}}$ .
-
(2) There are two families $\{L(C)\}_{C\in \mathcal {D}}$ and $\{R(C)\}_{C\in \mathcal {D}}$ of real positive numbers satisfying the following properties:
-
• $\overline {m}[C_1\cdots C_n]=L(C_1)R(C_n)\exp \{-n{h_{\mathrm {top}}(\Sigma _T^+,\sigma )}\}$ for any $n\ge 1$ and $C_1\cdots C_n \in \mathcal {L}_n(\Sigma _{\mathcal {D}})$ ;
-
• both $\sum _{C\in \mathcal {D}}L(C)$ and $\sup _{C\in \mathcal {D}}R(C)$ are finite.
-
Finally, we prove a weak Gibbs property for the unique measure of maximal entropy on $\Sigma _T^+$ , which has a key role in obtaining the large deviation bound in §3.
Lemma 2.3. Let $m^+\in \mathcal {M}_{\sigma }^e(\Sigma _T^+)$ be as in Lemma 2.2. Then, for any finite set $\mathcal {F}\subset \mathcal {D}$ , we can find $K=K_{\mathcal {F}}>1$ such that
Proof. Let $\overline {m}$ , $\{L(C)\}_{C\in \mathcal {D}}$ and $\{R(C)\}_{C\in \mathcal {D}}$ be as in Lemma 2.2. For a finite set $\mathcal {F}\subset \mathcal {D}$ , we put
where we set $L:=\sum _{C\in \mathcal {D}}L(C)$ and $R:=\sup _{C\in \mathcal {D}}R(C)$ . Note that $K<\infty $ by Lemma 2.2. Let $n\ge 1$ and $u=u_1\cdots u_n\in \mathcal {L}_n(\Psi ^+(\Sigma _{\mathcal {F}}))$ . Then we can find a word $C_1\cdots C_n\in \mathcal {L}_n(\Sigma _{\mathcal {F}})$ such that $ [C_1\cdots C_n]\subset (\Psi ^+)^{-1}[u]$ . Hence, it follows from Lemma 2.2 that
We note that $\Psi ^+(\Sigma _{\mathcal {D}})=\Sigma _T^+$ (see [Reference Hof bauer12], for instance). For $n\ge 1$ and $u\in \mathcal {L}_n(\Sigma _T^+)$ , we set
Then, for any $C\in \mathcal {C}$ , we can find a unique word $P(C)=C_1\cdots C_n\in \mathcal {L}_n(\Sigma _{\mathcal {D}})$ such that $C_1=C$ and $[C]\cap (\Psi ^+)^{-1}[u]=[P(C)]$ . It is easy to see that $\bigcup _{C\in \mathcal {D}}[P(C)]=(\Psi ^+)^{-1}[u]$ . Therefore, we have
which proves the lemma.
3 Proof of Theorem A
In this section we present a proof of Theorem A. First, we show the following analogous result of Theorem A on coding spaces.
Theorem C. Let $T\colon X\to X$ be as in Theorem A, $(\Sigma _T^+,\sigma )$ be a coding space of $(X,T)$ , and $m^+$ be the unique measure of maximal entropy on $\Sigma _T^+$ . Then $(\Sigma _T^+,\sigma )$ satisfies the large deviation principle with $m^+$ as reference, and the rate function $\mathcal {J}\kern1.5pt^+\colon \mathcal {M}(\Sigma _T^+)\to [0,\infty ]$ is expressed by
Proof. To prove the theorem, it is sufficient to show
for any closed set $\mathcal {K}\subset \mathcal {M}(\Sigma _T^+)$ , and
for any open set $\mathcal {U}\subset \mathcal {M}(\Sigma _T^+)$ . The upper bound (3.2) follows from the first inequality in Lemma 2.3 and [Reference Pfister and Sullivan20, Theorem 3.2] (see also [Reference Climenhaga, Thompson and Yamamoto6, §4]).
In what follows, we will show the lower bound (3.3). Let $\epsilon>0$ , $\mu ^+\in \mathcal {M}_{\sigma }(\Sigma _T^+)$ and $\mathcal {U}$ be an open neighborhood of $\mu ^+$ in $\mathcal {M}(\Sigma _T^+)$ . To show (3.3), it is enough to show that
In the following proposition, we use condition (DP) to show the entropy-density of ergodic measures, that is, any invariant measure can be approximated by ergodic ones with similar entropies. The entropy-density plays a key role in estimating the lower bounds on large deviations (see, for example, [Reference Pfister and Sullivan20, §1]).
Proposition 3.1. There exist a finite set $\mathcal {F}\subset \mathcal {D}$ and $\rho ^+\in \mathcal {M}_{\sigma }^e(\Psi ^+(\Sigma _{\mathcal {F}}))$ such that $\rho ^+\in \mathcal {U}$ and ${h_{\sigma }(\rho ^+)}\ge {h_{\sigma }(\mu ^+)}-\epsilon $ .
Remark 3.2. Recently, Takahasi [Reference Takahasi25] proved the entropy-density of ergodic measures with compact supports for countable Markov shifts. From this result, Proposition 3.1 follows if every invariant measure on $\Sigma _T^+$ can be lifted to $\Sigma _{\mathcal {D}}$ . Unfortunately, there are unliftable measures in general (see, for example, [Reference Hof bauer10, Reference Ito and Sadahiro17]). In the proof, to overcome this difficulty, we show that the set of liftable measures is entropy-dense in the set of invariant measures by using (DP).
Proof of Proposition 3.1
It follows from condition (DP) and [Reference Yamamoto26, Theorem A] that $\mathcal {M}_{\sigma }^p(\Sigma _T^+)$ is dense in $\mathcal {M}_{\sigma }(\Sigma _T^+)$ . If ${h_{\sigma }(\mu ^+)}-\epsilon \le 0$ , then we choose $\rho ^+\in \mathcal {M}_{\sigma }^p(\Sigma _T^+)\cap \mathcal {U}$ and take a periodic point $\omega \in \Sigma _T^+$ in the support of $\rho ^+$ . It follows from [Reference Hof bauer12, Theorem 8] that there are finite vertices $C_1,\ldots , C_n\in \mathcal {D}$ such that $\Psi ^+(\cdots C_1\cdots C_nC_1\cdots C_n\cdots\kern-1pt)=\omega $ holds. We set $\mathcal {F}:=\{C_1,\ldots ,C_n\}$ . Then it is clear that $\rho ^+\in \mathcal {M}_{\sigma }^e(\Psi ^+(\Sigma _{\mathcal {F}}))$ and ${h_{\sigma }(\rho ^+)}=0\ge {h_{\sigma }(\mu ^+)}-\epsilon $ . This suffices for the case ${h_{\sigma }(\mu ^+)}-\epsilon \le 0$ .
Henceforth, assume that ${h_{\sigma }(\mu ^+)}-\epsilon>0$ . First, we define two pushforward maps $\pi _{\ast }\colon \mathcal {M}_{\sigma }(\Sigma _T) \to \mathcal {M}_{\sigma }(\Sigma _T^+)$ and $\Psi _{\ast }\colon \mathcal {M}(\Sigma _{\mathcal {D}})\to \mathcal {M}(\Sigma _T)$ by $\pi _{\ast }(\vartheta ):=\vartheta \circ \pi ^{-1}$ for $\vartheta \in \mathcal {M}_{\sigma }(\Sigma _T)$ , and $\Psi _{\ast }(\overline {\vartheta }):=\overline {\vartheta }\circ \Psi ^{-1}$ for $\overline {\vartheta }\in \mathcal {M}(\Sigma _{\mathcal {D}})$ . Then $\pi _{\ast }$ is a homeomorphism and ${h_{\sigma }(\vartheta )}={h_{\sigma }(\pi _{\ast }(\vartheta ))}$ holds for any $\vartheta \in \mathcal {M}_{\sigma }(\Sigma _T)$ (see [Reference Faller8, Proposition 5.8]). Moreover, it follows from Lemma 2.1 that $\Psi _{\ast }$ is continuous, the restriction map $\Psi _{\ast }\colon \mathcal {M}_{\sigma }(\Sigma _{\mathcal {D}}\setminus \overline {N})\to \mathcal {M}_{\sigma }(\Sigma _T\setminus N)$ is well defined and bi-measurable, and ${h_{\sigma }(\overline {\vartheta })}={h_{\sigma }(\Psi _{\ast }(\overline {\vartheta }))}$ holds for any $\overline {\vartheta }\in \mathcal {M}_{\sigma }(\Sigma _{\mathcal {D}}\setminus \overline {N})$ .
We set $\mu :=\pi _{\ast }^{-1}(\mu ^+)\in \pi _{\ast }^{-1}(\mathcal {U})$ . Then we can find $0\le c\le 1$ and $\mu _1,\mu _2\in \mathcal {M}_{\sigma }(\Sigma _T)$ such that $\mu =c\mu _1+(1-c)\mu _2$ , $\mu _1(\Sigma _T\setminus N)=1$ and $\mu _2(N)=1$ hold. We note that ${h_{\sigma }(\mu _2)}=0$ by Lemma 2.1. Recall that $\mathcal {M}_{\sigma }^p(\Sigma _T^+)$ is dense in $\mathcal {M}_{\sigma }(\Sigma _T^+)$ . By the definition of $\pi _{\ast }$ , we have $\mathcal {M}^p_{\sigma }(\Sigma _T^+)=\pi _{\ast }(\mathcal {M}^p_{\sigma }(\Sigma _T)),$ and hence, $\mathcal {M}^p_{\sigma }(\Sigma _T)$ is also dense in $\mathcal {M}_{\sigma }(\Sigma _T)$ . Therefore, we can find $\nu _2\in \mathcal {M}_{\sigma }^p(\Sigma _T)$ such that $\nu :=c\mu _1+(1-c)\nu _2\in \pi _{\ast }^{-1}(\mathcal {U})$ and ${h_{\sigma }(\nu )}={h_{\sigma }(\mu )}$ hold. Since N has no periodic point, $\nu _2(\Sigma _T\setminus N)=1$ , which also implies that $\nu (\Sigma _T\setminus N)=1$ . Hence, we can define $\overline {\nu }:=\Psi _{\ast }^{-1}(\nu )\in (\pi _{\ast }\circ \Psi _{\ast })^{-1}(\mathcal {U})$ . Note that $\overline {\nu }$ is supported on a transitive countable Markov shift $\Sigma _{\mathcal {D}} ,$ and $(\pi _{\ast }\circ \Psi _{\ast })^{-1}(\mathcal {U})$ is open. Therefore, it follows from [Reference Takahasi25, Main Theorem] that there exist a finite set $\mathcal {F}\subset \mathcal {D}$ and an ergodic measure $\overline {\rho }\in (\pi _{\ast }\circ \Psi _{\ast })^{-1}(\mathcal {U})$ supported on $\Sigma _{\mathcal {F}}$ such that ${h_{\sigma }(\overline {\rho })}\ge {h_{\sigma }(\overline {\nu })}-\epsilon>0$ . Since ${h_{\sigma }(\overline {\rho })}>0$ , we have $\overline {\rho }(\overline {N})=0$ by Lemma 2.1. We define an ergodic measure $\rho ^+$ on $\Sigma _T^+$ by $\rho ^+:=\pi _{\ast }(\Phi _{\ast }(\overline {\rho }))$ . Clearly, we have $\rho ^+\in \mathcal {M}_{\sigma }^e(\Psi ^+(\Sigma _{\mathcal {F}}))\cap \mathcal {U}$ . Moreover, since $\overline {\rho }\in \mathcal {M}_{\sigma }(\Sigma _{\mathcal {D}}\setminus \overline {N})$ , we have ${h_{\sigma }(\overline {\rho })}={h_{\sigma }(\rho ^+)}$ , which implies that ${h_{\sigma }(\rho ^+)}\ge {h_{\sigma }(\mu ^+)}-\epsilon $ .
We continue the proof of Theorem C. Note that $\Psi ^+(\Sigma _{\mathcal {F}})\subset \Sigma _T^+$ is a subshift, and $\rho ^+(\Psi ^+(\Sigma _{\mathcal {F}}))=1$ . Hence, from the estimates in [Reference Pfister and Sullivan20], we have the following lemma.
Lemma 3.3. [Reference Pfister and Sullivan20, Propositions 2.1 and 4.1]
There exists $n_0\in \mathbb {N}$ such that, for any $n\ge n_0$ ,
For notational simplicity, for each $n\ge n_0$ , we set $\mathcal {L}_{n,\rho ^+}:=\{u\in \mathcal {L}_n(\Psi ^+(\Sigma _{\mathcal {F}})):[u]\subset (\delta _n^{\sigma })^{-1}(\mathcal {U})\}.$ Since $\mathcal {F}$ is finite, it follows from Lemma 2.3 that there is $K>1$ such that $m^+[u]\ge K^{-1}\exp \{-n{h_{\mathrm {top}}(\Sigma _T^+,\sigma )}\}$ for any $u\in \mathcal {L}_{n,\rho ^+}$ and $n\ge n_0$ . Therefore, by $\bigcup _{u\in \mathcal {L}_{n,\rho ^+}}[u]\subset (\delta _n^{\sigma })^{-1}(\mathcal {U})$ , we have
which proves Theorem C.
Proof of Theorem A
Let $T\colon X\to X$ and m be as in Theorem A with critical points $0=a_0<a_1<\cdots <a_k=1$ . Then it follows from Theorem C that $(\Sigma _T^+,\sigma )$ satisfies the large deviation principle with the unique measure of maximal entropy $m^+$ on $\Sigma _T^+ ,$ and the rate function $\mathcal {J}\kern1.5pt^+\colon \mathcal {M}(\Sigma _T^+)\to [0,\infty ]$ is expressed by equation (3.1).
Since the coding map $\mathcal {I}\colon X_T\to \Sigma _T^+$ is injective (see §2), the set
consists of a unique element for any $(\omega _i)_{i\in \mathbb {N}}\in \Sigma _T^+$ . Here, $\mathrm {cl}(A)$ denotes the closure of a set A. We define a map $\Phi \colon \Sigma _T^+\to X$ by $\Phi ((\omega _i)_{i\in \mathbb {N}})=y$ , where y is a unique element of the set (3.4), and let $\Phi _{\ast }\colon \mathcal {M}(\Sigma _T^+)\to \mathcal {M}(X)$ be a pushforward map; $\mu ^+\mapsto \mu ^+\circ \Phi ^{-1}$ . Then the following properties hold (see [Reference Faller8], for instance):
-
• $\Phi $ is a continuous surjection, $\Phi (\mathcal {I}(X_T))=X_T ,$ and the restriction map $\Phi \colon \mathcal {I}(X_T)\to X_T$ is bijective;
-
• $\Phi \circ \sigma (\omega )=T\circ \Phi (\omega )$ for $\omega \in \mathcal {I}(X_T)$ ;
-
• $m=\Phi _{\ast }(m^+)$ and $m(X_T)=m^+(\mathcal {I}(X_T))=1$ ;
-
• $h_{\sigma }(\mu ^+)=h_T(\Phi _{\ast }(\mu ^+))$ if $\Phi _{\ast }(\mu ^+)\in \mathcal {M}_T(X)$ .
From the aforementioned properties, we have $m\circ (\delta _n^T)^{-1}=m^+\circ (\delta _n^{\sigma })^{-1}\circ \Phi _{\ast }^{-1}$ . Hence, it follows from the contraction principle [Reference Dembo and Zeitouni7, Theorem 4.2.1] that $(X,T)$ satisfies the large deviation principle with $m,$ and the rate function $\mathcal {J}\colon \mathcal {M}(X)\to [0,\infty ]$ is expressed by
Hereafter, assume that T is piecewise increasing and either right or left continuous. Since $\mathcal {M}_{\sigma }^p(\Sigma _T^+)$ is dense in $\mathcal {M}_{\sigma }(\Sigma _T^+)$ , it follows from [Reference Yamamoto26, Theorem B and Lemma 2.7] that $\Phi _{\ast }(\mathcal {M}_{\sigma }(\Sigma _T^+))=\mathcal {M}_T(X)$ . Hence, $\mathcal {J}(\mu )=\infty $ if $\mu \not \in \mathcal {M}_T(X)$ , and
if $\mu \in \mathcal {M}_T(X)$ . This completes the proof of Theorem A.
4 Applications
In this section, we apply Theorem A to demonstrate the large deviation principle for the concrete examples presented in §1. First, we provide a sufficient condition for (IR).
Let $T\colon X\to X$ be a piecewise monotonic map with critical points $0=a_0<a_1<\cdots <a_k=1,$ and we consider the following condition.
Exactness (EX). For any open interval $I\subset X$ , there exist a positive integer $\tau $ , integers $1\le i_0,\ldots ,i_{\tau -1}\le k$ , and an open subinterval $L\subset I$ such that $T^j(L)\subset (a_{i_j-1},a_{i_j})$ for all $0\le j\le \tau -1$ and $T^{\tau }(L)=(0,1)$ .
Lemma 4.1. Condition (EX) implies (IR).
Proof. Let $C,D\in \mathcal {D}$ . To prove the lemma, it is sufficient to show that there are $n\ge 1$ and $C_1,\ldots ,C_n\in \mathcal {D}$ such that $C_1=C$ , $C_n=D$ and $C_i\rightarrow C_{i+1}$ for each $1\le i\le n-1$ . Since $C,D\in \mathcal {D}$ , there are $u=u_1\cdots u_l$ and $v=v_1\cdots v_m$ in $\mathcal {L}(\Sigma _T^+)$ such that $C=\sigma ^{l-1}[u]$ and $D=\sigma ^{m-1}[v]$ . If we set $I:=\bigcap _{j=1}^lT^{-(j-1)}((a_{u_j-1},a_{u_j}))$ , then I is a non-empty open interval since $u\in \mathcal {L}(\Sigma _T^+)$ . By condition (EX), there exist an open subinterval $L\subset I$ , $\tau \ge 1$ , and $1\le i_0,\ldots ,i_{\tau -1}\le k$ such that $T^j(L)\subset (a_{i_j-1},a_{i_j})$ for $0\le j\le \tau -1$ and $T^{\tau }(L)=(0,1)$ . Since $T^{l-1}(I)\subset (a_{u_{l-1}},a_{u_l})$ , we have $l\le \tau $ . We set
where $\lambda $ denotes the empty word. Then it is not difficult to see that $\sigma ^{\tau }[uw]=\Sigma _T^+$ . Hence, we have $\sigma ^{\tau +m-1}[uwv]=D$ , which proves the lemma.
Example 4.1. Let $0\le \alpha <1$ , $\beta>2$ , and $T=T_{\alpha ,\beta }\colon X\to X$ be as in (1.2). Then it follows from [Reference Hof bauer13, Theorem 2] that $\mathcal {M}_{\sigma }^p(\Sigma _T^+)$ is dense in $\mathcal {M}_{\sigma }(\Sigma _T^+)$ , and therefore $\mathcal {M}_T^p(X)$ is also dense in $\mathcal {M}_T^e(X)$ (see also [Reference Yamamoto26, Theorem A]). Moreover, it is proved in [Reference Carapezza, López and Robertson5, Proposition 3.14] that T satisfies condition (EX). Hence, by Lemma 4.1 and Theorem A, $(X,T)$ satisfies the large deviation principle with the unique measure of maximal entropy m. Since T is piecewise increasing and right continuous, the rate function $\mathcal {J}\colon \mathcal {M}(X)\to [0,\infty ]$ is expressed by (1.1).
Example 4.2. Let $(1+\sqrt {5})/2<\beta <2$ and $T=T_{-\beta }\colon X\to X$ be as in (1.3). Since $\beta>(1+\sqrt {5})/2$ , we can prove that T satisfies condition (EX) in a similar manner to the proof of [Reference Góra9, Proposition 8]. Hence, by Lemma 4.1, T satisfies (IR). Moreover, it is clear that T has two monotonic pieces and is transitive. Hence, it follows from [Reference Hof bauer and Raith15, Theorem 2] that $\mathcal {M}^p_{\sigma }(\Sigma _T^+)$ is dense in $\mathcal {M}_{\sigma }(\Sigma _T^+)$ , and hence, $\mathcal {M}_T^p(X)$ is also dense in $\mathcal {M}^e_T(X)$ . Therefore, by Theorem A, $(X,T)$ satisfies the large deviation principle with the unique measure of maximal entropy m.
Acknowledgements
The authors would like to express their gratitude to Hiroki Takahasi for fruitful discussions. They also thank the anonymous referee for his/her careful reading of the manuscript and critical comments. The first author was partially supported by JSPS KAKENHI grant number 20K03631 and the second author was partially supported by JSPS KAKENHI grant number 18K03359 and 21K03321.