1. Introduction
Let $m\ge 0$ and $n\ge 0$ be integers. A time-homogeneous $(m+n)$ -dimensional Markov processes $\{X_t\,{:}\, t\ge0\}= \{(Y_t,Z_t)\,{:}\, t\ge 0\}$ taking values in $D\,{:\!=}\, \mathbb{R}_{+}^m\times\mathbb{R}^n$ is called an affine Markov process if its characteristic function satisfies
where $\phi$ and $\psi$ satisfy certain generalized Riccati differential equations. The affine property means roughly that the logarithm of the characteristic function is affine with respect to the initial state. In this case, it is known that the m-dimensional process $\{Y_t\,{:}\,t\ge0\}$ is a continuous-state branching process with immigration (CBI process). The n-dimensional process $\{Z_t\,{:}\,t\ge0\}$ can be regarded as an Ornstein–Uhlenbeck-type process (OU-type process) depending on $\{Y_t\,{:}\,t\ge0\}$ . Then the above formulation includes as special cases both the CBI process and the OU-type process. A one-dimensional CBI process first appeared in the scaling limit theorem for discrete Galton–Watson branching processes with immigration established in Kawazu and Watanabe [Reference Kawazu and Watanabe12]. Compared with the discrete model, the CBI process is easier to deal with because its time and state spaces are both smooth, and the distributions that appear are infinitely divisible. For general treatments and backgrounds on branching processes in continuous state spaces, the reader may refer to Kyprianou [Reference Kyprianou14] and Li [Reference Li15, Reference Li17]. The affine processes involve rich common mathematical structures and have found interesting connections and applications in several areas.
The general theory of finite-dimensional affine Markov processes, including several equivalent characterizations and common financial applications, was given by Duffie et al. [Reference Duffie, Filipović and Schachermayer6] under a regularity assumption. The regularity problem asks whether this property holds automatically for stochastically continuous affine processes. This property was established in Dawson and Li [Reference Dawson and Li4] under the first moment condition. The problem was finally settled in Keller-Ressel et al. [Reference Keller-Ressel, Schachermayer and Teichmann13], where it was proved that any stochastically continuous affine Markov process is regular. The connection of the regularity problem with Hilbert’s fifth problem is explained in Keller-Ressel et al. [Reference Keller-Ressel, Schachermayer and Teichmann13].
The strong Feller and ergodic properties of the CBI and OU-type processes have been studied by a number of authors. In particular, a sufficient and necessary integrability condition for the ergodicity of a one-dimensional subcritical or critical CBI process was announced in Pinsky [Reference Pinsky23]; see Li [Reference Li15] for a proof of the result. The strong Feller property and exponential ergodicity in the total variation distance of one-dimensional CBI processes were established in Li and Ma [Reference Li and Ma16] by using a coupling process constructed using strong solutions of a stochastic equation; see also Li [Reference Li17]. The analytic properties of a finite-dimensional stable jump-type CBI process were studied by Friesen and Jin [Reference Friesen and Jin7], who proved that the transition kernel of the process satisfies an a priori bound in a weighted anisotropic Besov norm. From this regularity they deduced the strong Feller property and proved in the subcritical case the exponential ergodicity in the total variation distance; see also Jin et al. [Reference Jin, Kremer and Rüdiger10]. The strong Feller and ergodic properties of Dawson–Watanabe superprocesses with or without immigration were proved in the recent work of Li [Reference Li18] using coupling methods, which generalize the work of Li and Ma [Reference Li and Ma16].
It was proved in Sato and Yamazato [Reference Sato and Yamazato26] that a finite-dimensional OU-type process is ergodic if and only if the eigenvalues of its coefficient matrix have strictly negative real parts. The coupling property and strong Feller property of finite-dimensional OU-type processes were studied in Priola and Zabczyk [Reference Priola and Zabczyk24] and Wang [Reference Wang28]. The ergodicity and exponential ergodicity of such processes in the total variation distance were proved in Schilling and Wang [Reference Schilling and Wang27] and Wang [Reference Wang29].
Barczy et al. [Reference Barczy, Döring, Li and Pap3] studied the existence and uniqueness of a stationary distribution for a special subcritical two-factor affine process, where the first factor was an $\alpha$ -stable CBI process and the second one was driven by a Brownian motion. The exponential ergodicity of the process in the total variation distance was established in Jin et al. [Reference Jin, Kremer and Rüdiger9]. For a subcritical two-factor affine process driven by Lévy stable processes, the exponential ergodicity in the $L^1$ -Wasserstein distance was established in Bao and Wang [Reference Bao and Wang2] by a coupling approach. For general finite-dimensional affine Markov processes, Jin et al. [Reference Jin, Kremer and Rüdiger11] proved a sufficient condition for the ergodicity in weak convergence, which covers partially the results of Pinsky [Reference Pinsky23] and Sato and Yamazato [Reference Sato and Yamazato26]. The necessity of the condition of Jin et al. [Reference Jin, Kremer and Rüdiger11] was still an open problem. The exponential ergodicities in two suitably chosen Wasserstein distances for the process were established in Friesen et al. [Reference Friesen, Jin and Rüdiger8] by coupling methods. Some results on the ergodicity and exponential ergodicity in the total variation distance for affine processes driven by Brownian motions and compound Poisson processes were given by Zhang and Glynn [Reference Zhang and Glynn30]. For general affine processes on cones, the exponential ergodicity was studied by Mayerhofer et al. [Reference Mayerhofer, Stelzer and Vestweber19] under certain irreducibility, aperiodicity, and finite second moment assumptions.
In this paper, we prove some estimates for the variations of transition probabilities of the (1+1)-affine process. From those estimates we deduce the strong Feller and the ergodic properties in the total variation distance of the process. The key strategy is to construct several Markov couplings using strong solutions of stochastic equations, which naturally extend those of the CBI process and the OU-type process introduced by Li and Ma [Reference Li and Ma16], Schilling and Wang [Reference Schilling and Wang27], and Wang [Reference Wang29]. The stochastic equations established by Dawson and Li [Reference Dawson and Li4, Reference Dawson and Li5] provide an efficient method for the pathwise construction and analysis of the couplings, which are of interest in themselves; see, e.g., Friesen et al. [Reference Friesen, Jin and Rüdiger8, p. 2170] and Jin et al. [Reference Jin, Kremer and Rüdiger9, p. 1145]. For simplicity, here we only discuss the (1+1)-dimensional process. The method can be modified to treat general finite-dimensional affine processes by some extra work, which will be addressed separately.
The paper is organized as follows. In Section 2, we give the definition and some basic properties of the (1+1)-affine process. The key estimates for the variations of the transition probabilities are established in Section 3, where the strong Feller property and the exponential ergodicity are also deduced. In Section 4, an ergodicity result is proved under a weaker condition.
2. The affine process
Let us introduce more precisely the (1+1)-affine process. Here we adopt the framework of Duffie et al. [Reference Duffie, Filipović and Schachermayer6]; see also Dawson and Li [Reference Dawson and Li4]. Let $D= \mathbb{R}_{+}\times\mathbb{R}$ be endowed with its Borel $\sigma$ -algebra.
Definition 2.1. A set of parameters $(a, (\alpha_{ij}), (b_1,b_2), (\beta_{ij}), \mu, \nu)$ is called admissible if the following hold:
-
(i) $a\in\mathbb{R}_{+}$ is a constant;
-
(ii) $\alpha= (\alpha_{ij})$ is a symmetric nonnegative definite $(2\times2)$ matrix;
-
(iii) $b= (b_1,b_2)\in D$ is a vector;
-
(iv) $\beta= (\beta_{ij})$ is a $(2\times2)$ matrix with $\beta_{12}=0$ ;
-
(v) $\mu(\textrm{d} v)= \mu(\textrm{d} v_1,\textrm{d} v_2)$ is a $\sigma$ -finite measure on D, supported on $D\setminus\{0\}$ , such that
\begin{align*}\int_D \big(v_1\wedge v_1^2+|v_2|\wedge|v_2|^2\big) \mu(\textrm{d} v)<\infty; \end{align*} -
(vi) $\nu(\textrm{d} v)= \nu(\textrm{d} v_1,\textrm{d} v_2)$ is a $\sigma$ -finite measure on D, supported on $D\setminus\{0\}$ , such that
\begin{align*}\int_D\big(v_1+|v_2|\wedge|v_2|^2\big) \nu(\textrm{d} v)<\infty. \end{align*}
Let $U= \mathbb{C}_{-}\times i\mathbb{R}$ , where $\mathbb{C}_{-}= \{a+ib\,{:}\, a\in\mathbb{R}_{-},b\in\mathbb{R}\}$ and $i\mathbb{R}= \{ia\,{:}\, a\in \mathbb{R}\}$ . Given a set of admissible parameters $(a,(\alpha_{ij}), (b_1,b_2), (\beta_{ij}), \mu, \nu)$ , we define the functions F and R on U by the Lévy–Khintchine type representations
and
The (1+1)-affine process $\{X_t\,{:}\,t\ge 0\}$ is a Markov process with state space D and Feller transition semigroup $(P_t)_{t\ge 0}$ defined by
where $(\phi,\psi)$ is the unique solution of the system of equations
The uniqueness of the solution implies that
Clearly, we can rewrite (2.3) as
The Feller property implies that $\{X_t\,{:}\,t\ge 0\}$ has a càdlàg realization.
Suppose that $\{X_t\,{:}\,t\ge 0\}= \{(Y_t,Z_t)\,{:}\,t\ge 0\}$ is a (1+1)-affine process with transition semigroup $(P_t)_{t\ge 0}$ defined by (2.3) and (2.4). Then $\{Y_t\,{:}\,t\ge 0\}$ is a Markov process on $\mathbb{R}_+$ with Feller transition semigroup $(P^{(1)}_t)_{t\ge 0}$ defined by
where $\lambda\ge 0$ . It is known that $\{Y_t\,{:}\,t\ge 0\}$ is a continuous-state branching process with immigration (CBI process) with branching mechanism $\lambda\mapsto -R(\!-\lambda,0)$ and immigration mechanism $\lambda\mapsto -F(\!-\lambda,0)$ . It is known that
where
see, e.g., the formula (79) in Li [Reference Li17].
In particular, when $F(\!-\lambda,0)\equiv 0$ for all $\lambda\ge 0$ , the CBI process $\{Y_t\,{:}\,t\ge 0\}$ reduces to a continuous-state branching process (CB process) with branching mechanism $\lambda\mapsto R(\!-\lambda,0)$ . Then a CB process has Feller transition semigroup $(Q_t)_{t\ge 0}$ defined by
As a special case of (2.8) we have
Then Jensen’s inequality implies
From (2.9) it is easy to see that zero is a trap for the CB process. For a càdlàg realization of the CB process $\{Y_t\,{:}\,t\ge 0\}$ , we define its extinction time
The reader may refer to Kyprianou [Reference Kyprianou14] and Li [Reference Li15, Reference Li17] for compact treatments of CB and CBI processes.
Condition 2.2. (Grey’s condition.) There exists a constant $\lambda_0> 0$ such that
Proposition 2.3. Suppose that $F(\!-\lambda,0)\equiv 0$ and Condition 2.2 holds. Let $\{Y_t\,{:}\,t\ge 0\}$ be a càdlàg realization of the CB process with $Y_0= y$ . Then we have
where $t\mapsto \bar{v}_t\,{:\!=}\, -\lim_{\lambda\to \infty}\psi_1(t,-\lambda,0)$ is the unique positive solution of
The above proposition follows from Theorem 3.4 and Corollary 3.14 in Li [Reference Li17]. By (2.5) and (2.11), for any $t\ge \delta> 0$ we have
Then for $\beta_{11}< 0$ the probability in (2.12) decays exponentially fast as $t\to \infty$ .
A càdlàg realization of the general (1+1)-affine process can be constructed as the unique strong solution to a system of stochastic integral equations. Let $\sigma_0= \sqrt{a}$ , and let $(\sigma_{ij})$ be a $(2\times2)$ matrix satisfying $(\sigma_{ij})= (\alpha_{ij})(\alpha_{ij})^{\tau}$ . Suppose that $({\Omega},\mathscr{F},\mathscr{F}_t,\mathbb{P})$ is a filtered probability space satisfying the usual hypotheses. Let $W_0(t)$ be a standard $(\mathscr{F}_t)$ -Brownian motion. Let $W_i(\textrm{d} s,\textrm{d} u)$ , $i=1,2$ , be $(\mathscr{F}_t)$ -Gaussian white noises on $(0,\infty)^2$ with intensity $\textrm{d} s\textrm{d} u$ . Let $M(\textrm{d} s,\textrm{d} u,\textrm{d} v)$ be an $(\mathscr{F}_t)$ -Poisson random measure on $(0,\infty)^2\times D$ with intensity $\textrm{d} s\textrm{d} u\mu(\textrm{d} v)$ , and let $N(\textrm{d} s,\textrm{d} v)$ be an $(\mathscr{F}_t)$ -Poisson random measure on $(0,\infty)\times D$ with intensity $\textrm{d} s\nu(\textrm{d} v)$ . The corresponding compensated measures are denoted by $\tilde{M}(\textrm{d} s,\textrm{d} u,\textrm{d} v)$ and $\tilde{N}(\textrm{d} s,\textrm{d} v)$ . We assume these random elements are independent of each other. Given an $\mathscr{F}_0$ -measurable random variable $(Y_0,Z_0)\in D$ , we consider the following system of stochastic integral equations:
and
Here and in the sequel, we understand that, for any $a\leq b\in\mathbb{R}$ ,
The existence and pathwise uniqueness of the solution to (2.14) follows from Theorem 8.5 in Li [Reference Li17]. A weakly equivalent stochastic equation was first introduced by Dawson and Li [Reference Dawson and Li4]; see also Dawson and Li [Reference Dawson and Li5]. The existence and pathwise uniqueness of the solution to (2.15) are straightforward. In fact, by (2.14)–(2.15), one can see using integration by parts that
By Theorem 6.2 of Dawson and Li [Reference Dawson and Li4], the process $\{(Y_t,Z_t)\,{:}\,t\ge 0\}$ defined by (2.14)–(2.15) is a (1+1)-affine process with transition semigroup $(P_t)_{t\ge 0}$ given by (2.3) and (2.4).
Let us remark that $\{Z_t\,{:}\,t\ge 0\}$ reduces to a one-dimensional OU-type process if $\sigma_{21}= \sigma_{22}= \beta_{21}= 0$ and $\mu$ is supported on $(0,\infty)\times \{0\}$ . For instance, in this case, from (2.16) we have
A number of moment estimates for general finite-dimensional affine processes were given in Friesen et al. [Reference Friesen, Jin and Rüdiger8]. Since more accurate estimates are needed in this work, we here present the following result.
Proposition 2.4. Let $\{(Y_t,Z_t)\,{:}\,t\ge 0\}$ be a (1+1)-affine process with $Y_0= y\in \mathbb{R}_+$ and $Z_0= z\in \mathbb{R}$ . Let $D_1= \mathbb{R}_+\times [\!-1,1]$ and $D_1^c= D\setminus D_1$ . Then we have
Proof. We may assume that $\{(Y_t,Z_t)\,{:}\,t\ge 0\}$ is defined by (2.14)–(2.15). In view of (2.16) we have
Then (2.18) follows.
Proposition 2.5. Suppose that $\beta_{11}< 0$ and $\beta_{22}< 0$ . Then the transition semigroup $(P_t)_{t\ge 0}$ defined by (2.3) and (2.4) has a unique stationary distribution $\pi$ , which is given by
Moreover, the distribution $\pi$ has finite first moment; that is,
Proof. By Theorem 2.7 of Jin et al. [Reference Jin, Kremer and Rüdiger11], the affine process has a unique stationary distribution $\pi$ given by (2.19). In particular, we have
By differentiating both sides of the above equality at $\lambda= 0+$ and using (2.1), one may see that
By (2.8) and (2.18) there is a constant $C\ge 0$ such that
Since $\pi$ is a stationary distribution for $(P_t)_{t\ge 0}$ , it follows that
Then, letting $t\to \infty$ and $k\to \infty$ , we obtain
This proves the result.
Proposition 2.6. For $i=1,2$ , let $x_i= (y_i,z_i)\in D$ , and let $\{X_i(t)\,{:}\,t\ge 0\}= \{(Y_i(t),Z_i(t))\,{:}\,t\ge 0\}$ be defined by (2.14) and (2.15) with $(Y_i(0),Z_i(0))= x_i$ . Then, for any $t\ge 0$ ,
Proof. Without loss of generality, we may assume $y_1\ge y_2$ . By Theorem 10.1 in Li [Reference Li17] one can see that $\{Y_1(t)\ge Y_2(t)$ for every $t\ge 0\}= 1$ and $\{Y_1(t)-Y_2(t)\,{:}\,t\ge 0\}$ is a CB process with transition semigroup $(Q_t)_{t\ge 0}$ . Then we apply (2.16) to $Z_1(t)$ and $Z_2(t)$ and take the difference to see
It follows that
where we have used Doob’s $L^2$ inequality for the square-integrable martingales; see, e.g., Revuz and Yor [Reference Revuz and Yor25, Theorem 1.7, p. 54]. Then the desired estimate follows by (2.10).
A Markov process $\{(X_1(t),X_2(t))\,{:}\,t\ge 0\}$ with state space $D^2$ is called a Markov coupling of the (1+1)-affine process with transition semigroup $(P_t)_{t\ge 0}$ defined by (2.3) and (2.4) with coupling time $\tau\,{:\!=}\, \inf\{t\ge 0\,{:}\, X_1(t)= X_2(t)\}$ if both $\{X_1(t)\,{:}\,t\ge 0\}$ and $\{X_2(t)\,{:}\,t\ge 0\}$ are Markov processes with transition semigroup $(P_t)_{t\ge 0}$ and $X_1(\tau+t)= X_2(\tau+t)$ for every $t\ge 0$ .
The method of couplings provides an efficient way to estimate the variations of the transition probabilities of the affine process. Let $\|\cdot\|_{\textrm{var}}$ denote the total variation norm of signed measures. Let $\mathscr{B}_1$ be the set of Borel functions f on D satisfying $|f|\le 1$ . Then we have
Let $\{(X_1(t),X_2(t))\,{:}\,t\ge 0\}$ be a Markov coupling of the affine process with initial state $(X_1(0),X_2(0))= (x_1,x_2)$ and coupling time $\tau$ . From (2.20) it follows that
By the pathwise uniqueness for (2.14)–(2.15), the process $\{(X_1(t),X_2(t))\,{:}\,t\ge 0\}$ defined in Proposition 2.6 is a Markov coupling of the affine process with transition semigroup $(P_t)_{t\ge 0}$ . Based on this coupling, several different couplings of the affine process will be given in the next two sections. We shall see that the stochastic equations (2.14)–(2.15) and (2.16) provide an efficient method for the pathwise construction and analysis of those couplings. The approach of stochastic equations has also played an important role in other recent developments concerning branching processes in continuous state spaces; see, e.g., Bansaye and Méléard [Reference Bansaye and Méléard1], Li [Reference Li17], Pardoux [Reference Pardoux22], and the references therein.
3. Estimates for variations of probabilities
In this section, we study the strong Feller property and the exponential ergodicity of the total variation distance of the affine process. Let $\|\cdot\|_{\textrm{var}}$ denote the total variation norm of signed measures. Our strategy is to establish some estimates for the differences of the transition probabilities in the form (2.20). The proofs of the estimates are based on couplings of the affine process constructed in terms of strong solutions of stochastic equations. From those estimates we deduce the strong Feller property and the exponential ergodicity under natural conditions.
We first consider the case $\sigma_0> 0$ . Write $x_1= (y_1,z_1)$ and $x_2= (y_2,z_2)$ , where $y_1,y_2\in \mathbb{R}_+$ and $z_1,z_2\in \mathbb{R}$ . For $i=1,2$ let $(Y_i(t),Z_i(t))$ be defined by (2.14)–(2.16) with $(Y_i(0),Z_i(0))= (y_i,z_i)$ , and write $X_i(t)= (Y_i(t),Z_i(t))$ . Then $\{(X_1(t),X_2(t))\,{:}\,t\ge 0\}$ is a Markov coupling of the affine process. The pathwise uniqueness of the solution for (2.14) implies $Y_1(\tau_0+t)= Y_2(\tau_0+t)$ for $t\ge 0$ . In fact, by Theorem 10.1 in Li [Reference Li17] it is easy to see that $\{|Y_1(t)-Y_2(t)|\,{:}\,t\ge 0\}$ is a CB process with transition semigroup $(Q_t)_{t\ge 0}$ . Let $\tau_0= \inf\{t\ge 0\,{:}\, Y_1(t)= Y_2(t)\}$ be the extinction time of the process. By Proposition 2.3 we have
Let $a(\tau_0)= [Z_1(\tau_0) - Z_2(\tau_0)]/2\sqrt{2}\sigma_0$ and
Then we define the process $\{Z_2'(t)\,{:}\,t\ge 0\}$ by
It is clear that $Z_2'(t\land \tau_0)= Z_2(t\land \tau_0)$ for $t\ge 0$ . Write $X_2'(t)= (Y_2(t),Z_2'(t))$ . Then $\{(X_1(t),X_2'(t))\,{:}\,t\ge 0\}$ is also a Markov coupling of the affine process. For $t\ge 0$ let
Since $Y_1(\tau_0+t)= Y_2(\tau_0+t)$ for $t\ge 0$ , by the construction of $Z_1(t)$ and $Z_2'(t)$ we have
It follows that \P $\tau= \inf\{t\ge 0\,{:}\, \zeta(t)= 0\}$ , and so
Then $\tau_0+\tau$ is the coupling time of $\{(X_1(t),X_2'(t))\,{:}\,t\ge 0\}$ .
Theorem 3.1. Suppose that $\sigma_0> 0$ . Then there is a constant $C\ge 0$ such that
Proof. Let $\{(X_1(t),X_2'(t))\,{:}\,t\ge 0\}$ be the coupling of the affine process constructed as above. Then we have
where $\mathbb{P}(\tau_0> t/2)\le |y_1-y_2|\bar{v}_{t/2}$ by (3.1). In view of (3.2), there is a standard Brownian motion $\{B(t)\,{:}\,t\ge 0\}$ independent of $\mathscr{F}_{\tau_0}$ such that
where
Since $t-\tau_0$ is measurable relative to $\mathscr{F}_{\tau_0}$ , by the reflection principle for the Brownian motion we get
Corollary 3.2. Suppose that $\sigma_0> 0$ . Then $(P_t)_{t\ge 0}$ is a strong Feller transition semigroup.
Corollary 3.3. Suppose that $\beta_{11}< 0$ , $\beta_{22}< 0$ , and $\sigma_0> 0$ . Let $\pi$ be the unique stationary distribution for $(P_t)_{t\ge0}$ . Then for every $\delta>0$ there is a constant $C_\delta\ge 0$ such that
where $\kappa= |\beta_{11}|\land |\beta_{22}|$ .
Proof. By Proposition 2.5, the stationary distribution $\pi$ possesses a finite first moment. It is well known that
By Theorem 3.1, there is a constant $C\ge 0$ such that, for $x_i= (y_i,z_i)\in D$ , $i= 1,2$ ,
Then the desired estimate follows by (2.13).
Now let us consider the case where $\nu(D)> 0$ . For $\varepsilon> 0$ let $D_\varepsilon= \mathbb{R}_+\times [\!-\varepsilon,\varepsilon]$ and $D_\varepsilon^c= D\setminus D_\varepsilon$ . By choosing sufficiently small $\varepsilon\in (0,1]$ we have $0< \nu(D_\varepsilon^c)< \infty$ . Let $\nu_\varepsilon$ be the finite measure on D defined by
where $A\in \mathscr{B}(D)$ . Let $\hat{\nu}_\varepsilon= \nu_\varepsilon(D)^{-1} \nu_\varepsilon$ .
Condition 3.4. There exists $\varepsilon\in (0,1]$ such that
The above condition is a slight modification of (10) in Wang [Reference Wang29] for OU-type processes. As in Wang [Reference Wang29, p. 996], one may see that the above condition implies
Theorem 3.5. Suppose that Condition 3.4 is satisfied for some $\varepsilon\in (0,1]$ . Then we have
Proof. Step 1. Consider the case where $y_1= y_2= y\in \mathbb{R}_+$ . Let $\{Y_t\,{:}\,t\ge 0\}$ be the solution of (2.14) with $Y_0= y$ . Let $z_0= 0$ . For $i=0,1,2$ let $\{Z_i(t)\,{:}\,t\ge 0\}$ be defined by (2.16), with $Z_i(0)= z_i$ . It is easy to see that
Let $\{\eta_\varepsilon(t)\,{:}\,t\ge 0\}$ be the compensated compound Poisson process defined by
Let $\tau_1= \inf\{t\ge 0\,{:}\, \eta_\varepsilon(t)\neq \eta_\varepsilon(t-\!)\}$ be the first jump time of this process. For any $f\in \mathscr{B}_1$ we have
where $\mathbb{P}(\tau_1> t)= \textrm{e}^{-\nu(D_\varepsilon^c)t}$ and
By the strong Markov property and (2.16),
It follows that
Then we can use the Markov property and the representation (3.6) to get
Step 2. In the general case, we have $x_1= (y_1,z_1)$ and $x_2= (y_2,z_2)$ , where $y_1,y_2\in \mathbb{R}_+$ and $z_1, z_2\in \mathbb{R}_+$ . It suffices to consider the case of $y_1\ge y_2$ . Let $\{(Y_i(t),Z_i(t))\,{:}\,t\ge 0\}$ be defined by (2.14)–(2.15) with $(Y_i(0),Z_i(0))= x_i$ , $i=1,2$ . Then
with $\mathbb{P}(\tau_0> t/3)\le |y_1-y_2|\bar{v}_{t/3}$ and
where we have used (3.7) for the first inequality. Then the desired estimate follows by (2.20) and Proposition 2.6.
Corollary 3.6. Suppose that Condition 3.4 is satisfied for a sequence $\{\varepsilon_n\}\subset (0,1]$ and $\lim_{n\to \infty}\nu(D_{\varepsilon_n}^c)= \infty$ . Then $(P_t)_{t\ge 0}$ is a strong Feller transition semigroup.
Proof. Suppose that $\{x_k\}\in D$ is a sequence such that $\lim_{k\to \infty} x_k= x_0\in D$ . By Theorem 3.5, for $t>0$ and $n\ge 1$ we have
The left-hand side vanishes since $\lim_{n\to \infty}\nu(D_{\varepsilon_n}^c)= \infty$ .
Corollary 3.7. Suppose that $\beta_{11}< 0$ , $\beta_{22}< 0$ , and Condition 3.4 is satisfied. Then there is a constant $C_\varepsilon\ge 0$ such that
where $\kappa_\varepsilon= |\beta_{11}|\land |\beta_{22}|\land \nu(D_\varepsilon^c)$ .
Proof. By Theorem 3.5 there is a constant $C_\varepsilon\ge 0$ such that, for $t> 0$ and $x_i= (y_i,z_i)\in D$ , $i= 1,2$ ,
Then the result follows as in the proof of Corollary 3.3.
Theorems 3.1 and 3.5 and their corollaries are natural extensions of the existing results on CBI and OU-type processes in the literature. In fact, one may see that some parts of the proofs given above essentially follow the ideas of Li and Ma [Reference Li and Ma16] and Wang [Reference Wang29]; see also Wang [Reference Wang28]. For general affine processes on cones, Mayerhofer et al. [Reference Mayerhofer, Stelzer and Vestweber19] studied the exponential ergodicity in the total variation distance under certain irreducibility, aperiodicity, and finite second moment assumptions. Their techniques were based on the theory of stochastic stability of Markov processes; see Meyn and Tweedie [Reference Meyn and Tweedie20, Reference Meyn and Tweedie21] and the references therein. While the results of Mayerhofer et al. [Reference Mayerhofer, Stelzer and Vestweber19] were formulated in an abstract framework, it seems a delicate task to check their conditions for the process discussed here. Moreover, the finite second moment condition of Mayerhofer et al. [Reference Mayerhofer, Stelzer and Vestweber19] rules out some natural examples.
4. A weaker condition for ergodicity
Throughout this section, we assume $\beta_{11}< 0$ and $\beta_{22}< 0$ . We shall establish the ergodicity of the affine process under a condition on the Lévy measure $\nu$ weaker than Condition 3.4. The proof of the result is based on a coupling similar to those used in the last section. Suppose that $\nu(D)> 0$ and choose $0< \varepsilon< 1$ so that $0< \nu(D_\varepsilon)< \infty$ , where $D_\varepsilon=\mathbb{R}_+\times [\!-\varepsilon,\varepsilon]$ . Let $\nu_\varepsilon$ and $\hat{\nu}_\varepsilon$ be defined as in the last section. Let $\gamma^{\varepsilon}_z= \hat{\nu}_\varepsilon\land (\delta_{(0,z)}*\hat{\nu}_\varepsilon)$ for $z\in \mathbb{R}$ .
Condition 4.1. There are constants $\varepsilon\in (0,1]$ and $\delta> 0$ such that
The above condition was introduced for general finite-dimensional OU-type processes by Schilling and Wang [Reference Schilling and Wang27] and Wang [Reference Wang29]; see also Wang [Reference Wang28]. As observed in Wang [Reference Wang29, p. 992], the condition is weaker than Condition 3.4.
Lemma 4.2. Suppose that Condition 4.1 is satisfied. For $|z|\le \delta$ , let $\hat{\gamma}^{\varepsilon}_z= \gamma^{\varepsilon}_z(D)^{-1} \gamma^{\varepsilon}_z$ , and let $(\eta,\rho_1,\zeta)$ be a random vector such that, for $A\in \mathscr{B}(D)$ ,
Let $\rho_2= \rho_1+\zeta$ . Then we have
and
Proof. From (4.1) it is easy to see that $\zeta$ has distribution given by (4.2). Moreover, for any $A\in \mathscr{B}(D)$ we have
and
Then (4.3) holds.
Lemma 4.3. Suppose that Condition 4.1 is satisfied. Let $Q(z,\cdot)$ denote the joint distribution of $(\eta,\rho_1,\rho_2)$ . Then $Q(z,\cdot)$ is a probability kernel from $[\!-\delta,\delta]$ to $\mathbb{R}_+\times \mathbb{R}^2$ .
Proof. It is easy to see that $z\mapsto \delta_{(0,z)}*\hat{\nu}_\varepsilon$ is a Borel probability kernel from $[\!-\delta,\delta]$ to D. Let
denote the total variation of the signed measure $\hat{\nu}_\varepsilon - \delta_{(0,z)}*\hat{\nu}_\varepsilon$ . By the regularity of the measures, for any bounded positive continuous function f on D we have
where $\mathscr{C}_1$ is the set of continuous functions g on D satisfying $|g|\le 1$ . Then the mapping
is lower semicontinuous, so it is a Borel function on $[\!-\delta,\delta]$ . It follows that $m^{\varepsilon}(z,\cdot)$ and $\gamma^{\varepsilon}_z= \hat{\nu}_\varepsilon + \delta_{z}*\hat{\nu}_\varepsilon - m^{\varepsilon}(z,\cdot)$ are kernels from $[\!-\delta,\delta]$ to D. From (4.1) we see that $Q(z,\cdot)$ is a Borel probability kernel from $[\!-\delta,\delta]\setminus \{0\}$ to $\mathbb{R}_+\times \mathbb{R}^2$ .
Now let us define the first coupling in this section. The basic idea follows that of Schilling and Wang [Reference Schilling and Wang27] and Wang [Reference Wang29]. Here we give a pathwise construction of the coupling in terms of stochastic integrals. Let $x_1= (y,z_1)\in D$ and $x_2= (y,z_2)\in D$ , where $y\in \mathbb{R}_+$ and $z_1, z_2\in \mathbb{R}$ satisfy $z_1\neq z_2$ . Let $k= \lfloor\delta^{-1}|z_1-z_2|\rfloor+1$ . Then $k^{-1}|z_1-z_2|\le \delta$ . Let $G= \mathbb{R}_{+}\times \mathbb{R}^2$ be endowed with its Borel $\sigma$ -algebra. In addition to the noises in (2.14) and (2.15), let $N_0(\textrm{d} s,\textrm{d} u,\textrm{d} v_1,\textrm{d} v_2)$ be an $(\mathscr{F}_t)$ -Poisson random measure on $(0,\infty)\times G$ with intensity
We assume all of these noises are independent of each other. For $t\ge 0$ let
Then we have
where
Let $N_0(\textrm{d} s,\textrm{d} r)$ be the image of $N_0(\textrm{d} s,\textrm{d} u,\textrm{d} v_1,\textrm{d} v_2)$ under the mapping
We easily see that $N_0(\textrm{d} s,\textrm{d} r)$ is a Poisson random measure on $(0,\infty)\times \mathbb{R}$ with intensity $\nu_\varepsilon(D)\textrm{d} s\pi(s,\textrm{d} r)$ , where $\pi(s,\textrm{d} r)$ is the probability measure on $\mathbb{R}$ defined by
It follows that
is a Poisson process with parameter $q\nu_\varepsilon(D)$ . For $i\ge 1$ , let $\xi_i$ denote the size of the ith jump of the process $\{L(t)\,{:}\,t\ge 0\}$ . Then $\{\xi_i\,{:}\,i\ge 1\}$ are independent and identically distributed (i.i.d.) random variables with
Let $S(n)= \sum_{i=1}^n \xi_i$ . Then the processes $\{N(t)\,{:}\,t\ge 0\}$ and $\{S(n)\,{:}\, n\ge 0\}$ are independent and
This proves the following.
Lemma 4.4. The process $\{L(t)\,{:}\,t\ge 0\}$ is a continuous-time simple random walk with i.i.d. jumps $\{\xi_i\,{:}\,i\ge 1\}$ satisfying (4.7).
Let $\tau= \inf\{t\ge 0\,{:}\,\xi(t)= 0\}= \inf\{t\ge 0\,{:}\,L(t)= -1\}$ , and let $\{Y'(t)\,{:}\,t\ge 0\}$ be the solution of
Let $\{Z_1'(t)\,{:}\,t\ge 0\}$ and $\{Z_2'(t)\,{:}\,t\ge 0\}$ be defined by
and
Let $N_1(\textrm{d} s,\textrm{d} u,\textrm{d} v_1)$ and $N_2(\textrm{d} s,\textrm{d} u,\textrm{d} v_2)$ respectively denote the images of the random measure $N_0(\textrm{d} s,\textrm{d} u,\textrm{d} v_1,\textrm{d} v_2)$ under the mappings
Clearly, both $N_1(\textrm{d} s,\textrm{d} u,\textrm{d} w)$ and $N_2(\textrm{d} s,\textrm{d} u,\textrm{d} w)$ are Poisson random measures on $(0,\infty)\times D$ with intensity $\nu_\varepsilon(D)\textrm{d} s\hat{\nu}_\varepsilon(\textrm{d} u,\textrm{d} w)= \textrm{d} s\nu_\varepsilon(\textrm{d} u,\textrm{d} w)$ . It follows that both $\{(Y'(t),Z_1'(t))\,{:}\,t\ge 0\}$ and $\{(Y'(t),Z_2'(t))\,{:}\,t\ge 0\}$ are affine processes with transition semigroup $(P_t)_{t\ge 0}$ . From (4.8), (4.9), and (4.10) we see that
Then $\{(Y'(t),Z_1'(t)$ , $Y'(t),Z_2'(t))\,{:}\,t\ge 0\}$ is a coupling of the affine process with coupling time
Remark 4.5. The intensity (4.4) of the Poisson random measure $N_0(\textrm{d} s,\textrm{d} u,\textrm{d} v_1,\textrm{d} v_2)$ in (4.8) and (4.9)–(4.10) depends on the difference $Z_1'(0) - Z_2'(0)= z_1-z_2$ .
Lemma 4.6. Suppose that Condition 4.1 is satisfied. Then there exists a constant $C_\varepsilon> 0$ such that
Proof. By the reflection principle for the symmetric simple random walk, we have
see, e.g., Lemma 2.3 in Schilling and Wang [Reference Schilling and Wang27]. By the Berry–Esseen inequality, there is a universal constant $C_0> 0$ such that
Let $T= \inf\{n\ge 0\,{:}\,S(n)= -1\}$ . Then we have
where $C_1= 1\vee C_0$ . By the total probability formula and the independence of $\{S(n)\,{:}\,n\ge 0\}$ and $\{N(t)\,{:}\,t\ge 0\}$ , it follows that
Then (4.12) holds for some constant $C_\varepsilon\ge 0$ .
We next construct the main coupling of this section, through a concatenation of two couplings. Let $D([0,\infty),D^2)$ denote the space of càdlàg paths from $[0,\infty)$ to D. Let $\{w(t)\,{:}\,t\ge 0\}= \{(w_1(t),w_2(t)$ , $w_3(t),w_4(t))\,{:}\,t\ge 0\}$ denote the coordinate process of this space, and let $(\mathscr{F}_t\,{:}\,t\ge 0)$ be its natural filtration generated by the coordinate process. Let $\tau_0^w= \{t\ge 0\,{:}\,w_1(t)= w_3(t)\}$ and $\tau^w= \{t\ge \tau_0(w)\,{:}\,w_2(t)= w_4(t)\}$ . For $s\ge 0$ , let $\theta_s$ be the shifting operator on $D([0,\infty),D^2)$ defined by $\theta_sw(t)= w(s+t)$ , $t\ge 0$ . For $s\ge 0$ and $w\in D([0,\infty),D^2)$ , the stopped path $w^s\in D([0,\infty),D^2)$ is defined by $w^s(t)= w(s\land t)$ , $t\ge 0$ .
Let $x_1= (y_1,z_1)\in D$ and $x_2= (y_2,z_2)\in D$ , where $y_1,y_2\in \mathbb{R}_+$ and $z_1, z_2\in \mathbb{R}$ . For $i=1,2$ , let $\{Y_i(t)\,{:}\,t\ge 0\}$ be the solution of (2.14) with $Y_i(0)= y_i$ , and let $\{Z_i(t)\,{:}\,t\ge 0\}$ be defined by (2.16) with $Z_i(0)= z_i$ . Let $\mathbb{P}^{2,2}_{(x_1,x_2)}$ be the distribution on $D([0,\infty),D^2)$ of $\{(Y_1(t),Z_1(t)$ , $Y_2(t),Z_2(t))\,{:}\,t\ge 0\}$ . Let $\mathbb{P}^{1,2}_{(y,z_1,z_2)}$ be the distribution on $D([0,\infty),D^2)$ of the process $\{(Y'(t),Z_1'(t)$ , $Y'(t),Z_2'(t))\,{:}\,t\ge 0\}$ defined by (4.8)–(4.10). Let $\mathbb{P}_{(x_1,x_2)}$ be the probability measure on $D([0,\infty),D^2)$ defined by
where F and G are Borel functions on $D([0,\infty),D^2)$ , and the probability symbols are used to denote the corresponding expectations. Then under $\mathbb{P}_{(x_1,x_2)}$ the coordinate process $\{(w_1(t),w_2(t),$ $w_3(t),w_4(t))\,{:}\,t\ge 0\}$ evolves according to the transition law of $\{(Y_1(t),Z_1(t),$ $Y_2(t),Z_2(t))\,{:}\,t\ge 0\}$ up to time $\tau_0^w$ , after which it evolves according to the transition law of $\{(Y'(t),Z_1'(t),$ $Y'(t),Z_2'(t))\,{:}\,t\ge 0\}$ . It is clear that both $\{(w_1(t),w_2(t))\,{:}\,t\ge 0\}$ and $\{(w_3(t),w_4(t))\,{:}\,t\ge 0\}$ are affine processes with transition semigroup $(P_t)_{t\ge 0}$ . Thus they form a coupling of the affine process with coupling time $\tau^w$ .
Remark 4.7. One might wish to construct a coupling through stochastic equations by a direct concatenation of the two sets of stochastic equations. As above, one first constructs the process $\{(Y_1(t),Z_1(t),$ $Y_2(t),Z_2(t))\,{:}\,t\ge 0\}$ by (2.14)–(2.16) and defines the stopping time $\tau_0= \{t\ge 0\,{:}\,Y_1(t)= Y_2(t)\}$ . By Remark 4.5 one would need a Poisson random measure with intensity depending on the random variable $Z_1(\tau_0) - Z_2(\tau_0)$ to define the coupling process on the time interval $[\tau_0,\infty)$ by (4.8)–(4.10). We leave the details to the interested reader.
Theorem 4.8. Suppose that Condition 4.1 is satisfied. Then there is a constant $C_\varepsilon\ge 0$ such that
Proof. There is no loss of generality in assuming $y_1\ge y_2$ . Using the coupling of the affine process constructed above, we have
where $\mathbb{P}_{(x_1,x_2)}(\tau_0^w> t/2)\le |y_1-y_2|\bar{v}_{t/2}$ by (3.1). Since $t-\tau_0^w$ is measurable relative to $\mathscr{F}_{\tau_0^w}$ , we can use Lemma 4.6 to see
Corollary 4.9. Suppose that Condition 4.1 is satisfied. Then there is a constant $C_\varepsilon\ge 0$ such that
The above corollary gives an extension of Theorem 1.1 of Schilling and Wang [Reference Schilling and Wang27]; see also Theorem 2(i) of Wang [Reference Wang29].
Acknowledgements
We would like to express our gratitude to two anonymous referees for their very helpful comments and suggestions on the paper and on the literature. We are grateful to Peisen Li and Jian Wang for their helpful comments.
Funding information
This research was supported by the National Key R&D Program of China (No. 2020YFA0712900).
Competing interests
There were no competing interests to declare which arose during the preparation or publication process of this article.