1. Introduction
Hawkes processes were first introduced in [Reference Hawkes9, Reference Hawkes10] as an extension of the Poisson process, which has the so-called self-exciting effect; that is, the occurrence of an event will increase the probability of future events. Hawkes processes have been widely used to model various applications, for example, in finance [Reference Bacry, Mastromatteo and Muzy2, Reference Hawkes11] and internet traffic and queueing [Reference Chen4, Reference Daw and Pender6, Reference Gao and Zhu7, Reference Koops, Saxena, Boxma and Mandjes14]. There are extensive studies of Hawkes processes, including the exact distributional properties [Reference Hawkes and Oakes12, Reference Oakes16] for exponential-type kernel functions, and limit theorems in both conventional scaling [Reference Bacry, Delattre, Hoffmann and Muzy1, Reference Bacry, Mastromatteo and Muzy2, Reference Chen4, Reference Karabash and Zhu13] and large-intensity scaling [Reference Gao and Zhu7, Reference Gao and Zhu8, Reference Li and Pang15].
In this paper, we investigate the random splitting/sampling and aggregating/superposition of Hawkes processes. The random splitting/sampling of point processes has been an important topic in stochastic models. For example, one arrival stream of customers may require different types of services, and the same packet in a communication network may be sent out simultaneously on several outgoing links. Similarly, input into a service or communication system can come from aggregating several sources [Reference Sriram and Whitt17, Reference Sriram and Whitt18]. See further discussions in [Reference Whitt21, Chapter 9]. Splitting and aggregating of standard point processes are well understood in the literature [Reference Whitt19, Reference Whitt20, Reference Whitt21].
The Poisson branching representation [Reference Hawkes and Oakes12], also known as the immigration–birth representation, has been a fundamental tool in the analysis of Hawkes processes. This representation says that for each individual (generation), the number of individuals (children) produced by this individual over time, called the next generation, is a simple and conditional Poisson process with an intensity that is a functional of the counting process of the individual’s generation. It is known that random splitting/sampling of Poisson processes results in independent Poisson processes, each component with a rate equal to that of the original process multiplied by the splitting probability. By contrast, each sub-counting process resulting from the random splitting/sampling of a Hawkes process cannot be regarded as a Hawkes process itself, despite the conditional Poisson property for each generation. Intuitively, the jumping intensity of a sub-counting process depends on the history of the original Hawkes process, which requires information strictly greater than that provided by the sub-counting process. However, we show that the vector-valued splitting Hawkes process is a multidimensional Hawkes process (Propositions 3.2 and 5.1). It is also clear that the split processes are no longer independent, but their dependence structure is not at all obvious.
We thus aim to understand the split processes of a Hawkes process and their dependence structure. We provide two representations of the (scaled) split processes, one directly using the original Hawkes process, and the other using the non-homogeneous conditional Poisson processes in each generation from the Poisson branching representation. For ease of exposition, we start with the splitting of a one-dimensional Hawkes process; we discuss how the splitting schemes work and show the equivalence of the limits in the functional central limit theorems (FCLTs) derived from them, which rely on the existing results in Chapter 9.5 of Whitt [Reference Whitt21] and in [Reference Bacry, Delattre, Hoffmann and Muzy1]. (See the discussions in Section 3.) We next aim to understand the aggregation/superposition of a multivariate Hawkes process and show when the aggregated process can still be a Hawkes process (Section 4).
We then consider the scheme of first splitting and then aggregating a multivariate Hawkes process, which may transform it into a point process of a different dimension (Section 5). We identify conditions under which the transformed process is again Hawkes. We prove an FCLT for the transformed process, which is a Brownian motion limit with a surprisingly simple covariance function. We also discuss how that relates to the known result in the special case of a Hawkes process.
1.1. Notation
All random variables and processes are defined in a common complete probability space $\big(\Omega,\mathcal{F},\{\mathcal{F}_{t}\}_{t\geq 0},\mathbb{P}\big)$ . Throughout the paper, ${\mathbb N}$ denotes the set of natural numbers; ${\mathbb R} ({\mathbb R}_{+})$ denotes the space of real (nonnegative) numbers. Let ${\mathbb D}={\mathbb D}({\mathbb R}_{+},{\mathbb R})$ denote the ${\mathbb R}$ -valued function space of all càdlàg functions on ${\mathbb R}_+$ . The expression $({\mathbb D},J_{1})$ denotes the space ${\mathbb D}$ equipped with the Skorokhod $J_{1}$ topology (see [Reference Billingsley3]), which is complete and separable. ${\mathbb D}^{n}$ denotes an n-dimensional vector-valued càdlàg process endowed with the weak Skorokhod $J_{1}$ topology [Reference Whitt21], for which we write $({\mathbb D}^{n},J_{1})$ . $L^{2}(\mathbb{P})$ denotes the space of random variables with finite second moment. For an integrable function $f\;:\;{\mathbb R}\to{\mathbb R}$ , its $L^{1}$ norm is denoted by $\|f\|_{1}$ . The symbols $\to$ and $\Rightarrow$ mean convergence of real numbers and convergence in distribution, respectively. For a matrix $M=(M_{ij})_{i,j}$ , we denote by $\textrm{ent}_{ij}M=M_{ij}$ its (i,j)th entry, while $\textrm{row}_{k}M$ and $\textrm{col}_{m}M$ denote its kth row vector and the mth column vector, respectively. $M^{\textrm{T}}$ denotes the transpose of M; I denotes the identity matrix; e denotes the column vector of 1s with associated dimension. For a vector a, $\textrm{diag}(a)$ denotes the diagonal matrix with the elements of the vector a on the main diagonal. The expression $f*g(t)\;:\!=\;\int_{0}^{t}f(t-s)g(s)ds$ denotes the convolution of f and g on ${\mathbb R}_{+}$ . Additional notation is introduced in the paper whenever necessary.
2. Preliminaries on Hawkes processes
A d-dimensional Hawkes process, $N=\{N(t), t\geq0\}$ with $N=\big(N_{k}\big)_{k=1,\dots,d}$ , is formally defined as an ${\mathbb N}^{d}$ -valued simple counting process with conditional intensity
for every $k=1,\cdots,d$ , where $\tau_{kj}$ is the jth event time of $N_{k}$ ; $\lambda_{k0}\ge0$ is a constant called the baseline intensity of the kth subprocess; and $H_{kk^{\prime}}\;:\;{\mathbb R}_{+}\to{\mathbb R}_{+}$ is called the mutually exciting function or the kernel function, also known as the cross-exciting function for $k\neq k^{\prime}$ and the self-exciting function for $k=k^{\prime}$ .
Assumption A1. For all $k,k^{\prime}=1,\cdots,d$ , we have
and the spectral radius $\rho(\|H\|_{1})$ of the matrix
satisfies $\rho(\|H\|_{1})<1$ .
The condition in (2.2) is also called the non-explosion criterion in [Reference Bacry, Delattre, Hoffmann and Muzy1, Reference Daley and Vere-Jones5]. It is easy to calculate the mean of N(t):
Here $\lambda_{0}=\big(\lambda_{k0}\big)_{k}$ is the constant vector baseline intensity in (2.1), $\varphi*I(s)$ is a matrix with $\varphi_{kk^{\prime}}*1(s)=\int_{0}^{s}\varphi_{kk^{\prime}}(s-u)\,du$ at its (k, k ′)th entry (we abuse notation to let $1(\cdot)$ indicate a constant function equal to one), and $\varphi$ is a $d\times d$ matrix defined as an $L^{1}(dt)$ limit of the following series:
where $F*G$ is defined as the matrix having as each entry
for matrix-valued functions F, G; see e.g. [Reference Bacry, Delattre, Hoffmann and Muzy1, Theorem 2]. The function $\varphi$ can also be understood as the renewal density of the function H and satisfies the (matrix) renewal equation
In addition, under Assumption A1,
The functional law of large numbers for the Hawkes process N reads as follows [Reference Bacry, Delattre, Hoffmann and Muzy1, Theorem 1]: under Assumption A1,
almost surely and in $L^{2}(\mathbb{P})$ , where $\bar{N}_{T}(t)\;:\!=\;T^{-1}N(Tt)$ . The FCLT for the Hawkes process N is stated as follows [Reference Bacry, Delattre, Hoffmann and Muzy1, Theorem 2]: under Assumption A1,
as $T\to \infty$ , where
W is a d-dimensional standard Brownian motion, and
3. Splitting of a one-dimensional Hawkes process
We now describe the random splitting mechanism of Hawkes processes. We first focus on the splitting of a one-dimensional Hawkes process for the sake of exposition. The splitting of a d-dimensional Hawkes process can be derived similarly and will be discussed in Section 5 together with aggregation. We provide two representations of the split processes: the first using the method in Chapter 9.5 of Whitt [Reference Whitt21], and the second using the immigration–birth branching representation. Note that for the case $d=1$ , Assumption A1 reduces to $\|H\|_{1}\in(0,1)$ .
3.1. The first representation
Let N be the Hawkes process in (2.1) with $d=1$ and self-exciting function $H\;:\;{\mathbb R}_{+}\to{\mathbb R}_{+}$ . Denoting by $\{\xi_{j},j\ge1\}$ the splitting variables, whenever $\xi_{j}=m$ , the jth individual occurring at $\tau_j$ is assigned to the mth split process. Under this standard splitting, the process N splits into n sub-counting processes, denoted by $N^{(m)}$ :
We assume that $\{\xi_{j},j\ge1\}$ is a sequence of independent and identically distributed (i.i.d.) variables, independent of N, with
By the independence between N and $\{\xi_j\}_{j}$ , it is easy to see that
(Here $\varphi$ is an ${\mathbb R}_+$ -valued function.)
We consider the scaled process indexed by T, and all the variables are marked with an additional subscript T; that is, $N_{T}$ is a Hawkes process with intensity process $\lambda_{T}(\cdot)$ whose baseline intensity in (2.1) is $\lambda_{0,T}$ , and the kernel function H stays the same. The splitting variables are denoted by $\{\xi_{j,T},j\ge1\}$ with distribution $\{p^{(m)}_{T}\}_{m}$ .
Assumption A2. Assume that, for some $\lambda_{0},p^{(m)}>0$ with $\sum_{m=1}^{n}p^{(m)}=1$ ,
For every $m=1,2,\cdots,d$ , define
and the diffusion-scaled processes
where $\lfloor t \rfloor$ represents the largest integer no larger than $t\in{\mathbb R}_+$ . Then we have the first representation
Note that in the representation (3.5), the process $\hat{N}^{(m)}_{T}$ consists of two components: $\hat{S}^{(m)}_{T}$ represents the oscillation from the splitting scheme, and $p^{(m)}_{T}\hat{N}_{T}$ is the oscillation inherited from the original counting process and proportional to the splitting probability. Note that the processes $ \hat{S}^{(m)}_{T}$ and $ \hat{N}_{T}$ are independent, and in the limit we see that the two components $ \hat{S}^{(m)}_{T}\circ \bar{N}_{T}$ and $ p^{(m)}_{T}\hat{N}_{T}$ converge to two independent processes as $T\to \infty$ .
By applying [Reference Whitt21, Theorem 9.5.1], under Assumptions A1 and A2, provided with the limit for $\bar{N}_{T}$ in (2.6) and the limit for $\hat{N}_{T}$ in (2.8), we obtain the following.
Proposition 3.1. Let $(\hat{N}^{(m)}_{T})_{m}$ be the diffusion-scaled process in (3.4). Assume that $\|H\|_{1}\in(0,1)$ and Assumption A2 hold. We have
with
where $(\hat{S}^{(m)})_m$ is an n-dimensional Brownian motion with covariance function
and W is a standard Brownian motion, independent of $(\hat{S}^{(m)})_m$ .
Therefore, the limit $\big(\hat{N}^{(m)}\big)_{m}$ is an n-dimensional Brownian motion with covariance
3.2. The second representation
Recall the immigration–birth branching representation with chronological levels for the Hawkes process N described in (2.1), which generalizes the one proposed by [Reference Hawkes and Oakes12]. Basically, the points from N are categorized virtually into those from an exogenous arrival process, which are called migrants and/or the first generation, and those generated from existing points, which are called children, and whose chronological levels are obviously defined. Thus we rewrite
where $N_{1}(t)=\sup\{j\geq1, \tau_{1j}\leq t\}$ is a Poisson process with parameter $\lambda_{0}$ representing the arrival rate of the immigrants, and $\tau_{1j}$ denotes the arrival time of the jth immigrant. For $l\geq1$ , $N_{l+1}(t)=\sup\{j\geq1, \tau_{(l+1)j}\leq t\}$ , representing the individuals of $(l+1)$ th generation, is an inhomogeneous Poisson process with intensity, by conditioning on $\mathscr{F}_{l}(t)$ , as follows:
where $\mathscr{G}_{l}(t)=\sigma\{N_{l}(s), s\leq t\}$ and $\mathscr{F}_{l}(t)=\bigvee_{1\leq l'\leq l}\mathscr{G}_{l'}(t)$ , and where $\tau_{lj}$ denotes the birth time of the jth child of the lth generation, which is produced by some individual in the $(l-1)$ th generation. Here $\mathscr{G}_{l}(t)$ represents the information produced by the lth generation, $\mathscr{F}_{l}(t)$ represents the information up to the lth generation, and $\mathscr{F}_{\infty}(t)=\bigvee_{l\ge1}\mathscr{F}_{l}(t)$ collects the information of all the generations up to time t, which includes not only the occurrence times of events but also the virtually defined generation information and hence is strictly larger than the information generated by N itself. Under the non-explosion assumption (2.2), $\lambda_{l}$ and $N_{l}$ are finite-valued and can be constructed pathwise. By conditioning and the additive property of the intensity for the independent counting processes, N in (3.8) is a simple counting process with conditional intensity in (2.1).
To describe the split processes with the branching generations, we further let $\{\xi_{lj}, j\ge1\}$ be the i.i.d. splitting variables for the individuals from $N_{l}$ , which have the same distributional properties as $\{\xi_{j},j\geq1\}$ in (3.1). Recall $p^{(m)}$ in (3.2). Let
Then $N^{(m)}=\sum_{l\ge1} N^{(m)}_{l}$ .
Proposition 3.2. $\big(N^{(m)}\big)_{m}$ is an n-dimensional Hawkes process, where the baseline intensity vector is $\big(p^{(m)}\lambda_{0}\big)_{m}$ and the mutual exciting matrix at the (m, m′) entry is $p^{(m)}H$ ; that is, its intensity is given by
for every $m=1,2,\cdots,n$ , where $\lambda$ is the intensity for N in (2.1).
Remark 3.1. We remark that although the split process $\big(N^{(m)}\big)_{m}$ is a multivariate Hawkes process, the cross-exciting function $H_{mm'}$ in the definition (2.1) takes a special form, $p^{(m)}H$ , independent of m ′ in (3.11).
Proof. We show that $\big(N^{(m)}\big)_{m}$ is a counting process with conditional jumping intensity $\big(\lambda^{(m)}\big)_{m}$ by evaluating its conditional jumping intensity.
For every l, notice that given $\{\mathscr{F}_{l}(t)\}_{t\ge0}$ , $\big(N^{(m)}_{l+1}\big)_{m}$ is a Poisson process with intensity $\lambda^{(m)}_{l}=p^{(m)}\lambda_{l}$ and independent among m. Therefore, conditioning on $\mathscr{F}_{\infty}(t)$ , the jumping intensity of $N^{(m)}$ at t is
which is a process adapted to the natural filtration of $(N^{(m)})_{m}$ . This proves Proposition 3.2.
Recall $\hat{N}^{(m)}_T$ defined in (3.4). By applying [Reference Bacry, Delattre, Hoffmann and Muzy1, Theorem 2] to Proposition 3.2, we obtain the following result.
Proposition 3.3. Suppose that $\|H\|_{1}\in(0,1)$ and Assumption A2 hold. We have
where $\big(\hat{N}^{(m)}\big)_{m}$ is a standard n-dimensional Brownian motion with covariance matrix
where $\displaystyle\tilde{\Xi}=\big(p^{(m)}\|H\|_{1}\big)_{mm'}$ and $\displaystyle \tilde{\Sigma}=\textrm{diag}(p)\frac{\lambda_{0}}{1-\|H\|_{1}}$ .
Proof. To check the equivalence it suffices to show that the covariance functions in (3.7) coincide with the matrix in (3.12).
By definition we have
where $\text{ones}(n)$ denotes the n-dimensional square matrix with 1 for all its entries. Under Assumption A1, the spectral radius of $\tilde{\Xi}$ is $\|H\|_{1}\in(0,1)$ . Therefore,
where we use the fact $\sum_{m=1}^{n}p^{(m)}=1$ in the identity. Thus,
We then have
which coincides with the matrix in (3.7).
3.2.1. Another perspective via decomposed processes.
We have from [Reference Bacry, Delattre, Hoffmann and Muzy1, Lemma 4] that
where $X^{(m)}(t)=N^{(m)}(t)-\int_{0}^{t}\lambda^{(m)}(s)ds$ and
In addition to the process $N^{(m)}_{l}$ in the proof of Proposition 3.2, we further define
Then we also have $X^{(m)}=\sum_{l\ge1}X^{(m)}_{l}$ . By the conditional independence of the $X^{(m)}_{l}$ for $m=1,\dots,n$ and zero covariance for different l and l ′ by definition, we obtain immediately the following covariance function of $X^{(m)}$ : for $m,m'=1,\dots,n$ and $t, s\ge 0$ ,
where $\delta_{mm'}=1$ if $m=m'$ and $\delta_{mm'}=0$ if $m\neq m'$ . Notice that $\big(X^{(m)}\big)_{m}$ is a martingale with respect to the natural filtration $\big\{\sigma\big\{\big(N^{(m)}(s)\big)_{m}, s\leq t\big\}, t\ge0 \big\}$ , and its subprocesses $X^{(m)}_{l}$ do not jump at the same time.
Again we use the same scaling for the processes and quantities indexed by T. With the representations in the proposition, we can define $\big(N^{(m)}_{T}\big)_{m}$ and write
where for every $m=1,2,\cdots,n$ and $l\ge1$
with
Then we have from the immigration–birth representation that
It is clear that given $\|H\|_{1}\in(0,1)$ and Assumption A2, by (2.5), we obtain that this covariance converges as $T\to \infty$ to
We observe that the two components in the expression for $\hat{N}^{(m)}_{T}$ in (3.13) are intrinsically correlated, so that it appears to be more complicated than the first expression in (3.5); however, the martingale convergence method as in [Reference Bacry, Delattre, Hoffmann and Muzy1] can be applied and results in the following proposition (proof details are omitted for brevity).
Proposition 3.5. Under the conditions of $\|H\|_{1}\in(0,1)$ and Assumption A2, $\big(\hat{X}^{(m)}_{T}\big)_{m}\Rightarrow \big(\hat{X}^{(m)}\big)_{m}$ in $({\mathbb D}^{n}, J_1)$ as $T\to\infty$ , where $\big(\hat{X}^{(m)}\big)_{m}$ is an n-dimensional Brownian motion with covariance function
Thus, $\hat{Y}_{T}\Rightarrow \hat{Y}$ in $({\mathbb D}, J_1)$ as $T\to\infty$ , where
As a consequence, the process $\big(\hat{N}^{(m)}\big)_{m}$ can also be represented as
where $\big(W^{(m)}\big)_{m}$ is a standard n-dimensional Brownian motion, and $W=\sum_{m=1}^{n}\sqrt{p^{(m)}}W^{(m)}$ .
Remark 3.2. We observe that we can rewrite the expression for the limit $\hat{N}^{(m)}$ in (3.17) as
where $\hat{S}^{(m)}\;:\!=\;\sqrt{p^{(m)}}W^{(m)}- p^{(m)}W$ . It can be checked by direct calculations that the two components $(\hat{S}^{(m)})_{m}$ and W in the second expression in (3.18) are independent. Moreover, for every m and m ′,
which is exactly the identity in (3.6). This also proves the equivalence of the limit in (3.17) and that in Proposition 3.1, and thus, equivalence with that in Proposition 3.3.
4. Aggregating Hawkes processes
Splitting a Hawkes process results a multivariate Hawkes process with special exciting function. However, the aggregation of a multivariate Hawkes process is not necessarily a Hawkes process. In the following, we identify a necessary and sufficient condition for the aggregated multivariate Hawkes process to be a one-dimensional Hawkes process.
Let $N=(N_{k})_{k}$ be a multivariate Hawkes process with conditional intensity in (2.1). The associated aggregating process of N is denoted by
Then, similarly to the proof of Proposition 3.2, by conditioning on the natural filtration of $(N_{k})_{k}$ , the conditional jumping intensity is given by
which is a process adapted to the natural filtration of $\big(N_{k}\big)_{k}$ . Note that the filtration generated by the vector process N is strictly larger than that of A. Thus, to make $\lambda_{A}$ above adapted to the natural filtration of A, an immediate observation is the following property.
Proposition 4.1. $A=\{A(t), t\ge0\}$ is a one-dimensional Hawkes process if and only if $\tilde{H}\;:\!=\;\sum_{k=1}^{d}H_{kk^{\prime}}$ is a function independent of k′, under which the conditional intensity for A is
Remark 4.1. As a very special case, suppose that the $N_{k}$ are independent one-dimensional Hawkes processes with conditional intensity $\lambda_k(t) = \lambda_{k0} + \int_0^t H_k(t-s) N_k(ds)$ . Then
Thus, the aggregating process A is a Hawkes process with conditional intensity $\lambda_A(t) = e^{\textrm{T}}\cdot\lambda_{0}+\int_{0}^{t}H(t-s)A(ds)$ only if $H_k\equiv H$ .
Next we give another example to illustrate the property. Let N be a two-dimensional Hawkes process with positive baseline intensities $\lambda_{10},\lambda_{20}>0$ and with positive mutually exciting functions $(H_{ij})_{ij}$ . Then the associated aggregating process A is a Hawkes process if and only if $H_{11}+H_{21}=H_{12}+H_{22}$ .
Let $\hat{A}_{T}$ be the scaled aggregating process of the Hawkes process N,
Proposition 4.2. Under Assumption A1, we have
where W is a standard Brownian motion and $\Sigma=\textrm{diag}\big((I-\|H\|_{1})^{-1}\lambda_{0}\big)$ is the diagonal matrix in (2.9).
If, in addition, $\tilde{H}\;:\!=\;\sum_{k=1}^{d}H_{kk^{\prime}}$ is a function independent of k ′, then $\|\tilde{H}\|_{1}\in(0,1)$ and $\hat{A}$ is a Brownian motion with diffusion coefficient $\sum_{k=1}^{d}\lambda_{k0}\big(1-\|\tilde{H}\|_{1}\big)^{-3}$ .
Proof. The limit result for $\hat{A}_{T}$ in (4.2) is a direct consequence of (2.7) for $\hat{N}_{T}$ and the continuous mapping theorem, which is a one-dimensional Brownian motion with diffusion coefficient
For the special case that $\sum_{k}H_{kk^{\prime}}$ is independent of k ′, since it is a Hawkes process, we can directly apply [Reference Bacry, Delattre, Hoffmann and Muzy1, Theorem 2] to the one-dimensional Hawkes process A with the second expression for $\lambda_A(t)$ in Proposition 4.1 and obtain the diffusion coefficient as stated. We next verify that the expression can be also obtained from that in (4.2). We have $e^{\textrm{T}}\cdot H(t)=e^{\textrm{T}} \tilde{H}(t)$ and
which shows that $\|\tilde{H}\|_{1}$ and $e^{\textrm{T}}$ are the left eigenvalue and the associated eigenvector of the kernel matrix $\|H\|_{1}$ . Therefore, $\|\tilde{H}\|_{1}\in(0,1)$ by Assumption A1,
and the diffusion coefficient for $\hat{A}$ is
Here we make use of the fact that $\textrm{diag}(u)\cdot e=u$ for a vector u.
5. Splitting and aggregating multivariate Hawkes processes
Now we consider randomly splitting and aggregating a multivariate Hawkes process $N=\big(N_{k}\big)_{k}$ in (2.1). Let $\{\xi_{kj}\}_{k,j}$ be the splitting variables: whenever $\xi_{kj}=m$ , the jth individual of the kth component counting process $N_k$ (occurring at $\tau_{kj}$ ) is assigned to the mth sub-counting process. Then the $\mathbb{N}^{d}$ -valued Hawkes process $\big(N_{k}\big)_{k}$ splits into an $\mathbb{N}^{d\times n}$ -valued process $\big(N^{(m)}_{k}\big)_{k,m}$ and
where $\{\xi_{kj}\}_{j}$ are i.i.d. variables, independent of N, with distribution
We consider the following aggregated process:
By the procedure of splitting and then aggregating, we have transformed a d-dimensional Hawkes process into an n-dimensional counting process.
Before we proceed to study the properties of the split and aggregated processes, we discuss some potential applications. This often occurs when demands require re-categorization in order for them to be processed. In an insurance company, demands may arrive as a bivariate Hawkes process comprising home insurance and automobile insurance, each of which may be split into claims and new personal/commercial services, which will be processed by the corresponding service departments. In a remanufacturing facility, different products may arrive as a multivariate Hawkes process, then be split and aggregated, given the different component reprocessing needs, in order to be reprocessed at the corresponding machines. Similarly, in a data center, jobs may arrive as a multivariate Hawkes process, which must be regrouped in order to be processed at separate parallel servers because of computational requirements or constraints.
Similarly to Propositions 3.2 and 4.1, the Hawkes property is preserved for the splitting process indexed by (k, m), and the aggregated process $A^{(m)}$ is a Hawkes process under certain conditions.
Proposition 5.1. The following properties hold:
-
(i) $\big(N^{(m)}_{k}\big)_{k,m}$ is an $\mathbb{N}^{d\times n}$ -valued Hawkes process with conditional intensity
\begin{equation*}\lambda^{(m)}_{k}(t)=p^{(m)}_{k}\lambda_{k}(t)=p^{(m)}_{k}\lambda_{k0}+\sum_{k^{\prime},m'}\int_{0}^{t}\Big(p^{(m)}_{k}H_{kk^{\prime}}(t-s)\Big)N_{k^{\prime}}^{(m')}(ds),\end{equation*}where $(\lambda_{k})_{k}$ is the intensity for N in (2.1). -
(ii) $\big(A^{(m)}\big)_{m}$ is an n-dimensional Hawkes process if and only if $\tilde{H}^{(m)}\;:\!=\;\sum_{k=1}^{d}p^{(m)}_{k}H_{kk^{\prime}}$ is a function independent of $k^{\prime}\in\mathcal{L}^{(m)}\;:\!=\;\{k, p_{k}^{(m)}>0\}$ , under which the conditional intensity of $\big(A^{(m)}\big)_{m}$ is given by
\begin{equation*}\lambda^{(m)}_{A}(t)=\Big(\sum_{k=1}^{d}p^{(m)}_{k}\lambda_{k0}\Big)+\sum_{m'=1}^{n}\int_{0}^{t}\tilde{H}^{(m)}(t-s)A^{(m')}(ds).\end{equation*}
Remark 5.1. For a d-dimensional Hawkes process, it is always assumed to be irreducible, that is, $\mathbb{E}\big(N_{k}(\infty)\big)>0$ for every k. The independence of $\tilde{H}^{(m)}$ in Proposition 4.1 is required for every k ′. However, in the splitting and aggregating case, $\mathbb{P}\big(N^{(m)}_{k}(\infty)=0\big)=1$ for those $p_{k}^{(m)}=0$ (that is, $k\notin\mathcal{L}^{(m)}$ ); the necessity and sufficiency of the condition can be checked by evaluating the intensity for the second occurrence time of $A^{(m)}$ in Proposition 5.1(i), which should remain unchanged for different values of k ′.
For example, let $d=n$ and let $\{m_{k}\}_{k}$ be a permutation of $\{1,2,\cdots,d\}$ with $p^{(m_{k})}_{k}=1$ . Then we have from this assumption that $N^{(m)}_{k}(\infty)=0$ for $m\neq m_{k}$ and $A^{(m_{k})}=N_{k}$ for every k. This means $(A^{(m)})_{m}$ defines a rotated Hawkes process which switches the positions of individuals. It is of course Hawkes; however,
depends on k ′ for every $k_{0}$ .
We consider the scaled process indexed by T, and all the variables are marked with additional subscripts T; that is, $N_{T}=\big(N_{k,T}\big)_{k}$ is a Hawkes process whose baseline intensity is $\lambda_{0,T}=(\lambda_{k0,T})_{k}\in{\mathbb R}_{+}^{d}$ , and the kernel matrix function $H\in{\mathbb R}_{+}^{d\times d}$ stays the same. The splitting variables $\{(\xi_{kj,T})_{k}\}_{j}$ have subscripts T and distribution matrix $(p^{(m)}_{k,T})_{k,m}\in{\mathbb R}_{+}^{d\times n}$ , which results in the splitting process $\big(N^{(m)}_{k,T}\big)_{k,m}$ . The average process and the diffusion-scaled process are defined by
Assumption A3. Assume that for some $\lambda_{k0},p^{(m)}_{k}\ge0$ with $\sum_{m=1}^{n}p^{(m)}_{k}=1$ for every k,
We are interested in the FCLT of the aggregating process $(A^{(m)}_{T})_{m}$ , defined by
Theorem 5.1. Suppose Assumptions A1 and A3 hold. We have
where $(\hat{A}^{m})_{m}$ is an n-dimensional Brownian motion with covariance function
where $\Sigma=\textrm{diag}\big((I-\|H\|_{1})^{-1}\lambda_{0}\big)$ is the diagonal matrix in (2.8).
Proof. Applying [Reference Bacry, Delattre, Hoffmann and Muzy1, Lemma 4] to $\big(N^{(m)}_{k,T}\big)_{k,m}$ we have the representation
where
and where $\varphi$ is the matrix function defined in (2.4). Moreover, similarly to (3.15) and (3.16),
Therefore, we have
where, for some standard Brownian motion $(W^{(m)}_{k})_{k,m}$ ,
A direct calculation shows that
Summing over k and k ′ gives the covariance function for $\big(\hat{A}^{(m)}\big)_{m}$ .
Remark 5.2. From the proof, we have a second representation for $\big(\hat{N}^{(m)}_{k}\big)_{k,m}$ :
where $\big(W^{(m)}_{k}\big)_{k,m}$ is the standard Brownian motion in the proof, and
Then $\big(W_{k}\big)$ is an n-dimensional standard Brownian motion; $\big\{(\hat{S}^{(m)}_{k})_{m}, \,k=1,2,\cdots,d\big\}$ is a sequence of Brownian motions independent of $(W_{k})_{k}$ and over k; and
Under the conditions of Proposition 5.1(ii), $(A^{(m)}_{T})_{m}$ becomes a multivariate Hawkes process. Thus, [Reference Bacry, Delattre, Hoffmann and Muzy1, Theorem 2] can be applied directly and we have a second representation for its covariances.
Proposition 5.2. Suppose $\tilde{H}^{(m)}_{T}=\sum_{k=1}^{d}p^{(m)}_{k,T}H_{kk^{\prime}}$ is independent of k′, and Assumptions A1 and A3 hold. Then the covariance function for $\hat{A}=\big(\hat{A}^{(m)}\big)_{m}$ in Theorem 5.1 can also be represented as
where $\tilde{\Xi}=p^{\textrm{T}}\cdot\textrm{col}_{1}\big(\|H\|_{1}\big)\cdot e_{n}^{\textrm{T}}$ , and $e_{n}$ is the n-dimensional column vector of ones.
Proof. By assumption, $(A^{(m)}_{T})_{m}$ is a Hawkes process with conditional intensity
The representation for the covariance function in (5.3) follows from (2.7) by [Reference Bacry, Delattre, Hoffmann and Muzy1, Theorem 2]. In the following, we show that (5.3) coincides with the covariance in Theorem 5.1 in this special case.
Defining $\displaystyle \nu=p^{\textrm{T}}\cdot \textrm{col}_{1}\big(\|H\|_{1}\big)$ and $c_{0}=\nu^{\textrm{T}}\cdot e_{n}$ , we have
by the fact that $p\cdot e_{n}=e_{d}$ . Thus, $c_{0}$ and $e_{d}^{\textrm{T}}$ are the right eigenvalue and the associated eigenvector of $\|H\|_{1}$ , and $c_{0}\in(0,1)$ by Assumption A1. We further have, for $\tilde{\Xi}$ ,
and
Therefore, we have
where we make use of the fact that $\textrm{diag}(u)\cdot e=u$ for the vector u in the calculation.
On the other hand, for the covariance function in Theorem 5.1 in this special case, we have
where the fact from (5.4) is used. Thus, by the definition $\Sigma\cdot e_{d}=(I-\|H\|_{1})^{-1}\cdot\lambda_{0}$ , we have
where the fact that $e_{d}^{\textrm{T}}\cdot\|H\|_{1}=c_{0}\cdot e_{d}^{\textrm{T}}$ is used.
Therefore, by applying the identities in (5.6) and (5.7), we obtain
Comparing with the expression in (5.5), we conclude the equivalence.
Acknowledgements
We wish to thank the anonymous referees for their very helpful comments.
Funding information
Bo Li is supported by the National Natural Science Foundation of China (#11871289). Guodong Pang is partly supported by the National Science Foundation of the USA (DMS-2216765).
Competing interests
There were no competing interests to declare which arose during the preparation or publication process of this article.