1. Introduction
Insurance companies typically write policies in multiple lines of business and each line of business may cause claims of different types. For example, in auto insurance, as illustrated in Frees and Valdez (Reference Frees and Valdez2008), an accident can lead to any combination of the claims of the following three types: (1) injury to the third party, (2) property damage (PD) to the third party, and (3) injury/PDs to the policyholder. In insurance pricing, actuaries need to ensure that the aggregate premium level for a portfolio of business is fair and adequate and that the premium for each type of risk coverage reflects its contribution to the total risk of the portfolio. Therefore, it is important to accurately evaluate the risk of the aggregate losses and allocate the total capital requirement to different types of risks in the portfolio of business. To this end, effective modeling of the joint distribution of losses from different types of risks is essential.
Multivariate aggregate loss models with different types of dependence structures have been discussed extensively in the literature. Some of them, for example, Hesselager (Reference Hesselager1996), Vernic (Reference Vernic1999), Walhin and Paris (Reference Walhin and Paris2000), Cossette et al. (Reference Cossette, Mailhot and Marceau2012), and Kim et al. (Reference Kim, Jang and Pyun2019), assumed that claim frequencies are dependent but claim sizes are mutually independent and independent of the claim frequencies. Others, such as Sundt (Reference Sundt1999) and Sundt and Vernic (Reference Sundt and Vernic2004), assumed that the claim number is univariate, but each claim can generate several types of losses whose sizes are dependent. Models that allow dependence between claim frequencies and claim sizes have also been developed recently. For example, Gschlößl and Czado (Reference Gschlößl and Czado2007), Frees et al. (Reference Frees, Gao and Rosenberg2011), and Garrido et al. (Reference Garrido, Genest and Schulz2016) took a regression approach where the claim frequency is treated as an explanatory variable in the regression model for the claim sizes; Boudreault et al. (Reference Boudreault, Cossette, Landriault and Marceau2006), Cossette et al. (Reference Cossette, Marceau and Marri2008), and Marri and Furman (Reference Marri and Furman2012) assumed that the inter-claim times and claim sizes are dependent; Czado et al. (Reference Czado, Kastenmeier, Brechmann and Min2012), Frees et al. (Reference Frees, Lee and Yang2016), Cossette et al. (Reference Cossette, Marceau and Mtalai2019), and Oh et al. (Reference Oh, Shi and Ahn2020) employed bivariate copulas to model the dependency relationship between the number of claims and the average claim amount; Shi and Zhao (Reference Shi and Zhao2020) used a copula to model the relation between the frequency and the individual severity directly; Yang and Shi (Reference Yang and Shi2019) proposed a multivariate framework for pricing property insurance contracts with multiperil coverage in the longitudinal context by using copulas to capture the dependence within and between perils.
In this paper, following Cummins and Wiltbank (Reference Cummins and Wiltbank1983) and Frees and Valdez (Reference Frees and Valdez2008), we consider a hierarchical risk model where an accident can cause a combination of different types of claims, whose sizes could be dependent. In addition, the frequencies of accidents that cause the different combinations of claims are dependent. As pointed out in Cummins and Wiltbank (Reference Cummins and Wiltbank1983), this structure of multivariate compound distribution modeling of risk explicitly considers the intrinsic dependencies among the different components of the generating process. It is advantageous to the traditional approach of pooling the data from the entire portfolio of risks to obtain collective estimates of the frequency and severity parameters. Frees et al. (Reference Frees, Shi and Valdez2009) provided statistical tools to apply this hierarchical model to analyze the risk profile of either a single policy or a portfolio of risks. It was argued that the model allows actuaries to “unbundle” insurance contracts and price more primitive elements of insurance coverages.
It is usually challenging to compute the risk measures of compound distributions explicitly. However, some results exist in the actuarial literature. For example, Cossette et al. (Reference Cossette, Mailhot and Marceau2012) used a top-down approach to derive closed-form expressions for Tail Conditional Expectation (TCE) based capital allocation for multivariate compound distributions; Kim et al. (Reference Kim, Jang and Pyun2019) derived recursive algorithms to compute TCE for the sum of dependent compound mixed Poisson variables and to perform the associated capital allocation computation.
In this paper, we first derive formulas for evaluating TCE and Tail Variance (TV) of the aggregate loss amount in the hierarchical multivariate risk model. Then, we provide explicit expressions and computation methods for allocating the required capital to the different types of risks. Note that the multivariate loss model and the corresponding capital allocation studied here pertain to the portfolio of businesses level, not to the individual policyholder level. For the latter, data on the characteristics of individual policyholders are needed, as illustrated in Frees et al. (Reference Frees, Shi and Valdez2009).
In terms of methodology, we apply the method introduced by Furman and Landsman (Reference Furman and Landsman2005), Furman and Landsman (Reference Furman and Landsman2006), and Furman and Zitikis (Reference Furman and Zitikis2008), which showed that tail moments risk measures can be analyzed through the moment (size-biased) transform of distributions. The theory of moment transformation has a long history and is widely used in statistics. For details, one is referred to, for example, Patil and Ord (Reference Patil and Ord1976), Arratia and Goldstein (Reference Arratia and Goldstein2010), and references therein. Based on this method, Denuit (Reference Denuit2020) presented explicit expressions for TCE of some univariate compound distributions; Denuit and Robert (Reference Denuit and Robert2022) illustrated how to apply moment transform to analyze TCE of multivariate random variables. Ren (Reference Ren2022) derived formulas for TCE and TV of multivariate compound models based on Sundt (Reference Sundt1999), where claim frequency is one-dimensional, and one claim can yield multiple dependent losses; Jiang and Ren (Reference Jiang and Ren2022) provided methods for computing the TCE and TV of the multivariate aggregate losses, where the claim frequencies are dependent but the claim sizes are mutually independent and independent of the claim frequencies.
The remainder of the paper is organized as follows. Section 2 provides some preliminary results and definitions needed. Section 3 presents results for computing the hierarchical risk model’s TCE and TV and performing the corresponding capital allocations. Section 4 provides numerical examples with details of the computations and then extends the model by considering a case when the distribution of the claim counts and the claim size are dependent. Section 5 concludes.
2. Models and definitions
We first introduce the hierarchical multivariate compound aggregate loss model studied in this paper. Assume that an insurance policy covers $K$ categories of risks, denoted by $\mathcal{K}=\{1,\cdots, K\}$ . An accident can cause different types of claims in combinations, $\mathbf{h}=(h_{1},\ldots,h_{K})^\top$ , where the $k$ th coordinate $h_{k}$ equals to 1 if type $k$ claim occurs and 0 otherwise.
Let $\mathcal{M}=\{1,\cdots, M\}$ , where $M=2^K -1$ be the set of indexes of possible combinations that include at least one claim. For $m\in \mathcal{M}$ , let $N_m$ denote the number of accidents resulting in the $m$ th combination of claims for a portfolio of policies during a time period. The random variables $N_m$ , $m\in \mathcal{M}$ , can be dependent. Let $\mathbf{N}=(N_1, \cdots, N_M)^\top$ and denote its joint probability function by
where $\mathbf{n}=(n_1,\cdots,n_M)^\top \in \mathbb{N}^M$ .
For a given claim combination $m\in \mathcal{M}$ , let $\mathbf{X}_{(m)}=\left (X_{(m),1},\ldots,X_{(m),K}\right )^\top$ denote the random vector of claim sizes, where $X_{(m),k}$ for $k=1,\ldots, K$ represents the claim size of type $k$ risk in this combination. $X_{(m),k}=0$ if the $m$ th combination does not include a type $k$ claim. Since the claim size vector $\mathbf{X}_{(m)}$ is generated by one accident, its elements are stochastically dependent. However, since the loss size vectors $\mathbf{X}_{(1)},\ldots,\mathbf{X}_{(M)}$ result from different accidents, we assume that they are mutually independent and are independent of $\mathbf{N}$ . A slight extension of the model where $\mathbf{N}$ and $(\mathbf{X}_{(1)},\ldots,\mathbf{X}_{(M)})$ are dependent is considered in Section 4.2 of the paper.
For the portfolio of policies, let the aggregate amount of the $K$ types of claims resulting from the $m$ th combination be represented by the vector
where $\mathbf{X}_{(m)i}$ , $i\geq 1$ are independent copies of $\mathbf{X}_{(m)}$ .
Let
be the $K\times M$ dimensional compound loss matrix. Equivalently,
where the element $S_{N_m,k}$ represents the aggregate amount of the type $k$ claims that result from the claim combination $m$ . Then, the total amount of claims of all types for the portfolio of business is given by
Example 2.1. Suppose that an auto insurance policy covers two types of risks: PD and bodily injury (BI). An accident can cause a PD claim only, a BI claim only, or a claim that combines both PD and BI. Then we may denote the type of risk by $\mathcal{K}= \{1,2\}$ ; and the possible combinations of claims caused by an accident can be represented by three two-dimensional vectors $\mathbf{h}_1=(1,0)^\top$ , $\mathbf{h}_2=(0,1)^\top$ , and $\mathbf{h}_3=(1,1)^\top$ . Then we have $\mathcal{M}= \{1,2,3\}$ , and the numbers of claims of the three combinations of risk types, i.e., PD only, BI only, and both PD and BI, incurred in a time period is given by $\mathbf{N} = (N_1, N_2, N_3)^\top$ . The claim sizes are given by $\mathbf{X}_{(1)}=(X_{(1),1},0)^\top$ , $\mathbf{X}_{(2)}=(0,X_{(2),2})^\top$ and $\mathbf{X}_{(3)}=(X_{(3),1},X_{(3),2})^\top$ , respectively.
For this case, the multivariate aggregate loss matrix $\mathbf{S}_{\mathbf{N}}$ is given by
For $q\in (0, 1)$ , let $s_q$ denote the Value-at-Risk (VaR) of $S_{\bullet }$ at the $100q\%$ confidence level.
The TCE at level $q$ of $S_{\bullet }$ is defined by
According to the TCE-based capital allocation rule (Dhaene et al., Reference Dhaene, Henrard, Landsman, Vandendorpe and Vanduffel2008), the total capital requirement in a portfolio of business is $\text{TCE}_{S_{\bullet }}({q})$ for some $q$ and the part allocated to the type $k$ risk in the portfolio is
The capital required for the $m$ th combination of risk type is given by
Note that
Likewise, if the total required capital is determined by the TV of $S_{\bullet }$ at probability level $q$ , which is defined by (Furman and Landsman, Reference Furman and Landsman2006)
then, according to the TV-based capital allocation rule, the capital allocated to the type $k$ risk is given by
and the capital required for the $m$ th combination of risk type is given by
Notably,
For more details about the TCE- and TV-based capital allocation, one can refer to, for example, Cummins (Reference Cummins2000), Dhaene et al. (Reference Dhaene, Henrard, Landsman, Vandendorpe and Vanduffel2008), Furman and Zitikis (Reference Furman and Zitikis2008), and references therein. We note that other frameworks of risk measures exist in the literature. As an example, Furman et al. (Reference Furman, Wang and Zitikis2017) proposed Gini-type risk measures and developed the corresponding capital allocation rules. Notably, this framework requires only finiteness of the first moment of the underlying random variable.
In the next section, we derive formulas for computing the TCE and TV risk measures of the proposed multivariate compound loss model and performing the associated capital allocation. The main tool we use is the concept of moment (size-biased) transforms, which is widely used in statistics. For detailed studies of the moment transforms, one is referred to, for example, Patil and Ord (Reference Patil and Ord1976), Arratia and Goldstein (Reference Arratia and Goldstein2010), Furman and Landsman (Reference Furman and Landsman2005), Denuit (Reference Denuit2020), Denuit and Robert (Reference Denuit and Robert2022), Mohammed et al. (Reference Mohammed, Furman and Su2021), and Furman et al. (Reference Furman, Kye and Su2021).
For completeness of this paper, we provide definitions for the size-biased transform of univariate and multivariate random variables in the following.
Definition 2.1. Consider a non-negative random variable $X$ with the distribution function $F_X$ and moments $\mathbb{E}[X^{\alpha }]\lt \infty$ for some positive integer $\alpha$ . A random variable $\widetilde{X^{\alpha }}$ is said to be a copy of the $\alpha$ th moment transform of $X$ if its cumulative distribution function (c.d.f.) is given by
The first-moment transform of $X$ is commonly referred to as the size-biased transform. It is simply denoted by $\widetilde{X}$ .
Definition 2.2. Consider a random vector $\mathbf{X}=(X_1,\ldots,X_K)^\top$ with the c.d.f. $F_{\mathbf{X}}$ and moments $\mathbb{E}[X^{\alpha }_k]\lt \infty$ and $\mathbb{E}[X^{\alpha _1}_{k_1}X^{\alpha _2}_{k_2}]\lt \infty$ for some $k, k_1, k_2 \in \{1,\ldots,K\}$ and non-negative integers, $\alpha$ , $\alpha _1$ , and $\alpha _2$ .
The $k$ th component $\alpha$ th moment transform of $\mathbf{X}$ is any random vector $\widetilde{\mathbf{X}^{\alpha }}^{[k]}$ with the c.d.f.
where $\mathbf{x}=(x_1,\ldots, x_K)^\top$ .
The $(k_1,k_2)$ th component, $(\alpha _1,\alpha _2)$ th moment transform of $\mathbf{X}$ is any random vector $\widetilde{\mathbf{X}^{\alpha _1,\alpha _2}}^{[k_1,k_2]}$ with the c.d.f.
The $k$ th component first-moment transform of $\mathbf{X}$ is denoted as $\widetilde{\mathbf{X}}^{[k]}$ , and the $(k_1,k_2)$ th component $(1,1)$ th moment transform of $\mathbf{X}$ is denoted as $\widetilde{\mathbf{X}}^{[k_1,k_2]}$ .
For discrete distributions, following Patil and Ord (Reference Patil and Ord1976), we consider the factorial moment transform. For a positive integer $I$ , define
Then we have
Definition 2.3. Consider a non-negative discrete random variable $N$ with the probability mass function (p.m.f.) $p_N$ . A random variable $\widetilde{N^{(\alpha )}}$ is said to be a copy of the $\alpha$ th factorial moment transform of $N$ if its p.m.f. is given by
The first factorial moment transform of $N$ is denoted by $\widetilde{N}$ .
Definition 2.4. Consider a vector of discrete random variables $\mathbf{N}=(N_1,\ldots,N_K)^\top$ having the p.m.f. $p_{\mathbf{N}}(\mathbf{n})$ .
The $k$ th component, $\alpha$ th moment transform of $\mathbf{N}$ is any random vector $\widetilde{\mathbf{N}^{(\alpha )}}^{[k]}$ with p.m.f.
The $(k_1,k_2)$ th component, $(\alpha _1,\alpha _2)$ th moment transform of $\mathbf{N}$ is any random vector $\widetilde{\mathbf{N}^{(\alpha _1),(\alpha _2)}}^{[k_1,k_2]}$ with the p.m.f.
The $k$ th component, first-moment transform of $\mathbf{N}$ is denoted as $\widetilde{\mathbf{N}}^{[k]}$ , and the $(k_1,k_2)$ th component $(1,1)$ th moment transform of $\mathbf{N}$ is denoted as $\widetilde{\mathbf{N}}^{[k_1,k_2]}$ .
3. Risk measures and capital allocation for the multivariate compound loss model
In this section, we present results for calculating the risk measures and capital allocation for the multivariate compound loss model represented by the matrix $\mathbf{S}_{\mathbf{N}}$ defined in equation (2.1). To this purpose, we define the $(k,m)$ th ( $k$ th row, $m$ th column) component, first-moment transform of $\mathbf{S}_{\mathbf{N}}$ to be a matrix of random variables $\widetilde{\mathbf{S}_{\mathbf{N}^{[m]}}}^{[k]}$ , which has the same size as $\mathbf{S}_{\mathbf{N}}$ and the c.d.f.
where $\mathbf{s}$ is a matrix of non-negative constants
and the $\le$ operation is defined piecewisely.
Then, following Proposition 1 of Furman and Landsman (Reference Furman and Landsman2005), Proposition 3.1 of Denuit and Robert (Reference Denuit and Robert2022), or Lemma 2.1 in Jiang and Ren (Reference Jiang and Ren2022), we have, for $m\in \mathcal{M}$ and $k\in \mathcal{K}$ ,
where
and $\widetilde{S_{\mathbf{N}^{[m]}}}^{[k]}_{i,j}$ is the $i,j$ th element of $\widetilde{\mathbf{S}_{\mathbf{N}^{[m]}}}^{[k]}$ .
In addition, define the $[(k_1, m_1), (k_2, m_2)]$ th components joint moment transform $\mathbf{S}_{\mathbf{N}}$ to be a random matrix $\widetilde{\mathbf{S}_{\mathbf{N}^{[m_1,m_2]}}}^{[k_1,k_2]}$ with c.d.f.
Then, for $m_1,m_2\in \mathcal{M}$ and $k_1,k_2\in \mathcal{K}$ ,
where
and $\widetilde{S_{\mathbf{N}^{[m_1,m_2]}}}^{[k_1,k_2]}_{i,j}$ is the $(i,j)$ th element of $\widetilde{\mathbf{S}_{\mathbf{N}^{[m_1,m_2]}}}^{[k_1,k_2]}$ .
Therefore, together with Equations (2.2) to (2.5), it is seen that if we can compute the distribution function of $S_{\bullet }$ , $\widetilde{S_{\bullet }}^{m[k]}$ and $\widetilde{S_{\bullet }}^{m_1,m_2[k_1,k_2]}$ , then the TCE and TV of $S_{\bullet }$ can be determined and the associated capital allocation can be performed.
To determine the distribution of $S_{\bullet }$ , $\widetilde{S_{\bullet }}^{m[k]}$ , and $\widetilde{S_{\bullet }}^{m_1,m_2[k_1,k_2]}$ , we need the distribution functions of ${\mathbf{S}}_{\mathbf{N}}$ , $\widetilde{\mathbf{S}_{\mathbf{N}^{[m]}}}^{[k]}$ , and $\widetilde{\mathbf{S}_{\mathbf{N}^{[m_1,m_2]}}}^{[k_1,k_2]}$ , for which we have the following results.
Theorem 3.1. For $m\in \mathcal{M}$ , let $\mathbf{1}^{[m]}$ denote an $M$ dimensional vector with the $m$ th element being one and all others zero. Let
and
where
and $L_i^{[m]}$ , $i\in \mathcal{M}$ , is the $i$ th element of $\mathbf{L}^{[m]}$ . Then, for $k\in \mathcal{K}$ ,
where $\widetilde{\mathbf{X}_{(m)1}}^{[k]}$ is an independent copy of the $k$ th component, first-moment transform of $\mathbf{X}_{(m)}$ .
Further, let
then, for $k_1,k_2\in \mathcal{K}$ ,
where $\widetilde{\mathbf{X}_{(m)1}}^{[k_1]}$ and $\widetilde{\mathbf{X}_{(m)2}}^{[k_2]}$ are copies of the first-moment transform of $\mathbf{X}_{(m)}$ , and $\widetilde{\mathbf{X}_{(m)1}}^{[k_1,k_2]}$ is a copy of the $(k_1,k_2)$ th component (1,1)th moment transform of $\mathbf{X}_{(m)}$ . The random variables $\widetilde{\mathbf{X}_{(m)1}}^{[k_1]}$ , $\widetilde{\mathbf{X}_{(m)2}}^{[k_2]}$ , $\widetilde{\mathbf{X}_{(m)1}}^{[k_1,k_2]}$ , and $\mathbf{X}_{(m)}$ are mutually independent.
In addition, for $m_1,m_2\in \mathcal{M}$ and $m_1\neq m_2$ , let
then
The proof of this theorem is provided in the appendix of the paper.
Remark 3.1. Equation (3.4) shows that the distribution of $\widetilde{\mathbf{S}_{\mathbf{N}^{[m,m]}}}^{[k_1,k_2]}$ is a mixture of
and
with weights
and
respectively.
Remark 3.2. If one accident can only give rise to one type of claim, then $\mathcal{M}=\mathcal{K}$ and the claim size variables are univariate. In this case, Theorem 3.1 was reduced to Theorem 3.1 by Jiang and Ren (Reference Jiang and Ren2022). On the other hand, if the claim number random vectors $\mathbf{N}$ is univariate and the claim size random vector contains all risk types, Theorem 3.1 reduces to Theorem 3 in Ren (Reference Ren2022).
We summarize the procedures for performing the capital allocation computation as follows:
Computation Procedure 3.1.
-
Step 1. Determine the distributions of $\mathbf{N}$ , $\mathbf{L}^{[m]}$ , and $\mathbf{L}^{[m_1,m_2]}$ for $m,m_1,m_2 \in \mathcal{M}$ . Some commonly used distribution functions of $\mathbf{N}$ , such as multinomial, additive common shock, and common Poisson mixture, were studied in the literature by, for example, Hesselager (Reference Hesselager1996) and Kim et al. (Reference Kim, Jang and Pyun2019). In these cases, as shown by Jiang and Ren (Reference Jiang and Ren2022), the distributions of $\mathbf{L}^{[m]}$ and $\mathbf{L}^{[m_1,m_2]}$ are in fact mixture of some distributions in the same family as $\mathbf{N}$ and can be conveniently computed.
-
Step 2. Determine the distributions of $\mathbf{S}_{\mathbf{N}}$ , $\mathbf{S}_{\mathbf{L}^{[m]}}$ , and $\mathbf{S}_{\mathbf{L}^{[m_1,m_2]}}$ for $m,m_1,m_2 \in \mathcal{M}$ . When the distribution of $\mathbf{N}$ is as described in Step 1, this can be implemented by using the recursive methods introduced by Hesselager (Reference Hesselager1996) and Kim et al. (Reference Kim, Jang and Pyun2019). Alternatively, the Fast Fourier transform (FFT) method, as discussed by Robertson (Reference Robertson1992) and Wang (Reference Wang1998), can be applied if the characteristic function of $\mathbf{S}_{\mathbf{N}}$ , $\mathbf{S}_{\mathbf{L}^{[m]}}$ , and $\mathbf{S}_{\mathbf{L}^{[m_1,m_2]}}$ for $m,m_1,m_2 \in \mathcal{M}$ can be determined. This is possible if the characteristic functions of $\mathbf{N}$ (therefore $\mathbf{L}$ ’s) and $\mathbf{X}_{(m)}$ are known. For the cases discussed in this paper, both methods can be applied. We choose to use the FFT method since it can be implemented conveniently using software such as R and Matlab.
-
Step 3. Determine the distributions of $\mathbf{X}_{(m)}$ , $\widetilde{\mathbf{X}_{(m)1}}^{[k]}$ , and $\widetilde{\mathbf{X}_{(m)1}}^{[k_1,k_2]}$ for $m \in \mathcal{M}$ and $k,k_1,k_2\in \mathcal{K}$ . The probability density function (p.d.f.) of $\widetilde{\mathbf{X}_{(m)1}}^{[k]}$ and $\widetilde{\mathbf{X}_{(m)1}}^{[k_1,k_2]}$ for $m \in \mathcal{M}$ and $k,k_1,k_2\in \mathcal{K}$ can be determined by applying Definition 2.2 for the continuous case and Definition 2.4 for the discrete case.
-
Step 4. Determine the distributions of $\widetilde{\mathbf{S}_{\mathbf{N}^{[m]}}}^{[k]}$ and $\widetilde{\mathbf{S}_{\mathbf{N}^{[m_1,m_2]}}}^{[k_1,k_2]}$ for $m,m_1,m_2 \in \mathcal{M}$ and $k,k_1,k_2 \in \mathcal{K}$ . This can be done by applying Theorem 3.1 . The required convolutions can be computed using the FFT method.
-
Step 5. Determine the distributions of $S_{\bullet }$ , $\widetilde{S_{\bullet }}^{m[k]}$ , and $\widetilde{S_{\bullet }}^{m_1,m_2[k_1,k_2]}$ for $m,m_1,m_2 \in \mathcal{M}$ and $k,k_1,k_2 \in \mathcal{K}$ . The FFT of the distribution of $S_{\bullet }$ , $\widetilde{S_{\bullet }}^{m[k]}$ , and $\widetilde{S_{\bullet }}^{m_1,m_2[k_1,k_2]}$ is given by the diagonal term of the FFT of distributions of $S_{\mathbf{N}}$ , $\widetilde{\mathbf{S}_{\mathbf{N}^{[m]}}}^{[k]}$ , and $\widetilde{\mathbf{S}_{\mathbf{N}^{[m_1,m_2]}}}^{[k_1,k_2]}$ . Their distributions can be obtained using the one-dimensional inverse Fast Fourier transformation (IFFT).
-
Step 6. Determine the TCE- and TV-based capital allocations. The TCE- and TV-based capital requirement and the capital allocation can be determined by applying Equations (3.1) and (3.2).
Remark 3.3. Notice from equations (3.1) and (3.2) that in calculating TCE, TV, and the associate capital allocation, we only need the distribution of univariate random variables $S_{\bullet }$ , $\widetilde{S_{\bullet }}^{m[k]}$ , and $\widetilde{S_{\bullet }}^{m_1,m_2[k_1,k_2]}$ , not the whole joint distributions. Consequently, when computing their distributions by applying the FFT method, we do not need to apply multi-dimensional IFFT to the array of the FFT of $\mathbf{S}_{\mathbf{N}}$ , $\widetilde{\mathbf{S}_{\mathbf{N}^{[m]}}}^{[k]}$ , and $\widetilde{\mathbf{S}_{\mathbf{N}^{[m_1,m_2]}}}^{[k_1,k_2]}$ . Instead, we only need to apply one-dimensional IFFT to the diagonal terms to obtain the distribution of the sums.
4. Numerical examples
In this section, we illustrate how to apply the formulas derived in last section to compute the risk measures and to perform the capital allocations for the proposed multivariate aggregate loss models.
4.1 The multivariate aggregate claim model
In this subsection, we provide the general structure of the model that will be used in the numerical examples. Let $W$ be a counting random variable that follows $\text{NB}(r, \beta )$ distribution with probability mass function
Conditional on $W=w$ , let the claim number vector $\mathbf{N}=(N_1, \ldots, N_M)^\top$ follow a multinomial distribution with parameters $(w, q_1, \ldots, q_M)$ . That is,
This model was introduced by Hesselager (Reference Hesselager1996), and Jiang and Ren (Reference Jiang and Ren2022) denoted the unconditional distribution of $\mathbf{N}$ by $\text{HMN}(W, q_1, \ldots, q_M)$ . Regression analysis of this model was provided by Frees et al. (Reference Frees, Shi and Valdez2009).
We could easily obtain that
and
The joint p.g.f. of $\mathbf{N}$ is
As shown in Theorem 4.1 of Jiang and Ren (Reference Jiang and Ren2022), for $m,m_1,m_2\in \mathcal{M}$ and $m_1\neq m_2$ , we have
and
In addition, $\widetilde{W}-1$ follows $\text{NB}(r + 1, \beta )$ distribution, and $\widetilde{W}^{(2)}-2$ follows $\text{NB}(r + 2, \beta )$ distribution. Therefore, the distribution of $\mathbf{L}^{[m]}$ , ${\mathbf{L}^{(2)}}^{[m]}$ , and $\mathbf{L}^{[m_1,m_2]}$ is all in the same family as $\mathbf{N}$ .
For claim severity, we assume that if a claim combination $m$ only consists of a type $k\in \mathcal{K}$ claim, then its size follows a Poisson distribution with mean $a_k$ . Then we have
and
If a claim combination $m$ consists of non-zero claims of types $\{k_1,\ldots, k_h\}$ , then the joint distribution of the claim sizes is assumed to be a common Poisson mixture. That is, conditional on a mixing variable $\Lambda =\lambda$ , for $j = 1, \ldots, h$ , the size of type $k_j$ claim follows a Poisson distribution with mean $b_{k_j}\lambda$ . Further, we assume $\Lambda$ follows a gamma distribution with shape parameter $\alpha$ and p.d.f.
Consequently, ${\mathbb{E}}[\Lambda ]=1$ and for $i,j \in \{1,\ldots,h\}$ and $i\neq j$ , we have
In addition, for claim combinations $m, m_1, m_2\in \mathcal{M}$ , $m_1\neq m_2$ , and $k_1, k_2\in \mathcal{K}$ , we have
and
Let
where
denote the characteristic function (c.f.) of $\mathbf{S}_{\mathbf{N}}$ . Let $\mathcal{P}_{\mathbf{N}}(\cdot )$ denote the probability generating function (p.g.f.) of $\mathbf{N}$ and $\phi _{\mathbf{X_{(m)}}} (\cdot )$ the c.f. of $\mathbf{X_{(m)}}$ . Then
The c.f. of $\mathbf{S}_{\mathbf{L}^{[m]}}$ , $\mathcal{S}_{{\mathbf{L}^{(2)}}^{[m]}}$ , and $\mathcal{S}_{\mathbf{L}^{[m_1,m_2]}}$ can be derived similarly. Specifically,
and
With the above, all steps in computation procedure 3.1 can be carried out, and the risk analysis of $\mathbf{S}_{\mathbf{N}}$ can be performed.
We remark that the selection of the distribution of $\mathbf{N}$ and $\mathbf{X}_{(m)}$ is arbitrary in this section. Other distributions of $\mathbf{N}$ and $\mathbf{X}_{(m)}$ can be used as long as their moments, characteristic function, and moment transforms can be evaluated.
4.1.1 An example with two types of risks
We apply the setting in Example 2.1 where two types of claims, PD and BI, are considered. Recall that the claim number vector is $\mathbf{N} = (N_1, N_2, N_3)^\top$ and the claim sizes be $\mathbf{X}_{(1)}=(X_{(1),1},0)^\top$ , $\mathbf{X}_{(2)}=(0,X_{(2),2})^\top$ , and $\mathbf{X}_{(3)}=(X_{(3),1},X_{(3),2})^\top$ , respectively.
We assume that $\mathbf{N} \sim \text{HMN}(W, q_1=0.9, q_2 = 0.02, q_3= 0.08)$ and $W \sim \text{NB}(r=10, \beta =1)$ .
Let ${X_{(1),1}}\sim \text{Poi}(a_1=1)$ , ${X_{(2),2}}\sim \text{Poi}(a_2=5)$ , and $\mathbf{X}_{(3)}$ follow a common Poisson mixture, where conditional on $\Lambda =\lambda$ , ${X_{(3),1}}\sim \text{Poi}(1.2\lambda )$ and ${X_{(3),2}}\sim \text{Poi}(6\lambda )$ , and $\Lambda$ follows a gamma distribution with shape parameter $\alpha =2$ and mean one.
These parameter values are selected hypothetically to reflect the fact that accidents that cause only PDs usually have high frequency and low severity; it is unlikely ( $q_2 = 0.02$ ) that an accident causes BI but no PDs; accidents that cause both BI and PDs have low frequency and high severity. Note that we assume discrete distributions for the claim sizes for simplicity. If continuous distributions are assumed, they need to be discretized to apply the FFT or recursive methods.
The proportions of capital allocated to the two types of risks according to TCE with selected values of $q$ in $(0, 1)$ are plotted in Figure 1, panel (a). It shows that the proportion of risk capital allocated to PD (BI) claims decreases (increases) with $q$ . When $q$ is small, more capital is allocated to PD claims, whereas more risk capital is allocated to BI claims when $q$ is large.
The proportions of capital allocated according to TV are shown in panel (b) of Figure 1. We observe that when $q$ is small, the proportion allocated to BI claims is a decreasing function of $q$ , and when $q$ is large, the proportion allocated to BI claims increases with $q$ . The opposite pattern is observed for PD claims.
The amounts of capital allocated to the two types of risks according to both TCE and TV criteria increase with $q$ , as shown in panels (c) and (d) of Figure 1.
Figure 2 compares the proportions of capital allocated to the two types of risks according to TCE and TV criteria obtained by using the moment transform method proposed in this paper and those by using the Monte Carlo simulation (with $10^7$ runs). It shows that the results based on the moment transform are accurate. We note that the moment transform method takes much less computation time than the Monte Carlo simulation.
The capital allocation to the three combinations of risk types according to TCE and TV criteria can be performed following the same procedure. To avoid redundancy, we omit the analysis here.
Tables 1 and 2 show the numerical values of the amounts and proportions of capital allocated to the two types of risk according to TCE and TV criteria for the HMN models under different values of $\alpha$ , which is the parameter of the mixing random variable $\Lambda$ for $\mathbf{X}_{(3)}=(X_{(3),1}, X_{(3),2}$ ). Since $\text{Var}({X_{(3),1}})=b_1+b_1^2/\alpha$ , $\text{Var}({X_{(3),2}})=b_2+b_2^2/\alpha$ , and
we see that the variance of ${X_{(3),1}}$ and ${X_{(3),2}}$ and their correlation increases as $\alpha$ decreases. In particular, when $\alpha \rightarrow \infty$ , ${X_{(3),1}}$ , and ${X_{(3),2}}$ are uncorrelated; when $\alpha \rightarrow 0$ , the corelation coefficient approaches to one.
From Tables 1 and 2, we observe that larger variance of and stronger dependence between ${X_{(3),1}}$ and ${X_{(3),2}}$ lead to greater values of VaR and TCE of the total losses and higher proportion of capital allocated to BI risks.
4.1.2 An example with three types of risks
In this subsection, we consider the automobile insurance claim model discussed by Frees and Valdez (Reference Frees and Valdez2008), in which three types of claims, own damage (OD), third-party property (TPP), and third-party injury (TPI), are considered. An accident can cause any combination of the three types of claims with proportions shown in Table 3. Frees and Valdez (Reference Frees and Valdez2008) proposed a hierarchical, three-component (loss frequency, severity, and dependence) regression model to analyze this highly complex data structure.
In this example, we study the risk measure and capital allocation problem for the model. We assume that the vector of the number of claim combinations is given by
where $W\sim \text{NB}(r=10,\beta =1)$ , with $q_1=0.004$ , $q_2=0.732$ , $q_3=0.123$ , $q_4=0.003$ , $q_5=0.001$ , $q_6=0.135$ , $q_7=0.002$ . The claim size vectors are denoted by $\mathbf{X}_{(1)}=(X_{(1),1},0,0)^\top$ , $\mathbf{X}_{(2)}=(0,X_{(2),2},0)^\top$ , $\mathbf{X}_{(3)}=(0,0,X_{(3),3})^\top$ , $\mathbf{X}_{(4)}=(X_{(4),1},X_{(4),2},0)^\top$ , $\mathbf{X}_{(5)}=(X_{(5),1},0,X_{(5),3})^\top$ , $\mathbf{X}_{(6)}=(0,X_{(6),2},X_{(6),3})^\top$ , and $\mathbf{X}_{(7)}=(X_{(7),1},X_{(7),2},X_{(7),3})^\top$ .
Frees and Valdez (Reference Frees and Valdez2008) fitted the claim sizes by the generalized beta of the second kind (GB2) distribution and modelled their dependence by multivariate t-copula. Here, for illustration of our method, we simply assume that ${X_{(1),1}}\sim \text{Poi}(a_1)$ , ${X_{(2),2}}\sim \text{Poi}(a_2)$ , ${X_{(3),1}}\sim \text{Poi}(a_3)$ , and the non-zero elements of $\mathbf{X}_{(4)}$ , $\mathbf{X}_{(5)}$ , $\mathbf{X}_{(6)}$ , $\mathbf{X}_{(7)}$ follow common Poisson mixtures. Specifically, let $\Lambda _i$ for $i=1,2,3,4$ assumed to be independent; all follow a gamma distribution with shape parameter $\alpha =2$ and mean one. Conditional on $\Lambda _1=\lambda _1$ , ${X_{(4),1}}$ and ${X_{(4),2}}$ are independent Poisson variables $\text{Poi}(b_1\lambda _1)$ and $\text{Poi}(b_2\lambda _1)$ ; conditional on $\Lambda _2=\lambda _2$ , ${X_{(5),1}}$ and ${X_{(5),3}}$ are independent Poisson variables $\text{Poi}(b_1\lambda _2)$ and $\text{Poi}(b_3\lambda _2)$ ; conditional on $\Lambda _3=\lambda _3$ , ${X_{(6),2}}$ and ${X_{(6),3}}$ are independent Poisson variables $\text{Poi}(b_2\lambda _3)$ and $\text{Poi}(b_3\lambda _3)$ ; and conditional on $\Lambda _4=\lambda _4$ , ${X_{(7),1}}$ , ${X_{(7),2}}$ , and ${X_{(7),3}}$ are independent Poisson variables $\text{Poi}(b_1\lambda _4)$ , $\text{Poi}(b_2\lambda _4)$ , and $\text{Poi}(b_3\lambda _4)$ . The parameter values are set to be $a_1=5$ , $a_2=1$ , $a_3=0.8$ , $b_1=6$ , $b_2=1.2$ , and $b_3=0.96$ .
The proportions and amounts of capital allocated to the three types of risks according to TCE and TV with selected values of ${q}$ are plotted in Figure 3. As we can see, the pattern for TPI (OD) is similar to BI (PD) in the last example, and the pattern for TPP is somewhat in the middle.
Remark 4.1. Applying simulation methods to estimate risk measures and compute capital allocations for this complex hierarchical risk model can be time-consuming and/or inaccurate. Our proposed method, based on moment transform and FFT, can solve the problem efficiently. This is especially true because, as pointed out in Remark 3.3, we do not need to apply multivariate IFFT to obtain the joint distribution of the seven possible combinations of the three types of losses. Instead, we only need to perform one-dimensional IFFT to the diagonal terms to get the distribution of the total.
4.2 A model with dependent claim frequency and size
In this subsection, we study a model in which the claim frequency and size are dependent through a common mixing variable, $\Xi$ defined on $(0,\ \infty )$ . Similar to the example in Section 4.1.1, we suppose that insurance policies cover two types of claims, PD and BI. The claim frequency vector $\mathbf{N} = (N_1, N_2, N_3)^\top$ follows the $\text{HMN}(W, q_1, q_2, q_3)$ distribution defined in Section 4.1, where $W\sim \text{NB}(r\xi, \beta )$ . The claim sizes are denoted by $\mathbf{X}_{(1)}=(X_{(1),1}, 0)^\top$ , $\mathbf{X}_{(2)}=(0, X_{(2),2})^\top$ , and $\mathbf{X}_{(3)}=(X_{(3),1}, X_{(3),2})^\top$ , respectively. We assume that, conditional on $\Xi =\xi$ , ${X_{(1),1}}\sim \text{Poi}(a_1\xi )$ , ${X_{(2),2}} \sim \text{Poi}(a_2\xi )$ , and $\mathbf{X}_3$ follows a common Poisson mixture, where conditional on $\Lambda =\lambda$ , ${X_{(3),1}}\sim \text{Poi}(b_1\lambda \xi )$ and ${X_{(3),2}}\sim \text{Poi}(b_2\lambda \xi )$ . Finally, we assume that $\Xi$ and $\Lambda$ are independent and follow gamma distribution with shape parameters $\alpha _1$ and $\alpha _2$ , respectively. Both have unit mean.
The characteristic function of $\mathbf{S}_{\mathbf{N}}$ is given by
Since it is difficult to calculate the explicate expression of integral in Equation (4.1), even with the simple assumptions for the distributions of claim frequency and severity, in computation, we discretize the distribution of $\Xi$ and compute the expectation in Equation (4.1) numerically.
The computation for capital allocation can be performed by using the following equations. For $m,m_1,m_2=1,2,3$ and $k,k_1,k_2=1,2$
and
Conditional on $\Xi =\xi$ , the above quantities can be calculated by applying Theorem 3.1. Specifically, for $m=1,2,3$ and $k_1,k_2=1,2$ , we have the following:
where
For $m_1,m_2=1,2,3$ , $k_1,k_2=1,2$ , and $m_1\neq m_2$ , we have
where
Then, the expectation with regard to $\Xi$ can be computed numerically by discretizing the distribution of $\Xi$ .
In the following, we set the value of the shape parameter of the gamma distributed variable $\Xi$ to $\alpha _1=10$ and assume that all other parameters are the same as those in Section 4.1.1.
The proportions and amounts of capital allocated to the two types of risks according to TCE and TV are shown in Figure 4. It shows a similar pattern to that in Figure 1 in Section 4.1.1. That is, according to TCE, the proportion of risk capital allocated to PD (BI) claims decreases (increases) with ${q}$ ; and according to TV, the proportion allocated to BI(PD) claims decreases (increases) with ${q}$ when ${q}$ is small and increases (decreases) when ${q}$ is large. Also, the amounts of capital allocated to the two types of risks increase with ${q}$ in all cases.
Tables 4 and 5 show the numerical values of the amounts of capital allocated to the two types of risk according to TCE and TV criteria for different values of $\alpha _1$ , the coefficient of the mixing random variable $\Xi$ . Recall that a smaller value of $\alpha _1$ indicates a larger variance of $\Xi$ , and a stronger dependence between the claim frequency and severity. When $\alpha _1\rightarrow \infty$ , the loss frequency and severities are independent.
From Tables 4 and 5, we conclude that larger variance of the claim frequency and size, and stronger dependence between them, leads to greater values of VaR, TCE and TV of the total losses. The total capital allocated to each type of risk also increases.
5. Conclusions
This paper presents formulas for computing TCE and TV and performing corresponding capital allocation for a hierarchical multivariate compound model introduced by Cummins and Wiltbank (Reference Cummins and Wiltbank1983) and Frees and Valdez (Reference Frees and Valdez2008), where both the claim frequencies and the claim sizes are dependent. The main methodology we used is the multivariate moment transform.
Future research will study the risk measures and capital allocation problems for multivariate compound models with more complicated dependence structures between claim frequencies and claim sizes.
Funding statement
The second author of the paper gratefully acknowledges the financial support of the Natural Sciences and Engineering Research Council of Canada, grant number RGPIN-2019-06561.
Data availability statement
Data availability is not applicable to this article as no new data were created or analyzed in this study.
Competing interests
No known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Appendix: Proof of Theorem 3.1
Proof. We firstly assume that $\mathbf{N}$ takes fixed values $\mathbf{N}=\mathbf{n}=(n_1,\ldots,n_M)^\top$ . For $m\in \mathcal{M}$ , let
and
Then, similar to the results in Denuit and Robert (Reference Denuit and Robert2022) and Ren (Reference Ren2022), for $m\in \mathcal{M}$ , $i\in \{1,\ldots,n_m\}$ , and $k\in \mathcal{K}$ , we have
Since $\mathbf{X}_{(m)i}$ ’s are assumed to be i.i.d.,
Further, for $i,j\in \{1,\ldots,n_m\}$ , $i\neq j$ , and $k_1,k_2\in \mathcal{K}$ , we have
and
Then,
Since $\mathbf{S}_{n_1}, \ldots, \mathbf{S}_{n_M}$ are mutually independent, we have
and
In addition, for $m_1,m_2\in \mathcal{M}$ , $m_1\neq m_2$ ,
Therefore, applying the law of total probability to Equation (A.1) leads to
which leads to Equation (3.3).
Similarly, applying the law of total probability to Equations (A.2) and (A.3), respectively, yields
which leads to Equation (3.4).
In addition,
which leads to Equation (3.5). This ends the proof.