1. Introduction
The idea that individuals act to maximize their heterogeneous preferences in a context of uncertainty and scarcity of resources is the very reason for the existence of exchange markets. A practically relevant example of such markets for actuaries and insurance professionals are the markets for sharing risks between agents with differing preferences, beliefs, and levels of risk aversion. Broadly speaking, markets for exchanging risks can be classified into two categories: centralized risk-sharing markets and decentralized risk-sharing markets. In centralized risk-sharing markets, a central agent (or collection thereof) acts as the supplier of insurance. This is the traditional model of insurance, of which the basic theoretical component is the notion of an insurance contract (a premium and an indemnity function). Decentralized risk-sharing markets consist of a pool of agents who wish to directly insure each other, without recourse to a central insurance provider. The agents need to determine a priori a way to allocate the aggregate risk among the pool. Examples of these markets include peer-to-peer (P2P) insurance arrangements, which have gained significant popularity in recent years.
The study of Pareto-optimal insurance contracting within centralized insurance markets has its roots in the two-agent market consisting of one policyholder and one insurer. In early foundational work, Arrow (Reference Arrow1974) and Borch (Reference Borch1960) show that at a Pareto optimum, the indemnity function in a two-agent market is full coverage above a constant deductible, provided that the insurer is risk-neutral and the policyholder’s preference admits a risk-averse expected-utility representation. Numerous extensions of the classical model were subsequently proposed, mostly aiming at introducing more realistic models of policyholder behavior into the theory (see Gollier, Reference Gollier and Dionne2013 or Schlesinger, Reference Schlesinger and Dionne2000 for an overview). For instance, the effect of belief heterogeneity between the agents was examined by Ghossoub, (Reference Ghossoub2019c, Reference Ghossoub2017, Reference Ghossoub2019b), Boonen and Ghossoub (Reference Boonen and Ghossoub2019, Reference Boonen and Ghossoub2020), and Ghossoub et al. (Reference Ghossoub, Jiang and Ren2023). In the literature on ambiguity in optimal insurance design, the majority of the literature considered ambiguity on the side of the insured, as in Bernard et al. (Reference Bernard, He, Yan and Zhou2015) and Xu et al. (Reference Xu, Zhou and Zhuang2018). Several extensions were proposed in Amarante et al. (Reference Amarante, Ghossoub and Phelps2015), Ghossoub, (Reference Ghossoub2019c, Reference Ghossoub2019a), and Birghila et al. (Reference Birghila, Ghossoub and Boonen2023), for instance. Insurance market models in which the insurer and/or policyholders have preferences represented by risk measures were also examined, such as value-at-risk (VaR) and expected shortfall (ES) (e.g., Cai et al. Reference Cai, Tan, Weng and Zhang2008; Chi and Tan Reference Chi and Tan2013; Cheung et al. Reference Cheung, Chong and Yam2015, and Asimit et al. Reference Asimit, Boonen, Chi and Chong2021), and distortion risk measures (DRMs) (e.g., Cui et al. Reference Cui, Yang and Wu2013; Assa Reference Assa2015, and Zhuang et al. Reference Zhuang, Weng, Tan and Assa2016). Extensions of these results to a centralized market with multiple suppliers of insurance have been examined by Boonen et al. (Reference Boonen, Tan and Zhuang2016), Asimit and Boonen (Reference Asimit and Boonen2018), Boonen and Ghossoub (Reference Boonen and Ghossoub2019, Reference Boonen and Ghossoub2021), and Zhu et al. (Reference Zhu, Ghossoub and Boonen2023), for instance.
However, the focus on identifying Pareto-optimal allocations often neglects the impact of market forces in a given insurance market. In particular, the asymmetry of bargaining power between the insurer and the policyholder can invalidate certain Pareto-optimal allocations as likely outcomes of this market. Furthermore, the study of Pareto efficiency takes prices (or premia) in the market as given, and it does not address the question of how premia are determined at a market equilibrium. A popular equilibrium concept that takes into account the aforementioned asymmetry between the insurer and the policyholder is the Stackelberg equilibrium (or Bowley optimum), which applies to a monopolistic insurance market where the insurer and the policyholder move sequentially in an economic game, with the insurer as the leader and the policyholder as the follower. Specifically, the insurer is able to set the prices of insurance before the policyholder has a chance to purchase insurance. The contracts that are expected to emerge in markets with this leader–follower structure are Stackelberg equilibria, which were first examined in an insurance context by Chan and Gerber (Reference Chan and Gerber1985) for agents with exponential utility functions. These results were further extended to the case of DRMs by Cheung et al. (Reference Cheung, Yam and Zhang2019). However, of particular economic relevance is the relationship between these Stackelberg equilibria and Pareto optimality of allocations. To this end, Boonen and Ghossoub (Reference Boonen and Ghossoub2022) show that when the insurer uses a linear pricing rule and agents’ preferences are represented by DRMs, Stackelberg equilibria represent only a subset of Pareto-optimal allocations. Moreover, in every Stackelberg equilibrium, the insurer’s first-mover advantage allows it to charge prices so high that the policyholder is left with no incentive to purchase insurance. In other words, Stackelberg equlibria do not induce a welfare gain to the policyholder. Recent results from Ghossoub and Zhu (Reference Ghossoub and Zhu2024b) show that this situation occurs even in markets with multiple policyholders, and Zhu et al. (Reference Zhu, Ghossoub and Boonen2023) show that introducing competition on the supply side of the insurance market alleviates this problem.
As an alternative to centralized markets of insurance, decentralized insurance markets offer the agents the possibility of sharing risks directly among themselves, thereby avoiding interaction with traditional insurance providers. This would circumvent the potential no-welfare-gain equilibria described above, as there is no longer any danger of an insurance provider exploiting an advantage in bargaining power. A primary concern of decentralized risk-sharing markets is the Pareto-efficiency of the allocations of the aggregate risk. The seminal work of Borch (Reference Borch1962) and Wilson (Reference Wilson1968) showed that when agents have risk-averse expected-utility (EU) preferences, each individual’s allocation of risk is a nondecreasing deterministic function of the aggregate risk in the market at a Pareto optimum. This, in turn, leads to a complete characterization of efficient allocations (through the so-called Borch rule) and shows that Pareto optima (PO) are in fact comonotone. The subsequent literature has then examined several extensions of the classical model beyond EU. Notably, the effect of ambiguity-sensitive agents on optimal allocations has attracted significant attention in the literature. For instance, Dana (Reference Dana2002, Reference Dana2004) and De Castro and Chateauneuf (Reference De Castro and Chateauneuf2011) consider economies with maximum expected utility multiple-prior preferences à la Gilboa and Schmeidler (Reference Gilboa and Schmeidler1989). Chateauneuf et al. (Reference Chateauneuf, Dana and Tallon2000), Dana (Reference Dana2004), De Castro and Chateauneuf (Reference De Castro and Chateauneuf2011), and Beissner and Werner (Reference Beissner and Werner2023) study economies with non-probabilistic uncertainty as in the Choquet-Expected Utility model of Schmeidler (Reference Schmeidler1989). The more general class of variational preferences of Maccheroni et al. (Reference Maccheroni, Marinacci and Rustichini2006) was examined by Dana and Le Van (Reference Dana and Le Van2010) and Ravanelli and Svindland (Reference Ravanelli and Svindland2014), for instance. In the context of risk management, Pareto-optimal allocations have been studied when preferences satisfy the properties of translation invariance, risk aversion, convexity, coherence, or a combination thereof: we refer to Jouini et al. (Reference Jouini, Schachermayer and Touzi2008), Filipović and Svindland (Reference Filipović and Svindland2008), Mao and Wang (Reference Mao and Wang2020), Ghossoub and Zhu (Reference Ghossoub and Zhu2024a), and references therein for an overview of the topic. Of particular recent interest is the risk-sharing problem for agents with quantile-based risk measures that are not necessarily convex, as studied by Embrechts et al. (Reference Embrechts, Liu and Wang2018, Reference Embrechts, Liu, Mao and Wang2020), Liu (Reference Liu2020). In contrast with the literature on risk-averse preferences, optimal allocations can be counter-monotone rather than comonotone, as shown in Lauzier et al. (Reference Lauzier, Lin and Wang2024) and Ghossoub et al. (Reference Ghossoub, Ren and Wang2024).
In practice, decentralized markets of insurance are often found in the form of risk-sharing pooling arrangements, in which every agent pays a contribution fee to a pool in return for partial coverage of their monetary risk. For more details on real-world examples of peer-to-peer risk sharing, we refer to Abdikerimova and Feng (Reference Abdikerimova and Feng2022). Decentralized insurance is also prevalent in markets for catastrophe risk: we refer to Bollmann and Wang (Reference Bollmann and Wang2019) for an overview, as well as Feng et al. (Reference Feng, Liu and Taylor2023) for the flood risk market with proportional insurance. The allocation of the ex post risk to each agent is generally performed according to a predetermined risk-sharing rule. Examples of these rules previously studied in the literature include the conditional mean risk-sharing rule (e.g., Denuit and Dhaene, Reference Denuit and Dhaene2012), as well as rules based on actuarial fairness and Pareto optimality as in Feng et al. (Reference Feng, Liu and Taylor2023). Our focus in this paper is on the Pareto optimality of allocations. For more on desirable properties of risk-sharing rules and recent developments, we refer to Denuit et al. (Reference Denuit, Dhaene and Robert2022).
In this paper, we study Pareto optimality in a decentralized P2P risk-sharing market where agents’ preferences are represented by robust distortion risk measures that are not necessarily convex. In our setting, robustness is with respect to the distortion function used, not with respect to the probability measure on the space. This is in contrast to Bernard et al. (Reference Bernard, Pesenti and Vanduffel2023), but it is in line with Wang and Xu (Reference Wang and Xu2023), who argue that these risk measures can be motivated by the ambiguity present in each agent’s preference. This type of robustness also emerges naturally in the context of coherent risk measures (e.g., Dana, Reference Dana2005). Moreover, we allow for heterogeneity both in the baseline probability measures used by the agents and in the sets of distortion functions used by the agents. The set of feasible allocations is assumed to consist of comonotone allocations, that is, allocations that are 1-Lipschitz functions of the aggregate risk in the market. This is related to the so-called no-sabotage condition of Carlier and Dana (Reference Carlier and Dana2003, Reference Carlier and Dana2005) typically assumed in centralized insurance markets, which guarantees that no agent has an incentive to misreport their actual realized loss ex post.
Our main result (Theorem 3.3) provides a characterization of Pareto-optimal allocations of the aggregate risk in the market. In particular, we show that the shape of the allocations depends primarily on each agent’s assessment of the tail of the aggregate risk. We quantify the latter via an index of probabilistic risk aversion. In a special case of our result where the set of distortion functions for each agent is in fact a singleton (Corollary 3.4), we recover the explicit form of optimal allocations under DRMs previously attained by Liu (Reference Liu2020). We then provide several illustrations of our results using concrete examples of popular families of distortion functions.
As an application of our results, we revisit the market for flood risk insurance in the United States through the lens of P2P insurance. We present the decentralized risk-sharing arrangement as an alternative to the current centralized market structure, and we characterize the optimal allocations in a numerical study with historical flood data. Specifically, we revisit the numerical study of Boonen et al. (Reference Boonen, Chong and Ghossoub2024), which examines a centralized market with the federal government acting as the sole provider of insurance. However, as argued by Ghossoub and Zhu (Reference Ghossoub and Zhu2024b), this centralized structure provides an incentive for the central authority to charge extremely high prices at the expense of the policyholders. We therefore present the decentralized risk-sharing scheme as an alternative, and we characterize the optimal allocations thereof using historical data. We find that while the decentralized risk-sharing market avoids Stackelberg equilibria, the trade-off is that the average welfare gain is lower than that of the centralized market.
The remainder of this paper is structured as follows. Section 2 formulates the optimal allocation problem for decentralized markets. Section 3 provides the characterization of Pareto optima in this setting, as well as some general results for common families of DRMs. We apply our results to the market for flood risk insurance in the United States in Section 4, in which we also discuss the advantages and disadvantages of a decentralized market scheme. Section 5 concludes. Most proofs are relegated to Appendix A.
2. Problem formulation
2.1. The market
Let
$\mathcal{X}$
be the set of bounded real-valued measurable functions on a given measurable space
$\left(\Omega,\mathcal{F}\right)$
, and let
$\mathcal{X}_+$
denote its nonnegative elements. There are
$n\in\mathbb{N}$
agents wishing to share their endowed risks among themselves without any central authority involvements. For each
$i \in \mathcal{N} \,:\!=\, \{1,\ldots,n\}$
, let
$X_i \in \mathcal{X}_+$
denote the nonnegative loss of the ith agent. We consider a one-period, risk-sharing economy, where all risks are realized at the end of the period. The risk-sharing mechanism in this market is as follows.
For each
$i \in \mathcal{N}$
, agent i pays ex ante the contribution amount
$\pi_i\in\mathbb{R}$
to a pool. At the end of the period, the aggregate loss
$S\,:\!=\, \sum_{i=1}^{n}X_i$
is covered by the aggregate amount
$\sum_{i=1}^{n}\pi_i$
in the pool. The residual aggregate loss to be shared among the agents is thus given by
$\widetilde S \,:\!=\, \sum_{i=1}^{n}X_i-\sum_{i=1}^{n}\pi_i$
. If
$\widetilde S \geq 0$
, the pool is subject to a residual aggregate loss; whereas if
$\widetilde S \leq 0$
, the pool has a monetary surplus to be shared among the agents at the end of the period. Let
$Y_i$
be the shared loss by the ith agent,
$i\in\mathcal{N}$
, from the residual aggregate loss
$\widetilde S$
. For each
$i \in \mathcal{N}$
, the end-of-period, post-transfer risk exposure of agent i is thus given by
$Y_i+\pi_i$
.
Remark 2.1. Note that this formulation slightly generalizes the usual formulation in the literature by incorporating deterministic contributions from the agents to the pool at the beginning of the period. This can be compared to the usual formulation by defining
$Z_i\,:\!=\, Y_i+\pi_i$
to be the resulting allocation to agent i. Note that for each
$i\in\mathcal{N}$
, the allocation
$Z_i$
is almost surely bounded by below.
We assume that the market is a comonotone risk-sharing market, as previously studied by Boonen et al. (Reference Boonen, Liu and Wang2021). This is a market in which the admissible contracts are only those that are comonotone with the aggregate initial risk. Under this restriction, the contracts all satisfy the no-sabotage condition of Carlier and Dana (Reference Carlier and Dana2003, Reference Carlier and Dana2005), which guarantees that no agent has an incentive to misreport their actual realized loss. This is one possible justification for the existence of such a market, since preventing ex-post moral hazard is in the best interest of every agent. An alternative justification arises from the situation where each agent’s preference is monotone with respect to the convex order. In this case, it is well known that Pareto optima are comonotone (e.g., Ghossoub and Zhu, Reference Ghossoub and Zhu2024a).
The set of ex ante admissible decision variables is therefore given by

Remark 2.2. Note that
$\{Y_i+\pi_i\}_{i=1}^n$
is comonotone if and only if
$\{Y_i\}_{i=1}^n$
is comonotone. Recall that
$\sum_{i=1}^n(Y_i+\pi_i)=\sum_{i=1}^nX_i=S$
is the aggregate risk present in the insurance market. By a standard result (e.g., Denneberg, Reference Denneberg1994, Proposition 4.5), if
$\{Y_i+\pi_i\}_{i=1}^n$
is comonotone, then there exist increasing 1-Lipschitz functions
$f_i$
such that
$Y_i+\pi_i=f_i(S)$
.
The preferences of each agent i, for
$i\in\mathcal{N}$
, are represented by given risk measures
$\rho_i$
. Consequently, each agent i measures their pre-transfer risk exposure by
$\rho_i\left(X_i\right)$
, and their post-transfer risk exposure by
$\rho_i\left(Y_i+\pi_i\right)$
. We recall some properties of risk measures below.
Definition 2.3.
A risk measure
$\varrho\,:\,\mathcal{X}\to\mathbb{R}$
is said to be:
-
• Monotone if
$\varrho(Z_1)\le\varrho(Z_2)$ , for all
$Z_1,Z_2\in\mathcal{X}$ such that
$Z_1\le Z_2$ .
-
• Translation invariant if
$\varrho(Z+c)=\varrho(Z)+c$ , for all
$Z\in\mathcal{X}$ and
$c\in\mathbb{R}$ .
-
• Law-invariant if for all
$Z_1,Z_2\in\mathcal{X}$ with the same distribution under
$\mathbb{P}$ , we have
$\varrho(Z_1)=\varrho(Z_2)$ .
We will assume throughout that all risk measures are monotone.
Definition 2.4. A contract
$\left(\left\{Y^*_i\right\}_{i=1}^{n},\left\{\pi^*_i\right\}_{i=1}^{n}\right)\in\mathcal{A}$
is said to be:
-
• Individually Rational (IR) if it incentivizes the parties to participate in the market, that is,
\begin{align*}\rho_i\left(Y^*_i+\pi^*_i\right)\leq\rho_i\left(X_i\right),\quad \forall \, i \in \mathcal{N}.\end{align*}
We denote the set of all IR allocations by
$\mathcal{IR}$ .
-
• Pareto Optimal (PO) if it is IR and there does not exist any other IR contract
$\left(\left\{Y_i\right\}_{i=1}^{n},\left\{\pi_i\right\}_{i=1}^{n}\right)$ such that
\begin{align*}\rho_i\left(Y_i+\pi_i\right)\leq\rho_i\left(Y^*_i+\pi^*_i\right),\quad \forall \, i \in \mathcal{N},\end{align*}
with at least one strict inequality. We denote the set of all PO allocations by
$\mathcal{PO}$ .
2.2. Pareto optimality in peer-to-peer arrangements
First, we show that all comonotone allocations are translations of a suitable non-decreasing function of the aggregate endowment. To this end, define the set
$\mathcal{G}$
by the following.

Note that if
$\{g_i\}_{i=1}^n\in\mathcal{G}$
, then each
$g_i$
is 1-Lipschitz. Furthermore, since each
$g_i$
is non-negative, we have
$g_i(0)=0$
for all
$i\in\mathcal{N}$
.
Lemma 2.5. Let
$(\{Y_i\}_{i=1}^n,\{\pi_i\}_{i=1}^n)\in\mathcal{A}$
. Then there exist functions
$\{g_i\}_{i=1}^n\in\mathcal{G}$
and constants
$\{c_i\}_{i=1}^n\in\mathbb{R}^n$
such that

and
$\sum_{i=1}^nc_i=0$
.
In the case where every agent uses a translation invariant risk measure, we have a characterization of PO in terms of solutions of a minimization problem. Let
$\mathcal{S}$
denote the set of all solutions to the problem

Proposition 2.6. If, for each
$i\in\mathcal{N}$
,
$\rho_i$
is translation invariant, then
$\mathcal{PO}=\mathcal{S}$
.
3. Pareto optima for robust distortion risk measures
3.1. Robust distortion risk measures
In this section, we provide an explicit characterization of Pareto-optimal allocations when agents use robust DRMs. These are defined below.
Definition 3.1. A risk measure
$\varrho\,:\,\mathcal{X}\to\mathbb{R}$
is a robust distortion risk measure if there exists a set of distortion functions
$\mathcal{T}$
and a probability measure
$\mathbb{P}$
on
$(\Omega,\mathcal{F})$
such that for all random variables Z on the probability space
$(\Omega,\mathcal{F},\mathbb{P})$
,

Robustness in the context of Definition 3.1 refers to the consideration of different distortion functions by the decision maker, not different probability distributions on the underlying measurable space. Our definition is in line with that of Wang and Xu (Reference Wang and Xu2023), who argue that these risk measures can be motivated by the ambiguity present in each agent’s preference. We note that the use of the term robust distortion risk measure is not consistent throughout the literature. For instance, Bernard et al. (Reference Bernard, Pesenti and Vanduffel2023) use this term to refer to the case where the robustness relates to uncertainty in the distribution of the risk. In contrast, Definition 3.1 assumes that the distribution of a given risk is fixed with respect to a reference probability measure.
Lemma 3.2. For a robust distortion risk measure with respect to a set of distortion functions
$\mathcal{T}$
, we have

where
$\overline{\mathrm{co}}(\mathcal{T})$
is the closed convex hull of
$\mathcal{T}$
with respect to pointwise convergence.
We now assume that for each
$i\in\mathcal{N}$
, the risk measure
$\rho_i$
is a robust distortion risk measure. We allow for each agent to have heterogeneous beliefs, represented by different probability measures
$\mathbb{Q}_i$
on the same measurable space. Specifically, we assume that for each
$i\in\mathcal{N}$
, there exists a set of distortion functions
$\mathcal{T}_i$
such that

for all risks
$Z\in\mathcal{X}$
. By Lemma 3.2, we may assume without loss of generality that each
$\mathcal{T}_i$
is convex and closed with respect to pointwise convergence.
Note that DRMs are translation invariant. For translation-invariant risk measures, a characterization of Pareto-optimal allocations is given by Proposition 2.6. However, in the case of robust DRMs, a more explicit characterization of the set
$\mathcal{S}$
is possible, as shown by the following result.
Theorem 3.3. Suppose that for each
$i\in\mathcal{N}$
, the risk measure
$\rho_i$
is given by a robust distortion risk measure as in (3.1). Then the following hold:
-
(i) There exists a solution to the problem
(3.2)\begin{equation} \sup_{\{T_i\}_{i=1}^n\,\in\,\prod_{i=1}^n\mathcal{T}_i}\,\int_0^\infty\min_{i\in\mathcal{N}}\,\{T_i(\mathbb{Q}_i(S\unicode{x003E}x))\}\,dx . \end{equation}
-
(ii) A necessary condition for the allocation
$(\{Y_i^*\}_{i=1}^n,\{\pi_i^*\}_{i=1}^n)$ to be Pareto optimal is that
\[ Y_i^*+\pi^*_i=g_i^*(S)+c_i^*, \]
$\{c_i^*\}_{i=1}^n\in\mathbb{R}^n$ is chosen such that
$\sum_{i=1}^nc_i^*=0$ and
$\{g_i^*(S)+c_i^*\}_{i=1}^n\in\mathcal{IR}$ , and
$\{g_i^*\}_{i=1}^n\in\mathcal{G}$ can be written in terms of the integrals of suitable functions
$h_i$ . Specifically, for each
$i\in\mathcal{N}$ , we can write
$g_i^*(x)=\int_0^xh_i(z)\,dz$ , where each
$h_i\,:\,\mathbb{R}_+\to[0,1]$ is a function such that for almost every
$x\in\mathbb{R}_+$ ,
\[ \sum_{i\in L_x}h_i(x)=1\mbox{ and }\sum_{i\in L_x^C}h_i(x)=0, \]
\begin{align*} L_x&\,:\!=\, \left\{i\in\mathcal{N}\,:\, T_i^*(\mathbb{Q}_i(S\unicode{x003E}x))=\min_{j\in\mathcal{N}} \, \{T_j^*(\mathbb{Q}_j(S\unicode{x003E}x))\}\right\}, \\ L_x^C&=\mathcal{N}\setminus L_x, \end{align*}
$(T_1^*,\ldots,T_n^*)$ is a solution to (3.2).
In other words, Theorem 3.3 provides a necessary condition for an allocation to be Pareto-optimal allocation. This result is similar to the characterization obtained by Ghossoub and Zhu (Reference Ghossoub and Zhu2024a, Theorem 4.15). However, a key difference is that the present result of Theorem 3.3 does not require concavity of the distortion function (or equivalently, the convexity of the risk measure). The present result also holds under a setting of heterogeneous beliefs, as represented by the different probability measures
$\mathbb{Q}_i$
. These stronger conclusions can be obtained in the setting of this paper because the admissible allocations are constrained to be comonotone, which was not the case in Ghossoub and Zhu (Reference Ghossoub and Zhu2024a). The assumption that the risk-sharing market is comonotone is necessary for Theorem 3.3, since there does not necessarily exist a comonotone Pareto-optimal allocation when risk measures are not convex.
Theorem 3.3 can also be seen as an extension of previous characterizations of Pareto optima for DRMs: see Liu (Reference Liu2020, Theorem 3.3) and Ghossoub and Zhu (Reference Ghossoub and Zhu2024a, Corollary 5.3). Indeed, if the set
$\mathcal{T}_i$
is a singleton, then a robust distortion risk measure reduces to an ordinary distortion risk measure. The main technical difficulty introduced by robustness is that a solution
$(T_1^*,\ldots,T_n^*)$
to problem (3.2) must be obtained, and there is no closed-form expression for such a solution. Nonetheless, once
$(T_1^*,\ldots,T_n^*)$
is determined, the characterization for Pareto optima can be obtained explicitly in terms of
$(T_1^*,\ldots,T_n^*)$
.
In particular, Pareto-optimal allocations are obtained as translations of suitable functions of the aggregate risk. Part (ii) of Theorem 3.3 decomposes the optimal risk allocation for agent i into a constant
$c_i^*$
and a risky portion
$g_i^*(S)$
, which is normalized so that
$g_i^*(0)=0$
. Additionally, for any fixed
$\{g_i^*\}_{i=1}^n$
given by Theorem 3.3, it is always possible to choose
$\{c_i^*\}_{i=1}^n\in\mathbb{R}^n$
such that
$\{g_i^*(S)+c_i^*\}_{i=1}^n\in\mathcal{IR}$
. One such choice is given by

Then by translation invariance of each
$\rho_i$
, we have
$\rho_i(X_i)-\rho(g_i^*(S)+c_i^*)=0$
for
$i\in\{1,\ldots,n\}$
. Furthermore,

where the last inequality follows since
$\{g_i^*(S)+c_i^*\}_{i=1}^n\in\mathcal{S}$
and the no-risk-sharing arrangement
$\{X_i\}$
is individually rational.
More generally, the role of the constants
$c_i^*$
is to determine the distribution of the aggregate welfare gain from P2P insurance among the agents in the market. We define the aggregate welfare gain as the difference between the aggregate valuation of the individuals’ initial risks, given by
$\sum_{i=1}^n\rho_i^*(X_i)$
, and the aggregate valuation of the agents’ post-transfer risks, given by
$\sum_{i=1}^n\rho_i(g_i^*(S)+c_i^*)$
:

Note that
$\sum_{i=1}^n\rho_i(g_i^*(S)+c_i^*)$
is the value of problem (2.1), and therefore does not depend on the choice of the Pareto-optimal allocation
$\{g_i^*(S)+c_i^*\}_{i=1}^n$
. Hence, the aggregate welfare gain W is well-defined, and
$W\ge0$
. Now let
$\{w_i\}_{i=1}^n\in\mathbb{R}_n$
be such that
$\sum_{i=1}^nw_i=W$
and
$w_i\ge0$
for all
$i\in\mathcal{N}$
. Fix
$\{g_i^*\}_{i=1}^n$
satisfying Theorem 3.3 and define

By construction, we have
$\sum_{i=1}^nc_i^*=\left(\sum_{i=1}^n\rho_i(X_i)\right)-\left(\sum_{i=1}^n\rho(g_i^*(S))\right)-W=0$
. The allocation
$\{g_i^*(S)+c_i^*\}$
is the risk-sharing arrangement that provides a welfare gain of
$w_i$
to each agent i, in the sense that

Hence,
$\{g_i^*(S)+c_i^*\}\in\mathcal{IR}$
. Note that the above example (3.3) corresponds to the arrangement that allocates the entire welfare gain to agent n: that is,
$(w_1,\ldots,w_n)=(0,0,\ldots,W)$
.
3.2. Common parametric families of distortion functions
When each set of distortion functions
$\mathcal{T}_i$
is a singleton, there is no robustness in the risk measure of each agent, and we recover the simpler setting of P2P risk-sharing with DRMs. Suppose that
$\mathcal{T}_i=\{T_i\}$
for all
$i\in\mathcal{N}$
. As a special case of Theorem 3.3, we obtain a closed-form solution for Pareto optima that is known in the literature (e.g., Liu, Reference Liu2020, Theorem 3.3). This result is restated below.
Corollary 3.4. Suppose that for each
$i\in\mathcal{N}$
, the risk measure
$\rho_i$
is given by a distortion risk measure. Then an allocation
$(\{Y_i^*\}_{i=1}^n,\{\pi_i^*\}_{i=1}^n)$
is Pareto optimal if and only if

where
$\{c_i^*\}_{i=1}^n\in\mathbb{R}^n$
is chosen such that
$\sum_{i=1}^nc_i^*=0$
and
$\{g_i^*(S)+c_i^*\}_{i=1}^n\in\mathcal{IR}$
, and
$\{g_i^*\}_{i=1}^n\in\mathcal{G}$
can be written in terms of the integrals of suitable functions
$h_i$
. Specifically, for each
$i\in\mathcal{N}$
, we can write
$g_i^*(x)=\int_0^xh_i(z)\,dz$
, where each
$h_i \,:\,\mathbb{R}_+\to[0,1]$
is a function such that for almost every
$x\in\mathbb{R}_+$
,

where

Proof. We apply Theorem 3.3 in the special case where for
$i\in\mathcal{N}$
, each
$\mathcal{T}_i$
is given by the singleton set
$\{T_i\}$
. However, note that we obtain a stronger result by showing sufficiency of the characterization as well as necessity. The result follows from the same proof as that of Theorem 3.3, after applying the fact that problems (A4) and (A5) are identical problems in this special case.
In this subsection, we examine the consequences of Corollary 3.4 when agents’ preferences are determined by common parametric families of probability distortion functions. First, suppose that each
$T_i$
is an inverse S-shaped distortion function from a parametric family, and let
$\alpha_i$
denote the value of the parameter that determines
$T_i$
. It can be verified that certain values of the parameter
$\alpha_i$
are “more S-shaped,” which indicates a larger deviation from risk neutrality. To facilitate an economically meaningful comparison, we provide the following definition, which can be interpreted as a proxy for the level of risk aversion for a distortion function.
Definition 3.5 (Probabilistic Risk Aversion Index). For a given distortion
$T_i$
assumed to be twice differentiable, we define the Probabilistic Risk Aversion Index of
$T_i$
to be

This index is similar to the index of ambiguity aversion proposed by Carlier and Dana (Reference Carlier and Dana2008). Indeed, if
$T_2=g\circ T_1$
where g is a concave function, then it can be verified that
$PRA_2\ge PRA_1$
. Hence,
$T_2$
is more risk averse than
$T_1$
, which is consistent with the notion that the risk aversion of DRMs is due to concavity of the distortion function (see Yaari Reference Yaari1987 for more on this interpretation). Alternatively, we may use an index of relative probabilistic risk aversion, as defined below.
Definition 3.6 (Relative Probabilistic Risk Aversion Index). For a given distortion
$T_i$
assumed to be twice differentiable, we define the Relative Probabilistic Risk Aversion Index of
$T_i$
to be

One example of inverse S-shaped distortion functions is the parametric family of Prelec-1 distortion functions introduced by Prelec (Reference Prelec1998), who showed that these distortion functions are inverse-S shaped with a unique inflection point at
$(e^{-1},e^{-1})$
. These distortion functions are of the form

where
$\alpha_i\in(0,1)$
.
In the following, suppose that each
$\rho_i$
is a distortion risk measure with respect to a Prelec-1 distortion function with parameter
$\alpha_i$
. Let
$j,k\in\mathcal{N}$
such that
$\alpha_j=\max_{i\in\mathcal{N}}\alpha_i$
and
$\alpha_k=\min_{i\in\mathcal{N}}\alpha_i$
.
Proposition 3.7. For
$t\in(0,e^{-1})$
, the probabilistic risk aversion index
$PRA_i(t)$
is larger for smaller values of the parameter
$\alpha_i$
. In particular,

This result can be interpreted in the following manner. Recall that in the context of DRMs, each distortion function
$T_i$
is applied to the survival function
$\mathbb{P}(S\unicode{x003E}x)$
. Hence, the argument of the distortion
$t\in(0,e^{-1})$
represents the probability of a tail event. Proposition 3.7 then implies that agent k is most averse to tail events, provided that the probability of this tail event is no more than
$e^{-1}\approx0.3679$
.
Proposition 3.8. Suppose each
$\rho_i$
is distortion risk measure given by a Prelec-1 distortion function with parameter
$\alpha_i$
, and let
$\alpha_j=\max_{i\in\mathcal{N}}\alpha_i$
and
$\alpha_k=\min_{i\in\mathcal{N}}\alpha_i$
. Then a Pareto-optimal allocation
$(\{Y_i^*\}_{i=1}^n,\{\pi_i^*\}_{i=1}^n)$
is given by

where
$\{c_i\}_{i=1}^n\in\mathbb{R}^n$
is chosen so that the allocation is
$\mathcal{IR}$
, and

where
$d^*=\mathrm{VaR}_{e^{-1}}(S)$
.
Consequently, when all agents use a Prelec-1 distortion function, a possible PO allocation is a translate of the contract where agent k provides full coverage of the aggregate risk up to a limit
$d^*$
, and the excess aggregate loss beyond the level
$d^*$
is covered by agent j. By Proposition 3.7, agent k exhibits the most risk aversion toward tail events. This is reflected by the form of Pareto optima in this case, since agent k does not cover losses beyond the deductible. Instead, the tail risk is assumed by agent j, who exhibits the least risk aversion toward tail events.
Prelec (Reference Prelec1998) also defines a two-parameter family of distortion functions (Prelec-2 distortion functions), by

where
$\alpha_i\in(0,1)$
and
$\beta_i\unicode{x003E}0$
. It can be verified that these distortion functions have an inflection point at
$(e^{-1},e^{-\beta_i})$
. In this case, we have a result similar to that of Proposition 3.7.
Proposition 3.9. If each
$\rho_i$
is a distortion risk measure given by a Prelec-2 distortion function with parameters
$\alpha_i,\beta_i$
, then we have the following:
-
(i) For a fixed
$\alpha_i$ and for all
$t\in(0,1)$ , the probabilistic risk aversion index
$PRA_i(t)$ is larger for smaller values of the parameter
$\beta_i$ .
-
(ii) For a fixed
$\beta_i$ and for all
$t\in(0,\varepsilon)$ where
$\varepsilon\unicode{x003E}0$ , the probabilistic risk aversion index
$PRA_i(t)$ is larger for smaller values of the parameter
$\alpha_i$ .
Hence, if there exists an agent k such that
$\alpha_k=\min_{i\in\mathcal{N}}\alpha_i$
and
$\beta_k=\min_{i\in\mathcal{N}}\beta_i$
, then this agent exhibits the most risk aversion to tail events. A similar result to that of Proposition 3.8 is given below.
Proposition 3.10. Suppose each
$\rho_i$
is distortion risk measure given by a Prelec-2 distortion function with parameters
$\alpha_i$
and
$\beta_i$
.
-
(i) Suppose that
$\beta_1=\beta_2=\ldots=\beta_n\,:\!=\, \beta$ . Let
$\alpha_j=\max_{i\in\mathcal{N}}\alpha_i$ and
$\alpha_k=\min_{i\in\mathcal{N}}\alpha_i$ . Then a Pareto-optimal allocation
$(\{Y_i^*\}_{i=1}^n,\{\pi_i^*\}_{i=1}^n)$ is given by
\[Y_i^*+\pi_i^*=g_i^*(S)+c_i^*, \]
$\{c_i\}_{i=1}^n\in\mathbb{R}^n$ is chosen so that the allocation is
$\mathcal{IR}$ , and
\begin{align*} g_k^*(x)&=\min\{x,d^*\}, \\ g_j^*(x)&=\max\{x-d^*,0\}, \\ g_i^*(x)&=0,\mbox{ for }i\in\mathcal{N}\setminus\{j,k\} . \end{align*}
$d^*=\mathrm{VaR}_{e^{-1}}(S)$ .
-
(ii) Suppose that
$\alpha_1=\alpha_2=\ldots=\alpha_n\,:\!=\, \alpha$ . Let
$\beta_j=\max_{i\in\mathcal{N}}\beta_i$ . Then a Pareto-optimal allocation
$(\{Y_i^*\}_{i=1}^n,\{\pi_i^*\}_{i=1}^n)$ is given by
\[Y_i^*+\pi_i^*=g_i^*(S)+c_i^*, \]
$\{c_i\}_{i=1}^n\in\mathbb{R}^n$ is chosen so that the allocation is
$\mathcal{IR}$ , and
\begin{align*} g_j^*(x)&=x, \\ g_i^*(x)&=0,\mbox{ for }i\in\mathcal{N}\setminus\{j\} . \end{align*}
Part (i) of Proposition 3.10 is a similar result to that of Proposition 3.8 in the case where every agent has the same second parameter. That is, the agent with the largest aversion to tail events provides coverage up to a deductible. Part (ii) states that if every agent uses the same first parameter
$\alpha$
, then the agent with the least aversion to tail events provides full insurance.
Similar results can be obtained for the family of S-shaped distortion functions first introduced by Kahneman and Tversky (Reference Kahneman and Tversky1979), Tversky and Kahneman (Reference Tversky and Kahneman1992):

where
$\gamma_i\in(0.279,1]$
. Let
$\gamma_j=\max_{i\in\mathcal{N}}\gamma_i$
and
$\gamma_k=\min_{i\in\mathcal{N}}\gamma_i$
.
Remark 3.11. It can be numerically verified that for any
$t\in(0,\varepsilon)$
for a sufficiently small
$\varepsilon\unicode{x003E}0$
, the probabilistic risk aversion index
$PRA_i(t)$
is larger for smaller values of the parameter
$\gamma_i$
. The probabilistic risk aversion index is plotted in Figure 1(b). In particular,

Again, since small values of t correspond to tail events, we see that agent k exhibits the most aversion to the risk of tail events, whereas agent j exhibits the least aversion.

Figure 1. Kahneman–Tversky inverse S-shaped distortion functions.
It can be shown using numerical methods that there exists
$t^*\in(0,1)$
for which

Distortion functions for different values of the parameter
$\gamma$
are shown in Figure 1(a). It is clear from the figure that the minimum of the distortion functions is obtained by either the agent with the largest parameter (agent j) or by the agent with the smallest parameter (agent k). Thus, by the same arguments as those of Proposition 3.8, it follows that a PO allocation of a similar form exists. That is, consider the contract such that agent k provides full coverage of the aggregate risk up to a limit
$d^*$
, and the excess aggregate loss beyond the level
$d^*$
is covered by agent j. Then a suitable translate of this contract is PO.
4. Centralized versus decentralized insurance for flood risk
As an application of our explicit characterization of Pareto-optimal risk-sharing contracts, we reexamine the problem of flood risk insurance in the United States. This setting has been examined in detail by Boonen et al. (Reference Boonen, Chong and Ghossoub2024) and Ghossoub and Zhu (Reference Ghossoub and Zhu2024b), who consider a market where each State may insure their flood risk with the federal government. We refer to this structure as a centralized insurance market, namely, one where insurance is only provided by a limited set of entities in the market.
In contrast, the main result of the present paper characterizes Pareto optima in a decentralized insurance market or P2P insurance market. This market structure allows for agents to ensure each other’s risk through participation in a risk-sharing pool, which is not possible in a centralized insurance market. In the context of flood risk insurance, a decentralized insurance market is one where individual agents (communities, municipalities, or state-level agencies) agree to combine their flood risk exposures, and each covers a portion of the aggregate risk exposure. For the purposes of this section, we assume that the individual agents are States that wish to pool state-level flood risk. This is mainly for ease of comparison with the centralized setting. However, the conclusions of our analysis are also applicable to agents at a higher level of granularity (e.g., individual communities, etc.). Feng et al. (Reference Feng, Liu and Taylor2023) also applied their P2P risk-sharing results to flood risk insurance in the United States, but they considered a market where only quota-share risk-sharing rules (i.e., proportional insurance) are available.
The purpose of the numerical illustrations presented in this section is to study the differences between centralized and decentralized insurance markets for flood risk and to assess the economic implications thereof. For this reason, we use DRMs for each agent and the associated characterization result of Corollary 3.4, to facilitate a direct comparison with the previous work, which did not consider the effect of robustness. The effects of introducing robustness of DRMs are therefore not explored in the current study.
4.1. A centralized insurance market
First, we revisit the scenario of a centralized insurance market, in which a single monopolistic insurer is the sole provider of insurance. In the context of flood risk in the United States, this insurer is the National Flood Insurance Program (NFIP)Footnote 2 overseen by the Federal Emergency Management Agency (FEMA). Under the NFIP, the federal government of the United States is ultimately responsible for financial gains and losses.
A comprehensive numerical study of flood risk insurance in this centralized market is performed by Boonen et al. (Reference Boonen, Chong and Ghossoub2024), using a public dataset of claims statistics dating back to the year 1978. They find that by aggregating the flood risk across geographically distant regions, a centralized market is able to realize a significant welfare gain. In the following, we provide a similar but updated analysis of a centralized market. First, the public NFIP dataset has recently been updated to include the amount of financial loss for each flood event, instead of only including the claim amount. As a result, we no longer need to use the claim amount as a proxy for the loss amount. Additionally, Ghossoub and Zhu (Reference Ghossoub and Zhu2024b) are able to characterize Pareto optima without requiring that each agent uses a coherent risk measure. As a result, we are able to identify Pareto-optimal contracts under more general conditions. Finally, in our example, the risk measure of the central agent will be given by the expected shortfall (ES), which is the most popular coherent risk measure in practice. We recall the following standard definitions:
Definition 4.1. The value-at-risk (VaR) at level
$\alpha\in(0,1)$
of a random variable
$Z\in\mathcal{X}$
under the probability measure
$\mathbb{P}$
is

Definition 4.2. The ES at level
$\alpha\in(0,1)$
of a random variable
$Z\in\mathcal{X}$
under the probability measure
$\mathbb{P}$
is

In the context of centralized insurance markets with n agents and a central insurer, an insurance contract is a pair
$\left(\{I_i\}_{i=1}^n,\{\pi_i\}_{i=1}^n\right)\in\mathcal{I}^n\times\mathbb{R}^n$
, where
$\mathcal{I}$
denotes an admissible set of indemnity functions (usually monotone and 1-Lipschitz). Here,
$I_i$
represents the amount of coverage (i.e., the indemnity) that agent i receives from the central insurer, in exchange for a premium of the amount
$\pi_i$
. In other words, agent i cedes the risk
$I_i(X_i)$
to the central insurer. We denote the retained risk of agent i by
$R_i(X_i)\,:\!=\, X_i-I_i(X_i)$
.
The concepts of individual rationality and Pareto optimality for centralized markets are defined similarly to IR and PO in decentralized markets (see Definition 2.4). In particular, IR allocations for centralized markets also incentivize the central insurer to participate. A characterization of Pareto- optimal contracts in a centralized market is provided by the following result, which extends the result of Ghossoub and Zhu (Reference Ghossoub and Zhu2024b, Corollary 3.7) to the case of robust DRMs.
Proposition 4.3 (Pareto Optima in a Centralized Market). Suppose that the insurer’s risk measure is given by an expected shortfall
$\mathrm{ES}_\alpha^{\mathbb{P}}$
, and that each policyholder
$i\in\mathcal{N}$
uses a robust distortion risk measure
$\rho_i$
with respect to the closed convex set
$\mathcal{T}_i$
. Suppose further that
$\mathcal{T}_i$
is compact with respect to the topology of pointwise convergence. Then the following hold:
-
(1) There exists a solution to the problem
(4.1)where\begin{equation} \sup_{\mathbb{Q}\in\mathcal{Q}}\,\, \sup_{\{T_i\}_{i=1}^n\,\in\,\prod_{i=1}^n\mathcal{T}_i}\,\, \left\{ \sum_{i=1}^n\int_0^\infty\min\{T_i(\mathbb{Q}_i(X_i\unicode{x003E}t)),\mathbb{Q}(X_i\unicode{x003E}t)\}\,dt \right\}, \end{equation}
\[ \mathcal{Q}\,:\!=\, \left\{\mathbb{Q}\mbox{ a probability measure}\,:\, \mathbb{Q}\unicode{x003C}\unicode{x003C}\mathbb{P},\,\frac{d\mathbb{Q}}{d\mathbb{P}}\le\frac{1}{\alpha}\right\} . \]
-
(2) A necessary condition for the contract
$\left(\{I_i^*\}_{i=1}^n,\{\pi_i^*\}_{i=1}^n\right)$ to be Pareto optimal is that for every
$\mathbb{Q}^*,T_1^*,\ldots,T_n^*$ solving (4.1), we have
\begin{align*} (I_i^*)'(t) = \begin{cases} 1,&\text{ if}\quad\mathbb{Q}^*(X_i \unicode{x003E} t) \unicode{x003C} T_i^*(\mathbb{Q}_i(X_i \unicode{x003E} t)), \\ h_i(t),&\text{ if}\quad\mathbb{Q}^*(X_i \unicode{x003E} t)=T_i^*(\mathbb{Q}_i(X_i \unicode{x003E} t)), \\ 0, & \text{ if}\quad\mathbb{Q}^*(X_i \unicode{x003E} t) \unicode{x003E} T_i^*(\mathbb{Q}_i(X_i \unicode{x003E} t)), \\ \end{cases} \end{align*}
$h_i(t)$ is a [0,1]-valued measurable function, and
$\{\pi_i^*\}_{i=1}^n$ are chosen so that
$\left(\{I_i^*\}_{i=1}^n,\{\pi_i^*\}_{i=1}^n\right)$ is individually rational.
Remark 4.4. Compared to Theorem 3.3, the statement of Proposition 4.3 requires the additional assumption of compactness of each
$\mathcal{T}_i$
. One scenario where compactness is automatically satisfied is when
$\mathcal{T}_i$
is the convex hull of finitely many distortion functions (e.g., Aliprantis and Border, Reference Aliprantis and Border2006, Corollary 5.30).
For ease of comparison with Boonen et al. (Reference Boonen, Chong and Ghossoub2024) and Ghossoub and Zhu (Reference Ghossoub and Zhu2024b), we will adopt their model parameters for the remainder of this section. Specifically, we will assume that the distribution of the financial losses arising from floods in a given month adheres to the historical data. Specifically, we assume a discrete probability space
$(\Omega,\mathcal{F},\mathbb{P})$
, where
$\Omega$
has 553 states, corresponding to each of the months from January 1978 to January 2024. The measure
$\mathbb{P}$
assigns an equal probability to each of the states in
$\Omega$
. The risk measures used by each agent are DRMs on this probability space. That is, for all
$Z\in\mathcal{X}$
and
$i\in\{1,\ldots,n\}$
,

where each
$T_i$
is a distortion function. Let
$\nu_i \,:\!=\, T_i\circ\mathbb{P}$
.
We now apply the result of Proposition 4.3 to obtain Pareto-optimal contracts in the centralized insurance market. We consider a market with three agents: the States of California (
$i=1$
), New York (
$i=2$
), and Texas (
$i=3$
). The correlation between the monthly losses of these States is given by the matrix

which indicates very little correlation. Some summary statistics for the monthly losses for these States are shown in Table 1, in which all values are dollar amounts. We see that the financial losses are right-skewed and that it is possible to experience losses far larger than the average.
Table 1. Summary statistics for monthly losses due to floods.

We assume that the distortion functions
$T_1$
and
$T_2$
are both inverse S-shaped distortion functions, as in Kahneman and Tversky (Reference Kahneman and Tversky1979), Tversky and Kahneman (Reference Tversky and Kahneman1992). Specifically, for
$i=1,2$
, we take

with
$\gamma_1=0.4$
and
$\gamma_2=0.5$
. A parameter value of approximately
$0.5$
has recently been estimated by Rieger et al. (Reference Rieger, Wang and Hens2017) to be descriptive of actual behavior. We assume a power distortion for the remaining agent. Specifically,

with
$\gamma_3=0.4$
. Finally, we assume that the central insurer uses the Expected Shortfall with parameter
$\alpha=15\%$
. While this parameter may seem large, this is to allow for the central insurer’s risk measure to be display enough risk tolerance that it will accept a significant portion of risk from the agents in the market. If this parameter is too low, then the central insurer would be too conservative and avoid taking on any flood risk, leading to a market where no insurance is possible. Indeed, it can be verified that under these parameters, no insurance is provided if
$\alpha\approx2.5\%$
.
The retained risk of each agent in a Pareto-optimal contract is shown in Figure 2. Note that in this figure and subsequent figures, the amount of the premium is not shown, for visual clarity and to facilitate easier comparison between plots. In other words, each plot has been normalized so that an experienced loss of
$\$0$
corresponds to a retention of
$\$0$
as well. We see that in each State prefers to cede a portion of their tail risk to the central insurer, while choosing to retain some of the smaller financial losses. This suggests that a deductible contract is best in this situation, with losses beyond the deductible fully covered by the insurer. The value of this deductible is approximately
$1.9047\times10^7$
for California,
$4.0645\times10^7$
for New York, and
$5.1287\times10^7$
for Texas.

Figure 2. Centralized Pareto-optimal retention, CA/NY/TX.
As a measure of the effectiveness of this contract, we calculate the aggregate welfare gain that results from this arrangement. The welfare gain for each agent i is the difference between their evaluation of the initial risk and their evaluation of the post-contract position:

On the other hand, the welfare gain for the central insurer is the sum of premia received less their evaluation of their assumed risk:

The aggregate welfare gain is defined to be the sum of the welfare gain of each agent in the market, including the central insurer. Under our assumptions, the aggregate welfare gain is maximized in a Pareto-optimal allocation (see, e.g., Boonen et al. Reference Boonen, Chong and Ghossoub2024, Theorem 2.1). In the example above, the maximum aggregate welfare gain achievable through centralized insurance is
$7.9785\times10^8$
. The average gain per agent is in this example is
$1.9946\times10^8$
. Note that including the centralized insurer, there are four agents in the market in this example.
However, while a significant welfare gain is possible in a centralized market, Ghossoub and Zhu (Reference Ghossoub and Zhu2024b) warn that a monopolistic insurer with a first-mover’s advantage can absorb the entirety of the welfare gain by increasing premium prices. In particular, the central insurer has an incentive to increase prices and ultimately leave each policyholder with no welfare gain at an equilibrium. The amount of premium paid by each State in a Stackelberg equilibrium in our example is shown in Table 2. These amounts are extremely high, even exceeding the value-at-risk at the 5% level. In this case, the entire aggregate welfare gain of
$7.9785\times10^8$
is allocated only to the central insurer.
Table 2. Premium paid in a stackelberg equilibrium, CA/NY/TX.

4.2. Peer-to-peer risk sharing
As a partial solution to the concerns arising from the Stackelberg setting, the present paper proposes a P2P risk-sharing scheme among the agents themselves without the presence of a central agency. In the context of flood risk, this is the scenario where the individual State governments of California, New York, and Texas choose to pool their flood risk and each assume a portion of the aggregate financial loss.
Applying the result of Theorem 3.3 to the present example yields the Pareto-optimal allocation shown in Figure 3. The States of California and New York end up with all of the variability of the risk, since the risk measure of Texas is more risk averse under our chosen parameters. However, this does not mean that Texas is receiving free insurance, since each State must pay a premium for participation in the risk-sharing pool. Both Figures 3 and 2 show that Texas prefers retaining less risk than the other agents.

Figure 3. Decentralized Pareto-optimal distribution, CA/NY/TX.

Figure 4. Retained Loss, CA/NY/TX.
A comparison between the centralized and decentralized insurance schemes is shown in Figure 4, which also includes the scenario of no insurance. The vertical axis represents the retained monthly loss in dollars, and the horizontal axis represents the 553 states of the world
$\omega\in\Omega$
, which are sorted by the value of
$S(\omega)$
, the realized aggregate loss in state
$\omega \in \Omega$
. For ease of comparison, the plots have been normalized so that each agent is indifferent between each retention structure. In the case of centralized insurance, we see that each agent pays a premium to insure their risk beyond a deductible. This results in significantly less variance in retained risk, at a cost of a monthly premium. On the other hand, the structure of the Pareto-optimal risk-sharing arrangement is different for each agent. The plots show that California’s retention is similar to that under centralized insurance, in the sense that variation is reduced and extreme losses are insured. However, extreme losses are transferred to New York, who accepts a payment of a premium as compensation. Since there is no exogenous insurer in the decentralized market, all flood risk must be retained among the three agents in the market. The plot in Figure 4 shows that New York is the agent that accepts the extreme tail risk in this example. Finally, Texas elects to pay a premium to transfer all the flood risk to the other two agents, and so its retention structure is a flat horizontal line. This suggests that Texas is the most risk-averse in this scenario.
By Theorem 3.3, both the aggregate welfare gain and the average welfare gain are maximized by Pareto-optimal allocations. Hence, evaluating the effectiveness of the P2P risk-sharing scheme is possible by using the average welfare gain as a metric. Recall that the welfare gain for agent i is defined as the improvement in the risk-sharing scheme over the status quo:

The average welfare gain for the P2P risk-sharing scheme is the arithmetic average of the welfare gains of the three agents in the market. In this example, the average welfare gain is
$1.9512\times10^8$
. While this is a significant gain, this is not quite as large as the average gain of
$1.9946\times10^8$
that is possible from a centralized insurance market with the monopolistic insurer. Hence, we see the P2P risk-sharing scheme as a compromise compared to the centralized structure. While a larger welfare gain is ultimately possible in the centralized setting, this carries the risk that the welfare gain is absorbed by only the monopolistic insurer through aggressive pricing schemes; again, see Ghossoub and Zhu (Reference Ghossoub and Zhu2024b). On the other hand, the decentralized market admits no such risk, since the welfare gain is necessarily distributed among the agents participating in the risk-sharing pool. However, the trade-off is that the maximum possible average welfare gain is potentially lower in the P2P market.
In Table 3, we show the average welfare gain per agent in both centralized and decentralized markets for varying values of the parameter
$\gamma_3$
in the distortion function of the third agent
$T_3$
. It can be verified that power distortion functions have a constant relative index of probabilistic risk aversion. As
$\gamma_3$
increases, the RPRA of the third agent decreases, and the hence the third agent eventually prefers to retain more of the flood risk in the market. In the extreme case, we see that the P2P market can actually dominate that of the centralized market in terms of possible welfare gain, since the decentralized scheme allows for the third agent to insure others’ risk whereas the centralized scheme does not. Hence, it is possible for the decentralized market’s average welfare gain to exceed that of the centralized market. However, in practical settings, we would expect that the policyholders are more risk averse than the central authority, which would lead to an advantage for the centralized insurance market in general.
Table 3. Average welfare gain with varying parameter
$\gamma_3$
.

5. Conclusion
In this paper, we obtain a novel characterization of Pareto-optimal allocations in risk-sharing markets when agents use robust DRMs. As a special case of our result, we recover the characterization of optima under DRMs, and we provide some general results for risk-sharing using common families of distortion functions. We find that the form of allocations depends primarily on the risk attitude of each agent to tail events, which we quantify through the probabilistic risk aversion index.
As an application of our results, we reexamine the setting of flood risk in the United States from the perspective of a decentralized market. While this market currently operates under a centralized structure with the federal government as the central authority, this comes at a risk associated with Stackelberg equilibria, namely that the central insurer has an incentive to raise prices and eliminate any welfare gain for the policyholders. We therefore argue that the decentralized structure is an alternative that avoids the Stackelberg situation, and we characterize optimal decentralized allocations using historical data. Our results show that there is less welfare gain possible in the decentralized market compared to the centralized one, which we interpret as a necessary compromise.
Acknowledgments
Mario Ghossoub acknowledges financial support from the Natural Sciences and Engineering Research Council of Canada (NSERC Grant No. 2024-03744). Michael B. Zhu acknowledges financial support from the Society of Actuaries through the Hickman Scholars Program.
Appendix A. Proofs of Main Results
A.1. Proof of Lemma 2.5
Let
$(\{Y_i\}_{i=1}^n,\{\pi_i\}_{i=1}^n)$
be a comonotone allocation. By Remark 2.2, for each
$i\in\mathcal{N}$
, there exists an increasing 1-Lipschitz function
$f_i\,:\,\mathbb{R}\to\mathbb{R}$
such that
$f_i(S)=Y_i+\pi_i$
. Define the function
$g_i$
by

Then
$g_i(0)=0$
, and
$g_i$
is non-decreasing and 1-Lipschitz, which implies that
$g_i$
is non-negative when its domain is restricted to
$\mathbb{R}_+$
. Since
$f_i(S)$
is an allocation, we have A

Substituting 0 for S in (A1) gives

which implies that

and so
$\{g_i\}_{i=1}^n\in\mathcal{G}$
. Substituting this into (A1) gives
$\sum_{i=1}^nf_i(0)=0$
. Hence, defining
$c_i\,:\!=\, f_i(0)$
gives the desired result.
A.2. Proof of Proposition 2.6
Note that
$\mathcal{S}\subseteq\mathcal{PO}$
can be easily proved by contradiction. To show the reverse inclusion, assume, by way of contradiction, that there exist
$\left(\left\{Y^*_i\right\}_{i=1}^{n},\left\{\pi^*_i\right\}_{i=1}^{n}\right)\in\mathcal{PO}$
such that
$\left(\left\{Y^*_i\right\}_{i=1}^{n},\left\{\pi^*_i\right\}_{i=1}^{n}\right)\notin\mathcal{S}$
. Then, there exist
$\left(\left\{\widetilde{Y}_i\right\}_{i=1}^{n},\left\{\widetilde{\pi}_i\right\}_{i=1}^{n}\right)\in\mathcal{IR}$
such that

Define
$\mathcal{N}_1,\mathcal{N}_2,\mathcal{N}_3\subseteq\mathcal{N}$
such that,

Note that
$\mathcal{N}_1,\mathcal{N}_2,\mathcal{N}_3$
is a partition of
$\mathcal{N}$
; also, by (A2),
$\mathcal{N}_3\neq\varnothing$
.
By assumption,
$\left(\left\{Y^*_i\right\}_{i=1}^{n},\left\{\pi^*_i\right\}_{i=1}^{n}\right)\in\mathcal{PO}$
, which implies
$\mathcal{N}_1\neq\varnothing$
since
$\mathcal{N}_3\neq\varnothing$
. Define, for
$i\in\mathcal{N}_1$
,

Then, by (A2), there exist
$\left\{\varepsilon_i\right\}_{i\in\mathcal{N}_3}$
such that, (i)
$\varepsilon_i\geq 0$
, for
$i\in\mathcal{N}_3$
, (ii)
$\rho_i\left(\widetilde{Y}_i+\widetilde{\pi}_i+\varepsilon_i\right)\leq\rho_i\left(Y^*_i+\pi^*_i\right)$
, for
$i\in\mathcal{N}_3$
, with at least one strict inequality, and (iii)
$\sum_{i\in\mathcal{N}_3}\varepsilon_i=\sum_{i\in\mathcal{N}_1}\varepsilon_i$
. Define



Note that
$\left(\left\{\widetilde{Y}_i\right\}_{i=1}^{n},\left\{\hat{\pi}_i\right\}_{i=1}^{n}\right)\in\mathcal{A}$
; indeed,

Moreover,
$\left(\left\{\widetilde{Y}_i\right\}_{i=1}^{n},\left\{\hat{\pi}_i\right\}_{i=1}^{n}\right)\in\mathcal{IR}$
, since



in which (A3) has at least one strict inequality, which implies
$\left(\left\{Y^*_i\right\}_{i=1}^{n},\left\{\pi^*_i\right\}_{i=1}^{n}\right)\not\in\mathcal{PO}$
, a contradiction.
A.3. Proof of Lemma 3.2
Since the Choquet integral is linear in T, we have that

where
$\mathrm{co}(\mathcal{T})$
denotes the convex hull of
$\mathcal{T}$
. It suffices to show that the Choquet integral is continuous with respect to pointwise convergence of T. To this end, let
$\{T^{(k)}\}_{k=1}^\infty$
be a sequence of continuous functions converging pointwise to T. Then we have

where we may apply the dominated convergence theorem since

A.4. Proof of Theorem 3.3
(i) Since S is essentially bounded by
$M\,:\!=\, ||S||_{\infty}\unicode{x003C}\infty$
, we have

Let
$\{(T_1^{(k)},\ldots,T_n^{(k)}\}_{k=1}^\infty$
be a sequence such that

Then by repeatedly applying Helly’s compactness theorem (e.g., Doob Reference Doob1994, pp. 165–166), there exists a subsequence
$\{(T_1^{(k_j)},\ldots,T_n^{(k_j)}\}_{j=1}^\infty$
that converges pointwise to a limit
$(T_1^*,\ldots,T_n^*)\in\prod_{i=1}^n\mathcal{T}_i$
. By the proof of Lemma 3.2, the objective of (3.2) is continuous with respect to pointwise convergence of the distortion functions. Hence,

which implies that
$(T_1^*,\ldots,T_n^*)$
solves (3.2).
(ii) Let
$(\{Y_i\}_{i=1}^n,\{\pi_i\}_{i=1}^n)\in\mathcal{A}$
. By Lemma 2.5, this allocation can be written in terms of functions
$\{g_i\}_{i=1}^n\in\mathcal{G}$
and constants
$\{c_i\}_{i=1}^n\in\mathbb{R}^n$
where
$\sum_{i=1}^nc_i=0$
. Conversely, if
$\{g_i\}_{i=1}^n\in\mathcal{G}$
and
$\{c_i\}_{i=1}^n\in\mathbb{R}^n$
, then
$(\{g_i(S)\}_{i=1}^n,\{c_i\}_{i=1}^n)\in\mathcal{A}$
. Therefore,

where we write
$(\{g_i\}_{i=1}^n,\{c_i\}_{i=1}^n)\in\mathcal{IR}$
when the allocation
$(\{g_i(S)\}_{i=1}^n,\{c_i\}_{i=1}^n)\in\mathcal{IR}$
. This problem is solved by
$(\{g_i^*\}_{i=1}^n,\{c_i^*\}_{i=1}^n)\in\mathcal{G}\times\mathbb{R}^n$
if and only if
$\{g_i^*\}_{i=1}^n$
solves

and the constants
$c_i^*$
are chosen such that
$(\{g_i^*(S)\}_{i=1}^n,\{c_i^*\}_{i=1}^n)\in\mathcal{IR}$
. It remains to show that solutions to (A4) are of the given form.
Since each
$\rho_i$
is a robust distortion risk measure, (A4) can be written as

We now verify that the minimax equality holds for this problem. To this end, let
$\mathcal{C}([0,M])$
denote the set of continuous functions on [0, M], which is a Banach space under the supremum norm. Let
$\mathcal{D}\,:\!=\, \mathbb{R}^{[0,1]}$
denote the space of functions from
$[0,1]\to\mathbb{R}$
, which is a topological vector space under the topology of pointwise convergence. With the embeddings
$\mathcal{G}\subseteq\mathcal{C}([0,M])^n$
and
$T_i\subseteq\mathcal{D}$
, the objective function above can be viewed as a function from
$\mathcal{C}([0,M])^n\times\mathcal{D}^n$
to
$\mathbb{R}$
.
Note that the sets
$\mathcal{G}$
and
$\mathcal{T}_i$
are closed and convex. Since
$\mathcal{G}$
is a closed subset of a product of 1-Lipschitz functions on the interval [0,M], it is compact by the Arzela-Ascoli Theorem (Dunford and Schwartz, Reference Dunford and Schwartz1958, IV.6.7). Furthermore, the objective function is linear in both
$g_i$
and
$T_i$
. Finally, the objective is continuous with respect to the topologies above, since the Choquet integral is continuous with respect to the supremum norm,Footnote 3 and it is also sequentially continuous with respect to pointwise convergence of
$T_i$
as shown by Lemma 3.2.
Therefore by Sion’s minimax theorem (Barbu and Precupanu, Reference Barbu and Precupanu2012, Theorem 2.132), the minimax equality holds. Exchanging the order of the infimum and supremum yields the problem

By standard results on minimax problems (e.g., Barbu and Precupanu Reference Barbu and Precupanu2012, Section 2.3), a necessary condition for
$\{g_i^*\}_{i=1}^n$
to solve (A4) is that for every
$(T_1^*,\ldots,T_n^*)$
solving (A5), we have

To complete the proof, it suffices to show that for every fixed choice of distortion functions
$(T_1,\ldots,T_n)$
, the functions
$\{g_i^*\}_{i=1}^n$
solve the problem

if and only if
$\{g_i^*\}_{i=1}^n$
are of the given form, and that the value of problem (A6) is

To this end, we first check that
$\{g_i^*\}_{i=1}^n\in\mathcal{G}$
. For all
$x\in\mathbb{R}_+$
, we have

Furthermore, since the derivative of
$g_i^*$
is nonnegative,
$g_i^*$
is increasing, and so
$\{g_i^*\}_{i=1}^n\in\mathcal{G}$
and
$g_i^*(S)$
is comonotone with S. Now let
$\{\widetilde{g}_i\}_{i=1}^n\in\mathcal{G}$
. We have

Therefore,
$\{g_i^*\}_{i=1}^n$
solves (A4).
To show the converse, let
$(\{Y_i\}_{i=1}^n,\{\pi_i\}_{i=1}^n)\in\mathcal{A}$
. By Lemma 2.5, we may write

for some
$\{\widetilde{g}_i\}_{i=1}^n\in\mathcal{G}$
and
$\{c_i\}_{i=1}^n\in\mathbb{R}^n$
such that
$\sum_{i=1}^nc_i=0$
. Suppose that
$\widetilde{g}_i$
are not of the specified form. Namely,
$\sum_{i\in L_x^C}\widetilde{g}_i'(x)\unicode{x003E}0$
on a set
$\mathcal{A}$
of positive measure. Then for every x in
$\mathcal{A}$
, we have

where the strict inequality follows because
$L_x^C$
is nonempty and
$\widetilde{g}_i'(x)$
are not all zero for
$i\in L_x^C$
. Taking the integral over the set
$\mathcal{A}$
of positive measure gives

Therefore in this case, the inequality (A7) is strict, and so
$\widetilde{g}_i(x)$
does not solve (A4).
A.5. Proof of Proposition 3.7
For a given parameter
$\alpha_i\in(0,1)$
, we have

Differentiating this with respect to
$\alpha_i$
gives

For
$t\in(0,e^{-1})$
, we have
$-\ln(t)\unicode{x003E}1$
. Therefore

Since
$t\ln(t)\unicode{x003C}0$
, this implies that

which yields the desired result.
A.6. Proof of Proposition 3.8
We first show that

Suppose first that
$\mathbb{P}(S\unicode{x003E}x)\le e^{-1}$
. Then we have

It follows that for all
$i\in\mathcal{N}$
,

and

which implies that
$j\in L_x$
.
The second case is similar. Suppose that
$\mathbb{P}(S\unicode{x003E}x)\unicode{x003E}e^{-1}$
. Then we have

from which it is straightforward to conclude that
$k\in L_x$
. Therefore, letting

satisfies the conditions of Theorem 3.3. The result follows by taking
$g_i^*(x)=\int_0^xh_i(z)\,dz$
, for all
$i\in\mathcal{N}$
.
A.7. Proof of Proposition 3.9
Similarly to the calculation in the proof of Proposition 3.7, we have

We see that
$\frac{d}{d\beta_i}\,PRA_i(t)\unicode{x003C}0$
for all
$t\in(0,1)$
, which shows (i). For (ii), if we take

then for
$t\in(0,\varepsilon)$
, we have

Therefore

Since
$t\ln(t)\unicode{x003C}0$
, this implies that

which yields the desired result.
A.8. Proof of Proposition 3.10
Similarly to the proof of Proposition 3.8, we show that

Suppose first that
$\mathbb{P}(S\unicode{x003E}x)\le e^{-1}$
. Then we have

It follows that for all
$i\in\mathcal{N}$
,
$({-}\ln(\mathbb{P}(S\unicode{x003E}x)))^{\alpha_j} \ge ({-}\ln(\mathbb{P}(S\unicode{x003E}x)))^{\alpha_i}$
, and so

thereby yielding

which implies that
$j\in L_x$
.
The second case is similar. Suppose that
$\mathbb{P}(S\unicode{x003E}x)\unicode{x003E}e^{-1}$
. Then we have

from which it is straightforward to conclude that
$k\in L_x$
. Applying Theorem 3.3 gives (i).
To show (ii), note that if
$\beta_j\ge\beta_i$
, then

Therefore
$j\in L_x$
for all
$x\in\mathbb{R}_+$
. The result then follows by applying Theorem 3.3.
A.9. Proof of Proposition 4.3
The proof of the result follows closely the methodology of Ghossoub and Zhu (Reference Ghossoub and Zhu2024b, Theorem 3.5). First, by Ghossoub and Zhu (Reference Ghossoub and Zhu2024b, Remark 2.9), it suffices to show that solutions to the problem

are necessarily of the form given in part (2) of Proposition 4.3. It is well known (e.g., McNeil et al. Reference McNeil, Frey and Embrechts2015, Theorem 8.14) that the expected shortfall satisfies

for all
$Z\in\mathcal{X}$
. Let
$\mathcal{Q}$
be the set of probability measures such that
$\mathbb{Q}\unicode{x003C}\unicode{x003C}\mathbb{P}$
and
$\frac{d\mathbb{Q}}{d\mathbb{P}}\le\frac{1}{\alpha}$
. Then we may rewrite the problem (A8) as

We now rewrite the above in terms of the marginal indemnification functions. Let
$M\unicode{x003C}\infty$
be an upper bound for every
$X_i$
, and define the set
$\mathcal{MI}\,:\!=\, \{h\in L^\infty([0,M])\,:\,||h||_\infty\le1,h\ge0\,a.e.\}$
. Then the above problem simplifies to

Note that the objective function is an affine transformation when viewed as a function of either
$\{h_i\}_{i=1}^n$
,
$\mathbb{Q}$
, or
$\{T_i\}_{i=1}^n$
. The objective is also continuous in
$\{h_i\}_{i=1}^n$
with respect to the norm topology on
$L^\infty([0,M])$
, continuous in
$\mathbb{Q}$
with respect to the weak topology on
$L^1(\Omega,\mathcal{F},\mathbb{P})$
, and continuous in
$\{T_i\}_{i=1}^n$
with respect to the topology of pointwise convergence (as shown in the proof of Lemma 3.2). Since
$\mathcal{Q}$
is a set of measures with bounded Radon-Nikodym derivative, it is weakly compact in
$L^1$
. By compactness of
$\mathcal{Q}$
and each
$\mathcal{T}_i$
, Sion’s minimax theorem may be applied, and the problem becomes:

It follows that the inner infimum is attained if and only if foe each
$i \in \mathcal{N}$
, the function
$h_i$
satisfies

By compactness, both suprema are attained. Hence,
$\{I_i^*\}_{i=1}^n$
solves (A8) only if for every
$\mathbb{Q}^*$
and every
$(T_1^*,\ldots,T_n^*)$
solving (A9), we have

Problem (A9) then simplifies to

which is precisely (4.1). Note that by continuity and compactness, a solution to (4.1), which proves (1). Part (2) follows from standard results on minimax problems using the argument above.