1. Introduction
This article focuses on fairness in binary voting protocols. As early as 1785, Condorcet [Reference Condorcet4] studied the principle of voting from a probabilistic point of view. One of his famous results is Condorcet’s jury theorem, which states the following. Let us suppose there is a large population of voters, and each of them independently votes ‘correctly’ with probability $p > 1/2$ . Then the probability that the outcome of a majority vote is ‘correct’ grows with the sample size and converges to one. In many applications, for instance, distributed computing, it is not feasible for every node to query every other participant, and a centralized entity that collects the votes of all participants and communicates the final result is not desired. Natural decentralized solutions with low message complexity are the so-called voting consensus protocols. Nodes query other nodes (only a sample of the entire population) about their current opinions and adjust their own opinion throughout several rounds based on the proportion of other opinions they have observed.
These protocols may achieve good performance in noiseless and undisturbed networks. However, their performances significantly decreases with noise [Reference Gács, Kurdyumov and Levin6, Reference Kar and Moura7] or errors [Reference Moreira, Mathur, Diermeier and Amaral10] and may completely fail in a Byzantine setting [Reference Capossele, Mueller and Penzkofer2]. Recently, [Reference Popov and Buchanan14] introduced a variant of the standard voting protocol, the so-called fast probabilistic consensus (FPC), that is robust in Byzantine environments. The performance of FPC was then studied using Monte Carlo simulations in [Reference Capossele, Mueller and Penzkofer2]. The above voting protocols are tailored for homogeneous networks where all votes have equal weight. In [Reference Müller, Penzkofer, Camargo and Saa11, Reference Müller12] FPC was generalized to heterogeneous settings. These studies also revealed that how votes are sampled does have a considerable impact on the quality of the protocol.
The aim is to choose a ‘representative’ sample from a given population. If we allow elements to occur multiple times in the sample, there are three different ways of sampling:
-
1. Choose with replacement until one has $m\in \mathbb{N}$ (not necessarily distinct) elements.
-
2. Choose with replacement until one has $k\in \mathbb{N}$ distinct elements.
-
3. Choose without replacement until one has $k=m$ distinct elements.
The first method is usually referred to as sampling with replacement. Although in the 1950s, e.g., in [Reference Raj and Khamis16], the second way was called sampling without replacement, sampling without replacement nowadays usually refers to the third possibility. To avoid any confusion, in this paper we call the second possibility ‘greedy sampling’.
Most voting protocols assume that every participant has the same weight. In heterogeneous situations, this does not reflect possible differences in weight or influence of the participants. An essential way in which weights improve voting protocols is by ensuring that the voting protocol is fair in the sense that the influence of a node on another node’s opinion is proportional to its weight. This fairness is an essential feature of a voting protocol both for technical reasons (e.g., defense against Sybil attacks) and social reasons (e.g., participants may decide to leave the network if the voting protocol is unfair). Moreover, an unfair situation may incentivize participants to split their weight among several participants or increase their weight by pooling with other participants. These incentives may lead to undesired effects such as fragility against Sybil attacks and centralization.
The construction of fair voting consensus protocols with weights was recently discussed in [Reference Müller, Penzkofer, Camargo and Saa11, Reference Müller12]. We consider a network with N nodes (or participants), identified with a finite set $\mathcal N$ . The weights of the nodes are described by $(m_i)_{i \in \mathcal N}$ with $\sum_{i \in \mathcal N} m_i = 1$ , $m_i \ge 0$ being the weight of the node i. Every node i has an initial state or opinion $s_{i}\in \{0, 1\}$ . Then, at each (discrete) time step, each node chooses $k \in \mathbb{N}$ random nodes from the network and queries their opinions. This sampling can be done in one of the three ways described above. For instance, [Reference Müller, Penzkofer, Camargo and Saa11] studied fairness in the case of sampling with replacement. The mathematical treatment of this case is the easiest of the three possibilities. However, simulations in [Reference Müller12] strongly suggest that the performance of some consensus protocols are considerably better in the case of greedy sampling. The third method, sampling without replacement, is not explicitly mentioned in [Reference Müller, Penzkofer, Camargo and Saa11, Reference Müller12]. A possible explanation is that in the case of heterogeneous weight distributions, the consensus protocol is not as robust towards differences in the perception of the weights. Nevertheless, it might be interesting to study this sampling scheme for fairness. The main object of our work is the mathematical analysis of weighted greedy sampling with respect to fairness.
The weights of the node may enter at two points during the voting: in sampling and in weighting the collected votes or opinions. We consider a first weighting function ${f}\,{:}\,{[0, \infty)}\to{[0, \infty)}$ that describes the weight of a node in the sampling. More precisely, a node i is chosen with probability
We call this function f the sampling weight function. A natural weight function is $f\equiv id$ ; a node is chosen proportional to its weight.
As discussed later in the paper, we are interested in how the weights influence the voting if the number of nodes in the network tends to infinity. Therefore, we often consider the situation with an infinite number of nodes. The weights of these nodes are again described by $(m_i)_{i \in \mathcal N}$ with $\sum_{i \in \mathcal N} m_i = 1$ . A network of N nodes is then described by setting $m_i = 0$ for all but N nodes.
Once a node has chosen k distinct elements, by greedy sampling, it calculates a weighted mean opinion of these nodes. Let us denote by $S_{i}$ the multi-set of the sample for a given node i. The mean opinion of the sample is
where ${g}\,{:}\,{[0, \infty)}\to{[0, \infty)}$ is a second weight function that we dub the averaging weight function. The pair (f, g) of the two weight functions is called a voting scheme.
In standard majority voting every node adjusts its opinion as follows: if $\eta_{i}<1/2$ it updates its own opinion $s_{i}$ to 0, and if $\eta_{i}>1/2$ it updates its opinion to 1. The case of a draw, $\eta_{i}=1/2$ , may be solved by randomization or by deterministically choosing one of the options. After the opinion update, every node re-samples; this procedure is continued until some stopping condition is verified. In general, the aim of such a protocol is for all nodes to ultimately agree on one opinion—in other words, to find consensus. As mentioned above, this kind of protocol works well in a non-faulty environment. However, if some nodes do not follow the rules, or even try to hinder the other nodes from reaching consensus, then the protocol may fail to yield consensus. In this case, one speaks of honest nodes, the nodes which follow the protocol, and malicious nodes, the nodes that try to interfere. An additional feature was introduced by [Reference Popov and Buchanan14] that makes this kind of consensus protocol robust to some given proportion of malicious nodes in the network.
Let us briefly explain this crucial feature. As in [Reference Capossele, Mueller and Penzkofer2, Reference Müller, Penzkofer, Camargo and Saa11, Reference Müller12] we consider a basic version of the FPC introduced in [Reference Popov and Buchanan14]. Let $U_{t}$ , $t=1, 2,\ldots$ , be independent and identically distributed random variables with law $\mathrm{Unif}( [\beta, 1-\beta])$ for some parameter $\beta \in [0,1/2]$ . Every node i has an opinion or state. We denote by $s_{i}(t)$ the opinion of the node i at time t. Opinions take values in $\{0,1\}$ . Every node i has an initial opinion $s_{i}(0)$ . The update rules for the opinion of a node i are then given by
for some $\tau \in [0,1]$ . For $t\geq 1$ ,
Note that if $\tau=\beta=0.5$ , FPC reduces to a standard majority consensus. It is important that the above sequence of random variables $U_t$ is the same for all nodes. The randomness of the threshold effectively reduces the capabilities of an attacker to control the opinions of honest nodes, and it also increases the rate of convergence in the case of honest nodes only. Since in this paper we focus our attention mainly on the construction and analysis of the voting schemes (f, g), we refer to [Reference Capossele, Mueller and Penzkofer2, Reference Müller, Penzkofer, Camargo and Saa11, Reference Müller12] for more details on FPC.
We concentrate mostly on the case $f \equiv id$ and $g \equiv 1$ . For the voting scheme with sampling with replacement, it was shown in [Reference Müller, Penzkofer, Camargo and Saa11, Theorem 1] that for $g \equiv 1$ , i.e., when the opinions of different nodes are not additionally weighted after the nodes are sampled, the voting scheme (f, g) is fair (see Definition 3) if and only if $f \equiv id$ . For $f \equiv id$ , the probability of sampling a node j satisfies $p_j = m_j$ , because we assumed that $\sum_{i \in \mathcal N}m_i = 1$ . In many places we use $m_j$ and $p_j$ interchangeably, with both variables referring simultaneously to the weight of the node j and the probability that the node j is sampled.
Our primary goal is to verify whether the voting scheme (id, 1) is fair in the case of greedy sampling. We show in Proposition 4 that the voting scheme (id, 1) is in general not fair. For this reason, we introduce the notion of asymptotic fairness; see Definition 5. Even though the definition of asymptotic fairness is very general, the best example to keep in mind is when the number of nodes grows to infinity. An important question related to the robustness of the protocol against Sybil attacks is whether the gain in influence on the voting obtained by splitting one node into ‘infinitely’ many nodes is limited.
We find a sufficient condition on the sequence of weight distributions $\{(m_i^{(n)})_{i \in \mathcal N}\}_{n \in \mathbb{N}}$ for asymptotic fairness; see Theorem 1. In particular, this ensures robustness against Sybil attacks for wide classes of weight distributions. However, we also note that there are situations that are not asymptotically fair; see Corollary 2 and Remark 5.
A key ingredient of our proof is a preliminary result on greedy sampling. This is a generalization of some of the results of [Reference Raj and Khamis16]. More precisely, we obtain a formula for the joint distribution of the random vector $(A_k^{}(i), v_k^{})$ . Here, the random variable $v_k^{}$ , defined in (2.1), counts the number of samplings needed to sample k different elements, and the random variable $A_k^{}(i)$ , defined in (2.2), counts how many times in those $v_k^{}$ samplings the node i is sampled. The result on asymptotic fairness, Corollary 2, relies on a stochastic coupling that compares the nodes’ influence before and after splitting. We use this coupling again in the simulations in Section 5; it considerably improves the convergence of our simulations by reducing the variance.
Fairness plays a prominent role in many areas of science and applications. It is therefore not astonishing that it plays a part also in distributed ledger technologies. For instance, proof-of-work in Nakamoto consensus ensures that the probability of creating a new block is proportional to the computational power of a node; see [Reference Chen, Papadimitriou and Roughgarden3] for an axiomatic approach to block rewards and further references. In proof-of-stake blockchains, the probability of creating a new block is usually proportional to the node’s balance. However, this does not always have to be the optimal choice; see [Reference Leonardos, Reijsbergen and Piliouras8, Reference Popov13].
Our initial motivation for this paper was to show that the consensus protocol described in [Reference Popov15] is robust against splitting and merging. Both effects are undesirable in a decentralized and permissionless distributed system. We refer to [Reference Müller, Penzkofer, Camargo and Saa11, Reference Müller12] for more details. Besides this, we believe that the study of the different voting schemes is of theoretical interest and that many natural questions are still open; see Section 5.
We organize the article as follows. Section 2 defines the key concepts of this paper: voting power, fairness, and asymptotic fairness. We also recall Zipf’s law, which we use to model the weight distribution of the nodes. Even though our results are obtained in a general setting, we discuss in several places how these results apply to the case of Zipf’s law; see Subsection 2.2 and Figure 1. Section 3 is devoted to studying greedy sampling on its own. We find the joint probability distribution of sample size and occurrences of the nodes, $(A_k^{}(i), v_k^{})$ , and develop several asymptotic results that we use in the rest of the paper. In Section 4 we show that the voting scheme (id, 1) is in general not fair. However, we give a sufficient condition on the sequence of weight distributions that ensures asymptotic fairness. We provide an example where, without this condition, the voting scheme (id, 1) is not asymptotically fair. Section 5 contains a short simulation study. Besides illustrating the theoretical results developed in the paper, we investigate the cases when some of the assumptions we impose in our theoretical results are not met. Last but not least, we present some open problems and conjectures in Section 5. To keep the presentation as clear as possible, we present some technical results in Appendices A and B.
2. Preliminaries
2.1. Main definitions
We now introduce this paper’s key concepts: greedy sampling, voting scheme, voting power, fairness, and asymptotic fairness.
We start with the definition of greedy sampling. We consider a probability distribution $P = (p_i)_{i \in \mathcal N}$ on $\mathcal N$ and an integer $k \in \mathbb{N}$ . We sample with replacement until k different nodes have been chosen. The number of samplings needed to choose k different nodes is given by the random variable
The outcome of a sampling will be denoted by the multi-set
here the $a_{i}$ take values in $\mathcal N$ . Furthermore, for any $i \in \mathcal N$ , let
be the number of occurrences of i in the multi-set $S=\{a_1, a_2, \ldots, a_{v_{k}}\}$ .
Every node i is assigned a weight $m_{i}$ . Together with a function ${f}\,{:}\,{[0, \infty)}\to{[0, \infty)}$ that we call the sampling weight function, the weights define a probability distribution $P = (p_i)_{i \in \mathcal N}$ on $\mathcal N$ by
We consider a second weight function ${g}\,{:}\,{[0, \infty)}\to{[0, \infty)}$ , the averaging weight function, that weights the sample’s opinions; see Equation (1.2). The couple (f, g) is called a voting scheme. We first consider general voting schemes but later focus on the voting scheme (f, g) with $f \equiv id$ and $g \equiv 1$ .
Let us denote by $S_{i}$ the multi-set of the sample for a given node i. To define the voting powers of the nodes, we recall the definition of the mean opinion, Equation (1.2),
where $s_{j}$ is the opinion of node j. The multi-set $S_{i}$ is random, and taking expectation leads to
Hence, the influence of the node j on another node’s mean opinion is measured by the corresponding coefficient in the above series.
Definition 1. (Voting power.) The voting power of a node $i\in \mathcal N$ is defined as
If $g \equiv 1$ , the voting power reduces to
Definition 2. (r-splitting.) Let $(m_i)_{i \in \mathcal N}$ be the weight distribution of the nodes. We fix some node $i\in \mathcal N$ and $r\in\mathbb{N}$ . We say that $m_{i_1^{(r)}}, \ldots, m_{i_r^{(r)}} > 0$ is an r-splitting of node i if $m_i = \sum_{j = 1}^r m_{i_j^{(r)}}$ . With an r-splitting of node i, the probability distribution $P = (p_i)_{i \in \mathcal N}$ given in (1.1) changes to the probability distribution $\widehat{P}_{r,i}$ on $\mathcal N \setminus\{i\} \cup \{ i_1^{(r)}, \ldots, i_r^{(r)}\}$ defined by
Definition 3. (Fairness.) (i) We say that a voting scheme (f, g) is robust to splitting into r nodes if for all nodes $i\in \mathcal N$ and all r-splittings $m_{i_1^{(r)}}, \ldots, m_{i_r^{(r)}}$ we have
(ii) We say that a voting scheme (f, g) is robust to merging of r nodes if for all nodes $i\in \mathcal N$ and all r-splittings $m_{i_1^{(r)}}, \ldots, m_{i_r^{(r)}}$ we have
If the relation (2.3) holds for every $r \in \mathbb{N}$ , we say that the voting scheme (f, g) is robust to splitting, and if the relation (2.4) holds for every $r \in \mathbb{N}$ , we say that the voting scheme (f, g) is robust to merging. If a voting scheme (f, g) is both robust to splitting and robust to merging, that is, if for every node $i\in \mathcal N$ , every $r \in \mathbb{N}$ , and every r-splitting $m_{i_1^{(r)}}, \ldots, m_{i_r^{(r)}} > 0$ it holds that
then we say that the voting scheme (f, g) is fair.
The existence of fair voting schemes for greedy sampling is an open question; see also Question 5. In Section 4 we show that the natural voting scheme (id, 1) is not fair for $k=2$ , but the question for $k>2$ remains open.
To generalize the above definitions to sequences of weights and to define asymptotic fairness, we first define sequences of r-splittings.
Definition 4. (Sequence of r-splittings.) Let $k \in \mathbb{N}$ be a positive integer and let $\{(m_i^{(n)})_{i \in \mathcal N}\}_{n \in \mathbb{N}}$ be a sequence of weight distributions. For a fixed positive integer $r \in \mathbb{N}$ and a fixed node i, we say that $m_{i_1^{(r)}}^{(n)}, \ldots, m_{i_r^{(r)}}^{(n)} > 0$ is a sequence of r-splittings of node r if $m_i^{(n)} = \sum_{j = 1}^r m_{i_j^{(r)}}^{(n)}$ . We define the sequence of probability distributions $(\widehat{P}_{r,i}^{(n)})_{n\in \mathbb{N}}$ on the set $\mathcal N \setminus \{i\} \cup \{ i_1^{(r)}, \ldots, i_r^{(r)}\}$ by
Definition 5. (Asymptotic fairness.) We say that a voting scheme (f, g) is asymptotically fair for the sequence $\{(m_i^{(n)})_{i \in \mathcal N}\}_{n \in \mathbb{N}}$ of weight distributions if, for all $r \in \mathbb{N}$ and all nodes $i\in \mathcal N$ ,
for all sequences of r-splittings of node i.
Remark 1. The canonical class of examples of the sequence $\{(m_i^{(n)})_{i \in \mathcal N}\}_{n \in \mathbb{N}}$ of weight distributions is the one with $\mathcal N = \mathbb{N}$ and $m_i^{(n)} = 0$ for all $i > N_{n}$ for some strictly increasing sequence $(N_{n})_{n\in \mathbb{N}}$ . With weight distributions of this type, we can model the scenario where the number of nodes in the network grows to infinity.
2.2. Zipf’s law
We do not assume any particular weight distribution in our theoretical results. However, for examples and numerical simulations, it is essential to consider specific weight distributions. Probably the most appropriate modelings of weight distributions rely on universality phenomena. The most famous example of this universality phenomenon is the central limit theorem. While the central limit theorem is suited to describing statistics where values are of the same order of magnitude, it is not appropriate for modeling more heterogeneous situations where the values might differ by several orders of magnitude. A heterogeneous weight distribution may instead be described by a Zipf law. Zipf’s law was first observed in quantitative linguistics, stating that any word’s frequency is inversely proportional to its rank in the corresponding frequency table. Nowadays, many fields claim that specific data fit a Zipf law; examples include city populations, internet traffic data, the formation of peer-to-peer communities, company sizes, and scientific citations. We refer to [Reference Li9] for a brief introduction and more references, and to [Reference Adamic and Huberman1] for the appearance of Zipf’s law in the internet and computer networks. We also refer to [Reference Tao17] for a more mathematical introduction to this topic.
There is a rule of thumb for situations when a Zipf law may govern the asymptotic distribution of data or statistics: variables
-
(1) take values as positive numbers,
-
(2) range over many different orders of magnitude,
-
(3) arise from a complicated combination of largely independent factors, and
-
(4) have not been artificially rounded, truncated, or otherwise constrained in size.
We consider a situation with N elements or nodes and set $\mathcal N = \{1,\ldots, N\}$ . Zipf’s law predicts that the (normalized) frequency of the node of rank k is given by
where $s\in [0,\infty)$ is the Zipf parameter. Since the value y(k) in (2.5) only depends on two parameters, s and n, this provides a convenient model for investigating the performance of a voting protocol in a wide range of network situations. For instance, nodes with equal weight can be modeled by choosing $s = 0$ , while more centralized networks can be described with parameters $s > 1$ .
A convenient way to observe a Zipf law is by plotting the data on a log–log graph, with the axes being log(rank order) and log(value). The data conform to a Zipf law to the extent that the plot is linear, and the value of s may be estimated using linear regression. We note that this visual inspection of the log–log plot of the ranked data is not a rigorous procedure. We refer to the literature on how to detect systematic modulation of the basic Zipf law and on how to fit more accurate models. In this work, we deal with distributions that are ‘Zipf-like’ without verifying certain test conditions.
For instance, Figure 1 shows the distribution of IOTA for the top 10,000 richest addresses with a fitted Zipf law. Based on the universality phenomenon, the plausibility of the hypotheses (1)–(4) above, and Figure 1, we assume the weight distribution to follow a Zipf law if we want to specify a weight distribution. To be more precise, we assume that for every $n \in \mathbb{N}$ and some parameter $s > 0$ ,
where $P^{(n)} = (p_j^{(n)})_{j \in \mathbb{N}}$ is the weight distribution among the nodes in the network when the total number of nodes is n. Notice that, for a fixed j, the sequence $(p_j^{(n)})_{n \in \mathbb{N}}$ is decreasing in n. Furthermore, since $\sum_{i = 1}^{\infty} (1/i^s)$ diverges for $s \le 1$ , the sequence $(p_j^{(n)})_{n \in \mathbb{N}}$ converges to 0 in this case (when n goes to infinity). On the other hand, if the parameter s is strictly larger than 1, the sequence $(p_j^{(n)})_{n \in \mathbb{N}}$ converges to a positive number (when n goes to infinity).
3. Greedy weighted sampling
In this section, we study fundamental properties of greedy sampling that are necessary for the rest of the paper. In particular, we identify the joint probability distribution of sample size and occurrences of the nodes and develop several related asymptotic results. Recall that in greedy sampling, we consider sampling with replacement until k different elements have been chosen. The actual size of the sample is described by the random variable $v_{k}$ .
Proposition 1. Let $P = (p_i)_{i \in \mathcal N}$ be a probability distribution on $\mathcal N$ , $k \in \mathbb{N}$ a positive integer, and $v_{k}=v_k^{(P)}$ the random variable defined in (2.1). For every $v \in \{k, k + 1, k + 2, \ldots\}$ we have
where
Proof. We are sampling from the distribution P until we have sampled k different nodes. A first observation is that the last node will be sampled only once. Any of the nodes that appear before the last one can be sampled more than once. We can construct such a sampling in the following way: first we choose a node $i \in \mathcal N$ that will be sampled last, then we choose $k - 1$ different nodes $a_1, a_2, \ldots, a_{k - 1}$ from the set $\mathcal N \setminus \{i\}$ that will appear in the sequence before the last node, and we choose positive integers $x_1, x_2, \ldots, x_{k - 1} \in \mathbb{N}$ that represent how many times each of the $k - 1$ nodes from the set $\{a_1, a_2, \ldots, a_{k - 1}\}$ will appear in the sampled sequence. Notice that $\sum_{i = 1}^{k - 1} x_i$ has to be equal to $v - 1$ , because the total length of the sequence, including the last node i, has to be v. The last thing we need to choose is the permutation of the first $v - 1$ elements in the sequence, which can be done in $\binom{v - 1}{x_1, x_2, \ldots, x_{k - 1}}$ ways. Summarizing, the probability of sampling a sequence where the last node is i, the first $k - 1$ nodes are $a_1, a_2, \ldots, a_{k - 1}$ , and they appear $x_1, x_2, \ldots, x_{k - 1}$ times is
Now we need to sum this up with respect to all the possible values of the element i, all the possible sequences of $k - 1$ positive integers $x_1, x_2, \ldots, x_{k - 1}$ that sum up to $v - 1$ (i.e., all the partitions of the integer $v - 1$ into $k - 1$ parts), and all the subsets of $\mathcal N \setminus \{i\}$ of cardinality $k - 1$ . This gives us exactly the expression from Equation (3.1). □
Remark 2. The random variable $v_k^{(P)}$ was studied in [Reference Raj and Khamis16] in the case where the population is finite and elements have equal weight. Therefore, (3.1) is a generalization of [Reference Raj and Khamis16, Formula (16)].
Another random variable studied in [Reference Raj and Khamis16] is the number of different elements in a sample with replacement of a fixed size. To be precise, let $k \in \mathbb{N}$ be a positive integer and let $P = (p_i)_{i \in \mathcal N}$ be a probability distribution on $\mathcal N$ . Let
The authors of [Reference Raj and Khamis16] calculated the distribution of the random variable $u_k^{(P)}$ , but again under the assumptions that the set from which the elements are sampled is finite and that all the elements are sampled with the same probability. Using analogous reasoning as in the proof of Proposition 1, for $u \in \{1, 2, \ldots, k\}$ we get
This formula generalizes [Reference Raj and Khamis16, Formula (8)].
Using Proposition 1, we now find the distribution of the random vector $(A_k^{}(i), v_k)$ for all $i \in \mathbb{N}$ .
Proposition 2. Let $P = (p_i)_{i \in \mathcal N}$ be a probability distribution on $\mathcal N$ , $k \in \mathbb{N}$ a positive integer, $v_{k}=v_k^{(P)}$ the random variable defined in (2.1), and $A_k^{}(i)=A_k^{(P)}(i)$ the random variable defined in (2.2). The support of the random vector $(A_k^{}(i), v_k^{})$ is
For every node $i \in \mathcal N$ and every $(\ell, v)$ in the support of $(A_k^{}(i), v_k^{})$ we have
Proof. Notice first that (0, v) is in the support of $(A_k^{}(i), v_k^{})$ . Now, if $\ell \ge 1$ , then for all $v < \ell + k - 1$ we have that $\mathbb{P}(A_k^{}(i) = l, v_k^{} = v) = 0$ , since we need at least $\ell + k - 1$ samplings to sample node i $\ell$ times and to sample the other $k - 1$ different nodes at least once.
Let us consider separately different values of the non-negative integer $\ell \in \mathbb{N} \cup \{0\}$ .
$\ell = 0$ : This case is an immediate consequence of Proposition 1. We just need to restrict the set of all nodes that can be sampled to $\mathcal N \setminus \{i\}$ .
$\ell= 1$ : Here we need to distinguish two disjoint scenarios. The first is when the node i is not sampled as the last node (i.e., the node i is not the kth different node that has been sampled). This means that the node i was sampled during the first $v - 1$ samplings. Hence, we first choose a node $j \in \mathcal N \setminus \{i\}$ that will be sampled last. Then we choose $k - 2$ different nodes $a_1, a_2, \ldots, a_{k - 2}$ from the set $\mathcal N \setminus \{i, j\}$ that will appear (together with the node i) in the sampled sequence before the last node, and we choose positive integers $x_1, x_2, \ldots, x_{k - 2} \in \mathbb{N}$ to represent how many times each of the $k - 2$ nodes in the set $\{a_1, a_2, \ldots, a_{k - 2}\}$ will appear in the sampled sequence. Notice that $\sum_{i = 1}^{k - 2} x_i$ has to be equal to $v - 2$ , because the total length of the sequence, including the one appearance of node j (in the last place) and one appearance of node i (somewhere in the first $v - 1$ samplings), has to be v. The last thing we need to choose is the permutation of the first $v - 1$ nodes in the sequence, which can be done in $\binom{v - 1}{x_1, \ldots, x_{k - 2}, 1}$ ways (taking into consideration that node i appears only once). Summarizing, the probability of sampling a sequence where the last node is $j \neq i$ , the node i appears exactly once in the first $v - 1$ sampled nodes, and the remaining $k - 2$ nodes that appear together with the node i before the last node j are $a_1, a_2, \ldots, a_{k - 2}$ and appear $x_1, x_2, \ldots, x_{k - 2}$ times is
As in Proposition 1, we now sum this up with respect to all the possible values of the node j, all the possible sequences of $k - 2$ positive integers $x_1, x_2, \ldots, x_{k - 2}$ that sum up to $v - 2$ , and all the subsets of $ \mathcal N \setminus \{i, j\}$ of cardinality $k - 2$ . In this way we obtain the first term in the expression for $\mathbb{P}(A_k^{}(i) = 1, v_k^{} = v)$ .
The second scenario is the one where the node i is sampled last. Here the situation is much simpler. The last node is fixed to be $i \in \mathcal N$ ; we then choose the $k - 1$ nodes that appear before the node i, and the number of times they appear, analogously as in Proposition 1. We immediately get the second term in the expression for $\mathbb{P}(A_k^{}(i) = 1, v_k^{} = v)$ .
$\ell \ge 2$ : Notice that in this case we do not have two different scenarios, because it is impossible for the node i to be the last node sampled. As we explained in Proposition 1, the last node can be sampled only once, since we terminate sampling when we have reached k different nodes. Now we reason analogously as in the first scenario of the case $\ell = 1$ . The only difference is that here the node i appears $\ell$ times (in the first $v - 1$ samplings), so the integers $x_1, x_2, \ldots, x_{k - 2}$ have to satisfy $\sum_{i = 1}^{k - 2} x_i = v - \ell - 1$ . Together with $\ell$ appearances of the node i and one appearance of the last node, this gives v sampled nodes in total. □
Let $b = (b_i)_{i \in \mathcal N} \subset \mathbb{R}$ be a sequence of real numbers. Denote by $\|{b}\|_{\infty} = \sup_{i \in \mathcal N} \vert b_i \vert$ the supremum norm of the sequence b. The next result shows that if the probabilities of sampling each of the nodes converge uniformly to zero, then the number of samplings needed to sample k different elements converges to k.
Lemma 1. Let $(P^{(n)})_{n \in \mathbb{N}}$ , $P^{(n)} = (p_i^{(n)})_{i \in \mathcal N}$ , be a sequence of probability distributions on $\mathcal N$ , and let $(k_n)_{n \in \mathbb{N}}$ be a sequence of positive integers such that $k_n^2 \|{P^{(n)}}\|_{\infty} \xrightarrow[n \rightarrow \infty]{} 0$ . Then
In particular, if for some fixed positive $k \in \mathbb{N}$ we have $k_n = k$ for all $n \in \mathbb{N}$ , and $\|{P^{(n)}}\|_{\infty} \xrightarrow[n \rightarrow \infty]{} 0$ , then we have that $v_k^{(P^{(n)})} \xrightarrow[n \rightarrow \infty]{\mathbb{P}} k$ .
Proof. For simplicity, we define $v_{k_{n}}^{(n)} \,{:\!=}\, v_{k_{n}}^{(P^{(n)})}$ . Since $v_{k_n}^{(n)}$ is larger than or equal to $k_n$ , it is sufficient to show that $\mathbb{P}(v_{k_n}^{(n)} > k_n) \xrightarrow[n \rightarrow \infty]{} 0$ . Denote by $X_i^{(n)}$ the random variable representing the node sampled in the ith sampling. Since the event $\{v_{k_n}^{(n)} > k_n \}$ happens if and only if some of the nodes sampled in the first $k_n$ samplings appear more than once, we have
By the assumption, the last term converges to zero when n goes to infinity, which is exactly what we wanted to prove. □
Remark 3. Let us investigate what happens when the sequence $(P^{(n)})_{n \in \mathbb{N}}$ is defined by a Zipf law (see (2.6)) with parameter $s > 0$ . Since each of the sequences $(p_i^{(n)})_{i \in \mathbb{N}}$ is decreasing in i, we have
Notice that for all $s \le 1$ we have $\|{P^{(n)}}\|_{\infty} \xrightarrow[n \rightarrow \infty]{} 0$ , because the series $\sum_{i = 1}^{\infty} \frac{1}{i^s}$ diverges for those values of the parameter s. Hence, for a fixed integer $k \in \mathbb{N}$ , we have that $v_k^{(P^{(n)})} \xrightarrow[n \rightarrow \infty]{\mathbb{P}} k$ whenever $s \le 1$ . Another important example is when the sequence $(k_n)_{n \in \mathbb{N}}$ is given by
where, for $x \in \mathbb{R}$ , $\lfloor x \rfloor$ is the largest integer less than or equal to x. Using
we get $k_n^2 \|{P^{(n)}}\|_{\infty} \xrightarrow[n \rightarrow \infty]{} 0$ for $s < 1$ , so we can apply Lemma 1 for this particular choice of sequences $(k_n)_{n \in \mathbb{N}}$ and $(P^{(n)})_{n \in \mathbb{N}}$ .
In Lemma 1 we dealt with the behavior of the sequence of random variables $(v_{k_{n}}^{P^{(n)}})_{n \in \mathbb{N}}$ if the sequence $(P^{(n)})_{n \in \mathbb{N}}$ satisfies $\,\,\|{P^{(n)}}\|_{\infty} \xrightarrow[n \rightarrow \infty]{} 0$ . Next, we study the case when the sequence $(P^{(n)})_{n \in \mathbb{N}}$ converges in the supremum norm to another probability distribution $P^{(\infty)}$ on $\mathcal N$ . As before, for $b = (b_i)_{i \in \mathcal N} \subset \mathbb{R}$ , we use the notation $\|{b}\|_{\infty} = \sup_{i \in \mathbb{N}} \vert b_i \vert$ and we write $\|{b}\|_1 = \sum_{i \in \mathcal N} \vert b_i \vert$ .
Proposition 3. Let $(P^{(n)})_{n \in \mathbb{N}}$ , $P^{(n)} = (p_i^{(n)})_{i \in \mathcal N}$ , be a sequence of probability distributions on $\mathcal N$ , and let $P^{(\infty)} = (p_i^{(\infty)})_{i \in \mathcal N}$ be a probability distribution on $\mathcal N$ . If
then, for all $i\in\mathcal N$ and all fixed $k \in \mathbb{N}$ ,
where $\xrightarrow[]{(d)}$ denotes convergence in distribution.
Proof. For simplicity, we write $v^{(n)}_{k} \,{:\!=}\, v_k^{(P^{(n)})}$ , $v^{(\infty)}_{k} \,{:\!=}\, v_k^{(P^{(\infty)})}$ , $A_{k}^{(n)}(i) \,{:\!=}\, A_k^{(P^{(n)})}(i)$ , and $A_k^{(\infty)}(i) \,{:\!=}\, A_k^{(P^{(\infty)})}(i)$ . Since we consider discrete random variables, the statement
is equivalent to
for all $\ell \in \mathbb{N} \cup \{0\}$ and all $v \in \mathbb{N}$ . As in the proof of Proposition 2, we consider separately different values of the non-negative integer $\ell \in \mathbb{N} \cup \{0\}$ .
$\ell= 0$ : Using Proposition 2, we have
with
It remains to prove that $\vert I^{(n)}(x_1, \ldots x_{k - 1}) - I^{(\infty)}(x_1, \ldots x_{k - 1}) \vert \xrightarrow[n \rightarrow \infty]{} 0$ , uniformly for all possible values of positive integers $x_1, x_2, \ldots x_{k - 1}$ . This is sufficient since the number of partitions of the integer $v - 1$ into $k - 1$ parts is finite and independent of n. Notice that for every $i, j \in \mathcal N$ ,
Clearly the same is true when, instead of the distribution $P^{(\infty)}$ , we consider the distribution $P^{(n)}$ . Thanks to the convergence of these series, we can rewrite
For simplicity, we write $T_1^{(n)} = T_1^{(n)}(x_1, \ldots, x_{k - 1})$ and $T_2^{(n)} = T_2^{(n)}(x_1, \ldots, x_{k - 1})$ . It remains to prove that $T_1^{(n)}$ and $T_2^{(n)}$ converge to 0 when n goes to infinity. Using the inequality (3.4) and Proposition 5 we have that
To treat the term $T_2^{(n)}$ , we use Lemma 2 in the second line and Proposition 5 in the last line to obtain
$\ell \ge 2$ : Again using Proposition 2, we have
To show that the above expression converges to zero as n tends to infinity, it remains to verify that
To obtain this, we can use the same arguments as in the previous case. Again, introducing a middle term leads to
Applying Proposition 5 again, we get the desired result.
$\ell= 1$ : The difference of the first terms in the expressions for $\mathbb{P}(A_k^{(n)}(i) = 1, v^{(n)}_{k} = v)$ and $\mathbb{P}(A_k^{(\infty)}(i) = 1, v^{(\infty)}_{k} = v)$ (see (3.3)) goes to zero. The difference of the second terms can be handled similarly as in the case $\ell = 0$ ; the situation is even simpler owing to the absence of the initial sum. This concludes the proof of this proposition. □
Corollary 1. Let $(P^{(n)})_{n \in \mathbb{N}}$ , $P^{(n)} = (p_i^{(n)})_{i \in \mathcal N}$ , be a sequence of probability distributions on $\mathcal N$ , and let $P^{(\infty)} = (p_i^{(\infty)})_{i \in \mathcal N}$ be a probability distribution on $\mathcal N$ . We assume that $g\equiv 1$ . If
then for all $i \in \mathcal N$ and all fixed $k \in \mathbb{N}$ ,
where $V_k^{(P^{(n)})}(i) = \mathbb{E}[A_k^{(P^{(n)})}(i) / v_k^{(P^{(n)})}]$ , $n \in \mathbb{N} \cup \{\infty\}$ , is the voting power of the node i in the case $g\equiv 1$ .
Proof. Convergence in (3.5) and (3.6) follows directly from Proposition 3 using the continuous mapping theorem (see [Reference Durrett5, Theorem 3.2.4]) applied to the projections ${\Pi_1, \Pi_2}\,{:}\,{\mathbb{R}^2}\to{\mathbb{R}}$ , $\Pi_i(x_1, x_2) = x_i$ , $i = 1, 2$ . To prove the convergence in (3.7), let us first define the bounded and continuous function
Then, combining Proposition 3 with [Reference Durrett5, Theorem 3.2.3], we get
Notice that we always have $A_k^{(P)}(i) \le v_k^{(P)}$ , since the random variable $A_k^{(P)}(i)$ counts the number of times the node i was sampled until k different nodes had been sampled, and the random variable $v_k^{(P)}$ counts the total number of samplings until k distinct elements had been sampled. Hence,
Combining the latter with (3.8) and using $V_k^{(P^{(n)})}(i) = \mathbb{E}[A_k^{(P^{(n)})}(i) / v_k^{(P^{(n)})}]$ , $n \in \mathbb{N} \cup \{\infty\}$ , we obtain (3.7). □
4. Asymptotic fairness
We start this section with the case $k=2$ , i.e., we sample until we get two different nodes. This small choice of k allows us to perform analytical calculations and prove some facts rigorously. We prove that the voting scheme (id, 1) is robust to merging but not fair. We also show that the more the node splits, the more voting power it can gain. However, with this procedure, the voting power does not grow to 1, but has a limit strictly less than 1.
Proposition 4. We consider the voting scheme (id, 1) and let $(m_i)_{i \in \mathcal N}$ be the weight distribution of the nodes. Let $P = (p_i)_{i \in \mathcal N}$ be the corresponding probability distribution on $\mathcal N$ , let $r \in \mathbb{N},$ let $i\in \mathcal N$ be a node, and let $k = 2$ . Then, for every r-splitting $m_{i_1^{(r)}}, \ldots, m_{i_r^{(r)}}>0$ of the node i, we have that
In other words, the voting scheme (id, 1) is robust to merging, but not robust to splitting. The difference of the voting power after splitting and before splitting reaches its maximum for
Furthermore, for this particular r-splitting, we have that the sequence
is strictly increasing and has a limit strictly less than 1.
Proof. Denote by $Y_i = A_2^{(P)}(i)$ the number of times that the node i was sampled from the distribution P until we had sampled 2 different nodes, and by $Y_{i_j^{}} = A_2^{(\widehat{P}_{r,i})}({i_j^{(r)}})$ , $j \in \{1, 2, \ldots, r\}$ , the number of times the node $i_j^{(r)}$ was sampled from the distribution $\widehat{P}_{r,i}$ until we had sampled 2 different nodes. We also write $V(i)\,{:\!=}\, V_k^{(P)}(i)$ and $V^{(r)}({i_j^{(r)}})\,{:\!=}\,V_k^{(\widehat{P}_{r,i})}({i_j^{(r)}}).$ Using this notation, we have
Similarly, for $j \in \{1, 2, \ldots, r\}$ we have
Combining the above calculations, we obtain
We take $x_1, x_2, \ldots x_r \in (0, 1)$ such that $\sum_{j = 1}^r x_j = 1$ and set
This gives us
Define
First we need to show that $\phi(x_1, x_2, \ldots, x_r) > 0$ for all $x_1, x_2, \ldots, x_r \in (0, 1)$ such that $\sum_{j = 1}^r x_j = 1$ . Using Proposition 6 repeatedly ( $r - 1$ times), we get
The second claim of this proposition is that the expression
reaches its maximum for $\widehat{p}_{i_j^{(r)}} = \frac{p_i}{r}, j \in \{1, 2, \ldots, r\}$ . This follows directly from Lemma 3, where we show that $\phi$ attains its unique maximum for $(x_1, \ldots, x_r) = (\frac{1}{r}, \ldots, \frac{1}{r})$ . Define
By Proposition 7 we have that the sequence $(\tau_r(p_i))_{r \in \mathbb{N}}$ is strictly increasing and
□
Remark 4. We consider the function ${\tau}\,{:}\,{(0, 1)}\to{\mathbb{R}}$ defined by
This function describes the gain in voting power that a node with initial weight m can achieve by splitting up into infinitely many nodes. As Figure 2 shows, this maximal gain in voting power is bounded. The function $\tau$ attains its maximum at $m^* \approx 0.82$ , and the maximum is $\tau(m^*) \approx 0.12$ . This means that a node that initially has around 82% of the total amount of weight can obtain the biggest gain in voting power (by theoretically splitting into an infinite number of nodes), and this gain is approximately $0.12$ . Loosely speaking, if a voting power of a node increases by $0.12$ , this means that during the querying, the proportion of queries that are addressed to this particular node increases by around 12%.
Corollary 2. Let $(m^{(n)})_{n\in \mathbb{N}}$ be a sequence of weight distributions with corresponding probability distributions $(P^{(n)})_{n \in \mathbb{N}}$ , $P^{(n)} = (p_i^{(n)})_{i \in \mathcal N}$ , on $\mathcal N$ . Let $m^{(\infty)}$ be a weight distribution such that for its corresponding probability distribution $P^{(\infty)} = (p_i^{(\infty)})_{i \in \mathcal N}$ we have that
Furthermore, we consider a sequence of r-splittings $m_{i_1^{(r)}}^{(n)}, \ldots, m_{i_r^{(r)}}^{(n)} > 0$ of a node i such that $ m_{i_j^{(r)}}^{(n)} \xrightarrow[n \rightarrow \infty]{} m_{i_j^{(r)}}^{(\infty)}$ , $j \in \{1, 2, \ldots, r\}$ , for some r-splitting $m^{(\infty)}$ . Then for $k = 2$ we have
Proof. The convergence follows directly from Corollary 1, and the strict positivity of the limit follows from Proposition 4. □
Remark 5. Corollary 2 implies that if $k=2$ and if the sequence of weight distributions $(P^{(n)})_{n \in \mathbb{N}}$ converges to a non-trivial probability distribution on $\mathcal N$ , the voting scheme (id, 1) is not asymptotically fair. Applying this result to the sequence of Zipf distributions defined in (2.6), we see that for $s > 1$ and $k = 2$ , the voting scheme is not asymptotically fair. Simulations suggest (see Figures 4 and 8) that for higher values of k, the difference in voting power of the node i before and after the splitting does not converge to zero as the number of nodes in the network grows to infinity.
In the following proposition we give a condition on the sequence of weight distributions $(P^{(n)})_{n \in \mathbb{N}}$ under which the voting scheme (id, 1) is asymptotically fair for any choice of the parameter k.
Theorem 1. Let k be a positive integer, and let $(m^{(n)})_{n\in \mathbb{N}}$ be a sequence of weight distributions with corresponding probability distributions $(P^{(n)})_{n \in \mathbb{N}}$ , $P^{(n)} = (p_i^{(n)})_{i \in \mathcal N}$ , on $\mathcal N$ . We assume that $\|{P^{(n)}}\|_{\infty} \xrightarrow[n \rightarrow \infty]{} 0$ . Furthermore, we consider a sequence of r-splittings $m_{i_1^{(r)}}^{(n)}, \ldots, m_{i_r^{(r)}}^{(n)} > 0$ of a given node i such that
for some r-splitting $m^{(\infty)}$ . Then
i.e., the voting scheme (id, 1) is asymptotically fair if the sequence of weight distributions converges in the supremum norm to 0.
Proof. For simplicity, we write $P_r^{(n)} \,{:\!=}\, \widehat P_{r,i}^{(n)}$ , $v_k^{(n)} \,{:\!=}\, v_k^{(P^{(n)})}$ , and $v_{k, r}^{(n)} \,{:\!=}\, v_k^{(P_r^{(n)})}$ ; recall that the random variable $v_k^{(P)}$ counts the number of samplings with replacement from the distribution P until k different elements have been sampled. The main idea of the proof is to couple the random variables $v_k^{(n)}$ and $v_{k, r}^{(n)}$ . We sample simultaneously from the probability distributions $P^{(n)}$ and $P_r^{(n)}$ and construct two different sequences of elements that both terminate once they contain k different elements. We do that in the following way: we sample an element from the distribution $P^{(n)}$ . If the sampled element is not i, we just add this element to both sequences that we are constructing and then sample the next element. If the element i is sampled, then we add i to the first sequence, but to the second sequence we add one of the elements $i_1^{(r)}, \ldots, i_r^{(r)}$ according to the probability distribution $\left(p_{i_1^{(r)}}^{(n)} / p_i^{(n)}, \ldots, p_{i_r^{(r)}}^{(n)} / p_i^{(n)}\right)$ . Now, the second sequence will terminate no later than the first one, since the second sequence always has at least as many different elements as the first sequence. This is a consequence of the fact that, each time the element i is sampled, we add one of the r elements $i_1^{(r)}, i_2^{(r)}, \ldots, i_r^{(r)}$ to the second sequence, while we just add i to the first sequence; see Figure 3.
Define
Since $v_k^{(n)} \ge v_{k, r}^{(n)}$ , we have $K_k^{(n)} \ge 0$ . We also introduce the random variable
where $A_k^{(P)}(i)$ is defined as in (2.2). The random variable $L_k^{(n)}$ measures the difference between the number of times the node i appears in the first sequence and the number of times the nodes $i_1^{(r)}, i_2^{(r)}, \ldots, i_r^{(r)}$ appear in the second sequence. At the time when the second sequence terminates, the number of times the node i has appeared in the first sequence is the same as the number of times that the nodes $i_1^{(r)}, i_2^{(r)}, \ldots, i_r^{(r)}$ have appeared in the second sequence; see Figure 3. Since the length of the first sequence is always larger than or equal to the length of the second sequence, it can happen that the element i is sampled again before the kth different element appears in the first sequence. Therefore, $L_k^{(n)} \ge 0$ . Clearly, $L_k^{(n)} \le K_k^{(n)}$ , because $K_k^{(n)}$ counts all the extra samplings we need to sample k different elements in the first sequence, while $L_k^{(n)}$ counts only those extra samplings in which the element i was sampled. Notice that if the element i is not sampled before the kth different element appears, or if i itself is the kth different element, then $K_k^{(n)} = L_k^{(n)} = 0$ .
Let
Then
Combining this with $v_k^{(n)} = v_{k, r}^{(n)} + K_k^{(n)}$ , we have
Define
It remains to prove that $\mathbb{E}[Z_n] \xrightarrow[n \rightarrow \infty]{} 0$ . Since $Y_{k, r}^{(n)} \le v_{k, r}^{(n)}$ , $k \le v_{k, r}^{(n)},$ and $L_k^{(n)} \le K_k^{(n)}$ , we have
By Lemma 1 we have that $v_k^{(n)} \xrightarrow[n \rightarrow \infty]{\mathbb{P}} k$ and $v_{k, r}^{(n)} \xrightarrow[n \rightarrow \infty]{\mathbb{P}} k$ (notice that $\|{P_r^{(n)}}\|_{\infty} \le \|{P^{(n)}}\|_{\infty}$ ). Therefore,
This implies that $Z_n \xrightarrow[n \rightarrow \infty]{\mathbb{P}} 0$ . Since $Z_{n} \leq 2$ we have that $\lim_{n \to \infty} \mathbb{E}[Z_n] = 0$ . □
Remark 6. The above proposition shows that $\|{P^{(n)}}\|_{\infty} \xrightarrow[n \rightarrow \infty]{} 0$ is a sufficient condition to ensure asymptotic fairness, regardless of the value of the parameter $k \in \mathbb{N}$ . Applying this result to the sequence of probability distributions $(P^{(n)})_{n \in \mathbb{N}}$ defined by the Zipf law (see (2.6)), we see that for $s \le 1$ the voting scheme (id, 1) is asymptotically fair.
5. Simulations, conjectures, and questions
In this section, we present some numerical simulations to complement our theoretical results. We are interested in the rate of convergence in the case of asymptotic fairness, Theorem 1, and we also want to support some conjectures for the situation where our theoretical results do not apply.
In the simulations, we always consider the voting scheme (id, 1) and a Zipf law for the nodes’ weight distribution; see the relation (2.6). The reasons for this assumption are presented in Subsection 2.2.
Figure 4 presents the results of a Monte Carlo simulation for a Zipf distribution with parameter $s\in\{0.8, 1.1\}$ and different network sizes on the x-axis. For real-world applications we expect values of k to be at least 20 (see also [Reference Müller12]) and therefore set the sample size to $k = 20$ . The y-axis shows the gain in voting power when the heaviest node splits into two nodes of equal weight. For each choice of network size, we performed $1\,000\,000$ simulations; we use the empirical average as an estimator for the gain in voting power. The gray zone corresponds to the confidence interval of level $0.95$ . Let us note that to decrease the variance of the estimation, we couple, as in the proof of Theorem 1, the sampling in the original network with the sampling in the network after splitting.
Theorem 1 and Remark 6 state that if the Zipf parameter $s\le 1$ , then the voting scheme is asymptotically fair, i.e., the difference in the voting power before and after the splitting of a node $i \in \mathbb{N}$ goes to zero as the number of nodes in the network increases. The left panel of Figure 4 indicates the speed of convergence for $s=0.8.$ The right panel of Figure 4 indicates that for $s=1.1$ the voting scheme is not asymptotically fair. Corollary 2 states that for $k=2$ , if the sequence of weight distributions $(P^{(n)})_{n \in \mathbb{N}}$ converges to a non-trivial probability distribution on $\mathbb{N}$ , then the voting scheme (id, 1) is not asymptotically fair.
Conjecture 1. Let $(m^{(n)})_{n\in \mathbb{N}}$ be a sequence of weight distributions with corresponding probability distributions $(P^{(n)})_{n \in \mathbb{N}}$ , $P^{(n)} = (p_i^{(n)})_{i \in \mathcal N}$ , on $\mathcal N$ . Let $m^{(\infty)}$ be a weight distribution such that for its corresponding probability distribution $P^{(\infty)} = (p_i^{(\infty)})_{i \in \mathcal N}$ we have that
Furthermore, we consider a sequence of r-splittings $m_{i_1^{(r)}}^{(n)}, \ldots, m_{i_r^{(r)}}^{(n)} > 0$ of a node i such that
for some r-splitting $m^{(\infty)}$ . Then, for any choice of $k\in \mathbb{N}$ ,
We take a closer look at the distribution of the increase in voting power in the above setting. Figures 5 and 6 present density estimations, with a Gaussian kernel, of the density of the increase in voting power. Again we simulated each data point $1\,000\,000$ times. The density’s multimodality can be explained by the different possibilities for whether the sample includes the heaviest node before and after splitting. For instance, in the case of two nodes and $k=2$ , both nodes will be present in every sample before the splitting. After the heaviest node splits into two nodes, we have two cases: the sample contains either both splittings of the heaviest node or only one splitting. In the first case the heaviest node has gained voting influence, while in the second case it has lost influence.
Figure 6 clearly shows the asymptotic fairness; the probability of having only a small change in voting power converges to 0 as the number of participants grows to infinity. Figure 7 compares the densities for different choices of s in a network of 1000 nodes.
These figures also show that even in the case where a splitting leads to an increase on average of the voting power, the splitting can also lead to less influence in a single voting round.
In the previous simulations, we kept the sample size $k=20$ . Increasing the sample size increases the quality of the voting, but at the cost of a higher message complexity. Figure 8 compares the increase in voting power for different values of k and s. We can see that increasing k increases the fairness of the voting scheme and that for some values of k the increase in voting power may even be negative.
Figure 9 presents density estimations of the increase in voting power. We can see the different behaviors in the more decentralized setting, $s<1$ , and the centralized setting, $s>1$ . In the first case, it seems that the density converges to a point mass in 0, whereas in the second case the limit may be described by a Gaussian density. Figure 10 provides Q–Q plots supporting this first visual impression.
While the study of the actual distribution of the increase in voting power is outside the scope of this paper, we think that the following questions may be of interest.
Question 1. How can the distribution of the increase in voting power be described?
Question 2. What characteristics of the distribution of the increase in voting power are important for the voting scheme and its applications?
Recall that, so far, we have only considered the change in voting power of the heaviest node when it splits into two nodes of equal weight. The goal of the next two simulations (see Figures 11 and 12) is to inspect what happens to the voting power of a node when it splits into more than two nodes.
For the simulation shown in Figure 11, we fixed the value of the parameter k and varied the value of the parameter s. In Proposition 4 we showed that for $k=2$ , a node always gains voting power with splitting. This result holds without any additional assumptions on the weight distribution of the nodes in the network. We ran simulations with $k = 20$ in which we split the heaviest node into r nodes, r ranging from 2 to 200. We kept the network size equal to 1000 and varied the parameter s in the set $\{0.8, 1, 1.1, 1.5, 2\}$ . For each value of s we ran $100\,000$ simulations of the voting scheme (id, 1). Several conjectures can be made from Figure 11. It seems that if the parameter k is equal to 20, the voting power may even drop for small values of the parameter r. This drop appears to be more significant the bigger the parameter s is. But if we split into more nodes (i.e., we set r to be sufficiently large), it seems that splitting gives us more voting power, and the gain is bigger for values of s larger than 1. This suggests that it is possible to have robustness to splitting into r nodes for r smaller than some threshold $\delta$ , and robustness to merging of r nodes for $r > \delta$ .
The simulations presented in Figure 12 show the change in the voting power of a node after it splits into multiple nodes for different values of the parameters k and s. As in the previous simulation, we consider a network size of 1000 and assume that the first node splits into r different nodes (where r again ranges from 2 to 200). For each combination of values of parameters k and s, we ran $100\,000$ simulations. Our results suggest that for $s \le 1$ , we always gain voting power with additional splittings. On the other hand, if $s > 1$ , then the behavior of the voting power depends even more on the precise value of k. It seems that for small k, we still cannot lose voting power by splitting, but that for k sufficiently large there is a region where the increase in voting power is negative.
Question 3. How does the increase in voting power of the heaviest node depend on k, s, and N? For which choices of these parameters is the increase in voting power negative?
The above simulation study is far from complete, but we believe that our results already show the model’s richness.
Question 4. How does the increase in voting power of a node of rank M depends on M, k, s, and N?
Note that in the simulations above, we only split the heaviest node. In a more realistic model, not only one but all nodes may simultaneously optimize their voting power. This is particularly interesting in situations that are not robust to splitting. We believe that in such a situation, it is reasonable to suppose that nodes may adapt their strategy from time to time to optimize their voting power. This simultaneous splitting or merging of nodes may lead to periodic behavior or to convergence to a stable situation, where none of the nodes has an incentive to split or merge.
Question 5. Construct a multi-player game where the aim is to maximize voting power. Do the corresponding weights always converge to a situation in which the voting scheme is fair?
As mentioned after the definition of a fair voting scheme, Definition 3, the existence of fair schemes is not well understood. While we focused on the particular scheme (id, 1) in this paper, it is unclear whether other voting schemes could be fair or have other advantages over the (id, 1) scheme.
Question 6. Identify all fair voting schemes.
Appendix A. Auxiliary results for Section 3
The first result is proved by induction.
Lemma 2. Let $w \in \mathbb{N}$ , and let $a_1, a_2, \ldots, a_w, b_1, b_2, \ldots, b_w \in \mathbb{R}$ ; then
Proposition 5. Let $(P^{(n)})_{n \in \mathbb{N}}$ , $P^{(n)} = (p_i^{(n)})_{i \in \mathbb{N}}$ , be a sequence of probability distributions on $\mathbb{N}$ , and let $P^{(\infty)} = (p_i^{(\infty)})_{i \in \mathbb{N}}$ be a probability distribution on $\mathbb{N}$ . Then the following statements are equivalent:
-
(a) $\|{P^{(n)} - P^{(\infty)}}\|_{\infty} = \sup_{i \in \mathbb{N}} \vert p_i^{(n)} - p_i^{(\infty)} \vert \xrightarrow[n \rightarrow \infty]{} 0$ .
-
(b) $\|{P^{(n)} - P^{(\infty)}}\|_1 = \sum_{i = 1}^{\infty} \vert p_i^{(n)} - p_i^{(\infty)} \vert \xrightarrow[n \rightarrow \infty]{} 0$ .
Proof. (b) $\Rightarrow$ (a): This follows immediately from the fact that
-
(a) $\Rightarrow$ (b): Let $\varepsilon > 0$ . Choose $n_0 = n_0(\varepsilon)$ such that
(A.1) \begin{equation} \sum_{i = 1}^{n_0} p_i^{(\infty)} > 1 - \varepsilon. \end{equation}
This can be done because $P^{(\infty)}$ is a probability distribution on $\mathbb{N}$ . Furthermore, let $n_1 = n_1(\varepsilon, n_0) \in \mathbb{N}$ be such that for every $n \ge n_1$ we have
Using this, for all $n \ge n_1$ we get
On the other hand, we have that for all $n \ge n_1$ ,
where in the first inequality we used (A.1) together with the fact that $\sum_{i = 1}^{\infty} p_i^{(n)} = 1$ , and in the last inequality we used (A.3). Combining (A.4) and (A.1) we get
for all $n \ge n_1$ . Finally, we have
where we used (A.3) and (A.5). This proves that $\|{P^{(n)} - P^{(\infty)}}\|_1 \xrightarrow[n \rightarrow \infty]{} 0$ assuming that $\|{P^{(n)} - P^{(\infty)}}\|_{\infty} \xrightarrow[n \rightarrow \infty]{} 0$ , which is exactly what we wanted to prove. □
Appendix B. Auxiliary results for Section 4
Proposition 6. Let $x, y > 0$ be such that $x + y < 1$ . Then
Proof. Define
We now show that $g(x, y) > 0$ for all $x, y > 0$ such that $x + y < 1$ . Let $y \in (0, 1)$ be arbitrary but fixed. Notice that
Hence, to prove that $g(x, y) > 0$ for all $x \in (0, 1 - y)$ (for fixed y), it is sufficient to show that $x \mapsto g(x, y)$ is strictly increasing on $(0, 1 - y)$ . We have
for
Therefore, it is enough to show that h(x) is a strictly increasing function on (0, 1), since then (for $y \in (0, 1)$ and $x \in (0, 1 - y)$ ) we would have $\frac{\partial g}{\partial x}(x, y) = h(x + y) - h(x) > 0$ . We verify that h(x) is strictly increasing on (0, 1) by showing that $h'(x) > 0$ on (0, 1). We have that
Hence, it remains to prove that
One way to see this is to prove that
As this is basic analysis we omit the details. □
Lemma 3. Let $p \in (0, 1)$ and let $D = \{x_1, x_2, \ldots, x_r \in (0, 1)\,{ :}\, \sum_{j = 1}^r x_{j} = 1\}$ . The function ${g}\,{:}\,{D}\to{\mathbb{R}}$ defined by
has a unique maximum on the set D at the point $(x_1, x_2, \ldots, x_r) = (\frac{1}{r}, \frac{1}{r}, \ldots, \frac{1}{r})$ .
The proof of Lemma 3 is a standard application of the method of Lagrange multipliers and is therefore omitted.
Proposition 7. Let $p \in (0, 1)$ and let
Then the sequence $(\tau_r(p))_{r \in \mathbb{N}}$ is an increasing sequence for all $p \in (0, 1)$ , and it holds that
Proof. Let us first show that the sequence $(\tau_r(p))_{r \in \mathbb{N}}$ is strictly increasing. For this, it is sufficient to show that for $p\in (0, 1)$ the function
is strictly increasing on $[1, \infty)$ , because the sequence $(\tau_r(p))_{r \in \mathbb{N}}$ satisfies $\tau_r(p) = (1 - p) (\varphi(r) - \varphi(1))$ . We will show that $\varphi'(x) > 0$ for all $x \in [1, \infty)$ . We have
Since $\lim_{x \to \infty} \varphi'(x) = 0$ and $\varphi'(x)$ is a continuous function on $[1, \infty)$ , it is now enough to show that $\varphi'(x)$ is strictly decreasing on $[1, \infty)$ to be able to conclude that $\varphi'(x) > 0$ for all $x \in [1, \infty)$ . Observe that
We can now check that $\varphi''(x) < 0$ on $[1, \infty)$ :
Since $p \in (0, 1)$ and $x \in [1, \infty)$ , we have $p/x \in (0, 1)$ , so the desired inequality follows from (B.2). Hence $\varphi'(x)$ is decreasing and we have that $\varphi'(x) > 0$ on $[1, \infty)$ , which is exactly what we wanted to prove.
Let us now calculate the limit of the sequence $(\tau_r(p))_{r \in \mathbb{N}}$ . Notice that
Applying L’Hospital’s rule twice, we obtain
Plugging this into Equation (B.4), we obtain
which concludes the proof. □
Acknowledgements
We wish to thank Serguei Popov for suggesting the name ‘greedy sampling’ and the whole IOTA research team for stimulating discussions. We also want to thank an anonymous referee for various helpful comments.
Funding information
A. Gutierrez was supported by the Austrian Science Fund (FWF) under Project P29355-N35. S. Šebek was supported by the Austrian Science Fund (FWF) under Project P31889-N35 and the Croatian Science Foundation under Project 4197. This financial support is gratefully acknowledged.
Competing interests
There were no competing interests to declare which arose during the preparation or publication process of this article.