Hostname: page-component-78c5997874-dh8gc Total loading time: 0 Render date: 2024-11-10T16:19:31.390Z Has data issue: false hasContentIssue false

Consistent ordered sampling distributions: characterization and convergence

Published online by Cambridge University Press:  01 July 2016

Peter Donnelly*
Affiliation:
Queen Mary and Westfield College, London
Paul Joyce*
Affiliation:
University of Southern California
*
Postal address: School of Mathematical Sciences, Queen Mary and Westfield College, University of London, Mile End Road, London E1 4NS, UK.
∗∗Postal address: Department of Mathematics, University of Southern California, Los Angeles, CA 90089-1113, USA.

Abstract

This paper is concerned with models for sampling from populations in which there exists a total order on the collection of types, but only the relative ordering of types which actually appear in the sample is known. The need for consistency between different sample sizes limits the possible models to what are here called ‘consistent ordered sampling distributions'. We give conditions under which weak convergence of population distributions implies convergence of sampling distributions and conversely those under which population convergence may be inferred from convergence of sampling distributions. A central result exhibits a collection of ‘ordered sampling functions', none of which is continuous, which separates measures in a certain class. More generally, we characterize all consistent ordered sampling distributions, proving an analogue of de Finetti's theorem in this context. These results are applied to an unsolved problem in genetics where it is shown that equilibrium age-ordered population allele frequencies for a wide class of exchangeable reproductive models converge weakly, as the population size becomes large, to the so-called GEM distribution. This provides an alternative characterization which is more informative and often more convenient than Kingman's (1977) characterization in terms of the Poisson–Dirichlet distribution.

Type
Research Article
Copyright
Copyright © Applied Probability Trust 1991 

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Footnotes

Research partially supported by NSF grant DMS 86-08857.

References

Aldous, D. J. (1985) Exchangeability and related topics. In Ecole d' Été de Probabilités de Saint-Flour XIII, 1983 , ed. Hennequin, P. L., Lecture Notes in Mathematics 1117, Springer-Verlag, Berlin, pp. 1198.CrossRefGoogle Scholar
Cannings, C. (1974) The latent roots of certain Markov chains arising in genetics: a new approach. 1. Haploid models. Adv. Appl. Prob. 6, 260290.CrossRefGoogle Scholar
Davies, E. B. and Vincent-Smith, G. F. (1968) Tensor products, infinite products and projective limits of Choquet simplexes. Math. Scand. 22, 145164.CrossRefGoogle Scholar
Donnelly, P. (1986) Partition structures, Pólya urns, the Ewens sampling formula, and the ages of alleles, Theoret. Popn Biol. 30, 271288.CrossRefGoogle ScholarPubMed
Donnelly, P. (1991) Weak convergence to a Markov chain with an entrance boundary: ancestral processes in population genetics. Ann. Prob. CrossRefGoogle Scholar
Donnelly, P. and Tavaré, S. (1986) The ages of alleles and a coalescent. Adv. Appl. Prob. 18, 119.CrossRefGoogle Scholar
Donnelly, P. and Tavaré, S. (1987) The population genealogy of the infinitely-many neutral alleles model. J. Math. Biol. 25, 381391.CrossRefGoogle ScholarPubMed
Ethier, S. N. and Kurtz, T. G. (1986) Markov Processes—Characterization and Convergence . Wiley, New York.CrossRefGoogle Scholar
Ewens, W. J. (1972) The sampling theory of selectively neutral alleles. Theoret. Popn Biol. 3, 87112.CrossRefGoogle ScholarPubMed
Ewens, W. J. (1990) Population genetics theory—the past and the future. In Mathematical and Statistical Developments in Evolutionary Theory , ed. Lessard, S., Kluwer Academic Publishers, Dordrecht, pp. 177228.CrossRefGoogle Scholar
Kingman, J. F. C. (1977) The population structure associated with the Ewens sampling formula. Theoret. Popn Biol. 11, 274283.CrossRefGoogle ScholarPubMed
Kingman, J. F. C. (1978a) Random partitions in population genetics. Proc. R. Soc. London A 361, 120.Google Scholar
Kingman, J. F. C. (1978b) The representation of partition structures. J. London Math. Soc. (2) 18, 374380.CrossRefGoogle Scholar
Kingman, J. F. C. (1982a) On the genealogy of large populations. J. Appl. Prob. 19A, 2743.CrossRefGoogle Scholar
Kingman, J. F. C. (1982b) The coalescent. Stoch. Proc. Appl. 13, 235248.CrossRefGoogle Scholar
Kingman, J. F. C. (1982c) Exchangeability and the evolution of large populations. In Exchangeability in Probability and Statistics , ed. Koch, G. and Spizzichino, F., North-Holland, Amsterdam, pp. 97112.Google Scholar
Lauritzen, S. L. (1974) Sufficiency, prediction, and extreme models. Scand. J. Statist. 1, 128134.Google Scholar
Tavaré, S. (1984) Line-of-descent and genealogical processes and their applications in population genetics models. Theoret. Popn Biol. 26, 119164.CrossRefGoogle ScholarPubMed
Watterson, G. A. (1984) Lines of descent and the coalescent. Theoret. Popn Biol. 26, 7792.CrossRefGoogle Scholar