Hostname: page-component-78c5997874-xbtfd Total loading time: 0 Render date: 2024-11-13T03:59:13.538Z Has data issue: false hasContentIssue false

Asymptotics of the overflow in urn models

Published online by Cambridge University Press:  11 July 2022

Raul Gouet*
Affiliation:
Universidad de Chile
Paweł Hitczenko*
Affiliation:
National Science Foundation and Drexel University
Jacek Wesołowski*
Affiliation:
Warsaw University of Technology
*
*Postal address: Departamento de Ingenieria Matemática and CMM (IRL 2807, CNRS), Universidad de Chile, Beauchef 851, 8370456 Santiago, Chile. Email: rgouet@dim.uchile.cl
**Postal address: Division of Mathematical Sciences, National Science Foundation, 2415 Eisenhower Avenue, Alexandria, VA 22314, USA.
****Postal address: Faculty of Mathematics and Information Science, Warsaw University of Technology, Koszykowa 75, Warsaw, Poland. Email: j.wesolowski@mini.pw.edu.pl

Abstract

Consider a finite or infinite collection of urns, each with capacity r, and balls randomly distributed among them. An overflow is the number of balls that are assigned to urns that already contain r balls. When $r=1$ , this is the number of balls landing in non-empty urns, which has been studied in the past. Our aim here is to use martingale methods to study the asymptotics of the overflow in the general situation, i.e. for arbitrary r. In particular, we provide sufficient conditions for both Poissonian and normal asymptotics.

Type
Original Article
Copyright
© The Author(s), 2022. Published by Cambridge University Press on behalf of Applied Probability Trust

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Arratia, R., Garibaldi, S. and Kilian, J. (2016). Asymptotic distribution for the birthday problem with multiple coincidences, via an embedding of the collision process. Random Structures Algorithms 48, 480502.10.1002/rsa.20591CrossRefGoogle Scholar
Barbour, A. Gnedin, A. (2009). Small counts in the infinite occupancy scheme. Electron. J. Prob. 14, 13, 365384.10.1214/EJP.v14-608CrossRefGoogle Scholar
Beśka, M., Kłopotowski, A. and Słomiński, L. (1982). Limit theorems for random sums of dependent d-dimensional random vectors. Z. Wahrscheinlichkeitsth. 61, 4357.10.1007/BF00537224CrossRefGoogle Scholar
Bobecka, K., Hitczenko, P., López-Blázquez, F., Rempała, G. and Wesołowski, J. (2013). Asymptotic normality through factorial cumulants and partition identities. Combinatorics Prob. Comput. 22, 213240.10.1017/S0963548312000545CrossRefGoogle ScholarPubMed
Chao, A. and Chiu, C.-H. (2016). Species richness: Estimation and comparison. In Wiley StatsRef: Statistics Reference Online (eds N. Balakrishnan, T. Colton, B. Everitt, W. Piegorsch, F. Ruggeri and J. L. Teugels), John Wiley, London.Google Scholar
Dupuis, P., Nuzman, C. and Whiting, P. (2004). Large deviation asymptotics for occupancy problems. Ann. Prob. 32, 27652818.10.1214/009117904000000135CrossRefGoogle Scholar
Gnedin, A., Hansen, B. and Pitman, J. (2007). Notes on the occupancy problem with infinitely many boxes: General asymptotics and power laws. Prob. Surv. 4, 146171.10.1214/07-PS092CrossRefGoogle Scholar
Helland, I. S. (1982). Central limit theorems for martingales with discrete or continuous time. Scand. J. Statist. 9, 7994.Google Scholar
Hwang, H. K. and Janson, S. (2008). Local limit theorems for finite and infinite urn models. Ann. Prob. 36, 9921022.10.1214/07-AOP350CrossRefGoogle Scholar
Joag-Dev, K. and Proschan, F. (1983). Negative association of random variables with applications. Ann. Statist. 11, 286295.10.1214/aos/1176346079CrossRefGoogle Scholar
Jogdeo, K and Patil, G. P. (1975). Probability inequalities for certain multivariate discrete distribution. Sankhyā B 37, 158164.Google Scholar
Johnson, N. L. and Kotz, S. (1977). Urn Models and Their Application . Wiley Series in Probability and Mathematical Statistics. John Wiley, New York.Google Scholar
Karlin, S. (1967). Central limit theorems for certain infinite urn schemes. J. Math. Mech. 17, 373401.Google Scholar
Knuth, D. E. (1998). The Art of Computer Programming, Vol. 2, 3rd edn. Addison-Wesley, Reading, MA.Google Scholar
Knuth, D. E. (1998). The Art of Computer Programming, Vol. 3, 2nd edn. Addison-Wesley, Reading, MA.Google Scholar
Kolchin, V. F., Sevastyanov, B. A. and Chistyakov, V. P. (1978). Random Allocations. Winston & Sons, Washington.Google Scholar
Kotz, S. and Balakrishnan, N. (1977). Advances in urn models during the past two decades. In Advances in Combinatorial Methods and Applications to Probability and Statistics, ed. N. Balkrishnan. Birkhäuser, Boston, MA, pp. 203257.Google Scholar
Mahmoud, H. M. (2009). Pólya Urn Models. CRC Press, Boca Raton, FL.Google Scholar
Mallows, C. L. (1968). An inequality involving multinomial probabilities. Biometrika 55, 422424.10.1093/biomet/55.2.422CrossRefGoogle Scholar
Mikhailov, V. G. (1981). The central limit theorem for the scheme of independent placements of particles among cells. Trudy Steklov Math. Inst. (MIAN) 157, 138152.Google Scholar
Monahan, J. F. (1987). An alternative method for computing the overflow probabilities. Commun. Statist. Theory Meth. 16, 33553357.10.1080/03610928708829575CrossRefGoogle Scholar
National Institute of Standards and Technology (2020). NIST Digital Library of Mathematical Functions, Release 1.0.28. (2020). Available at: https://dlmf.nist.gov.Google Scholar
Ramakrishna, M. V. (1987). Computing the probability of hash table/urn overflow. Commun. Statist. Theory Meth. 16, 33433353.10.1080/03610928708829574CrossRefGoogle Scholar