Approximations to the birthday problem with unequal occurrence probabilities and their application to the surname problem in Japan |
| |
Authors: | Shigeru Mase |
| |
Institution: | (1) Faculty of Integrated Arts and Sciences, Hiroshima University, Naka-ku, 730 Hiroshima, Japan |
| |
Abstract: | Let X
1, X
2,..., X
n be iid random variables with a discrete distribution {p
i
}
i
=1
m
. We will discuss the coincidence probability R
n
, i.e., the probability that there are members of {X
i
} having the same value. If m=365 and p
i
1/365, this is the famous birthday problem. Also we will give two kinds of approximation to this probability. Finally we will give two applications. The first is the estimation of the coincidence probability of surnames in Japan. For this purpose, we will fit a generalized zeta distribution to a frequency data of surnames in Japan. The second is the true birthday problem, that is, we will evaluate the birthday probability in Japan using the actual (non-uniform) distribution of birthdays in Japan.This research is supported in part by Grant-in-Aid for Scientific Research of the Ministry of Education, Science and Culture under the contact number 01540141 and 02640057. |
| |
Keywords: | Birthday problem coincidence probability non-uniformness Bell polynomial approximation surname |
本文献已被 SpringerLink 等数据库收录! |
|