zoukankan      html  css  js  c++  java
  • 基本概率分布Basic Concept of Probability Distributions 2: Poisson Distribution

    PDF version

    PMF

    A discrete random variable $X$ is said to have a Poisson distribution with parameter $lambda > 0$, if the probability mass function of $X$ is given by $$f(x; lambda) = Pr(X=x) = e^{-lambda}{lambda^xover x!}$$ for $x=0, 1, 2, cdots$.

    Proof:

    $$ egin{align*} sum_{x=0}^{infty}f(x; lambda) &= sum_{x=0}^{infty} e^{-lambda}{lambda^xover x!}\ & = e^{-lambda}sum_{x=0}^{infty}{lambda^xover x!}\ &= e^{-lambda}left(1 + lambda + {lambda^2 over 2!}+ {lambda^3over 3!}+ cdots ight)\ & = e^{-lambda} cdot e^{lambda}\ & = 1 end{align*} $$

    Mean

    The expected value is $$mu = E[X] = lambda$$

    Proof:

    $$ egin{align*} E[X] &= sum_{x=0}^{infty}xe^{-lambda}{lambda^xover x!}\ & = sum_{x=1}^{infty}e^{-lambda}{lambda^xover (x-1)!}\ & =lambda e^{-lambda}sum_{x=1}^{infty}{lambda^{x-1}over (x-1)!}\ & = lambda e^{-lambda}left(1+lambda + {lambda^2over 2!} + {lambda^3over 3!}+cdots ight)\ & = lambda e^{-lambda} e^{lambda}\ & = lambda end{align*} $$

    Variance

    The variance is $$sigma^2 = mbox{Var}(X) = lambda$$

    Proof:

    $$ egin{align*} Eleft[X^2 ight] &= sum_{x=0}^{infty}x^2e^{-lambda}{lambda^xover x!}\ &= sum_{x=1}^{infty}xe^{-lambda}{lambda^xover (x-1)!}\ &= lambdasum_{x=1}^{infty}xe^{-lambda}{lambda^{x-1}over (x-1)!}\ & = lambdasum_{x=1}^{infty}(x-1+1)e^{-lambda}{lambda^{x-1}over (x-1)!}\ &= lambdaleft(sum_{x=1}^{infty}(x-1)e^{-lambda}{lambda^{x-1}over (x-1)!} + sum_{x=1}^{infty} e^{-lambda}{lambda^{x-1}over (x-1)!} ight)\ &= lambdaleft(lambdasum_{x=2}^{infty}e^{-lambda}{lambda^{x-2}over (x-2)!} + sum_{x=1}^{infty} e^{-lambda}{lambda^{x-1}over (x-1)!} ight)\ & = lambda(lambda+1) end{align*} $$ Hence the variance is $$ egin{align*} mbox{Var}(X)& = Eleft[X^2 ight] - E[X]^2\ & = lambda(lambda + 1) - lambda^2\ & = lambda end{align*} $$

    Examples

    1. Let $X$ be Poisson distributed with intensity $lambda=10$. Determine the expected value $mu$, the standard deviation $sigma$, and the probability $Pleft(|X-mu| geq 2sigma ight)$. Compare with Chebyshev's Inequality.

    Solution:

    The Poisson distribution mass function is $$f(x) = e^{-lambda}{lambda^xover x!}, x=0, 1, 2, cdots$$ The expected value is $$mu= lambda=10$$ Then the standard deviation is $$sigma = sqrt{lambda} = 3.162278$$ The probability that $X$ takes a value more than two standard deviations from $mu$ is $$ egin{align*} Pleft(|X-lambda| geq 2sqrt{lambda} ight) &= Pleft(X leq lambda-2sqrt{lambda} ight) + Pleft(X geq lambda + 2sqrt{lambda} ight)\ & = P(X leq 3) + P(X geq 17)\ & = 0.03737766 end{align*} $$ R code:

    sum(dpois(c(0:3), 10)) + 1 - sum(dpois(c(0:16), 10))
    # [1] 0.03737766 

    Chebyshev's Inequality gives the weaker estimation $$Pleft(|X - mu| geq 2sigma ight) leq {1over2^2} = 0.25$$

    2. In a certain shop, an average of ten customers enter per hour. What is the probability $P$ that at most eight customers enter during a given hour.

    Solution:

    Recall that the Poisson distribution mass function is $$P(X=x) = e^{-lambda}{lambda^xover x!}$$ and $lambda=10$. So we have $$ egin{align*} P(X leq 8) &= sum_{x=0}^{8}e^{-10}{10^{x}over x!}\ &= 0.3328197 end{align*} $$ R code:

    sum(dpois(c(0:8), 10))
    # [1] 0.3328197
    ppois(8, 10)
    # [1] 0.3328197 

    3. What is the probability $Q$ that at most 80 customers enter the shop from the previous problem during a day of 10 hours?

    Solution:

    The number $Y$ of customers during an entire day is the sum of ten independent Poisson distribution with parameter $lambda=10$. $$Y = X_1 + cdots + X_{10}$$ Thus $Y$ is also a Poisson distribution with parameter $lambda = 100$. Thus we have $$ egin{align*} P(Y leq 80) &= sum_{y=0}^{80}e^{-100}{100^{y}over y!}\ &= 0.02264918 end{align*} $$ R code:

    sum(dpois(c(0:80), 100))
    # [1] 0.02264918
    ppois(80, 100)
    # [1] 0.02264918 

    Alternatively, we can use normal approximation (generally when $lambda > 9$) with $mu = lambda = 100$ and $sigma = sqrt{lambda}=10$. $$ egin{align*} P(Y leq 80) &= Phileft({80.5-100over 10 } ight)\ &= Phileft({-19.5over10} ight)\ &=0.02558806 end{align*} $$ R code:

    pnorm(-19.5/10)
    # [1] 0.02558806 

    4. At the 2006 FIFA World Championship, a total of 64 games were played. The number of goals per game was distributed as follows: 8 games with 0 goals 13 games with 1 goal 18 games with 2 goals 11 games with 3 goals 10 games with 4 goals 2 games with 5 goals 2 games with 6 goals Determine whether the number of goals per game may be assumed to be Poisson distributed.

    Solution:

    We can use Chi-squared test. The observations are in Table 1.

    On the other hand, if this is a Poisson distribution then the parameter should be $$ egin{align*} lambda &= mu\ & = {0 imes8 + 1 imes13 +cdots + 6 imes2 over 8+13+cdots+2}\ & = {144over 64}\ &=2.25 end{align*} $$ And the Poisson point probabilities are listed in Table 2.

    And hence the expected numbers are listed in Table 3.

    Note that we have merged some categories in order to get $E_i geq 3$. The statistic is $$ egin{align*} chi^2 &= sum{(O-E)^2over E}\ &= {(8-6.720)^2 over 6.720} + cdots + {(4-4.992)^2 over 4.992}\ &= 2.112048 end{align*} $$ There are six categories and thus the degree of freedom is $6-1 = 5$. The significance probability is 0.8334339. R code:

    prob = c(round(dpois(c(0:6), 2.25), 3), 
    +          1 - round(sum(dpois(c(0:6), 2.25)), 3))
    expect = prob * 64
    prob; expect
    # [1] 0.105 0.237 0.267 0.200 0.113 0.051 0.019 0.008
    # [1]  6.720 15.168 17.088 12.800  7.232  3.264  1.216  0.512
    O = c(8, 13, 18, 11, 10, 4)
    E = c(expect[1:5], sum(expect[6:8]))
    O; E
    # [1]  8 13 18 11 10  4
    # [1]  6.720 15.168 17.088 12.800  7.232  4.992
    chisq = sum((O - E) ^ 2 / E)
    1 - pchisq(chisq, 5)
    # [1] 0.8334339 

    The hypothesis is $$H_0: mbox{Poisson distribution}, H_1: mbox{Not Poisson distribution}$$ Since $p = 0.8334339 > 0.05$, so we accept $H_0$. That is, it is reasonable to claim that the number of goals per game is Poisson distributed.

    Reference

    1. Ross, S. (2010). A First Course in Probability (8th Edition). Chapter 4. Pearson. ISBN: 978-0-13-603313-4.
    2. Brink, D. (2010). Essentials of Statistics: Exercises. Chapter 5 & 9. ISBN: 978-87-7681-409-0.


    作者:赵胤
    出处:http://www.cnblogs.com/zhaoyin/
    本文版权归作者和博客园共有,欢迎转载,但未经作者同意必须保留此段声明,且在文章页面明显位置给出原文连接,否则保留追究法律责任的权利。

  • 相关阅读:
    angular设置反向代理
    typescript设置全屏
    解决TS报错Property 'style' does not exist on type 'Element'
    安全错误使用CORS在IE10与Node和Express及XMLHttpRequest: 网络错误 0x4c7, 操作已被用户取消
    vue axios 请求 https 的特殊处理
    SQL Server 查询基础及分组(每三个小时一组)
    SQL Server 表结构操作
    SQL Server 存储过程T-SQL基础语法
    .Net Core 使用Http请求及基于 Polly 的处理故障
    .Net Core 通用主机(Core 在控制台应用程序中的应用)
  • 原文地址:https://www.cnblogs.com/zhaoyin/p/4199022.html
Copyright © 2011-2022 走看看