Lecture 11 — Discrete & Poisson Probability Distributions (第11讲——离散与泊松概率分布)


1. Overview (概览)

  • Random Variable(随机变量)
  • Discrete Probability Distribution(离散概率分布)
  • Binomial Distribution(二项分布)
  • Poisson Distribution(泊松分布)
  • Expected Value & Variance(期望值与方差)
  • Binomial vs Poisson(二项分布与泊松分布对比)
  • Cumulative Probability(累积概率)

2. Random Variable & Discrete Probability (随机变量与离散概率分布)

Random Variable(随机变量)

  • Assigns numerical values to random outcomes.
  • 为随机试验结果分配数值,量化不确定性。

Discrete Probability Distribution(离散概率分布)

  • Lists all possible values and their probabilities.
  • 展示离散型随机变量的所有可能取值及其概率。

Expected Value & Variance(期望与方差)

  • Expected Value = Long-run average (E[X])
  • 方差衡量数据波动 (Var[X])
  • Business use: budgeting & risk assessment(预算与风险分析)

3. Binomial Distribution (二项分布)

Definition & Conditions(定义与条件)

  • Fixed number of independent trials (n).
  • Each trial has two outcomes: success/failure.
  • Constant success probability (p).
  • 固定次数、独立性、恒定成功率。

Formula(公式)

  • f(x) = n! / [x!(n–x)!] × p^x × (1–p)^(n–x)
  • 组合项 × 概率乘积 = 成功次数的概率。

Mean & Variance(期望与方差)

  • E[X] = np
  • Var[X] = np(1–p)

Graph Shape(图形特征)

  • For p < 0.5 → Right-skewed(右偏)
  • For p = 0.5 → Symmetrical(对称)

4. Example — Evans Electronics(二项分布案例)

Case Background(案例背景)

  • Turnover rate = 10% per employee (p=0.1)
  • n=3 employees → Probability one will leave?
  • 三名员工中恰有一人离职的概率。

Calculation(计算过程)

  • P(one leaves) = 3×0.1×0.9² = 0.243
  • f(1) = 0.243 → 24.3% chance of one leaving.

Business Meaning(商业意义)

  • HR can forecast turnover and plan replacements.
  • 人力资源可预测离职人数与培训需求。

Excel Function(Excel函数)

  • =BINOM.DIST(x,n,p,FALSE) → individual probability
  • =BINOM.DIST(x,n,p,TRUE) → cumulative probability

5. Expected Value & Variance (二项分布的期望与方差)

Expected Value(期望值)

  • E[X] = np = 3×0.1 = 0.3
  • 平均每组三人中约有0.3人离职。

Variance(方差)

  • Var[X] = np(1–p) = 0.27
  • 标准差 σ = √Var = 0.52 → 波动性。

Application(应用)

  • Forecasting staffing needs(预测人力需求)
  • Estimating risk of turnover(评估离职风险)

6. Poisson Probability Distribution (泊松分布)

Concept(概念)

  • Models number of occurrences in fixed interval/time.
  • 描述固定时间或空间区间内事件发生次数的分布。

Formula(公式)

  • f(x) = (μ^x * e^–μ) / x!
  • μ = average rate of occurrence(平均发生率)
  • e = 2.71828(自然常数)

Properties(性质)

  • μ = σ² → Mean equals variance.
  • Events occur independently & at constant rate.
  • 事件独立,平均速率恒定。

Excel Function(Excel函数)

  • =POISSON.DIST(x,μ,FALSE) → non-cumulative
  • =POISSON.DIST(x,μ,TRUE) → cumulative

7. Poisson Distribution (Shape & Range) (泊松分布的形态与范围)

Range(取值范围)

  • x ∈ {0,1,2,…,∞}
  • 理论上无限,实际仅有限取值显著。

Shape(分布形态)

  • Small μ → Right-skewed(偏右)
  • Large μ → Nearly symmetric(近似正态)

Graph Analysis(图形分析)

  • μ=3 peaks at x=2,3 → f(x)=0.224 each.
  • 概率峰值出现在2与3处,之后迅速递减。

Business Application(商业应用)

  • Customer arrivals(顾客到达)
  • Equipment failures(设备故障)
  • Service demand forecasting(服务需求预测)

8. Example — Mercy Hospital(案例:仁慈医院)

Case Context(案例背景)

  • Avg. 6 patients/hour → in 30 min, μ=3.
  • 求30分钟内恰有4人到达的概率。

Calculation(计算)

  • f(4) = (3⁴ * e⁻³) / 4! = 0.168
  • 表示半小时内恰有4人到达概率为16.8%。

Practical Use(应用)

  • Predict staffing needs and emergency flow.
  • 医院利用该模型优化急诊人员配置。

9. Example — Baby Heart Disease(案例:新生儿心脏病)

Case Context(案例背景)

  • 8/1000 babies have disease → p=0.008
  • Among 120 infants: μ = 0.008×120 = 0.96。

Calculation(计算)

  • P(x=4) = (0.96⁴ * e⁻⁰․⁹⁶)/4! = 0.014
  • P(x=1) = (0.96¹ * e⁻⁰․⁹⁶)/1! = 0.3676。

Application(应用)

  • Disease prediction in epidemiology.
  • 用于流行病学中疾病概率预测与公共健康规划。

10. Binomial vs. Poisson Comparison(二项分布与泊松分布对比)

Relationship(关系)

  • Poisson approximates Binomial when n is large, p is small.
  • 当 n 大、p 小时,泊松分布可近似二项分布。

Key Differences(主要区别)

Binomial(二项分布)

  • Mean (期望): E(X) = np
  • Variance (方差): Var(X) = np(1–p)
  • Parameters (参数): n, p

Poisson(泊松分布)

  • Mean (期望): E(X) = μ
  • Variance (方差): Var(X) = μ
  • Relationship (关系): μ = np

Graph Interpretation(图形分析)

  • For n=10, p=0.3 → μ=3, shapes nearly identical.
  • 两者曲线几乎重合,泊松分布更平滑。

Business Implication(商业意义)

  • Use Binomial for fixed-trial experiments.
  • Use Poisson for continuous-time rare events.
  • 二项分布适合“实验次数固定”,泊松分布适合“连续时间随机事件”。

11. Cumulative Probability (累积概率)

Concept(概念)

  • P(X ≤ x) = sum of probabilities up to x.
  • 累积概率表示随机变量不超过 x 的总概率。

Graph Interpretation(图形解读)

  • Bars = single probability, line = cumulative total.
  • 柱形代表单点概率,折线代表累计概率逐渐逼近1。

Excel Function(Excel函数)

  • =BINOM.DIST(x, n, p, TRUE)
  • 快速求出累计概率。

Application(应用)

  • Quality control limits(质量控制阈值)
  • Risk thresholds(风险阈值设定)
  • 决策分析与容错范围评估。

12. Chapter Summary (章节总结)

Core Formulas(核心公式)

  • Binomial: f(x)=n!/[x!(n–x)!]·p^x·(1–p)^(n–x)
  • Poisson: f(x)=(μ^x·e^–μ)/x!

Key Relations(关键关系)

  • When n→∞, p→0, and np=μ → Binomial ≈ Poisson
  • 当 n 很大、p 很小且 np=μ 时,二项分布趋近泊松分布。

Applications(应用领域)

  • Binomial: Product testing, marketing success rate, HR turnover.
  • Poisson: Customer arrivals, medical emergencies, rare event modeling.
  • 二项分布适用于有限重复试验;泊松分布适用于随机到达与稀有事件。