Lecture 11 — Discrete & Poisson Probability Distributions (第11讲——离散与泊松概率分布)
1. Overview (概览)
- Random Variable(随机变量)
- Discrete Probability Distribution(离散概率分布)
- Binomial Distribution(二项分布)
- Poisson Distribution(泊松分布)
- Expected Value & Variance(期望值与方差)
- Binomial vs Poisson(二项分布与泊松分布对比)
- Cumulative Probability(累积概率)
2. Random Variable & Discrete Probability (随机变量与离散概率分布)
Random Variable(随机变量)
- Assigns numerical values to random outcomes.
- 为随机试验结果分配数值,量化不确定性。
Discrete Probability Distribution(离散概率分布)
- Lists all possible values and their probabilities.
- 展示离散型随机变量的所有可能取值及其概率。
Expected Value & Variance(期望与方差)
- Expected Value = Long-run average (E[X])
- 方差衡量数据波动 (Var[X])
- Business use: budgeting & risk assessment(预算与风险分析)
3. Binomial Distribution (二项分布)
Definition & Conditions(定义与条件)
- Fixed number of independent trials (n).
- Each trial has two outcomes: success/failure.
- Constant success probability (p).
- 固定次数、独立性、恒定成功率。
- f(x) = n! / [x!(n–x)!] × p^x × (1–p)^(n–x)
- 组合项 × 概率乘积 = 成功次数的概率。
Mean & Variance(期望与方差)
- E[X] = np
- Var[X] = np(1–p)
Graph Shape(图形特征)
- For p < 0.5 → Right-skewed(右偏)
- For p = 0.5 → Symmetrical(对称)
4. Example — Evans Electronics(二项分布案例)
Case Background(案例背景)
- Turnover rate = 10% per employee (p=0.1)
- n=3 employees → Probability one will leave?
- 三名员工中恰有一人离职的概率。
Calculation(计算过程)
- P(one leaves) = 3×0.1×0.9² = 0.243
- f(1) = 0.243 → 24.3% chance of one leaving.
Business Meaning(商业意义)
- HR can forecast turnover and plan replacements.
- 人力资源可预测离职人数与培训需求。
Excel Function(Excel函数)
- =BINOM.DIST(x,n,p,FALSE) → individual probability
- =BINOM.DIST(x,n,p,TRUE) → cumulative probability
5. Expected Value & Variance (二项分布的期望与方差)
Expected Value(期望值)
- E[X] = np = 3×0.1 = 0.3
- 平均每组三人中约有0.3人离职。
Variance(方差)
- Var[X] = np(1–p) = 0.27
- 标准差 σ = √Var = 0.52 → 波动性。
Application(应用)
- Forecasting staffing needs(预测人力需求)
- Estimating risk of turnover(评估离职风险)
6. Poisson Probability Distribution (泊松分布)
Concept(概念)
- Models number of occurrences in fixed interval/time.
- 描述固定时间或空间区间内事件发生次数的分布。
- f(x) = (μ^x * e^–μ) / x!
- μ = average rate of occurrence(平均发生率)
- e = 2.71828(自然常数)
Properties(性质)
- μ = σ² → Mean equals variance.
- Events occur independently & at constant rate.
- 事件独立,平均速率恒定。
Excel Function(Excel函数)
- =POISSON.DIST(x,μ,FALSE) → non-cumulative
- =POISSON.DIST(x,μ,TRUE) → cumulative
7. Poisson Distribution (Shape & Range) (泊松分布的形态与范围)
Range(取值范围)
- x ∈ {0,1,2,…,∞}
- 理论上无限,实际仅有限取值显著。
Shape(分布形态)
- Small μ → Right-skewed(偏右)
- Large μ → Nearly symmetric(近似正态)
Graph Analysis(图形分析)
- μ=3 peaks at x=2,3 → f(x)=0.224 each.
- 概率峰值出现在2与3处,之后迅速递减。
Business Application(商业应用)
- Customer arrivals(顾客到达)
- Equipment failures(设备故障)
- Service demand forecasting(服务需求预测)
8. Example — Mercy Hospital(案例:仁慈医院)
Case Context(案例背景)
- Avg. 6 patients/hour → in 30 min, μ=3.
- 求30分钟内恰有4人到达的概率。
Calculation(计算)
- f(4) = (3⁴ * e⁻³) / 4! = 0.168
- 表示半小时内恰有4人到达概率为16.8%。
Practical Use(应用)
- Predict staffing needs and emergency flow.
- 医院利用该模型优化急诊人员配置。
9. Example — Baby Heart Disease(案例:新生儿心脏病)
Case Context(案例背景)
- 8/1000 babies have disease → p=0.008
- Among 120 infants: μ = 0.008×120 = 0.96。
Calculation(计算)
- P(x=4) = (0.96⁴ * e⁻⁰․⁹⁶)/4! = 0.014
- P(x=1) = (0.96¹ * e⁻⁰․⁹⁶)/1! = 0.3676。
Application(应用)
- Disease prediction in epidemiology.
- 用于流行病学中疾病概率预测与公共健康规划。
10. Binomial vs. Poisson Comparison(二项分布与泊松分布对比)
Relationship(关系)
- Poisson approximates Binomial when n is large, p is small.
- 当 n 大、p 小时,泊松分布可近似二项分布。
Key Differences(主要区别)
Binomial(二项分布)
- Mean (期望): E(X) = np
- Variance (方差): Var(X) = np(1–p)
- Parameters (参数): n, p
Poisson(泊松分布)
- Mean (期望): E(X) = μ
- Variance (方差): Var(X) = μ
- Relationship (关系): μ = np
Graph Interpretation(图形分析)
- For n=10, p=0.3 → μ=3, shapes nearly identical.
- 两者曲线几乎重合,泊松分布更平滑。
Business Implication(商业意义)
- Use Binomial for fixed-trial experiments.
- Use Poisson for continuous-time rare events.
- 二项分布适合“实验次数固定”,泊松分布适合“连续时间随机事件”。
11. Cumulative Probability (累积概率)
Concept(概念)
- P(X ≤ x) = sum of probabilities up to x.
- 累积概率表示随机变量不超过 x 的总概率。
Graph Interpretation(图形解读)
- Bars = single probability, line = cumulative total.
- 柱形代表单点概率,折线代表累计概率逐渐逼近1。
Excel Function(Excel函数)
- =BINOM.DIST(x, n, p, TRUE)
- 快速求出累计概率。
Application(应用)
- Quality control limits(质量控制阈值)
- Risk thresholds(风险阈值设定)
- 决策分析与容错范围评估。
12. Chapter Summary (章节总结)
- Binomial: f(x)=n!/[x!(n–x)!]·p^x·(1–p)^(n–x)
- Poisson: f(x)=(μ^x·e^–μ)/x!
Key Relations(关键关系)
- When n→∞, p→0, and np=μ → Binomial ≈ Poisson
- 当 n 很大、p 很小且 np=μ 时,二项分布趋近泊松分布。
Applications(应用领域)
- Binomial: Product testing, marketing success rate, HR turnover.
- Poisson: Customer arrivals, medical emergencies, rare event modeling.
- 二项分布适用于有限重复试验;泊松分布适用于随机到达与稀有事件。