Group A — MGS 2150 Lecture 2 · Chap 1-1(10 题 · 数据类型、抽样、可视化与数据质量)

Question A1 — Scale Choice for Satisfaction / 满意度量表与度量选择

A manager wants to compare “average satisfaction” across 4 stores. Ratings are recorded as Very bad/Bad/Neutral/Good/Very good. What summary and chart are appropriate? 某经理想比较 4 家门店的“平均满意度”。评分为“非常差/差/中/好/非常好”。应采用什么汇总指标与图表?


Question A2 — Cross-Section vs Time Series / 横截面 vs. 时间序列

You have April 2025 “same-day” delivery times for all stores (cross-section) and monthly series from 2023–2025 (time series). What charts for each and why? 你有 2025 年 4 月所有门店“当日达”时长(横截面)和 2023–2025 的月度序列(时间序列)。各用何图?为什么?


Question A3 — Sampling Frame / 抽样框设计

To estimate average app waiting time in a day with strong peaks, how to sample? 要估计一天内 App 等待时长的平均值(高峰明显),如何抽样?


Question A4 — Data Quality & Outliers / 数据质量与极端值

A few tickets show “> 200 hours” delivery. Delete or not? 有少量工单显示配送时长“>200 小时”。是否删除?


Question A5 — Variable Types / 变量类型识别

Classify: (a) coupon amount, (b) member or not, (c) store tier, (d) reorder rate. 分类:(a) 优惠券面额;(b) 是否会员;(c) 门店等级;(d) 复购率。


Question A6 — Dashboard Ethics / 仪表盘伦理

A chart starts y-axis at 90% to exaggerate improvement. What to do? 图表把纵轴从 90% 起点,夸大改进。怎么办?


Question A7 — Simpson’s Paradox Lite / 轻量辛普森悖论

Overall conversion is lower for Channel A than B, but within “new vs returning” segments A is higher. What to report? 总体上 A 转化低于 B,但在“新客/老客”分组内 A 更高。怎么汇报?


Question A8 — Survey Weighting / 调查加权

Satisfaction survey over-samples heavy users. Without re-surveying, how to adjust? 满意度样本重度偏向高频用户,如何在不重采的前提下矫正?


Question A9 — Frequency Table & Bin Alignment / 频数分组对齐

When building a histogram for call duration, how to choose bin width to align with an SLA threshold at 120s? 制作通话时长直方图,如何让组距与 120 秒 SLA 对齐?


Question A10 — Metric Definition / 指标口径修订

“Store-visit conversion = store visits / app opens” is distorted by remote opens. Fix it. “到店转化率=到店人数/APP 打开人数”被大量异地打开扭曲。如何修订?


Group B — MGS 2150 Lecture 3 · Chap 1-2 v.1(10 题 · 表格/图表设计、帕累托、折线/注释、热力图)

Question B1 — Pareto for Returns / 退货帕累托优先级

After plotting a Pareto chart of return reasons, how do you pick the first fixes for the next 30 days? 画出退货原因帕累托图后,如何选定未来 30 天的优先修复项?


Question B2 — Small Multiples / 小倍图对比

You must compare distribution of turnover days for 12 stores in one page. Best approach? 一页内比较 12 家门店周转天数分布,最佳做法?


Question B3 — Dual-Axis Caution / 双轴图注意

Why can a sales vs. ad-spend dual-axis line mislead and what to use instead? 销售额与广告费双轴折线为何易误导?替代方案?


Question B4 — Heatmap Standardization / 热力图标准化

A region×category heatmap highlights big cities. How to avoid population bias? 地区×品类热力图总被一线城市“亮瞎”。如何避免人口尺度偏差?


Question B5 — Annotated Lines / 注释折线

How to show the effect of a pricing policy change in a time series without clutter? 如何在时间序列中提示“价格政策变化”的影响且不拥挤?


Question B6 — Table for Decisions / 决策型表格

List three must-have columns for an executive action table. 高层决策表必须包含的三列?


Question B7 — Category Granularity / 分类粒度

Combining “stock-out” and “supply break” caused diagnosis failure. Fix? 把“缺货/断供”合并成“无货”导致根因分析失败,如何修正?


Question B8 — Stem-and-Leaf Use / 茎叶图使用

When is a stem-and-leaf better than a box plot for managers? 在什么情况下茎叶图优于箱线图?


Question B9 — Real-time vs. Accuracy / 时效与准确

Daily dashboard is T+1. How to add timely insight without “passing off estimates as truth”? 日报 T+1,如何在不“以估作真”前提下提升时效?


Question B10 — Consistency Checks / 一致性校验

POS sales vs. WMS shipments differ due to returns timing. What documentation and checks? POS 销售与 WMS 出库因退货时点不同而不一致。需要哪些文档与校验?


Group C — MGS 2150 Lecture 4 · Chap 2-1(10 题 · 集中趋势、离散度、稳健度量)

Question C1 — Mean vs Median / 均值 vs. 中位数

Delivery times are right-skewed. Which center measure for “typical user experience”? 配送时长右偏,哪种中心更能代表“典型体验”?


Question C2 — Weighted Mean / 加权平均

Why use weighted mean for average discounted price? 折后均价为何要用加权平均


Question C3 — Geometric Mean / 几何平均

For 3 yearly growth rates 10%, −5%, 15%, why prefer geometric mean for multi-year growth? 三年增长 10%、−5%、15%,为何多期增长用几何平均?


Question C4 — Range, IQR, SD / 极差、IQR、标准差

Suggest a simple stability KPI less sensitive to outliers. 给一个对异常值不敏感的“稳定性 KPI”。


Question C5 — Coefficient of Variation / 变异系数

Two lines: μ₁=100, σ₁=8; μ₂=60, σ₂=7. Which is more stable? 两条产线:μ₁=100, σ₁=8;μ₂=60, σ₂=7。哪条更稳定?


Question C6 — Empirical Rule (Appropriate Use) / 经验法则的适用

When can you use the 68-95-99.7 rule to flag unusual observations? 何时可用 68-95-99.7 经验法则标记异常?


Question C7 — Chebyshev (Distribution-Free) / 切比雪夫不等式

Provide a conservative 2-SD bound coverage for any distribution. 给出任意分布下“均值±2σ”的保守覆盖率。


Question C8 — z-Scores & Standardization / z 分数与标准化

Why standardize metrics before comparing across categories with different scales? 跨品类、不同尺度的指标为何要标准化后比较?


Question C9 — Outlier Policy / 异常值政策

Give two principles for an outlier policy in reporting. 报表中的异常值政策给出两条原则。


Question C10 — Percentiles for SLA / 用分位数管理 SLA

SLA: “95% orders delivered within 24h”. What should the weekly report include? SLA:“95% 24 小时内送达”。周报应包含哪些统计?


  • Group D — Lecture 5 · Chap 2-2(四分位、箱线图、偏度峰度、稳健统计、展示规范)
  • Group E — Lecture 6 · Chap 3-1(加法/乘法/条件概率、独立性、计数)
  • Group F — Lecture 7 · Chap 3-2(贝叶斯、先验更新、阈值与代价)
  • Group G — Lecture 8 · Chap 4-1(离散型/泊松近似、几何/负二项、混合/稀疏化)
  • Group H — Lecture 9 · Chap 4-2(正态/指数/均匀、CLT、区间估计与样本量)
  • Group I — MGS 2150 6th(综合运用:指标栈、治理、A/B 合规、情景分析)

Group D — MGS 2150 Lecture 5 · Chap 2-2(10 题 · 分位数/箱线图/偏度峰度/稳健统计/展示规范)

Question D1 — Percentiles for SLA / 用分位数管理 SLA

A courier service promises “95% orders ≤ 24h”. Which percentiles should the weekly report show? 某快递承诺“95% 订单 24 小时内送达”。周报应展示哪些分位数?


Question D2 — Boxplot for Carriers / 用箱线图选择承运商

Three carriers’ delivery times are summarized by boxplots. Which features guide a contract decision? 三家承运商箱线图已给出。签约时应关注哪些要素?


Question D3 — Special vs. Common Cause / 特殊原因 vs 常见原因

A single-day spike of late orders is observed. How to decide whether to exclude from KPI? 某天超时订单激增,是否应从 KPI 考核剔除?


Question D4 — IQR Stability KPI / IQR 稳定性指标

Design a simple stability KPI using IQR. 用 IQR 设计一个“稳定性”KPI。


Question D5 — Skewness Interpretation / 偏度的运营含义

Customer spending shows positive skewness. What promotion idea fits this pattern? 客单价正偏(右长尾)。哪种促销更合适?


Question D6 — Kurtosis & Tail Risk / 峰度与尾部风险

High kurtosis is observed in delivery times. Operational risk? 配送时长峰度偏高,风险在哪里?


Question D7 — Combine Plots / 组合图避免信息过载

How to show distribution and threshold without clutter? 如何同时展示分布与阈值而不过载?


Question D8 — Robust Center / 稳健中心

Why prefer median over mean when outliers exist? 有离群值时,为什么首选中位数?


Question D9 — Winsorization Policy / 温莎化政策

When is winsorization appropriate and how to document it? 何时适合温莎化?如何记录以便审计?


Question D10 — Dashboard Guardrails / 图表护栏

List three guardrails to reduce misinterpretation. 列出三条图表“护栏”。


Group E — MGS 2150 Lecture 6 · Chap 3-1(10 题 · 概率加法/乘法/条件/独立/计数)

Question E1 — Addition Rule / 加法法则

70% members, 40% used coupons, 25% both. What is P(at least one)? 会员占 70%,用券 40%,交集 25%。至少使用一项概率?


Question E2 — Only One of Two / 仅其一

Using the same data, probability of “only one of the two”? 在同一数据下,“仅其一”的概率?


Question E3 — Complement / 补事件

System success 99.3%. Failure probability? 系统成功率 99.3%,失败概率?


Question E4 — Independence Check / 独立性直觉

If P(coupon | new)=0.6 but P(coupon)=0.4, independent? 若 P(券|新客)=0.6,而 P(券)=0.4,是否独立?


Question E5 — Conditional Probability / 条件概率

Click-to-purchase = 12%, overall purchase = 3%. What is click rate? 点击后购买率 12%,总体购买 3%。点击率?


Question E6 — Law of Total Probability / 全概率

Overall churn 8%; new users 30% with 15% churn. Old-user churn? 总体流失 8%,新客占 30% 且其流失 15%。老客流失率?


Question E7 — Basic Counting: Combinations / 组合

“12 choose 2” bundles and “at least 1 of 3 new SKUs”? 12 选 2 组合数;且“至少含 1 个 3 款新品”?


Question E8 — Basic Counting: Permutations / 排列

Three hosts for Mon/Tue/Wed, one per day, no repeat. Arrangements? 三位主播排三天,每天一人不重复。排法数?


Question E9 — Multiplication Rule / 乘法法则

P(A)=0.4, P(B)=0.5, A and B independent. P(A∩B)? 独立事件 A、B:P(A)=0.4,P(B)=0.5,求交集。


Question E10 — Bayes Lite (Interpretation) / 简易贝叶斯直觉

High-score leads 20%; P(sale|high)=15%, P(sale|low)=2%. If a random sale occurs, is it more likely from high-score? 高分线索 20%,其转化 15%,低分 80% 转化 2%。已知出现一笔成交,更可能来自高分吗?


Group F — MGS 2150 Lecture 7 · Chap 3-2(10 题 · 贝叶斯/先验更新/阈值与代价)

Question F1 — Bayes Posterior / 后验概率计算

High 20% with 15% conv; low 80% with 2%. Given a sale, P(high)? 高分 20%(转化 15%),低分 80%(2%)。已成交,来自高分的概率?


Question F2 — Base Rate Fallacy / 基准率陷阱

Disease rate 1%, sensitivity 95%, false positive 5%. P(disease | positive)? 患病率 1%,敏感度 95%,假阳性 5%。阳性后患病概率?


Question F3 — Precision Under Imbalance / 类别极不平衡的精确率

Alert fire rate 5%, true threat 0.2%, TPR 90%. Approx precision? 告警触发 5%,真实威胁 0.2%,召回 90%。精确率约多少?


Question F4 — Decision Threshold by Cost / 代价驱动阈值

When to block a transaction by posterior probability and costs? 如何用后验概率与代价设定拦截阈值?


Question F5 — Cascaded Screening / 级联筛查

Low-precision first screen + high-precision review: why two thresholds? 低精度初筛 + 高精度复核,为什么要两段阈值?


Question F6 — Sequential Updating / 顺序更新

How to update lead probability after each new signal? 每次新证据到来如何更新线索成交概率?


Question F7 — Prior Setting / 先验设定

Entering a new city with few observations, how to set priors? 新城市数据少,先验如何设?


Question F8 — Posterior to Action / 后验到动作

A coupon costs 5,预期毛利提升 ΔG。如何基于后验做决策?


Question F9 — Value of Information / 信息价值

When to stop testing and ship the better variant? 何时停止试验并上线更优版本?


Question F10 — Communicating Posterior / 沟通后验

How to present posterior to non-technical managers? 如何向非技术经理解释后验?


Group G — MGS 2150 Lecture 8 · Chap 4-1(10 题 · 二项/泊松/几何/负二项/混合/稀疏化)

Question G1 — Binomial Basics / 二项分布基础

Email delivery p=0.98, n=1000. Approx P(X=980) idea? 到达率 0.98,群发 1000,近似 P(X=980) 的思路?


Question G2 — Hypergeometric QA / 超几何抽检

200 items, 10 defects, draw 5 without replacement. P(at least one defect)? 200 件,10 次品,不放回抽 5。至少 1 次品概率?


Question G3 — Poisson Tail / 泊松尾部概率

λ=0.8/hr. P(≥2 next hour)? λ=0.8/小时。下一小时至少 2 起的概率?


Question G4 — Geometric Expectation / 几何分布期望

First-call resolution p=0.7. Expected number of calls? 首次解决率 0.7。期望通话次数?


Question G5 — Negative Binomial / 负二项期望

Need 3 successes, success p=0.6. Expected trials? 需要 3 次成功,单次成功率 0.6,期望尝试数?


Question G6 — Poisson Approximation / 泊松近似二项

When can Bin(n,p) ≈ Pois(λ=np)? Give an example. 何时二项可近似泊松?举例。


Question G7 — Thinning Property / 稀疏化

Events ~ Pois(λ=10/hr). Keep 20% alerts. New process? 告警流泊松(10/h),仅保留 20%。新过程?


Question G8 — Mixture Mean & Var / 混合分布的均值方差

30% high λ=12, 70% low λ=4. Find mean and variance. 高峰(30%) λ=12、平峰(70%) λ=4 的混合。求期望与方差。


Question G9 — Compound Poisson Sum / 复合泊松

Orders N~Pois(λ), amount i.i.d. with mean μ. E(total sales)? 订单数泊松,单笔金额独立同分布均值 μ。总销售额期望?


Question G10 — Net Difference: Skellam / 泊松差分

Two independent Poissons λ_A, λ_B. Distribution of (A−B)? 两独立泊松流 A、B 的差 A−B 的分布?


Group H — MGS 2150 Lecture 9 · Chap 4-2(10 题 · 正态/指数/均匀/CLT/区间/样本量)

Question H1 — Normal Tail Fee / 正态尾部加费

Weight ~ N(1.00, 0.06²) kg. Fee if >1.10 kg. What proportion pays fee? 重量正态(1.00, 0.06²),>1.10kg 加费。比例?


Question H2 — Exponential Waiting / 指数等待

λ=6/min (mean 10s). P(wait > 30s)? λ=6/分(均值 10 秒)。等待超过 30 秒概率?


Question H3 — Uniform Tolerance / 均匀公差

X~U[−0.5,0.5] mm. P(|X|>0.3)? 均匀分布在 [−0.5,0.5],超过 |0.3| 的概率?


Question H4 — CLT Sample Mean / 中心极限定理

σ=0.12, n=36, μ=1.00. P( X̄ >1.03 )? 总体 σ=0.12,n=36,均值 1.00。样本均值 >1.03 的概率?


Question H5 — z-Interval for Mean / 已知 σ 的均值区间

σ=0.12, n=100, X̄=1.02 kg. 95% CI? 已知 σ=0.12,样本 100,样本均值 1.02。95% 区间?


Question H6 — CI for Proportion / 比例区间

n=200, success=182. 95% CI? 样本 200,成功 182。95% 比例区间?


Question H7 — t vs z / 何时用 t

When to use t-interval for mean instead of z? 均值区间何时用 t 而不是 z?


Question H8 — One- vs Two-Tailed / 单尾与双尾

You only want to show “time decreased”. Which tail and why pre-register? 只想证明“时间缩短”,该用单尾;为何要预注册方向?


Question H9 — Practical vs Statistical / 实质 vs 统计显著

p=0.02 but mean time reduces only 0.2 minutes. How to communicate? p=0.02,但均值仅缩短 0.2 分钟。如何沟通?


Question H10 — Sample Size for Proportion / 比例样本量

Target p≈0.05, margin ±0.5pct, 95% CI. Rough n? 预估 p=0.05,误差 ±0.5 个百分点,95% 置信。样本量级?


Group I — MGS 2150 6th.pdf(10 题 · 综合:指标栈/治理/A-B 合规/情景分析)

Question I1 — KPI Stack / 指标全栈

Design a traceable KPI stack from raw logs to decisions. 设计一套“可追溯”的 KPI 栈(原始日志→指标→决策)。


Question I2 — Mixed Methods / 混合方法

Sales drop has both data and process changes. How to combine quant & qual? 销量下滑既有数据波动也有流程变化,如何混合方法诊断?


Question I3 — Metric Hierarchy / 指标层级

Build North Star → drivers → operational metrics to avoid local optima. 构建“北极星→驱动→操作”指标,避免局部最优。


Question I4 — Experiment Under Constraint / 样本受限的实验

Three variants but limited traffic. Design efficiently. 三方案但流量有限,如何高效设计?


Question I5 — Robust Weekly Pack / 稳健周报

Name five standard elements in a robust weekly analytics pack. 稳健的周报应固定展示的五项内容?


Question I6 — Scenario Planning / 情景规划

Demand is uncertain. How to plan inventory with scenarios? 需求不确定,如何做情景库存规划?


Question I7 — A/B Governance / A/B 合规

List key guardrails to prevent p-hacking. 防止 p-hacking 的关键机制?


Question I8 — Metric Drift / 指标漂移

Define and monitor metric drift. 如何定义并监控“指标漂移”?


Question I9 — From Correlation to Causation / 从相关到因果

Propose a roadmap from correlation to causal inference. 从相关到因果的路线图?


Question I10 — Three-Region Threshold / 三段阈值

Turn posterior into “block / review / pass” three regions with costs. 将后验概率转化为“拦截/复核/放行”三段阈值。