Slide 1 — Statistical inference about the difference of population means(第1页——总体均值差的统计推断)
Knowledge Points (知识点)
- Inference for the difference of two population means(两个总体均值差的统计推断)
- Case 1: population standard deviations σ₁ and σ₂ known(情形一:已知总体标准差 σ₁、σ₂)
- Case 2: population standard deviations σ₁ and σ₂ unknown(情形二:未知总体标准差 σ₁、σ₂)
Explanation(解释)
- We study how to use two independent samples to make inferences about the difference in population means .
- There are two main scenarios:
- When both population standard deviations and are known, we use the z-distribution.
- When at least one of is unknown, we estimate them using sample standard deviations and use t-based methods.
- The goal is to construct confidence intervals and perform hypothesis tests about .
📖 点击查看中文解释
- 本章讨论如何利用两个独立样本来推断两个总体均值之差 。
- 主要分为两种情形:
- 当总体标准差 与 已知 时,采用 z 分布 进行推断;
- 当 或 至少有一个 未知 时,用样本标准差估计,并使用 t 分布 方法。
- 我们的目标是对 构造置信区间并进行假设检验。
Example(例子)
- Suppose we compare average spending of Group 1 (students who mainly shop online) and Group 2 (students who mainly shop offline).
- We want to know whether the population means differ, that is, whether .
- Depending on whether are known or unknown, we choose different formulas and distributions, but the target parameter is always .
📖 点击查看中文解释
- 例如比较两组学生的平均消费水平:第一组主要线上消费(总体 1),第二组主要线下消费(总体 2)。
- 我们关心总体均值是否不同,即 。
- 根据 是否已知,选择不同的公式和分布,但关注的参数始终是 。
Extension(拓展)
- This framework applies to many business problems: comparing two products, two branches, or two marketing strategies.
- Later sections will also discuss assumptions such as independence, normality, and large-sample approximations that justify using z or t procedures.
📖 点击查看中文解释
- 该框架可用于大量商业问题:比较两种产品、两家门店或两种营销策略的平均效果。
- 后续内容还会讨论使用 z 或 t 方法所需的前提假设,如样本独立性、总体近似正态以及大样本近似等。
Summary(小结)
- This chapter introduces two-sample inference for the difference in means, with separate methods for known and unknown population standard deviations.
📖 点击查看中文解释
- 本章介绍针对总体均值差的两样本推断,并根据总体标准差已知或未知采用不同的方法。
Slide 2 — Notation for two-population mean comparison(第2页——两个总体均值比较的符号约定)
Knowledge Points (知识点)
- Population parameters (总体参数:均值与标准差)
- Sample statistics (样本统计量:均值与样本量)
- Difference of population means and difference of sample means (总体均值差与样本均值差)
Explanation(解释)
- Population 1 has mean and standard deviation ; Population 2 has mean and standard deviation .
- We draw independent samples:
- Sample 1: size , sample mean from Population 1.
- Sample 2: size , sample mean from Population 2.
- The parameter of interest is the difference in population means .
- The point estimator of is the difference in sample means .
📖 点击查看中文解释
- 总体 1 的均值为 ,标准差为 ;总体 2 的均值为 ,标准差为 。
- 我们各自从两个总体中抽取独立样本:
- 样本 1:容量 ,样本均值 ,来自总体 1;
- 样本 2:容量 ,样本均值 ,来自总体 2。
- 我们关注的总体参数是总体均值差 。
- 其点估计量是样本均值差 。
Example(例子)
- Population 1: monthly online spending of all business students; Population 2: monthly online spending of all non-business students.
- We sample business students and non-business students and compute their sample means .
- We will use to estimate .
📖 点击查看中文解释
- 总体 1:所有商科学生的每月线上消费;总体 2:所有非商科学生的每月线上消费。
- 抽取 个商科学生和 个非商科学生,计算各自的样本均值 。
- 用样本均值差 来估计总体均值差 。
Extension(拓展)
- The notation extends naturally to paired-sample designs, but here we focus on independent samples.
- Clear notation helps when we later derive the sampling distribution of and construct confidence intervals.
📖 点击查看中文解释
- 这些符号也可以扩展到配对样本的情形,但本节主要讨论独立样本。
- 统一的符号约定有助于后续推导 的抽样分布并构造置信区间。
Summary(小结)
- We distinguish clearly between population parameters and sample statistics, and recognize as the point estimator of .
📖 点击查看中文解释
- 本页明确区分了总体参数与样本统计量,并认识到样本均值差 是总体均值差 的点估计量。
Slide 3 — Distribution of the difference of sample means(第3页——样本均值差的分布)
Knowledge Points (知识点)
- Expected value of (样本均值差的期望)
- Standard deviation / standard error of when are known(已知总体标准差时的标准差/标准误)
- Role of sample sizes (样本量对变异性的影响)
Explanation(解释)
- Under the usual assumptions (independent samples, each from a population with mean and variance ), the expected value of the difference in sample means is
- When the population standard deviations are known, the standard deviation (also called standard error) of is
- Larger sample sizes make the standard error smaller, leading to more precise estimates.
📖 点击查看中文解释
- 在常见假设下(两个样本相互独立,各自来自均值为 、方差为 的总体),样本均值差的期望值为
- 当总体标准差已知时,样本均值差 的标准差/标准误为
- 样本量 越大,标准误越小,估计就越精确。
Example(例子)
- Let .
- Then
- This tells us the typical sampling variation of the difference in sample means.
📖 点击查看中文解释
- 设 。
- 则
- 该值表示样本均值差在重复抽样中的典型波动大小。
Extension(拓展)
- When both populations are normal or when sample sizes are large, the distribution of is approximately normal with mean and standard deviation .
- This normality justifies using z-based confidence intervals and hypothesis tests in the “σ known” case.
📖 点击查看中文解释
- 当两个总体为正态分布,或样本量足够大时, 近似服从均值为 、标准差为 的正态分布。
- 这种正态性为“已知 σ” 情形下使用 z 置信区间与 z 检验提供了理论依据。
Summary(小结)
- The difference in sample means is an unbiased estimator of the difference in population means, and its standard error depends on both population variances and sample sizes.
📖 点击查看中文解释
- 样本均值差是总体均值差的无偏估计量,其标准误由两总体的方差和样本量共同决定。
Slide 4 — Confidence interval for when σ₁, σ₂ are known(第4页——已知 σ₁、σ₂ 时的均值差置信区间)
Knowledge Points (知识点)
- Point estimate (总体均值差的点估计)
- confidence interval formula with known σ₁, σ₂(已知总体标准差时的置信区间公式)
- Meaning of significance level in a two-tailed interval(双侧置信区间中的显著性水平)
Explanation(解释)
- The point estimate of the difference in population means is
- When are known and the sampling distribution of is normal, a confidence interval for is
- Here is the critical value from the standard normal distribution such that the two tails together have area .
📖 点击查看中文解释
- 总体均值差的点估计为
- 当 已知且 近似正态时, 的置信区间为
- 其中 是标准正态分布的临界值,使得两侧尾部的总面积为 。
Example(例子)
- Suppose .
- For a 95% confidence interval, and .
- The standard error is
- The interval is
- We are 95% confident that lies between 3.46 and 16.54.
📖 点击查看中文解释
- 设 。
- 对于 95% 置信区间,,。
- 标准误为
- 置信区间为
- 我们有 95% 的把握认为,总体均值差 介于 3.46 与 16.54 之间。
Extension(拓展)
- The same formula can be adapted for one-sided intervals by using instead of .
- The structure parallels the one-sample z-interval, but now the standard error combines the variability from both populations.
📖 点击查看中文解释
- 若构造单侧置信区间,只需将临界值改为 而非 。
- 该公式的结构与单总体 z 置信区间类似,只是标准误中合并了两个总体的变异性。
Summary(小结)
- With known population standard deviations, the confidence interval for is built from the point estimate plus/minus a z-critical value times the combined standard error.
📖 点击查看中文解释
- 当总体标准差已知时,均值差的置信区间由“样本均值差 ± z 临界值 × 合并标准误”构成,是两总体均值比较的基本工具。
Slide 5 — Confidence interval for μ₁ − μ₂(第5页——均值差置信区间)
Knowledge Points (知识点)
- Point estimate for (总体均值差的点估计)
- Confidence interval formula with known (已知总体标准差时的置信区间公式)
- Significance level and two-tailed intervals(显著性水平与双侧区间)
Explanation(解释)
- The point estimate of the difference in population means is
- When population standard deviations are known and samples are independent, a confidence interval for is
- Here is the critical value from the standard normal distribution such that each tail has probability .
- is the significance level for a two-tailed interval: it is the total probability outside the interval.
📖 点击查看中文解释
- 总体均值差 的点估计为
- 当总体标准差 已知且样本相互独立时, 的置信区间为
- 其中 是标准正态分布的临界值,使得每个尾部的概率为 。
- 是双侧置信区间的显著性水平,表示区间之外的总概率。
Example(例子)
- For a 95% confidence interval, we set .
- The corresponding critical value is .
- Any confidence interval using 95% confidence will have the form
📖 点击查看中文解释
- 若需要 95% 置信区间,则取 。
- 此时临界值为 。
- 任意 95% 置信区间都可以写成
Extension(拓展)
- If the hypothesized difference (often ) lies outside the confidence interval, we will later reject the null hypothesis in a two-tailed test.
- Thus confidence intervals and hypothesis tests are closely connected.
📖 点击查看中文解释
- 若某个假设的均值差(通常为 )落在置信区间之外,那么在后面进行的双侧假设检验中,我们会拒绝原假设。
- 因此,置信区间与假设检验之间有紧密联系。
Summary(小结)
- With known , the confidence interval for is constructed by taking the point estimate plus/minus a z-critical value times the combined standard error.
📖 点击查看中文解释
- 当总体标准差已知时,均值差的置信区间由“样本均值差 ± z 临界值 × 合并标准误”构成,是比较两个总体均值的基本工具。
Slide 6 — Example setup: ABC company vs competitor(第6页——示例设定:ABC 公司与竞争对手)
Knowledge Points (知识点)
- Comparing two population means using sample information(用样本信息比较两个总体均值)
- Identifying from a table(从表格中识别参数)
- Significance level in practice(实际问题中的显著性水平)
Explanation(解释)
- The ABC company compares the average life of its own product (Sample 1) with that of a competitor (Sample 2).
- Both samples are independent, and population standard deviations are assumed known.
📖 点击查看中文解释
- ABC 公司想比较自己产品(样本 1)与竞争对手产品(样本 2)的平均寿命。
- 两个样本相互独立,并且假设总体标准差已知。
Example(例子)
Data table(数据表)
| Sample 1 (ABC) | Sample 2 (Competitor) | |
|---|---|---|
| Sample size | 120 units | 80 units |
| Sample mean | 275 min | 258 min |
| Standard deviation | 15 min | 20 min |
- Significance level: .
📖 点击查看中文解释
- 样本 1(ABC 产品):
- 样本量 ,样本均值 分钟,标准差 分钟;
- 样本 2(竞争对手):
- 样本量 ,样本均值 分钟,标准差 分钟;
- 显著性水平:。
Extension(拓展)
- The question “Is there any difference?” corresponds to testing whether
or estimating a confidence interval to see if is included.
📖 点击查看中文解释
- “是否有差异?”这一问题在统计上对应于检验
或者构造置信区间,看 是否落在区间之内。
Summary(小结)
- This example provides realistic sample data and a chosen significance level, allowing us to compute a confidence interval for and judge whether ABC’s product differs from its competitor’s.
📖 点击查看中文解释
- 本例给出了具体的样本数据和显著性水平,为我们计算 的置信区间、判断 ABC 产品是否优于竞争对手提供了基础。
Slide 7 — Example calculation: confidence interval and conclusion(第7页——示例计算:置信区间与结论)
Knowledge Points (知识点)
- Computing the standard error of (计算样本均值差的标准误)
- Finding the confidence interval numerically(数值上求出置信区间)
- Using the interval to judge significance(利用置信区间判断显著性)
Explanation(解释)
- From the ABC example, the point estimate is
- The standard error of is
- With , the critical value is .
- The confidence interval is
📖 点击查看中文解释
- 对于 ABC 例子,均值差的点估计为
- 样本均值差的标准误为
- 显著性水平 时,临界值 。
- 置信区间为
Example(例子):Interpretation(区间解释)
- Because the entire interval is above 0, we can say:
- At the 5% significance level, there is significant evidence that ABC’s product has a larger mean life than its competitor’s.
- The estimated difference in mean life is between about 12 and 22 minutes.
📖 点击查看中文解释
- 因为区间 整体大于 0,所以在 5% 的显著性水平下:
- 有显著证据表明 ABC 产品的平均寿命高于竞争对手;
- 平均寿命的差异估计在约 12 到 22 分钟之间。
Extension(拓展)
- If 0 had been contained in the interval, we would conclude that the data are consistent with no difference in mean lifetimes.
- This logic is equivalent to performing a two-tailed hypothesis test with null hypothesis .
📖 点击查看中文解释
- 如果 0 落在置信区间内,我们会认为数据与“平均寿命无差异”的假设一致。
- 这种判断与对原假设 进行双侧假设检验的结论是等价的。
Summary(小结)
- For the ABC example, the 95% confidence interval shows a positive difference far from 0, indicating that ABC’s product performs significantly better than the competitor’s in terms of mean life.
📖 点击查看中文解释
- 在 ABC 示例中,95% 置信区间完全为正且远离 0,说明 ABC 产品在平均寿命上显著优于竞争对手。
Slide 8 — Hypothesis tests about μ₁ − μ₂ with known σ₁, σ₂(第8页——已知 σ₁、σ₂ 时的均值差假设检验)
Knowledge Points (知识点)
- Null and alternative hypotheses for comparing two means(比较两个均值的原假设与备择假设)
- Three types of tests: left-tailed, right-tailed, two-tailed(三种检验形式:左尾、右尾、双尾)
- z test statistic for when σ’s are known(已知总体标准差时的 z 检验统计量)
Explanation(解释)
Hypotheses(假设形式)
- Left-tailed test (testing if population 1 mean is smaller):
- Right-tailed test (testing if population 1 mean is larger):
- Two-tailed test (testing if there is any difference):
- Test statistic (known ):
📖 点击查看中文解释
- 左尾检验(检验总体 1 均值是否更小):
- 右尾检验(检验总体 1 均值是否更大):
- 双尾检验(检验是否存在差异):
- 当 已知时,检验统计量为
Example(例子)
- For the ABC company, to test whether there is any difference, we set and use the two-tailed hypotheses:
- The test statistic becomes
which is far into the rejection region for .
📖 点击查看中文解释
- 对 ABC 公司例子,若要检验“是否存在差异”,取 ,建立双尾假设:
- 检验统计量为
远大于 时的临界值 ,因此拒绝原假设。
Extension(拓展)
- Decision rules:
- Left-tailed: reject if .
- Right-tailed: reject if .
- Two-tailed: reject if .
- p-value methods lead to equivalent conclusions.
📖 点击查看中文解释
- 决策规则:
- 左尾检验:若 ,则拒绝 ;
- 右尾检验:若 ,则拒绝 ;
- 双尾检验:若 ,则拒绝 。
- 使用 p 值方法也会得到等价的结论。
Summary(小结)
- Hypothesis tests about specify a null value , choose the appropriate tail form, and use the z statistic based on the combined standard error when are known.
📖 点击查看中文解释
- 针对 的假设检验,需要给定假设差值 ,确定是左尾、右尾还是双尾检验,并在已知标准差时使用基于合并标准误的 z 统计量进行决策。
Slide 9 — One-sided test: ABC product vs competitor (p-value)(第9页——单侧检验:ABC 产品与竞争对手(p 值法))
Knowledge Points (知识点)
- Right-tailed test for difference of means(均值差的右尾检验)
- p-value vs. significance level (p 值与显著性水平的比较)
- Interpretation: “significantly higher” vs “not higher”(“显著更高”的解释)
Explanation(解释)
- Question: Is ABC’s mean product life higher than the competitor’s?
- We use a right-tailed test for the difference of two population means.
Step 1: Hypotheses
- : mean life of ABC’s product
- : mean life of competitor’s product
Step 2: Significance level
Step 3: Test statistic and p-value
- For , the p-value < 0.001 (very close to 0).
Decision rule (right-tailed):
If , reject .
Since , we reject and support .
📖 点击查看中文解释
- 问题:ABC 产品的平均寿命是否高于竞争对手?
- 使用右尾检验比较两个总体均值。
步骤 1:提出假设
- 其中 为 ABC 产品的总体平均寿命, 为竞争对手产品的总体平均寿命。
步骤 2:显著性水平
步骤 3:计算检验统计量与 p 值
- 对应的 p 值 < 0.001,远小于 0.01。
- 决策规则(右尾):若 ,则拒绝 。
- 因为 ,所以拒绝 ,支持 。
Example(例子)结论
- We conclude that ABC’s mean life is significantly higher than the competitor’s at the 1% significance level.
📖 点击查看中文解释
- 结论:在 1% 的显著性水平下,ABC 产品的平均寿命显著高于竞争对手产品。
Extension(拓展)
- The p-value approach does not require computing the critical value.
- It directly tells how extreme the observed is under .
📖 点击查看中文解释
- p 值方法不需要先求临界值,而是直接衡量在原假设成立时观测到当前 值“有多极端”。
- p 值越小,反对 的证据越强。
Summary(小结)
- For a one-sided right-tailed test, if the computed p-value is smaller than , we conclude that population 1’s mean is significantly larger than population 2’s mean.
📖 点击查看中文解释
- 在单侧右尾检验中,若 p 值小于显著性水平 ,则说明总体 1 的均值显著大于总体 2 的均值。
Slide 10 — One-sided test: ABC vs competitor (critical value)(第10页——单侧检验:ABC 与竞争对手(临界值法))
Knowledge Points (知识点)
- z-critical value for right-tailed test(右尾检验的临界值 )
- Compare test statistic with (用 与 比较做决策)
- Connection to p-value approach(与 p 值法的一致性)
Explanation(解释)
Same hypotheses and data as Slide 9:
- Test statistic is still .
Critical value for right-tailed test
Decision rule (right-tailed):
Reject if .
Since
we reject and conclude that ABC’s mean is significantly higher.
📖 点击查看中文解释
- 与第 9 页相同的假设与数据:
- 检验统计量仍为 。
右尾检验的临界值
- 决策规则:若 ,则拒绝 。
- 因为
所以拒绝 ,认为 ABC 产品的平均寿命显著更高。
Example(例子)比较两种方法
- p-value approach (Slide 9): compare p-value with .
- Critical value approach (this slide): compare with .
- Both give the same conclusion.
📖 点击查看中文解释
- p 值法(第 9 页):比较 p 值与 。
- 临界值法(本页):比较 与 。
- 两种方法得到的结论完全一致。
Extension(拓展)
- For a left-tailed test, we would use .
- For a two-tailed test, we use as critical values.
📖 点击查看中文解释
- 左尾检验中,临界值为 。
- 双尾检验中,临界值为 。
Summary(小结)
- In the critical-value approach, if the standardized test statistic lies in the rejection region (beyond the critical value), we reject ; otherwise, we fail to reject .
📖 点击查看中文解释
- 在临界值法中,若标准化统计量落入拒绝域(超过临界值),就拒绝 ;否则就“不拒绝” 。
Slide 11 — Practice: TOEFL scores of two universities(第11页——练习:两所大学托福成绩比较)
Knowledge Points (知识点)
- Two-sample z test for difference in mean scores(两总体均值差的 z 检验)
- Setting up hypotheses for “significant difference”(“是否有显著差异”的假设设定)
- Interpreting test results in context(在情境中解释统计结论)
Explanation(解释)
We compare TOEFL scores of Newland University and ABC University.
Data table(数据表)
| Group | Score ( ) | Standard deviation ( ) | Sample size ( ) |
|---|---|---|---|
| Newland University | 103 | 15 | 50 |
| ABC University | 96 | 10 | 50 |
- Significance level: .
Step 1: Hypotheses
“Is there significant difference?” → two-tailed test
where
- : mean TOEFL score of Newland students,
- : mean TOEFL score of ABC students.
Step 2: Test statistic
Point estimate:
Standard error:
z-score:
Step 3: Decision
- For a two-tailed test with , critical values:
- Since , we reject .
📖 点击查看中文解释
- 比较 Newland University 与 ABC University 学生的托福成绩。
- 数据见上表,显著性水平 。
步骤 1:假设(是否有差异 → 双尾检验)
- :Newland 学生托福平均分;:ABC 学生托福平均分。
步骤 2:检验统计量
步骤 3:决策
- 双尾检验、 时临界值为 。
- 因为 ,所以拒绝 。
Example(例子)结论
- There is a significant difference in mean TOEFL scores between Newland and ABC at the 5% level.
- Since , Newland students have higher average TOEFL scores.
📖 点击查看中文解释
- 在 5% 的显著性水平下,两所大学的托福平均分存在显著差异。
- 且 Newland 的样本均值更高,说明 Newland 学生的托福成绩平均更高。
Extension(拓展)
- We could also construct a 95% confidence interval for :
- Because 0 is not in this interval, the confidence-interval approach gives the same conclusion as the hypothesis test.
📖 点击查看中文解释
- 亦可构造 95% 置信区间:
- 由于区间不包含 0,与假设检验得到的“有显著差异”结论一致。
Summary(小结)
- For the TOEFL example, we used a two-sample z test (and an equivalent confidence interval) to show that Newland University’s mean TOEFL score is significantly higher than ABC University’s at .
📖 点击查看中文解释
- 托福示例通过两样本 z 检验(以及等价的置信区间)表明,在显著性水平 下,Newland University 学生的平均托福分数显著高于 ABC University。