相关例题

Q1 — CLT and Probability for Sample Mean（中心极限定理与样本均值概率）

Question (EN): A logistics company records the daily shipping cost per truck. For a very long period, the population mean cost is known to be $μ = 520$ dollars with population standard deviation $σ = 80$ dollars. Assume the daily costs are not exactly normal but moderately skewed.

A manager takes a simple random sample of $n = 64$ days.

(a) Find the mean and standard error of the sampling distribution of $\overset{x}{ˉ}$ .
(b) Use the Central Limit Theorem to approximate $P (500 \leq \overset{x}{ˉ} \leq 540)$ .
(c) Explain, in words, why it is reasonable to use the normal approximation here.

📖 点击查看翻译

题目（中文）：
某物流公司长期记录每辆卡车的单日运输成本。已知在很长时间内，总体平均成本为 $μ = 520$ 美元，总体标准差为 $σ = 80$ 美元。假设单日成本分布并非完全正态，而是有适度偏态。
现随机抽取 $n = 64$ 天的成本数据作为简单随机样本。

(a) 求样本均值 $\overset{x}{ˉ}$ 抽样分布的均值和标准误差。

(b) 利用中心极限定理近似计算 $P (500 \leq \overset{x}{ˉ} \leq 540)$ 。

(c) 用文字解释：为什么在本例中使用正态近似是合理的？

📖 点击查看答案

(a)
抽样分布的均值：
$E (\overset{x}{ˉ}) = μ = 520$
标准误差：
$σ_{\overset{x}{ˉ}} = \frac{σ}{n} = \frac{80}{64} = \frac{80}{8} = 10$
(b)
我们要计算：
$P (500 \leq \overset{x}{ˉ} \leq 540)$
标准化：
$z_{1} = \frac{500 - 520}{10} = - 2, z_{2} = \frac{540 - 520}{10} = 2$ $P (500 \leq \overset{x}{ˉ} \leq 540) \approx P (- 2 \leq z \leq 2)$
由标准正态表：
$P (- 2 \leq z \leq 2) \approx 0.9545$
(c)
虽然单日成本分布略有偏态，但样本量 $n = 64$ 足够大，根据中心极限定理， $\overset{x}{ˉ}$ 的抽样分布会接近正态，因此可以使用正态近似来计算概率。

Quantity Formula Value
$E (\overset{x}{ˉ})$ $μ$ $520$
$σ_{\overset{x}{ˉ}}$ $\frac{σ}{n}$ $10$
$z_{1}$ $\frac{500 - 520}{10}$ $- 2$
$z_{2}$ $\frac{540 - 520}{10}$ $2$
$P (500 \leq \overset{x}{ˉ} \leq 540)$ $P (- 2 \leq z \leq 2)$ $\approx 0.9545$

结论： 在样本量较大的情况下，样本均值的抽样分布近似正态，本例中约有 $95.45%$ 的样本均值会落在 $500$ 到 $540$ 之间。

Quantity	Formula	Value
$E (\overset{x}{ˉ})$	$μ$	$520$
$σ_{\overset{x}{ˉ}}$	$\frac{σ}{n}$	$10$
$z_{1}$	$\frac{500 - 520}{10}$	$- 2$
$z_{2}$	$\frac{540 - 520}{10}$	$2$
$P (500 \leq \overset{x}{ˉ} \leq 540)$	$P (- 2 \leq z \leq 2)$	$\approx 0.9545$

📝 点击查看解析

思路 / 解析：

第一步：识别本题考察的是样本均值的抽样分布与中心极限定理（CLT），已知 $μ$ 与 $σ$ 。

第二步：根据公式 $σ_{\overset{x}{ˉ}} = \frac{σ}{n}$ 计算标准误差，用于后续标准化。

第三步：将区间端点转换为 $z$ 值： $z = \frac{x ˉ - μ}{σ _{\overset{x}{ˉ}}}$ ，再用标准正态分布查表或计算。

第四步：从 CLT 角度解释：虽然总体不完全正态，但 $n = 64$ 较大，故 $\overset{x}{ˉ}$ 近似服从正态分布，可以安全使用正态近似。

Q2 — Finite Population Correction in Employee Survey（员工调查中的有限总体修正）

Question (EN): A company has a finite population of $N = 400$ employees. The monthly overtime hours (in hours) in this population have a known standard deviation of $σ = 12$ hours.

A HR analyst selects a simple random sample:

Case A: $n = 20$ employees
Case B: $n = 80$ employees

For each case:

(a) Compute the standard error of $\overset{x}{ˉ}$ ignoring the finite population correction (FPC).
(b) Compute the standard error of $\overset{x}{ˉ}$ with the FPC, using
$σ_{\overset{x}{ˉ}} = \frac{N - n}{N - 1} \cdot \frac{σ}{n}$ .
(c) Comment on when the FPC is important and when it can be ignored.

📖 点击查看翻译

题目（中文）：
某公司共有 $N = 400$ 名员工（有限总体）。已知该总体中，员工每月加班时数的标准差为 $σ = 12$ 小时。
人力资源部门进行简单随机抽样：

情形 A： $n = 20$ 名员工；

情形 B： $n = 80$ 名员工。
对每种情形：

(a) 在**忽略有限总体修正系数（FPC）**的情况下，计算样本均值 $\overset{x}{ˉ}$ 的标准误差。

(b) 在考虑有限总体修正系数的情况下，用
$σ_{\overset{x}{ˉ}} = \frac{N - n}{N - 1} \cdot \frac{σ}{n}$
计算样本均值的标准误差。

(c) 说明在什么情况下 FPC 很重要，什么时候可以忽略。

📖 点击查看答案

(a) 忽略 FPC 的标准误差：

通用公式：
$σ_{\overset{x}{ˉ}} = \frac{σ}{n}$
情形 A（ $n = 20$ ）：
$σ_{\overset{x}{ˉ}, A} = \frac{12}{20} \approx 2.683$
情形 B（ $n = 80$ ）：
$σ_{\overset{x}{ˉ}, B} = \frac{12}{80} \approx 1.342$
(b) 考虑 FPC 的标准误差：

FPC 公式：
$σ_{\overset{x}{ˉ}} = \frac{N - n}{N - 1} \cdot \frac{σ}{n}$

情形 A： $N = 400, n = 20$
$FPC_{A} = \frac{400 - 20}{400 - 1} = \frac{380}{399} \approx 0.974$ $σ_{\overset{x}{ˉ}, A}^{(FPC)} = 0.974 \times 2.683 \approx 2.615$

情形 B： $N = 400, n = 80$
$FPC_{B} = \frac{400 - 80}{400 - 1} = \frac{320}{399} \approx 0.896$ $σ_{\overset{x}{ˉ}, B}^{(FPC)} = 0.896 \times 1.342 \approx 1.203$

Case $n$ Ignore FPC $σ_{\overset{x}{ˉ}}$ With FPC $σ_{\overset{x}{ˉ}}$ FPC Value
A $20$ $\approx 2.683$ $\approx 2.615$ $\approx 0.974$
B $80$ $\approx 1.342$ $\approx 1.203$ $\approx 0.896$

(c) 结论与评论：

情形 A 中， $n / N = 20/400 = 0.05$ ，FPC 略有影响，但不算特别大。

情形 B 中， $n / N = 80/400 = 0.20$ ，抽样比例更大，FPC 明显减小了标准误差。

一般当 $n / N \leq 0.05$ 时，FPC 影响很小，可以忽略；当 $n / N$ 较大时，应考虑 FPC。

结论： 在有限总体且抽样比例较大时，必须使用 FPC 来获得更准确的标准误差；抽样比例较小时，可以近似视为无限总体而忽略 FPC。

Case	$n$	Ignore FPC $σ_{\overset{x}{ˉ}}$	With FPC $σ_{\overset{x}{ˉ}}$	FPC Value
A	$20$	$\approx 2.683$	$\approx 2.615$	$\approx 0.974$
B	$80$	$\approx 1.342$	$\approx 1.203$	$\approx 0.896$

📝 点击查看解析

思路 / 解析：

本题考察有限总体修正系数（FPC）的应用以及何时需要修正。

第一步：先按无限总体公式 $σ_{\overset{x}{ˉ}} = \frac{σ}{n}$ 计算基准标准误差。

第二步：再用 FPC 公式 $\frac{N - n}{N - 1}$ 对标准误差进行修正，可观察修正前后的差异。

第三步：通过比较 $n / N$ 的大小来判断 FPC 的重要性：抽样比例越大，FPC 越重要。

核心理解：有限总体无放回抽样会降低样本之间的独立性，从而降低变异性，因此标准误差需要乘以 FPC 进行调整。

Q3 — Comparing Precision at Different Sample Sizes（不同样本量下估计精度比较）

Question (EN): The daily sales (in units) for a popular product have population mean $μ = 300$ and population standard deviation $σ = 50$ . Assume the population is very large and can be treated as infinite.

Two analysts independently estimate the mean daily sales:

Analyst A uses a simple random sample of $n_{A} = 25$ days.
Analyst B uses a simple random sample of $n_{B} = 100$ days.

For each analyst:

(a) Compute the standard error of $\overset{x}{ˉ}$ .
(b) Approximate $P (∣ \overset{x}{ˉ} - μ ∣ \leq 10)$ using the normal distribution.
(c) Comment on which estimate is more precise and why.

📖 点击查看翻译

题目（中文）：
某热门产品的每日销量（单位：件）总体均值为 $μ = 300$ ，总体标准差为 $σ = 50$ 。假设总体很大，可视为无限总体。
两位分析师分别估计平均每日销量：

分析师 A：使用 $n_{A} = 25$ 天的简单随机样本；

分析师 B：使用 $n_{B} = 100$ 天的简单随机样本。
对每位分析师：

(a) 计算样本均值 $\overset{x}{ˉ}$ 的标准误差；

(b) 用正态近似计算 $P (∣ \overset{x}{ˉ} - μ ∣ \leq 10)$ ；

(c) 比较哪个估计更精确，并说明原因。

📖 点击查看答案

(a) 标准误差：

通用公式（无限总体）：
$σ_{\overset{x}{ˉ}} = \frac{σ}{n}$

分析师 A（ $n_{A} = 25$ ）： $σ_{\overset{x}{ˉ}, A} = \frac{50}{25} = \frac{50}{5} = 10$

分析师 B（ $n_{B} = 100$ ）： $σ_{\overset{x}{ˉ}, B} = \frac{50}{100} = \frac{50}{10} = 5$

(b) 计算 $P (∣ \overset{x}{ˉ} - μ ∣ \leq 10)$ ：

我们要求：
$P (∣ \overset{x}{ˉ} - 300∣ \leq 10) = P (290 \leq \overset{x}{ˉ} \leq 310)$

分析师 A：
$σ_{\overset{x}{ˉ}, A} = 10, z_{1} = \frac{290 - 300}{10} = - 1, z_{2} = \frac{310 - 300}{10} = 1$ $P (290 \leq \overset{x}{ˉ} \leq 310) \approx P (- 1 \leq z \leq 1) \approx 0.6826$

分析师 B：
$σ_{\overset{x}{ˉ}, B} = 5, z_{1} = \frac{290 - 300}{5} = - 2, z_{2} = \frac{310 - 300}{5} = 2$ $P (290 \leq \overset{x}{ˉ} \leq 310) \approx P (- 2 \leq z \leq 2) \approx 0.9545$

| Analyst | $n$ | $σ_{\overset{x}{ˉ}}$ | $P (∣ \overset{x}{ˉ} - μ ∣ \leq 10)$ | |:—:|:—:|:—:|:—:| | A | $25$ | $10$ | $\approx 0.6826$ | | B | $100$ | $5$ | $\approx 0.9545$ |

(c) 评论：

分析师 B 的标准误差更小（ $5 < 10$ ），说明其样本均值波动更小。

对于 $\pm 10$ 的误差范围，分析师 B 的样本均值更可能落在该区间内（ $95.45%$ 对比 $68.26%$ ）。

因此，分析师 B 的估计更加精确。

结论： 增大样本量可以显著降低 $σ_{\overset{x}{ˉ}}$ ，提升样本均值对总体均值的估计精度。

📝 点击查看解析

思路 / 解析：

本题核心是比较不同样本量下的标准误差与估计精度，并用概率进行量化。

先用 $σ_{\overset{x}{ˉ}} = \frac{σ}{n}$ 计算两种样本量对应的标准误差，观察大小差异。

再将区间 $[290, 310]$ 转换为 $z$ 区间，用标准正态分布求概率。

看概率谁更大，就说明哪一个样本均值更有可能接近总体均值。

这体现了：样本量越大 → 标准误差越小 → 抽样分布越集中 → 估计越精确。

Q4 — Sampling Distribution of Sample Proportion（样本比例的抽样分布）

Question (EN): In a large online store, the long-run proportion of orders with express shipping is $p = 0.40$ . A data analyst takes a simple random sample of $n = 200$ orders.

(a) Compute the mean and standard error of the sampling distribution of $\overset{p}{^}$ .
(b) Use the normal approximation to estimate $P (0.35 \leq \overset{p}{^} \leq 0.45)$ .
(c) Explain the conditions under which the normal approximation for $\overset{p}{^}$ is valid.

📖 点击查看翻译

题目（中文）：
在某大型网店中，长期来看，选择加急配送的订单比例为 $p = 0.40$ 。数据分析员随机抽取 $n = 200$ 笔订单作为简单随机样本。

(a) 计算样本比例 $\overset{p}{^}$ 抽样分布的均值和标准误差；

(b) 使用正态近似估计 $P (0.35 \leq \overset{p}{^} \leq 0.45)$ ；

(c) 说明在什么条件下，对 $\overset{p}{^}$ 使用正态近似是合理的。

📖 点击查看答案

(a) 对样本比例 $\overset{p}{^}$ ，抽样分布的性质为：
$E (\overset{p}{^}) = p = 0.40$ $σ_{\overset{p}{^}} = \frac{p ( 1 - p )}{n} = \frac{0.4 \times 0.6}{200} = \frac{0.24}{200} = 0.0012 \approx 0.0346$
(b) 计算 $P (0.35 \leq \overset{p}{^} \leq 0.45)$ ：
$z_{1} = \frac{0.35 - 0.40}{0.0346} \approx - 1.44, z_{2} = \frac{0.45 - 0.40}{0.0346} \approx 1.44$ $P (0.35 \leq \overset{p}{^} \leq 0.45) \approx P (- 1.44 \leq z \leq 1.44)$
由标准正态表：
$P (- 1.44 \leq z \leq 1.44) \approx 0.851$
Measure Formula Result
$E (\overset{p}{^})$ $p$ $0.40$
$σ_{\overset{p}{^}}$ $\frac{p ( 1 - p )}{n}$ $\approx 0.0346$
$z_{1}$ $\frac{0.35 - 0.40}{0.0346}$ $\approx - 1.44$
$z_{2}$ $\frac{0.45 - 0.40}{0.0346}$ $\approx 1.44$
$P (0.35 \leq \overset{p}{^} \leq 0.45)$ $P (- 1.44 \leq z \leq 1.44)$ $\approx 0.851$

(c) 正态近似对样本比例有效的条件通常是：

$n p \geq 10$ 且 $n (1 - p) \geq 10$ 。
在本例中：

$n p = 200 \times 0.4 = 80 \geq 10, n (1 - p) = 200 \times 0.6 = 120 \geq 10$
条件明显满足，因此使用正态近似是合理的。

结论： 样本比例的抽样分布近似正态，约有 $85.1%$ 的样本比例会落在 $[0.35, 0.45]$ 之间。

Measure	Formula	Result
$E (\overset{p}{^})$	$p$	$0.40$
$σ_{\overset{p}{^}}$	$\frac{p ( 1 - p )}{n}$	$\approx 0.0346$
$z_{1}$	$\frac{0.35 - 0.40}{0.0346}$	$\approx - 1.44$
$z_{2}$	$\frac{0.45 - 0.40}{0.0346}$	$\approx 1.44$
$P (0.35 \leq \overset{p}{^} \leq 0.45)$	$P (- 1.44 \leq z \leq 1.44)$	$\approx 0.851$

📝 点击查看解析

思路 / 解析：

本题考察样本比例 $\overset{p}{^}$ 的抽样分布与正态近似条件。

第一步：用 $σ_{\overset{p}{^}} = \frac{p ( 1 - p )}{n}$ 计算标准误差，并给出 $E (\overset{p}{^}) = p$ 的无偏性结论。

第二步：将 $\overset{p}{^}$ 区间转换为 $z$ 区间，通过标准正态分布求概率。

第三步：检查 $n p$ 与 $n (1 - p)$ 是否都大于或等于 10，以判断是否可以使用正态近似。

这一题同时体现了无偏估计量（ $E (\overset{p}{^}) = p$ ）与样本量对近似精度的影响。

Q5 — Designing Sample Size for Target Precision（根据精度目标设计样本量）

Question (EN): A university wants to estimate the mean weekly study time (in hours) of its business students. From past data, the population standard deviation is approximately $σ = 30$ hours.

The dean wants the sample mean $\overset{x}{ˉ}$ to be within $\pm 5$ hours of the true mean $μ$ with $95%$ confidence, assuming the sampling distribution of $\overset{x}{ˉ}$ is approximately normal.

(a) Find the minimum required sample size $n$ to meet this precision requirement (use $z_{0.025} \approx 1.96$ ).
(b) Explain the relationship between sample size, standard error, and confidence interval width.
(c) Briefly comment on why $\overset{x}{ˉ}$ is an unbiased but not “perfect” estimator of $μ$ .

📖 点击查看翻译

题目（中文）：
某大学希望估计商学院学生每周学习时间（单位：小时）的总体均值。根据历史数据，已知总体标准差约为 $σ = 30$ 小时。
院长希望在 $95%$ 置信水平下，使样本均值 $\overset{x}{ˉ}$ 与真实均值 $μ$ 的误差不超过 $\pm 5$ 小时（假设 $\overset{x}{ˉ}$ 的抽样分布近似正态）。

(a) 计算满足上述精度要求所需的最小样本量 $n$ （取 $z_{0.025} \approx 1.96$ ）；

(b) 解释样本量、标准误差与置信区间宽度之间的关系；

(c) 简要说明为什么 $\overset{x}{ˉ}$ 虽然是 $μ$ 的无偏估计量，但仍不是“完美”的估计。

📖 点击查看答案

(a) 目标是：
$P (∣ \overset{x}{ˉ} - μ ∣ \leq 5) \approx 0.95$
对正态置信区间，有：
$\overset{x}{ˉ} \pm z_{α /2} \cdot σ_{\overset{x}{ˉ}}$
其中 $σ_{\overset{x}{ˉ}} = \frac{σ}{n}$ ，误差界（半宽度）为：
$E = z_{α /2} \cdot \frac{σ}{n}$
要求 $E = 5$ ，代入 $z_{α /2} = 1.96, σ = 30$ ：
$5 = 1.96 \cdot \frac{30}{n}$ $n = \frac{1.96 \cdot 30}{5} = 1.96 \cdot 6 = 11.76$ $n = (11.76)^{2} \approx 138.30$
因样本量必须为整数且要“向上取整”，
$n_{m i n} = 139$
(b) 样本量、标准误差与置信区间宽度的关系：

标准误差： $σ_{\overset{x}{ˉ}} = \frac{σ}{n}$ ，样本量 $n$ 越大， $σ_{\overset{x}{ˉ}}$ 越小；

置信区间半宽度： $E = z_{α /2} \cdot σ_{\overset{x}{ˉ}}$ ，当 $σ_{\overset{x}{ˉ}}$ 减小时，区间变窄；

因此，增大 $n$ 可以减小标准误差，从而缩短置信区间宽度，提高估计精度。

(c) 关于无偏但不“完美”：

无偏性： $E (\overset{x}{ˉ}) = μ$ ，说明在重复抽样的长期平均中， $\overset{x}{ˉ}$ 不会系统性高估或低估 $μ$ ；

但每一次具体样本仍受随机变异影响，可能偏离 $μ$ ，且方差 $σ_{\overset{x}{ˉ}}^{2}$ 不为零；

因此， $\overset{x}{ˉ}$ 是无偏但有变异的估计量，只能在概率意义上“逼近”真实均值。

结论： 至少需要 $n = 139$ 个样本才能在 $95%$ 置信水平下，把误差控制在 $\pm 5$ 小时以内；增大样本量可以缩小置信区间，但无法消除所有随机误差。

📝 点击查看解析

思路 / 解析：

本题综合考察：标准误差、置信区间、样本量设计、无偏估计量。

第一步：写出误差界公式 $E = z_{α /2} \cdot \frac{σ}{n}$ ，将 $E$ 、 $z_{α /2}$ 、 $σ$ 代入求解 $n$ 。

第二步：通过 $σ_{\overset{x}{ˉ}} = \frac{σ}{n}$ 解释为什么样本量越大，区间越窄，估计越精确。

第三步：讨论无偏性的含义—— $\overset{x}{ˉ}$ 的长期平均等于 $μ$ ，但单次样本仍会随机偏离，因此需要用标准误差和置信区间来刻画这种不确定性。

本题很好地将“点估计 + 抽样分布 + 标准误差 + 置信区间 + 无偏性”综合在一起。

Quartz 4

Explorer

相关例题

Q1 — CLT and Probability for Sample Mean（中心极限定理与样本均值概率）

Q2 — Finite Population Correction in Employee Survey（员工调查中的有限总体修正）

Q3 — Comparing Precision at Different Sample Sizes（不同样本量下估计精度比较）

Q4 — Sampling Distribution of Sample Proportion（样本比例的抽样分布）

Q5 — Designing Sample Size for Target Precision（根据精度目标设计样本量）

Graph View

Table of Contents

Backlinks