Q1 — Setting Hypotheses for Delivery Time(设定送达时间的假设)

Question (EN): An e-commerce company promises that its average delivery time is no more than 3 days. A researcher wants to test whether customers are actually waiting longer than promised.

  1. Define the null hypothesis and alternative hypothesis using the population mean delivery time .
  2. State whether this is a one-tailed or two-tailed test, and which tail.
  3. In this context, describe in words what a Type I error and a Type II error would mean.

Q2 — Upper-Tail z-Test for Assembly Time(装配时间的右尾 检验)

Question (EN): A factory sets a goal that the average assembly time per unit should be 12 minutes or less. The population standard deviation is known to be minutes. A sample of units has a mean assembly time of minutes.

Using , test whether the goal has not been achieved (i.e., the mean exceeds 12 minutes). Assume normality.

  1. State and .
  2. Compute the -value.
  3. Using the critical-value approach (upper-tail), decide whether to reject with .
  4. Interpret the decision in context.

Q3 — Left-Tail p-Value Test for Defect Rate(次品率的左尾 值检验)

Question (EN): A quality manager claims that a new process reduces the defect rate below the historical level of . Let be the true defect rate (proportion). A sample of items from the new process shows a sample defect rate of with known standard deviation approximately .

Using , perform a left-tailed -test using the p-value approach.

  1. State and .
  2. Compute the -value.
  3. Find the p-value (left-tail).
  4. Decide whether to reject and state your conclusion.

Q4 — Two-Tailed Test with z and p(双尾 检验与 值)

Question (EN): A bank monitors the average waiting time at a branch. Historically, the mean waiting time is minutes with known minutes. After a layout change, a sample of customers yields a mean waiting time of minutes.

At , test whether the average waiting time has changed. Use both the critical-value approach and the p-value approach for a two-tailed test.


Q5 — Interpreting Type I and II Errors(理解第一类与第二类错误)

Question (EN): A university tests a new teaching method for an introductory statistics course. Let be the true mean exam score with the new method, and the traditional method has a historical mean of . The university tests

with .

Describe in words:

  1. What is a Type I error in this context?
  2. What is a Type II error?
  3. Which error is directly controlled by , and what does mean here?

Q6 — Confidence Interval vs Hypothesis Test(置信区间与假设检验)

Question (EN): For a certain production line, the target mean processing time is minutes. The population standard deviation is known to be minutes. A random sample of items gives a sample mean of minutes.

  1. Construct a confidence interval for the true mean .
  2. Using this interval, test vs at .
  3. Explain whether your conclusion matches what you would get from a two-tailed -test.

Q7 — Choosing One- or Two-Tailed Test(选择单尾或双尾检验)

Question (EN): For each situation below, decide whether you should use a one-tailed or two-tailed hypothesis test, and write appropriate and using the population mean .

  1. A marketing team claims a new advertisement increases the average daily sales above the current level of $50,000.
  2. A regulator wants to know if the average pollutant level of a factory is different from the legal limit of 30 ppm (parts per million), in either direction.

Q8 — Rejection Region and Observed z(拒绝域与观测 值)

Question (EN): Suppose you perform a two-tailed -test at significance level with

  1. Find the critical values and .

  2. For each observed test statistic, state whether you reject or fail to reject :

    • (a)
    • (b)
    • (c)

Q9 — Comparing Two p-Values(比较两个 值)

Question (EN): Two different studies test the same null hypothesis against the same alternative using independent samples. Both use significance level .

  • Study A reports p-value .
  • Study B reports p-value .
  1. For each study, state whether is rejected at .
  2. Which study provides stronger evidence against ? Explain.
  3. If a journal only accepts results with p-value below , would either study qualify?

Q10 — Integrated Hypothesis Test Scenario(综合检验与置信区间)

Question (EN): A call center claims its average call handling time is 6 minutes or less. Historical data suggest the population standard deviation is minutes. A supervisor takes a random sample of calls and finds a sample mean of minutes. Assume call times are approximately normal.

Using significance level :

  1. Set up and to test whether the center is failing to meet its claim.
  2. Compute the test statistic .
  3. Using the p-value approach, decide whether to reject .
  4. Construct the corresponding confidence interval for and check whether lies inside it.
  5. Explain how the CI result supports your test decision.