Slide 2–3 — Chapter 1: Introduction to Data in Business Statistics
第2–3页——第1章:商业统计中的数据介绍
Knowledge Points (知识点)
- Data (数据)
- Elements (要素)
- Variables (变量)
- Observations (观测值)
Data (数据)
-
Explanation (解释):
Data are fact-based information (numbers, figures, tables) collected, analyzed, and summarized for interpretation.
数据是基于事实的信息(如数字、图表、表格),需要被收集、分析并总结以便解释。 -
Example (例子):
Sales revenue of a company in January is $500,000.
一家公司一月份的销售收入是 500,000 美元。 -
Extension (拓展):
Data provide the foundation for decision-making in business and economics.
数据为商业与经济中的决策提供基础。
Elements (要素)
-
Explanation (解释):
Elements are the objects or entities on which data are collected.
要素是数据收集的对象或实体。 -
Example (例子):
Each student in a class is an element in a survey about study habits.
在一个关于学习习惯的调查中,每个学生就是一个要素。 -
Extension (拓展):
Correctly identifying elements ensures data are representative.
正确识别要素能保证数据具有代表性。
Variables (变量)
-
Explanation (解释):
A variable is a characteristic, feature, or aspect of elements.
变量是要素的特征、属性或方面。 -
Example (例子):
Age, gender, and GPA are variables of a student.
年龄、性别和绩点是学生的变量。 -
Extension (拓展):
Variables can be qualitative (categorical) or quantitative (numerical).
变量可以是定性(类别)或定量(数值)。
Observations (观测值)
-
Explanation (解释):
An observation is a set of measurements collected for each variable of an element.
观测值是针对某个要素在各个变量上的一组测量数据。 -
Example (例子):
For one student: {Age = 20, Gender = Female, GPA = 3.5}.
对于一个学生:{年龄 = 20,性别 = 女,绩点 = 3.5}。 -
Extension (拓展):
Observations are usually recorded in rows of a dataset.
观测值通常记录在数据集的行中。
Slide 5–6 — Chapter 1: Measurement Scales and Data Types
第5–6页——第1章:测量尺度与数据类型
Knowledge Points (知识点)
- Scales of Measurement (测量尺度)
- Categorical Data (类别数据)
- Quantitative Data (定量数据)
Scales of Measurement (测量尺度)
-
Explanation (解释):
Variables can be measured using four scales: nominal, ordinal, interval, ratio.
变量可通过四种尺度来测量:名义尺度、顺序尺度、区间尺度、比率尺度。 -
Example (例子):
- Nominal: Gender (Male/Female).
- Ordinal: Ratings (Very good=5, Very bad=1).
- Interval: Temperature in °C.
- Ratio: Distance in km, or stock market loss probability.
- 名义:性别(男/女)
- 顺序:评分(非常好=5,非常差=1)
- 区间:摄氏温度
- 比率:距离(公里)、股市亏损概率
-
Extension (拓展):
Correct scale selection affects valid statistical methods.
正确选择尺度影响统计方法的有效性。
Categorical Data (类别数据)
-
Explanation (解释):
Categorical data represent categories or labels (nominal or ordinal).
类别数据表示类别或标签(名义或顺序)。 -
Example (例子):
- Nominal: Male=1, Female=2
- Ordinal: Very good=5, Very bad=1
- 名义:男=1,女=2
- 顺序:非常好=5,非常差=1
-
Extension (拓展):
Categorical data are often analyzed using counts, percentages, or frequency tables.
类别数据通常通过计数、百分比或频率表来分析。
Quantitative Data (定量数据)
-
Explanation (解释):
Quantitative data are numerical and measurable (interval or ratio).
定量数据是数值型、可测量的数据(区间或比率)。 -
Example (例子):
Distance = 10 km vs. 3 km; Stock loss chance = 0.5 vs. 0.3.
距离 = 10 公里 vs. 3 公里;股市亏损概率 = 0.5 vs. 0.3。 -
Extension (拓展):
Quantitative data allow advanced statistical analysis, such as mean, variance, and regression.
定量数据能进行高级统计分析,如均值、方差、回归。
Slide 8–9 — Chapter 1: Data Collection and Sources
第8–9页——第1章:数据收集与来源
Knowledge Points (知识点)
- Cross-sectional Data (横截面数据)
- Time Series Data (时间序列数据)
- Data Sources (数据来源)
- Data Acquisition Errors (数据获取误差)
Cross-sectional Data (横截面数据)
-
Explanation (解释):
Collected at a single point in time.
在同一时间点收集的数据。 -
Example (例子):
A survey of customer satisfaction conducted in May 2025.
2025年5月进行的一次顾客满意度调查。 -
Extension (拓展):
Useful for comparing groups but not for analyzing changes over time.
横截面数据适合比较群体,但不适合分析时间变化。
Time Series Data (时间序列数据)
-
Explanation (解释):
Collected over several time periods.
在多个时间段内收集的数据。 -
Example (例子):
Monthly sales revenue from 2020 to 2025.
2020年至2025年的月度销售收入。 -
Extension (拓展):
Time series is essential for trend, seasonality, and forecasting analysis.
时间序列在趋势、季节性与预测分析中非常重要。
Data Sources (数据来源)
-
Explanation (解释):
Data can come from existing sources (internal/external) or from statistical studies (experimental/observational).
数据可以来自现有来源(内部/外部)或统计研究(实验/观察)。 -
Example (例子):
Internal: company sales records; External: purchased industry reports.
内部:公司销售记录;外部:购买的行业报告。 -
Extension (拓展):
Different sources vary in cost, reliability, and accessibility.
不同数据来源在成本、可靠性和可获得性上有所不同。
Data Acquisition Errors (数据获取误差)
-
Explanation (解释):
Errors may occur in data collection and can be minimized by consistency checks and common sense.
数据收集过程中可能出现误差,可以通过一致性检查和常识来减少。 -
Example (例子):
Rejecting outliers like a reported age of 200 years.
拒绝异常值,比如报告的年龄为200岁。 -
Extension (拓展):
Data cleaning is an essential step before analysis.
数据清理是分析前的重要步骤。
Summary (总结)
本章的核心内容:
- 数据与要素、变量和观测值的关系
- 测量尺度(名义、顺序、区间、比率)及数据类型(类别 vs 定量)
- 数据收集方式(横截面与时间序列)和数据来源
- 数据获取中的误差与清理的重要性