当前课程知识点:Learn Statistics with Ease > Chapter 8: Hypothesis Tests > 8.3Type I and Type II errors > 8.3.1 Hypothesis testing: basic steps 假设检验:基本步骤
返回《Learn Statistics with Ease》慕课在线视频课程列表
返回《Learn Statistics with Ease》慕课在线视频列表
爸爸
Dad
你怎么关注起
How do you pay attention to
女生喝茶的问题了
the problem of lady tasting tea
这是一本非常经典的
This is a very classic
统计学书籍.
book of statistics
里面有很多有趣的故事
There are many interesting stories about it
想不想听听女士品茶的故事
Do you want to listen to a story about The Lady Tasting Tea
好的
Yes
那是二十世纪二十年代后期
It was in Cambridge, the United Kingdom
在英国剑桥
in the late 1920s
一个夏日的午后
After the noon of a summer day
一群大学的绅士
a group of university gentlemen
和他们的夫人们
were drinking the afternoon tea
还有来访者
with their ladies
享用着下午茶
and visitors
在品茶过程中
While tasting the tea
一位女士坚称
a lady asserted that
把奶加进茶里
the tea would taste
或把茶加进奶里
different
不同的做法会使茶的味道
if treated in different ways, such as pouring milk into the tea
品起来不同
or pouring the tea into milk
在场的一帮科学精英们
The scientific elites present
对这位女士的胡言乱语
turned up their nose at
嗤之以鼻
babble of the lady
这怎么可能呢
How can this be
他们不能想象
They could not imagine
仅仅因为加茶
the mere difference in the order
加奶的先后顺序不同
or adding tea or milk
茶就会发生不同的化学反应
would cause different chemical reactions in the tea
然而在座的一个身材矮小
However, a scrub
带着厚眼镜的先生
gentleman wearing a pair of thick glasses there
也就是著名的费希尔先生
namely the well-known figure Mr. Fisher
却不这么看
did not think so
他对这个问题很感兴趣
He was very interested in this problem
他兴奋地说到
excitedly saying
让我们来检验这个命题
“Let’s verify this proposition
并开始策划一个实验
and start to hatch an experiment”
在实验中
In the experiment
坚持茶有不同味道的那位女士
the lady who had insisted the tea would taste different
被奉上一连串的
was offered a series of
已经调制好的茶
well mixed teas
其中有的是先加奶
some of which were made
后加茶制成的
in the tea-after-milk order
有的则是先加茶
whereas some were made
后加奶制成的
in the milk-after-tea order
费希尔先生调配出了
Mr. Fisher mixed
八杯其他条件一模一样
eight cups of tea with the same conditions
而仅仅是倒茶
only except that the orders of pouring tea
倒奶顺序相反的茶
and milk were opposite
其中两类各有四杯
There were four cups in each of the two types
然后他让女士品尝之后
Then he let the lady have a taste before
告诉他哪四杯
telling him which four cups
是先加奶后加茶制成的
of tea were made in the tea-after-milk order
当然
Of course
剩下的就是
the rest of them
先加茶后加奶制成的了
were made in the milk-after-tea order
在分析实验结果的时候
While analyzing the experiment result
他运用了这样的逻辑
he used such a logic
他首先假设
He began by hypothesizing
女士没有这个能力区分
the lady could not make a distinction
这个假设被称为原假设
This hypothesis is called the original hypothesis
然后如果女士很好地
If the lady identified
鉴别了这八杯茶
those eight cups of tea inerrantly
那就说明
it shows that
在原假设成立的情况下
something eccentric happened
发生了非常反常的现象
in the case that the original hypothesis holds
以至于说明
so much so that
原假设是令人怀疑的
the original hypothesis is doubtful
从统计上来说
Statistically
如果在原假设成立的前提下
if a very small probability event happens
发生了非常小概率的事件
under the premise that the original hypothesis holds
那我们就有理由怀疑
we have the reason to doubt
原假设的真实性
the veracity of the original hypothesis
费希尔先生的原假设是
The original hypothesis made by Mr. Fisher is that
女士没有这样的能力
the lady had no such ability
但是实际这位女士猜对了
While the lady guessed
所有的八杯茶
all the eight cups of tea right
如果在原假设成立的前提下
If the lady was guessing blindly without any ground
女士是毫无根据的瞎猜
under the premise that the original hypothesis holds
她猜对的概率是1/70
and the probability she guessed right was 1/70
这个概率很小
a very small probability
只能说明
it only shows
女士是具有这个能力的
the lady had such an ability
如果女士没有这个能力
If the lady had no such ability
但是她侥幸靠瞎猜
but she Passed the test
通过了测试
by guessing blindly
使我们错误地认为
so that we misbelieve
她有这个能力
she had such ability
这个概率只有1/70
the probability is only 1/70
这种错误
and such errors
被称为第一类错误
are called Type I errors
一般来说
Generally
不希望这个错误
it is hoped that the probability
发生的概率超过5%
such errors occur does not exceed 5%
刚才的故事告诉我们
The story just now tells us
假设检验
hypothesis testing
是先对总体的参数
is a statistical procedure that starts with
提出某种假设
the proposal of some hypothesis about the parameter(s) of the population
然后我们利用样本的信息
before we utilize the information on a sample
来判断假设是否成立的统计方法
to judge whether the hypothesis is true
它在逻辑上
Logically
运用的是反证法
it employs the proof by contradiction
那么统计上
Statistically
依据的是小概率原理
it relies on the small probability principle
那么总结刚才故事当中
To summarize the basic steps
体现出来的假设检验的
of hypothesis testing manifested
基本步骤
in the story just now
那么我们可以
we can
把这个假设检验的步骤
generalize these steps of hypothesis testing
概括为以下的四点
as the following four points
第一点
First
大家首先要建立
everyone shall set up
正确的原假设和备择假设
the knowledge point on
这个知识点
correct original hypothesis and alternative hypothesis
我们在开始的时候
At the very beginning
给大家详细的讲解过
we interpreted in details to everyone
大家注意关键点
The key points to note
就是我们先确定备择假设
are that we shall decide on the alternative hypothesis
再确定原假设
before the original hypothesis
等号总在原假设上
and that the equal sign is always attached to the original hypothesis
那么备择假设
So the alternative hypothesis
是我们想收集证据
is the hypothesis we want to corroborate
予以支持的假设
by collecting evidence
这些关键点
Hope everyone pays attention to
希望大家重视起来
these key points
第二步
At the second step
我们要确定
we shall determine
我们假设检验的检验统计量
the test statistics in our hypothesis test
那么依据检验统计量的取值
According to the value of the test statistics
我们最终对原假设
we make the final judgment on the original hypothesis
和备择假设作出判断
and the alternative hypothesis
那么如何确定
So how to determine
检验统计量的内容呢
the content of test statistics
我们在后续的学习当中
In the subsequent learning
会给大家介绍
we will make an introduction to everyone
第三步
At the third step
就是根据事先确定的显著性水平
we determine the rejection region
确定拒绝域
according to the predetermined level of significance
第四步 就是根据我们
At the fourth step, we compare
第二步确定的
the value of the test statistics determined in the second step
检验统计量的这个取值
那么跟我们的
with
拒绝域的临界值做比较
the critical value of the rejection region
最终作出判断
thereby making the final judgment
下面我们给大家介绍一下
Below we introduce to everyone
检验统计量
something about test statistics
那么检验统计量
Test statistics
就是在假设检验当中
are the sample statistics for making a judgment on the original hypothesis
我们根据样本数据计算得到的
we work out according to the sample data
对原假设进行判断的样本统计量
during the hypothesis test
也就是说
In other words
我们怎么认为小概率事件
whether we believe a small probability event
有没有发生
has happened
事实上就是依据
relies in fact on
检验统计量的取值来确定的
the value of test statistics
那么在假设检验当中
During the hypothesis test
我们需要根据所检验的问题
We need to
样本容量以及总体的分布
determine the test statistics in a scientific way
和总体的分布
based on the problem under test
是否已知等各种条件
whether the sample size
要科学地确定检验统计量
and the distribution of population is known, and other conditions
那么在后续的学习当中
In the subsequent learning
我们会根据具体的情况
we will introduce the corresponding test statistics
给大家介绍相应的检验统计量
to everyone, contingent on the specific situation
下面我们看几张图
Next, let’s look at several graphs
那么第一张图
The first graph
是我们双侧检验的
is a schematic diagram of hypothesis test
假设检验的一个示意图
of our two-sided test
那么在这个图形当中
In this graph
大家看
everyone notices
因为是双侧检验
since it is a two-sided test
我们说不等于
we say “unequal to”
有大于和小于两种情况
includes both greater-than and smaller-than cases
所以在双侧检验当中
So during the two-sided test
大家看
everyone notices
我们两侧的
the shaded areas
这个阴影部分的面积
on both sides
各为α/2
are α/2 each
加起来就是α
summing up to α
也就是我们所说的显著性水平
the so-called level of significance
而显著性水平
While the level of significance
就是我们人为规定的
is the artificially specified
小概率事件的这个概率
probability of a small probability event
也就是说检验统计量的取值
In other words, once the value of test statistics
一旦落在了
falls
这两个阴影部分的
inside either
这个区域的话
of the shaded areas
那么就意味着
it means that
小概率事件发生了
the small probability even happens
那么根据假设检验的反证法
Based on the proof by contradiction for hypothesis testing
和小概率原理的这样一个思想
and on the thought of small probability principle
在原假设成立的情况下
in the case that the original hypothesis holds
我们认为小概率事件
we do not believe the small probability event
是不会发生的
would happen
如果小概率事件发生了
If the small probability event should happen
那么我们就有理由
we have the reason
拒绝原假设了
to reject the original hypothesis
那么这个就是双侧检验的
This is such a rejection region
这样一个拒绝域
of two-sided test
就是在两侧的这个阴影部分
It is the shaded area on either side
就是我们的这个拒绝域
It is the rejection region
接下来的这个图
The next graph
它是一个左侧检验的拒绝域
is the rejection region of left-sided test
大家要注意一点
Everyone shall pay attention that
我们左侧检验的拒绝域在左边
the rejection region of left-sided test is on the left
因为左侧检验的
since the original hypothesis
它的原假设
of left-sided test
是μ大于等于μ0
is μ is greater than or equal to μ0
而备择假设
while the alternative hypothesis
是μ小于μ0
is that μ is smaller than μ0
那么我们在原假设
In the case that
成立的情况下
the original hypothesis holds
不可能发生的这个事件
we say the event unlikely to happen
我们说肯定是在小于这一侧
must fall on the smaller-than side
所以我们假设检验的
This is why the rejection region
这个拒绝域在左侧
of the hypothesis test is on the left
就是在左侧检验的时候
That is, the rejection region of the hypothesis is on the left
我们假设检验的拒绝域在左边
during the left-sided test
那么大家看
Everyone notices
左边这块阴影部分的面积
the shaded area on the left
就是α
is α
这就是我们刚刚说的
This is the level of significance
这个显著性水平
we have just mentioned
就是在这一侧
On this side
我们认为
we consider
如果检验统计量的值
if the value of a test statistic
取到了这个区间
falls within this interval
那么就意味着
it means
小概率事件发生了
the small probability event happens
那么我们同样
We have the reason to reject the original hypothesis
有理由拒绝原假设
as well
同样的道理
By the same token
我们可以看右侧检验
we can examine the right-sided test
在右侧检验当中
In the right-sided test
我们可以看到
we can notice that
右侧检验的拒绝域
its rejection region
就在右边
is on the right side
因为右侧检验的原假设
This is because the original hypothesis of the right-sided test
是μ小于等于μ0
is μ is smaller than or equal to μ0
当然它不可能发生的
which is definitely impossible
这个事件的方向
We say the direction of the event
我们说应该是在
should be
μ大于μ0的这个方向
the one in which μ is greater than μ0
这样的话
This way
我们同样
we determine
同样我们在右边
such a shaded area
确定了这样一个阴影部分
on the right as well
那么它这个阴影部分的面积就是α
The shaded area is α
就是我们人为规定的
the artificially specified
这个小概率事件的概率
probability of the small probability event
也就是我们的显著性水平
also the level of significance
根据刚才的三个图形
According to the three graphs just now
我们就可以确定
we can determine
我们统计量决策的这个规则
the rule for statistic decision-making
当然在给出
Of course
显著性水平α的这个前提下
Given the level of significance, α
那么我们要找出临界值
we shall find the critical value
也就是说
namely
阴影部分的那个界限
the limit to the shaded area
那么可以根据
We can
你是双侧检验或者单侧检验
determine the corresponding critical value
去确定相应的临界值
depending on whether it is a two-sided test or one-sided test
接下来
Next
我们就根据
we compare
检验统计量的取值
the value of the test statistic
跟我们确定的临界值进行比较
with the critical value we have determined
在双侧检验的情况下
In the case of two-sided test
如果我们统计量的取值
if the absolute value of the statistic
它的绝对值大于临界值
is greater than the critical value
因为双侧检验
namely
我们既要考虑左侧
the value of the statistic
又要考虑右侧
falls in the rejection region
那么这样的话
then
如果这个检验统计量的
we shall reject the original hypothesis
取值的绝对值大于临界值了
since in two-sided test
那么也就是说
we should
这个统计量的值
consider both the left side
我们说它取在了拒绝域里
and
那么我们要拒绝原假设
the right side
在左侧检验的时候
During the left-sided test
我们的拒绝域在左边
the rejection region is on the left
所以只要统计量的取值
so as long as the value of the statistic
小于我们的临界值
is smaller than the critical value
那么我们就要拒绝原假设
we shall reject the original hypothesis
那么当然
Of course
在右侧检验的时候
during the right-side test
如果我们统计量的这个取值
if the value of the statistic
大于这个临界值的话
is greater than the critical value
那么同样
we shall also reject the original hypothesis
我们也要拒绝原假设
the same way
因为这意味着
After all, this means
小概率事件发生了
given the small probability event has happened
我们就有理由拒绝原假设
we have the reason to reject the original hypothesis
那么这一讲
So much for
我们就讲到这里
this lecture
谢谢大家
Thank you everyone
-1.1 Applications in Business and Economics
--1.1.1 Statistics application: everywhere 统计应用:无处不在
-1.2 Data、Data Sources
--1.2.1 History of Statistical Practice: A Long Road 统计实践史:漫漫长路
-1.3 Descriptive Statistics
--1.3.1 History of Statistics: Learn from others 统计学科史:博采众长
--1.3.2 Homework 课后习题
-1.4 Statistical Inference
--1.4.1 Basic research methods: statistical tools 基本研究方法:统计的利器
--1.4.2 Homework课后习题
--1.4.3 Basic concepts: the cornerstone of statistics 基本概念:统计的基石
--1.4.4 Homework 课后习题
-1.5 Unit test 第一单元测试题
-2.1Summarizing Qualitative Data
--2.1.1 Statistical investigation: the sharp edge of mining raw ore 统计调查:挖掘原矿的利刃
-2.2Frequency Distribution
--2.2.1 Scheme design: a prelude to statistical survey 方案设计:统计调查的前奏
-2.3Relative Frequency Distribution
--2.3.1 Homework 课后习题
-2.4Bar Graph
--2.4.1 Homework 课后习题
-2.6 Unit 2 test 第二单元测试题
-Descriptive Statistics: Numerical Methods
-3.1Measures of Location
--3.1.1 Statistics grouping: from original ecology to systematization 统计分组:从原生态到系统化
--3.1.2 Homework 课后习题
-3.2Mean、Median、Mode
--3.2.2 Homework 课后习题
-3.3Percentiles
--3.3 .1 Statistics chart: show the best partner for data 统计图表:展现数据最佳拍档
--3.3.2 Homework 课后习题
-3.4Quartiles
--3.4.1 Calculating the average (1): Full expression of central tendency 计算平均数(一):集中趋势之充分表达
--3.4.2 Homework 课后习题
-3.5Measures of Variability
--3.5.1 Calculating the average (2): Full expression of central tendency 计算平均数(二):集中趋势之充分表达
--3.5.2 Homework 课后习题
-3.6Range、Interquartile Range、A.D、Variance
--3.6.1 Position average: a robust expression of central tendency 1 位置平均数:集中趋势之稳健表达1
--3.6.2 Homework 课后习题
-3.7Standard Deviation
--3.7.1 Position average: a robust expression of central tendency 2 位置平均数:集中趋势之稳健表达2
-3.8Coefficient of Variation
-3.9 unit 3 test 第三单元测试题
-4.1 The horizontal of time series
--4.1.1 Time series (1): The past, present and future of the indicator 时间序列 (一) :指标的过去现在未来
--4.1.2 Homework 课后习题
--4.1.3 Time series (2): The past, present and future of indicators 时间序列 (二) :指标的过去现在未来
--4.1.4 Homework 课后习题
--4.1.5 Level analysis: the basis of time series analysis 水平分析:时间数列分析的基础
--4.1.6Homework 课后习题
-4.2 The speed analysis of time series
--4.2.1 Speed analysis: relative changes in time series 速度分析:时间数列的相对变动
--4.2.2 Homework 课后习题
-4.3 The calculation of the chronological average
--4.3.1 Average development speed: horizontal method and cumulative method 平均发展速度:水平法和累积法
--4.3.2 Homework 课后习题
-4.4 The calculation of average rate of development and increase
--4.4.1 Analysis of Component Factors: Finding the Truth 构成因素分析:抽丝剥茧寻真相
--4.4.2 Homework 课后习题
-4.5 The secular trend analysis of time series
--4.5.1 Long-term trend determination, smoothing method 长期趋势测定,修匀法
--4.5.2 Homework 课后习题
--4.5.3 Long-term trend determination: equation method 长期趋势测定:方程法
--4.5.4 Homework 课后习题
-4.6 The season fluctuation analysis of time series
--4.6.1 Seasonal change analysis: the same period average method 季节变动分析:同期平均法
-4.7 Unit 4 test 第四单元测试题
-5.1 The Conception and Type of Statistical Index
--5.1.1 Index overview: definition and classification 指数概览:定义与分类
-5.2 Aggregate Index
--5.2.1 Comprehensive index: first comprehensive and then compare 综合指数:先综合后对比
-5.4 Aggregate Index System
--5.4.1 Comprehensive Index System 综合指数体系
-5.5 Transformative Aggregate Index (Mean value index)
--5.5.1 Average index: compare first and then comprehensive (1) 平均数指数:先对比后综合(一)
--5.5.2 Average index: compare first and then comprehensive (2) 平均数指数:先对比后综合(二)
-5.6 Average target index
--5.6.1 Average index index: first average and then compare 平均指标指数:先平均后对比
-5.7 Multi-factor Index System
--5.7.1 CPI Past and Present CPI 前世今生
-5.8 Economic Index in Reality
--5.8.1 Stock Price Index: Big Family 股票价格指数:大家庭
-5.9 Unit 5 test 第五单元测试题
-Sampling and sampling distribution
-6.1The binomial distribution
--6.1.1 Sampling survey: definition and several groups of concepts 抽样调查:定义与几组概念
-6.2The geometric distribution
--6.2.1 Probability sampling: common organizational forms 概率抽样:常用组织形式
-6.3The t-distribution
--6.3.1 Non-probability sampling: commonly used sampling methods 非概率抽样:常用抽取方法
-6.4The normal distribution
--6.4.1 Common probability distributions: basic characterization of random variables 常见概率分布:随机变量的基本刻画
-6.5Using the normal table
--6.5.1 Sampling distribution: the cornerstone of sampling inference theory 抽样分布:抽样推断理论的基石
-6.9 Unit 6 test 第六单元测试题
-7.1Properties of point estimates: bias and variability
--7.1.1 Point estimation: methods and applications 点估计:方法与应用
-7.2Logic of confidence intervals
--7.2.1 Estimation: Selection and Evaluation 估计量:选择与评价
-7.3Meaning of confidence level
--7.3.1 Interval estimation: basic principles (1) 区间估计:基本原理(一)
--7.3.2 Interval estimation: basic principles (2) 区间估计:基本原理(二)
-7.4Confidence interval for a population proportion
--7.4.1 Interval estimation of the mean: large sample case 均值的区间估计:大样本情形
--7.4.2 Interval estimation of the mean: small sample case 均值的区间估计:小样本情形
-7.5Confidence interval for a population mean
--7.5.1 Interval estimation of the mean: small sample case 区间估计:总体比例和方差
-7.6Finding sample size
--7.6.1 Determination of sample size: a prelude to sampling (1) 样本容量的确定:抽样的前奏(一)
--7.6.2 Determination of sample size: a prelude to sampling (2) 样本容量的确定:抽样的前奏(二)
-7.7 Unit 7 Test 第七单元测试题
-8.1Forming hypotheses
--8.1.1 Hypothesis testing: proposing hypotheses 假设检验:提出假设
-8.2Logic of hypothesis testing
--8.2.1 Hypothesis testing: basic ideas 假设检验:基本思想
-8.3Type I and Type II errors
--8.3.1 Hypothesis testing: basic steps 假设检验:基本步骤
-8.4Test statistics and p-values 、Two-sided tests
--8.4.1 Example analysis: single population mean test 例题解析:单个总体均值检验
-8.5Hypothesis test for a population mean
--8.5.1 Analysis of examples of individual population proportion and variance test 例题分析 单个总体比例及方差检验
-8.6Hypothesis test for a population proportion
--8.6.1 P value: another test criterion P值:另一个检验准则
-8.7 Unit 8 test 第八单元测试题
-Correlation and regression analysis
-9.1Correlative relations
--9.1.1 Correlation analysis: exploring the connection of things 相关分析:初探事物联系
--9.1.2 Correlation coefficient: quantify the degree of correlation 相关系数:量化相关程度
-9.2The description of regression equation
--9.2.1 Regression Analysis: Application at a Glance 回归分析:应用一瞥
-9.3Fit the regression equation
--9.3.1 Regression analysis: equation establishment 回归分析:方程建立
-9.4Correlative relations of determination
--9.4.1 Regression analysis: basic ideas
--9.4.2 Regression analysis: coefficient estimation 回归分析:系数估计
-9.5The application of regression equation