当前课程知识点:Learn Statistics with Ease >  Chapter 3 Descriptive Statistics: Numerical Methods >  3.1Measures of Location >  3.1.1 Statistics grouping: from original ecology to systematization 统计分组:从原生态到系统化

返回《Learn Statistics with Ease》慕课在线视频课程列表

3.1.1 Statistics grouping: from original ecology to systematization 统计分组:从原生态到系统化在线视频

下一节:3.2.1 Frequency distribution: the initial appearance of the overall distribution characteristics 频数分布:初显总体分布特征

返回《Learn Statistics with Ease》慕课在线视频列表

3.1.1 Statistics grouping: from original ecology to systematization 统计分组:从原生态到系统化课程教案、知识点、字幕

傻二妞 在干吗呢
My little sister, how are you doing?

啊... 姐 这次我们班期中考试的成绩不理想
Not so good… Sister, our class did not do well in the mid-term test.

老师今天交给我一个任务
The teacher gave me an assignment today

我一点头绪都没有
but I have no clue.

谁叫你是学习委员
You are the model student, after all.

啥任务呀
What is the assignment?

哎 要我想想有没有办法
Well, the teacher asked me to think about ways

能把班上的学习成绩提高上去啊
to improve the academic performance of our class.

愁死我了
I am so worried.

别担心 有姐在呢
Don’t worry. Maybe I can help.

我觉得呀
I think

我们首先应该找到
we should first figure out

你们班成绩不理想的原因
why your class failed the mid-term test.

然后才能对症下药
And then we can suit the methods to the situation.

可是 姐
But, sister,

你看看 这么多门课
you see, there are many courses

这么多同学
and many students,

我都不知道该从哪里找起啊
I don’t know where to start.

那么 试试用这些
Then, why don’t you start with

搜集来的数据告诉我们答案吧
collecting some statistical data?

现在我们讲第三章
Today, we will learn the third chapter.

第三章 就是我们前一章
In this chapter, we are going to deal with

第二章搜集来的资料
the statistical data collected in ways we learned from the second chapter.

它是没有条理性
These data need to be organized.

我们要让这些资料
We want these data

其实是要反映总体的情况
to reflect the overall situation.

那要反映总体的情况
In order to reflect the overall situation

就要让那些杂乱无章的资料
we need to put this random information

条理化 系统化
in a systematic and organized order.

这个条理化 系统化的过程
The process to systematically organize data

就属于统计整理
is statistical arrangement.

在这一章里呢
In this chapter

我们要讲完统计整理
we will learn about statistical arrangement

还要讲一些数值的计算
and numerical computation.

统计整理 它是包括三项内容
Statistical arrangement includes 3 sections:

第一 资料的审核
data auditing,

二 统计分组
statistical grouping

三 统计表
and statistical table.

第一 资料的审核
Data auditing

就是要把搜集来的
refers to auditing of data,

那些原始资料
the original data

要进行审核
that has been collected,

看它的正确与否
to assess its accuracy.

当然审核的方法有
Normally the way to review the data include

逻辑推理 逻辑方法
logical reasoning and logical method.

比如说
For example,

这个人年龄与他的学历对比
when we compare one’s age with one’s education,

你不可能说十岁就大学毕业
it would be illogical to say that he graduated from college at 10.

这是逻辑上可以审的出来的
Logical review can help us find the unreasonable.

但还有一些大部分是
Yet, more often the review relies on

数量审核 数量关系
quantity check and quantitative relations.

比如工厂里面的用电量
For instance, a factory’s electricity consumption,

用水量 用工量 就劳动工资量
water consumption, the amount of labor used, namely the amount of wages,

跟你的产值 税收
and its production value, and paid tax

它是肯定有一定的关系
It must have something to do with it

这是资料的审核
And this is what data auditing refers to.

资料的审核是保证我们资料的
Data auditing is a key step to

真实性的关键的一步
ensure data validity.

第二步 统计分组
Statistical grouping.

在第一章我们讲了
In the first chapter, we learned that

统计分组是统计的一种
statistical grouping is one of the elementary

最基本的分析方法
analysis methods in statistics.

统计分组 它的定义是
Statistics grouping is a method

对总体按照某一个标志
that divides collected information into several components

把它区分为若干个组成部分
according to a certain character.

首先涉及到的内容是
The first thing that should be taken into consideration

第一个要按某一个研究目的
is research purpose.

目的确定了
Once the purpose is clear,

你才能选择分组的标准
You can choose the criteria for grouping

或者叫标志
or the character for classification.

那这样分出来的结果
The items grouped in this way

就有组内性质相同
share a common characteristic internally,

组间差异很大
which differs greatly among groups.

所以我们通常讲
So we generally define this as

组内同质 组间差异
homogeneity within the group and between-group variation.

统计分组还有一个关键问题
There is a key element in statistical grouping.

它要遵循一些最基本的原则
It must follow some basic principles.

第一 穷尽性原则
The first principle is exhaustion.

用几何的概念来说
Let us expound it by borrowing the concept of geometry.

就是所有的子集
As the total sub-aggregates

它的并集为全集
form a union or an aggregate,

也就是说我们总体单位
the population union here,

分组以后
after being grouped,

都能找到自己的归属
will fall into their own category

都能进入到相应的组中去
and enter the corresponding group.

第二个原则就是它的互斥性
The second principle is mutual exclusivity.

互斥性指的就是说
Being mutually exclusive means

组与组之间
there is no intersection

不存在着交集
among these groups.

也就是我们集合与集合之间的
That is to say, the intersection between sets

交集为空集
is null.

比如我们会计里面
Take accounting for example,

会计的科目
subjects in accounting

其实就是统计分组的结果
actually result from statistical grouping.

它是把企业的资金
It regards the enterprise's fund

作为一个总体
as the population union,

把它区分为各个科目
to be divided into individual subjects.

那么你每一笔经济业务发生
So each business transaction

只能记录到一个科目里
can only be recorded into one subject,

某一个特定科目中去
a specific subject.

不能用既可以记A科目
Can't record a account with both subject A

又可以记B科目
and subject B.

这就讲的统计分组的原则里
This is the principle of mutual exclusivity

互斥性
in statistical grouping.

统计分组
Statistical grouping

它分出来的作用非常大
has tremendous significance because

第一 它可以划分事物的类型
it divides according to characteristics.

我们前面讲的定类变量
What we mentioned earlier about nominal variable

就是统计分组的体现
is a realization of statistical grouping.

像我们 人按性别分
We divide people by gender,

企业按属性分等等等等
and enterprises, by business, etc.

都属于划分事物的类型
and these are all examples of dividing according to characteristics.

第二 统计分组还可以
Secondly, statistical grouping can also

把事物的内部结构和发展规律
reflect the inner structure and law of development

反映出来
in things.

大家知道的恩格尔系数
The well-known Engel's Coefficient

就是通过统计分组
has used statistical grouping.

把我们的消费总量区分为食品
If we divide the total income into spending on food,

就是食 衣 住 行 烧
namely, food, clothing, housing, transportation and energy consumption,

文教娱乐及其它几大类
as well as culture, education, entertainment and other categories,

那你食品支出占总支出的比重
then, the proportion of income that you spent on food

就是恩格尔系数
is called Engel's Coefficient.

并且发现随着收入的提高
The study reveals that a rise in income

恩格尔系数会不断地下降
keeps lowering Engel's Coefficient.

那就是恩格尔定律
This is known as Engel's Law.

统计分组还能够反映
Statistical grouping also reflects

事物与事物之间的依存关系
the dependency among things.

在农业里面也可以发现例子
there are examples of the dependency in agriculture, too,

比如说施肥量与粮食产量
for example, between fertilizer sowing rate and grain yield.

在一定的范围内
To a limited extent,

它也存在着正相关关系
there exists a positive correlation.

这个都是统计分组
These are all research purposes

能够达到的目的
that are accomplished through statistical grouping.

统计分组的第二个问题
The second question in statistical grouping

就是统计分组的标志的选择
is the choice of character in classification.

它是统计分组的关键
It is a critical element in statistical grouping.

统计分组标志一经确定
Once the grouping character is determined,

那么这个标志下的差异
the variation compared with that character

将被体现出来
becomes manifest.

比如说 我们在人口普查里
Take population census for example,

把年龄作为分组标志
if age is used as a grouping character,

那我们就发现
we will discover

我们国家就是婴儿幼儿特别少
that the number of infants and young children is quite small in our country.

跟以前比
Compared to previous data,

我们现在老年人特别多
the portion of senior citizens has grown.

所以大家现在一直在呼吁
So, now people always say that

现在我们中国进入老年人社会
China has now entered an aging society.

这就是在分组标志选择了年龄这个标志
This is how the grouping character of age

那么它底下的差异就体现出来了
reflects variations under this category.

那分组标志有几类呢 有两类
How many types of classifications are there? There are two.

一类就是我们讲的品质标志
One is classification by qualitative character.

一类就是数量标志
The other is classification by numerical indication.

品质标志分组和数量标志分组
Both types of classification

它各有自己的难度或者是重点
has their own priorities or difficulties.

品质标志大家都觉得挺好分
Most of us take qualitative character for granted.

你看 人按性别分
Look, we can divide people by gender,

那就分成男 女
as male and female.

企业按属性分
and divide enterprises by ownership,

分成国有 集体 外资等等
as state-owned, collective-owned and foreign-funded.

那其实有些品质标志也挺难分
But sometimes the qualitative character is not easy to tell

比如说农产品和工业品
such as between agricultural products and industrial products.

有的时候就很难分
It is difficult to distinguish.

比如 我们讲一下
For example,we suppose that

在山上有一棵树
there is a tree growing on a mountain.

这时候它不属于社会产品
It is not a social product yet.

它属于自然资源
It belongs to natural resources.

但是 如果把它砍伐下来
Now, if the tree is cut down,

砍伐下来 它属于什么产品呢
what kind of product will it belong to?

我们一般认为呢
Generally, we assume,

现在有些人可能会认为是工业品
most people will think of it as an industrial product.

其实不是 它是林业产品
The truth is, however, it is a forestry product

是广义的农产品
which, in a broad sense, is a kind of agricultural product.

如果这个农民把这棵树伐回来以后
Suppose a farmer cuts down the tree and brings it home

拿到家里做椅子 做凳子 做床
to build chairs, stools, and beds.

而这时候形成的社会产品
Then these social products made of wood

它就属于工业品了
belong to industrial products.

就像我们城乡结合部
A similar case is rural-urban fringe area.

那怎么划分 是城市还是农村
How do we define such an area? Rural or urban?

那可能就按别的标准来分
At that point, some other character is introduced for classification.

按年龄分
Classification by age,

那就是按数量标志分
is based on numerical indication.

数量标志分 相对来讲
As for numerical indication, comparatively speaking

它就是确定组限比较麻烦
the most difficult part is to set the group limit.

比如说 育龄妇女生育率
For example, the total fertility rate

这是联合国要的一个指标
is an UN specified index.

育龄妇女的生育率
But what age should be set

你说按什么年龄
as universally agreeable childbearing age?

它各个国家都不相同
Different countries have their own standards.

像我们国家法定年龄
The legal age in China

是二十一岁或者多少岁结婚
for marriage is 21 or another.

就属于才可以生育
So is the childbearing age.

但是阿拉伯国家可能十岁
But the legal age for marriage in the Arab world

或者十一岁她就结婚
is 10 or 11.

那怎么办
What can we do?

联合国统一规定一个标准
UN has a global unified standard

十四岁
of 14.

就作为育龄妇女
So women from age 14 to 49

十四岁以上 四十九岁以下
are classified as in childbearing age.

作为育龄妇女年龄段的人
The classification of women in childbearing age

这是按数量标志分组
is based on numerical indication.

Learn Statistics with Ease课程列表:

Chapter 1 Data and Statistics

-Introduction

-1.1 Applications in Business and Economics

--1.1.1 Statistics application: everywhere 统计应用:无处不在

-1.2 Data、Data Sources

--1.2.1 History of Statistical Practice: A Long Road 统计实践史:漫漫长路

-1.3 Descriptive Statistics

--1.3.1 History of Statistics: Learn from others 统计学科史:博采众长

--1.3.2 Homework 课后习题

-1.4 Statistical Inference

--1.4.1 Basic research methods: statistical tools 基本研究方法:统计的利器

--1.4.2 Homework课后习题

--1.4.3 Basic concepts: the cornerstone of statistics 基本概念:统计的基石

--1.4.4 Homework 课后习题

-1.5 Unit test 第一单元测试题

Chapter 2 Descriptive Statistics: Tabular and Graphical Methods

-Statistical surveys

-2.1Summarizing Qualitative Data

--2.1.1 Statistical investigation: the sharp edge of mining raw ore 统计调查:挖掘原矿的利刃

-2.2Frequency Distribution

--2.2.1 Scheme design: a prelude to statistical survey 方案设计:统计调查的前奏

-2.3Relative Frequency Distribution

--2.3.1 Homework 课后习题

-2.4Bar Graph

--2.4.1 Homework 课后习题

-2.6 Unit 2 test 第二单元测试题

Chapter 3 Descriptive Statistics: Numerical Methods

-Descriptive Statistics: Numerical Methods

-3.1Measures of Location

--3.1.1 Statistics grouping: from original ecology to systematization 统计分组:从原生态到系统化

--3.1.2 Homework 课后习题

-3.2Mean、Median、Mode

--3.2.1 Frequency distribution: the initial appearance of the overall distribution characteristics 频数分布:初显总体分布特征

--3.2.2 Homework 课后习题

-3.3Percentiles

--3.3 .1 Statistics chart: show the best partner for data 统计图表:展现数据最佳拍档

--3.3.2 Homework 课后习题

-3.4Quartiles

--3.4.1 Calculating the average (1): Full expression of central tendency 计算平均数(一):集中趋势之充分表达

--3.4.2 Homework 课后习题

-3.5Measures of Variability

--3.5.1 Calculating the average (2): Full expression of central tendency 计算平均数(二):集中趋势之充分表达

--3.5.2 Homework 课后习题

-3.6Range、Interquartile Range、A.D、Variance

--3.6.1 Position average: a robust expression of central tendency 1 位置平均数:集中趋势之稳健表达1

--3.6.2 Homework 课后习题

-3.7Standard Deviation

--3.7.1 Position average: a robust expression of central tendency 2 位置平均数:集中趋势之稳健表达2

-3.8Coefficient of Variation

--3.8.1 Variance and standard deviation (1): Commonly used indicators of deviation from the center 方差与标准差(一):离中趋势之常用指标

--3.8.2 Variance and Standard Deviation (2): Commonly Used Indicators of Deviation Trend 方差与标准差(二):离中趋势之常用指标

-3.9 unit 3 test 第三单元测试题

Chapter 4 Time Series Analysis

-Time Series Analysis

-4.1 The horizontal of time series

--4.1.1 Time series (1): The past, present and future of the indicator 时间序列 (一) :指标的过去现在未来

--4.1.2 Homework 课后习题

--4.1.3 Time series (2): The past, present and future of indicators 时间序列 (二) :指标的过去现在未来

--4.1.4 Homework 课后习题

--4.1.5 Level analysis: the basis of time series analysis 水平分析:时间数列分析的基础

--4.1.6Homework 课后习题

-4.2 The speed analysis of time series

--4.2.1 Speed analysis: relative changes in time series 速度分析:时间数列的相对变动

--4.2.2 Homework 课后习题

-4.3 The calculation of the chronological average

--4.3.1 Average development speed: horizontal method and cumulative method 平均发展速度:水平法和累积法

--4.3.2 Homework 课后习题

-4.4 The calculation of average rate of development and increase

--4.4.1 Analysis of Component Factors: Finding the Truth 构成因素分析:抽丝剥茧寻真相

--4.4.2 Homework 课后习题

-4.5 The secular trend analysis of time series

--4.5.1 Long-term trend determination, smoothing method 长期趋势测定,修匀法

--4.5.2 Homework 课后习题

--4.5.3 Long-term trend determination: equation method 长期趋势测定:方程法

--4.5.4 Homework 课后习题

-4.6 The season fluctuation analysis of time series

--4.6.1 Seasonal change analysis: the same period average method 季节变动分析:同期平均法

-4.7 Unit 4 test 第四单元测试题

Chapter 5 Statistical Index

-Statistical indices

-5.1 The Conception and Type of Statistical Index

--5.1.1 Index overview: definition and classification 指数概览:定义与分类

-5.2 Aggregate Index

--5.2.1 Comprehensive index: first comprehensive and then compare 综合指数:先综合后对比

-5.4 Aggregate Index System

--5.4.1 Comprehensive Index System 综合指数体系

-5.5 Transformative Aggregate Index (Mean value index)

--5.5.1 Average index: compare first and then comprehensive (1) 平均数指数:先对比后综合(一)

--5.5.2 Average index: compare first and then comprehensive (2) 平均数指数:先对比后综合(二)

-5.6 Average target index

--5.6.1 Average index index: first average and then compare 平均指标指数:先平均后对比

-5.7 Multi-factor Index System

--5.7.1 CPI Past and Present CPI 前世今生

-5.8 Economic Index in Reality

--5.8.1 Stock Price Index: Big Family 股票价格指数:大家庭

-5.9 Unit 5 test 第五单元测试题

Chapter 6 Sampling Distributions

-Sampling and sampling distribution

-6.1The binomial distribution

--6.1.1 Sampling survey: definition and several groups of concepts 抽样调查:定义与几组概念

-6.2The geometric distribution

--6.2.1 Probability sampling: common organizational forms 概率抽样:常用组织形式

-6.3The t-distribution

--6.3.1 Non-probability sampling: commonly used sampling methods 非概率抽样:常用抽取方法

-6.4The normal distribution

--6.4.1 Common probability distributions: basic characterization of random variables 常见概率分布:随机变量的基本刻画

-6.5Using the normal table

--6.5.1 Sampling distribution: the cornerstone of sampling inference theory 抽样分布:抽样推断理论的基石

-6.9 Unit 6 test 第六单元测试题

Chapter 7 Confidence Intervals

-Parameter Estimation

-7.1Properties of point estimates: bias and variability

--7.1.1 Point estimation: methods and applications 点估计:方法与应用

-7.2Logic of confidence intervals

--7.2.1 Estimation: Selection and Evaluation 估计量:选择与评价

-7.3Meaning of confidence level

--7.3.1 Interval estimation: basic principles (1) 区间估计:基本原理(一)

--7.3.2 Interval estimation: basic principles (2) 区间估计:基本原理(二)

-7.4Confidence interval for a population proportion

--7.4.1 Interval estimation of the mean: large sample case 均值的区间估计:大样本情形

--7.4.2 Interval estimation of the mean: small sample case 均值的区间估计:小样本情形

-7.5Confidence interval for a population mean

--7.5.1 Interval estimation of the mean: small sample case 区间估计:总体比例和方差

-7.6Finding sample size

--7.6.1 Determination of sample size: a prelude to sampling (1) 样本容量的确定:抽样的前奏(一)

--7.6.2 Determination of sample size: a prelude to sampling (2) 样本容量的确定:抽样的前奏(二)

-7.7 Unit 7 Test 第七单元测试题

Chapter 8: Hypothesis Tests

-Hypothesis Tests

-8.1Forming hypotheses

--8.1.1 Hypothesis testing: proposing hypotheses 假设检验:提出假设

-8.2Logic of hypothesis testing

--8.2.1 Hypothesis testing: basic ideas 假设检验:基本思想

-8.3Type I and Type II errors

--8.3.1 Hypothesis testing: basic steps 假设检验:基本步骤

-8.4Test statistics and p-values 、Two-sided tests

--8.4.1 Example analysis: single population mean test 例题解析:单个总体均值检验

-8.5Hypothesis test for a population mean

--8.5.1 Analysis of examples of individual population proportion and variance test 例题分析 单个总体比例及方差检验

-8.6Hypothesis test for a population proportion

--8.6.1 P value: another test criterion P值:另一个检验准则

-8.7 Unit 8 test 第八单元测试题

Chapter 9 Correlation and Regression Analysis

-Correlation and regression analysis

-9.1Correlative relations

--9.1.1 Correlation analysis: exploring the connection of things 相关分析:初探事物联系

--9.1.2 Correlation coefficient: quantify the degree of correlation 相关系数:量化相关程度

-9.2The description of regression equation

--9.2.1 Regression Analysis: Application at a Glance 回归分析:应用一瞥

-9.3Fit the regression equation

--9.3.1 Regression analysis: equation establishment 回归分析:方程建立

-9.4Correlative relations of determination

--9.4.1 Regression analysis: basic ideas

--9.4.2 Regression analysis: coefficient estimation 回归分析:系数估计

-9.5The application of regression equation

--9.5.1 Regression analysis: model evaluation 回归分析:模型评价

3.1.1 Statistics grouping: from original ecology to systematization 统计分组:从原生态到系统化笔记与讨论

也许你还感兴趣的课程:

© 柠檬大学-慕课导航 课程版权归原始院校所有,
本网站仅通过互联网进行慕课课程索引,不提供在线课程学习和视频,请同学们点击报名到课程提供网站进行学习。