Math: population and random sample
Goal: understand the properties of random sample.
Reading: Appendix B.1, B.2, B.3, B.4 of the textbook
1. Statistically speaking, a population is an (unknown) distribution of certain variable.
2. For example, the population for our purpose may be the distribution of the Miami
econ-major students’ ratings of their eco201 instructors.
3. The population distribution can be characterized by parameters such as population
mean (expected value) denoted by Ey or µ
y
, population variance denoted by var(y)
or σ
2
y
, etc. Usually those parameters are unknown, and we want to estimate them.
4. A sample is a part (portion or subset) of the population.
5. For example, one sample is the ratings provided by the students in this eco311 class
(section). Another class may provide a different sample.
6. Obtaining a sample is much easier than obtaining a population.
7. Statistics is about using the sample to estimate (make inference) the unknown pop-
ulation distribution and its parameters.
8. Intuitively, the estimate is “good” if the sample is “good.” Random sample is such a
good sample.
9. A random sample {y
1
, y
2
, . . . , y
n
}, or {y
i
}
n
i=1
, is a special sample with nice properties
(a) E(y
i
) = µ
y
, (i = 1, 2, . . . , n). In words, all observations have identical mean.
(b) var(y
i
) = σ
2
y
, (i = 1, 2, . . . , n). In words, all observations have identical variance.
(c) cov(x
i
, x
j
) = 0, ∀i ̸= j. In words, all observations are independent, so they have
zero covariance with each other.
10. Put differently, a random sample is i.i.d sample . i.i.d stands for identically and
independently distributed.
11. A biased (non-random) sample arises typically because people choose (or select) to be
the sample.
3