Lesson 77 — Central Limit Theorem
The mean of n iid random variables converges to a normal distribution regardless of the original distribution — the most important law in statistics. Proof via characteristic function, Berry-Esseen bound, and inference applications.
Used in: 2.º ano do EM (16-17 anos) · Math B japonês §4.4 · Stochastik LK alemão · H2 Math singapurense cap. 21
Rigorous notation, full derivation, hypotheses
Formal statement and proof
Lindeberg-Lévy version
"The central limit theorem is the unofficial sovereign of probability theory." — Grinstead & Snell, Introduction to Probability, §9.1
Version for sums
If , then for large .
Convergence speed: Berry-Esseen inequality
Proof sketch via characteristic function
Let (zero mean, unit variance). Taylor expansion of :
For :
But is the characteristic function of . The Lévy's Continuity Theorem concludes .
When the CLT does not hold
Essential hypotheses
- Independence (minimum sufficient; relaxable to -mixing).
- Finite variance .
- n sufficiently large — rule of thumb: for not-too-skewed distributions; for high skewness.
Solved examples
Exercise list
37 exercises · 9 with worked solution (25%)
- Ex. 77.1Application
exponential with and . Write the approximate distribution of and calculate .
- Ex. 77.2Application
uniform on . Determine and , and write the approximate distribution of by the CLT.
- Ex. 77.3ApplicationAnswer key
Roll 100 fair dice. Determine the approximate distribution of the sum , stating and .
- Ex. 77.4Application
. Write the approximate distribution of by the CLT and calculate the standard deviation of the sample proportion.
- Ex. 77.5Application
A population has and . For , calculate the standard deviation of (as an integer).
- Ex. 77.6ApplicationAnswer key
Using the data from 77.5 (, , ), calculate .
- Ex. 77.7Application
With the same parameters (, , ), calculate .
- Ex. 77.8Application
with , , . Calculate .
- Ex. 77.9Application
Sum of 50 iid r.v. with , . Calculate .
- Ex. 77.10Application
with , . How many observations for a 95% CI with margin of error ?
- Ex. 77.11Understanding
When sample size is multiplied by 4, the standard deviation of ():
- Ex. 77.12Understanding
has a very skewed distribution (skewness = 3). For what size of is the CLT reasonable?
- Ex. 77.13Application
Grades with , . Sample . Calculate .
- Ex. 77.14Application
With the same parameters as 77.13 (, , ), calculate .
- Ex. 77.15Application
With , , and , construct a 95% CI for .
- Ex. 77.16ApplicationAnswer key
Package weight: g, g. Sample . Calculate .
- Ex. 77.17ApplicationAnswer key
With the parameters from 77.16 ( g, g, ), calculate .
- Ex. 77.18Application
Response time: ms, ms. Mean of 100 measurements. What is the 95% SLA limit?
- Ex. 77.19Application
Roll a die 1,000 times. Calculate .
- Ex. 77.20Application
Using the distribution of the sum of 1,000 die rolls, calculate .
- Ex. 77.21Application
(, ). Calculate .
- Ex. 77.22ApplicationAnswer key
Election poll: , . Calculate .
- Ex. 77.23ModelingAnswer key
You hold 50 independent stocks; daily return of each: , . What is the distribution of the average daily return of the portfolio?
- Ex. 77.24ModelingAnswer key
ML model: individual error . Calculate the standard deviation of the mean error over 1,000 predictions.
- Ex. 77.25Modeling
Determine the sample size to detect a 5% difference in proportions with and 80% power.
- Ex. 77.26Modeling
Monte Carlo estimate of : random points in the square , count those falling in the quarter-disk. What is the standard deviation of the estimate of as a function of ?
- Ex. 77.27Modeling
Batch of 500 parts: g, g. Determine the distribution of the total mass .
- Ex. 77.28Modeling
Bus wait time: min. Calculate for the average wait of 50 passengers.
- Ex. 77.29Modeling
X-bar control chart with . The control limits are . Calculate the interval width in terms of process .
- Ex. 77.30Modeling
Satisfaction survey: margin of error at 95% confidence, unknown. What is the minimum ?
- Ex. 77.31ModelingAnswer key
Call time: min, min. 100 calls per hour. Determine the distribution of total time and calculate .
- Ex. 77.32ChallengeAnswer key
A/B test: 10,000 visitors per variant; conversion rate A = 5%, B = 6%. Is the 1 percentage point lift statistically significant? Calculate the -value and -value.
- Ex. 77.33Understanding
Which of the following correctly describes the Central Limit Theorem?
- Ex. 77.34Understanding
Why does the classical Lindeberg-Lévy CLT not apply to the Cauchy distribution?
- Ex. 77.35Challenge
Simulate the CLT in Python for an exponential distribution with . Generate histograms of 10,000 sample means for and compare visually with the theoretical normal curve.
- Ex. 77.36Proof
Sketch the proof of the CLT via characteristic function, indicating where each hypothesis (finite variance, iid) is used.
- Ex. 77.37Proof
Show that the CLT implies the Weak Law of Large Numbers: if , then .
Sources
- OpenIntro Statistics (4th ed) — Diez, Çetinkaya-Rundel, Barr · 2019 · CC-BY-SA. Primary source for exercises 77.2, 77.4, 77.8, 77.11, 77.14–17, 77.22–23, 77.25–26, 77.28, 77.30, 77.33–34.
- OpenStax Statistics — Illowsky, Dean · 2022 · CC-BY. Source for exercises 77.1, 77.3, 77.5–7, 77.9–10, 77.12–13, 77.18–19, 77.21, 77.24, 77.27, 77.29, 77.31, 77.35 and examples 1–3.
- Introduction to Probability (Grinstead-Snell) — Grinstead, Snell · Dartmouth · GNU FDL. Source for exercises 77.19–20, 77.26, 77.36–37 and example 5.