Math ClubMath Club
v1 · padrão canônico

Lesson 72 — Variance and standard deviation

Statistical dispersion: how much data deviates from the mean. Population and sample variance, standard deviation, computational formula, properties of linearity and independence.

Used in: 2.º ano do EM (16-17 anos) · Equiv. Stochastik LK alemão · Equiv. Math B japonês · Equiv. H2 Statistics singapurense

σ2=1ni=1n(xixˉ)2\sigma^2 = \frac{1}{n}\sum_{i=1}^{n}(x_i - \bar{x})^2
Choose your door

Rigorous notation, full derivation, hypotheses

Rigorous definition

Variance and standard deviation — population and sample

"Variance is more or less the mean squared distance of each data point from the mean. The unit associated with variance is in squared units. To ensure the dispersion measure has the same units as the data, we take the square root of the variance, called the standard deviation." — OpenIntro Statistics §2.1, Diez et al., CC-BY-SA.

"In statistics problems, we usually do not have access to the entire population, so we use sample data to estimate population parameters. For this, we divide by the sample degrees of freedom, n1n-1, instead of nn." — OpenStax Statistics §2.7, Illowsky & Dean, CC-BY.

Algebraic properties

Geometric representation — scatter plot

High dispersion (large σ)μLow dispersion (small σ)μ

Two sets with the same mean but distinct dispersions. Points far from the dashed line (mean) generate high variance; grouped points generate low variance.

Solved examples

Exercise list

40 exercises · 10 with worked solution (25%)

Application 24Understanding 3Modeling 9Proof 4
  1. Ex. 72.1Application

    Calculate the population variance and standard deviation of {4,6,8}\{4, 6, 8\}.

  2. Ex. 72.2Application

    Calculate the sample variance s2s^2 and sample standard deviation ss for {2,4,4,4,5,5,7,9}\{2, 4, 4, 4, 5, 5, 7, 9\}.

  3. Ex. 72.3Application

    Calculate the population standard deviation of {5,6,7,8,9}\{5, 6, 7, 8, 9\}.

  4. Ex. 72.4ApplicationAnswer key

    What is the variance of {10,10,10,10}\{10, 10, 10, 10\}? Explain geometrically.

  5. Ex. 72.5ApplicationAnswer key

    Calculate the population variance of {0,100}\{0, 100\}.

  6. Ex. 72.6Application

    Salaries (thousand R):): 3, 3, 4, 4, 5, 20$. Calculate the mean and sample standard deviation. Comment on the effect of the outlier.

  7. Ex. 72.7Application

    Use the computational formula x2xˉ2\overline{x^2} - \bar{x}^2 to calculate the variance of {1,2,3,4,5}\{1, 2, 3, 4, 5\}.

  8. Ex. 72.8Application

    Waiting time (min) in 8 service calls: 5,7,6,8,4,5,6,75, 7, 6, 8, 4, 5, 6, 7. Calculate the sample standard deviation.

  9. Ex. 72.9ApplicationAnswer key

    Weights (kg) of 6 watermelons: 8,9,9,10,11,138, 9, 9, 10, 11, 13. Calculate s2s^2 and ss.

  10. Ex. 72.10Application

    XX assumes values 1,2,31, 2, 3 with probabilities 12,14,14\frac{1}{2}, \frac{1}{4}, \frac{1}{4}. Calculate Var(X)\text{Var}(X).

  11. Ex. 72.11Application

    Fair 6-sided die. Calculate Var(X)\text{Var}(X).

  12. Ex. 72.12ApplicationAnswer key

    Sum of two independent fair dice. Calculate Var(S)\text{Var}(S) using the independence property.

  13. Ex. 72.13Application

    Maximum temperature (°C) over 7 days: 10,7,4,9,8,11,510, 7, 4, 9, 8, 11, 5. Calculate the sample variance.

  14. Ex. 72.14Application

    Use the computational formula E[X2](E[X])2E[X^2] - (E[X])^2 to calculate the variance of {1,3,5,7,9}\{1, 3, 5, 7, 9\}.

  15. Ex. 72.15ApplicationAnswer key

    If Var(X)=9\text{Var}(X) = 9, calculate Var(2X+5)\text{Var}(2X + 5).

  16. Ex. 72.16Application

    If σX=4\sigma_X = 4, what is the standard deviation of 3X3X?

  17. Ex. 72.17ApplicationAnswer key

    Var(X)=4\text{Var}(X) = 4, Var(Y)=9\text{Var}(Y) = 9, XX and YY independent. Calculate Var(X+Y)\text{Var}(X+Y) and Var(XY)\text{Var}(X-Y).

  18. Ex. 72.18Application

    Standardize X=80X = 80 if μ=70\mu = 70, σ=5\sigma = 5. Calculate the z-score.

  19. Ex. 72.19Application

    F=1.8C+32F = 1.8C + 32 (Celsius to Fahrenheit conversion). If σC=5\sigma_C = 5°C, what is σF\sigma_F?

  20. Ex. 72.20Application

    Calculate the coefficient of variation CV=σ/μCV = \sigma/\mu for heights (μ=170\mu = 170 cm, σ=8\sigma = 8 cm) and weights (μ=70\mu = 70 kg, σ=12\sigma = 12 kg). Which set is relatively more variable?

  21. Ex. 72.21Application

    Standardize {60,70,80}\{60, 70, 80\} using μ=70,σ=10\mu = 70, \sigma = 10. What are the mean and standard deviation of the z-scores?

  22. Ex. 72.22Application

    Var(X)=16\text{Var}(X) = 16. What is Var(X)\text{Var}(-X)?

  23. Ex. 72.23ApplicationAnswer key

    Sample mean of n=25n = 25 independent observations with σ=10\sigma = 10. What is the standard deviation of the mean?

  24. Ex. 72.24Application

    Sum of 100 iid random variables with σ=1\sigma = 1. What is the standard deviation of the sum?

  25. Ex. 72.25Understanding

    Why does sample variance use the divisor n1n-1 instead of nn?

  26. Ex. 72.26Understanding

    To compare dispersion between salaries ()andheights(cm),is) and heights (cm), is \sigmaororCV$ preferred? Why?

  27. Ex. 72.27Understanding

    Can variance be negative?

  28. Ex. 72.28Modeling

    Production line: mean mass 500 g, σ=5\sigma = 5 g. Tolerance ±15\pm 15 g. How many σ\sigma does the tolerance represent?

  29. Ex. 72.29ModelingAnswer key

    Two funds with 8% expected return, but σA=5%\sigma_A = 5\% and σB=15%\sigma_B = 15\%. Which one to choose as risk-averse? Why?

  30. Ex. 72.30Modeling

    You measure a resistance 10 times: Rˉ=100Ω\bar{R} = 100\,\Omega, s=0.5Ωs = 0.5\,\Omega. Estimate the standard deviation of the mean.

  31. Ex. 72.31Modeling

    Home-to-work commute time: μ=30\mu = 30 min, σ=5\sigma = 5 min. Using Chebyshev's inequality as a conservative bound, how many minutes early should you leave to have at least a 95% chance of arriving on time?

  32. Ex. 72.32Modeling

    Six Sigma process: μ=10.00\mu = 10.00 mm, tolerance 9.949.94 to 10.0610.06 mm. What is the largest σ\sigma that still satisfies the Six Sigma requirement?

  33. Ex. 72.33ModelingAnswer key

    Stocks A: σA=1%\sigma_A = 1\%; Stocks B: σB=2%\sigma_B = 2\%. 50-50 portfolio, zero correlation. Portfolio variance.

  34. Ex. 72.34Modeling

    Same portfolio as the previous exercise, but with 0.5-0.5 correlation between stocks. Variance. Compare with the zero correlation case.

  35. Ex. 72.35Modeling

    In machine learning, why should features with different scales be standardized before training gradient-based models?

  36. Ex. 72.36Modeling

    ENEM Math grades: μ520\mu \approx 520, σ110\sigma \approx 110 points. A student got 740. Calculate the z-score and interpret (how many standard deviations above the mean are they?).

  37. Ex. 72.37Proof

    Prove that Var(X)=E[X2](E[X])2\text{Var}(X) = E[X^2] - (E[X])^2 from the definition Var(X)=E[(Xμ)2]\text{Var}(X) = E[(X-\mu)^2].

  38. Ex. 72.38Proof

    Prove that Var(aX+b)=a2Var(X)\text{Var}(aX + b) = a^2\,\text{Var}(X) for any constants a,ba, b.

  39. Ex. 72.39ProofAnswer key

    Prove that Var(X+Y)=Var(X)+Var(Y)\text{Var}(X + Y) = \text{Var}(X) + \text{Var}(Y) when XX and YY are independent.

  40. Ex. 72.40Proof

    Prove Chebyshev's inequality: P(Xμkσ)1k2P(|X - \mu| \geq k\sigma) \leq \dfrac{1}{k^2} for k>0k > 0.

Sources

  • OpenIntro Statistics (4th ed.) — Diez, Çetinkaya-Rundel, Barr · CC-BY-SA. Primary source for this lesson. §2.1–§2.2 cover sample variance, standard deviation, boxplots, and applied examples.

  • Statistics (OpenStax) — Illowsky, Dean · CC-BY. §2.7 covers dispersion measures, computational formula, calculator exercises, and education/health data.

  • Introduction to Probability — Grinstead & Snell (Dartmouth) — GNU FDL. Ch. 6 covers variance of discrete random variables, algebraic properties, Chebyshev's inequality, and connection to the law of large numbers.

Updated on 2025-05-14 · Author(s): Clube da Matemática

Found an error? Open an issue on GitHub or submit a PR — open source forever.