Standard deviation lets you understand how things are distributed within the range.
The formula to calculate a sample’s variance is
We have to take the sum of the squares of the difference because the underlying numbers may be negative, which the formula can’t handle.
You can then turn that sample variance into the sample’s “standard deviation” by taking the sqrt of it.
Finding stddev from a probability distribution
Formula:
Given this distribution of cat weights: 6.8, 8.2, 7.5, 9.4, 8.2
Mean = 8.02
would yield:
x | x-barx | (x-barx)^2 |
---|---|---|
6.8 | 6.8-8.02 = -1.22 | 1.4884 |
7.5 | -0.52 | .2704 |
8.2 | .18 | .0324 |
8.2 | .18 | .0324 |
9.4 | 1.38 | 1.9044 |
If we then divide that last column by n-1 (4), we get 0.932 lbs^2 as the variance. sqrt = 0.9654 lb as the standard deviation.
The n-1 above is “degrees of freedom”, but I don’t know what that means yet.
Population variance and standard deviation are rare to see, but they are represented by lowercase sigma.
—
A population’s variance is similar to sample variance, but it devides by , rather than .
Population Variance
where is the population mean, N is the total number of items in the population.
Sample Variance
where is the mean of the sample and is the number of items sampled.
Standard deviation in a normal curve
In a normal curve,
50% of the curve is on either side of the mean. 34% lives between -1 and 0 standard deviations. 34% between 0 and +1 13% between -2 and -1, 1 and 2 2% between -3 and -2, 2 and 3 and .1% further than -3 or +3
Estimated standard deviation
Estimated standard deviation () is
And so the estimated population variance () is