statistics

Standard deviation lets you understand how things are distributed within the range.

The formula to calculate a sample’s variance is

We have to take the sum of the squares of the difference because the underlying numbers may be negative, which the formula can’t handle.

You can then turn that sample variance into the sample’s “standard deviation” by taking the sqrt of it.

Finding stddev from a probability distribution

Formula:

Given this distribution of cat weights: 6.8, 8.2, 7.5, 9.4, 8.2

Mean = 8.02

would yield:

xx-barx(x-barx)^2
6.86.8-8.02 = -1.221.4884
7.5-0.52.2704
8.2.18.0324
8.2.18.0324
9.41.381.9044

If we then divide that last column by n-1 (4), we get 0.932 lbs^2 as the variance. sqrt = 0.9654 lb as the standard deviation.

The n-1 above is “degrees of freedom”, but I don’t know what that means yet.

Population variance and standard deviation are rare to see, but they are represented by lowercase sigma.

A population’s variance is similar to sample variance, but it devides by , rather than .

Population Variance

where is the population mean, N is the total number of items in the population.

Sample Variance

where is the mean of the sample and is the number of items sampled.

Standard deviation in a normal curve

In a normal curve,

50% of the curve is on either side of the mean. 34% lives between -1 and 0 standard deviations. 34% between 0 and +1 13% between -2 and -1, 1 and 2 2% between -3 and -2, 2 and 3 and .1% further than -3 or +3

Estimated standard deviation

Estimated standard deviation () is

And so the estimated population variance () is