Standard Deviation
Standard deviation is a way of measuring how spread out a set of data happens to be. More specifically, its a measure of much the data deviates from the mean. Data that is tightly clumped will have a smaller standard deviation, compared to a larger one for data that is more spread out.
There are a number of steps involved in calculating standard deviation, so you won’t be asked to do it by hand too often. And also, these steps are for finding standard deviation of a sample, not a population. There’s only one minor difference, but
- Find the mean $\bar{x}$.
- Determine how far each data point is from the mean by subtracting. This is also know as its deviation.
- Square each deviation. This might seem like a strange step, but there are a number of reasons for doing this, one of which is it makes every value positive.
- Add up all the squares.
- Divide that sum by one less than the number of entries. This will also seem like a strange step. Why not just divide by how many, like with mean? It turns out that because this is a sample, our resulting standard deviation will be too small compared to the true standard deviation of the population. Using one less corrects this
- Take the number from the previous step, which is also called the variance, and square root it.
With math symbols, the above looks like this
\[s = \sqrt{\frac{\Sigma (x-\bar{x})^2}{n-1}}\]The square root of the sum of all the squares of the deviations divided by one less than the number of data points.
Example
\[4, 3, 5, 7, 2, 9, 11, 7\]-
Find the mean first (often given to you in the problems).
\[\bar{x}=6\] -
I’m going to combine steps 2 through 4 here, mostly since this is how it will be presented to you in the problems. We’ll be finding all the differences, squaring them, and adding them up.
\[\begin{align*} \Sigma(x-\bar{x})^2 &= (4-6)^2 + (3-6)^2 + (5-6)^2 + (7-6)^2 + (2-6)^2 + (9-6)^2 + (11-6)^2 + (7-6)^2\\ &= (-2)^2 + (-3)^2 + (-1)^2 + 1^2 + (-4)^2 + 3^2 + 5^2 + 1^2 \\ &= 4 + 9 + 1 + 1 + 16 + 9 + 25 + 1 \\ &= 66 \end{align*}\] -
Variance, or step 5, is next. This is where we take our sum and divide is by one less than the number of entries.
\[s^2 =7 frac{\Sigma(x-\bar{x})^2}{n-1} = \frac{66}{7} \approx 9.43\] -
Lastly, we square root that result to get our standard deviation.
\[s = \sqrt{\frac{\Sigma (x-\bar{x})^2}{n-1}} = \sqrt{9.43} \approx 3.07\]