Last Time | Next Time |
There's a cool one by the Delta sigma zeta gamma sorority/fraternity, that I saw in the Student Union.
"Why didn't anyone ask about it?" the professor asks himself. What is the answer?
Our author says that, while it's true that "you'll find later in life that it's handy to know what a standard deviation is", it is "even handier to know what a distribution is."
So our objective in this unit is to understand three things, actually:
How is our mathematician doing at modeling the data?
What has she missed?
A deviation is the difference between a data value and the mean value. So if the mean height is 5'10", and you're 5'8" tall, then your deviation is -2. The "standard deviation" thus is a measure of the "typical deviation".
The larger is,
the more spread out the normal curve becomes. It's still bell-shaped --
it's just broader and flatter.
Notice that it's a little flatter, too. That probably means that some people are honest, and others exaggerate their height even more than two inches. This means that there will be a larger spread (standard deviation) in the heights than one would expect.
Self-reported data is notoriously dangerous, because of these sorts of biases: everyone wants to be a little taller, so they "make themselves taller"....
Unfortunately people who know a little stats may get infatuated with the normal distribution, and apply it even where it's not appropriate.
Do you believe that this data is normally distributed?
It and the statistic about Lakeside School are in the same category (Bill Gates and Paul Allen, founders of Microsoft, graduated from Lakeside, which tends to push up the mean), while the median remains unchanged.
In an unfavorable light, Bush's use of the mean could be construed as one of the three lies: "lies, damned lies, and statistics."
Better in both cases to use a median as the measure of central tendancy: