| Marcos |
My knowledge of statistics is fairly limited but I'm curious as to why when we replace N with (N-1) in the denominator of the variance formula we get a better estimate of the variance of the population. Thanks, Marcos |
||
| Kerwin
Hui |
The idea is that we have used up one degree of freedom in computing the sample mean and using it to estimate the population variance. One rarely gets sample mean=population mean, and so we have to make allowance for that. We can compute the expected value of SXX , the sum of squares of deviation. We have ![]() Now the first term has expected value N*variance of the population, and each summand of second term has expected value=population variance/N, so we get the expected value of SXX is (N-1) times the population variance, and so (1/(N-1))*SXX is an unbiased estimate of the population variance. Kerwin |
||
| Marcos |
Hrm... Basically I don't get why the expected value of Thanks, Marcos |
||
| Kerwin
Hui |
Are you happy with the basic rules for manipulating mean and variances? i.e.
|
||
| Marcos |
Thanks, I get it now (at least enough to satisfy myself) Marcos |