The t Tests Lecture 7
|
The t Distribution
|
-
The t-tests are similar to the z-tests
-
However, you assumed you knew the population variance
-
In reality, the variance has to be estimated too!
-
Switch to the t distribution
-
The t-distribution is shorter and fatter because you estimated two parameters, the mean and the variance
-
The Rule of Thumb
-
If the observations are less than 30, then use the t-distribution
-
If the number of observations are equal to or greater than 31, then use the z-distribution as an approximation
-
Example
-
You survey 30 people in Almaty. The average income, = $600 per month and variance, = 10,000
-
Find the 95% Confidence Interval
-
Use an a = 0.05 and df = 30 – 1 = 29
-
Using Excel, = tinv( a, df)
-
t c = 2.04523
-
If this was a normal distribution, then z c = 1.96
-
The standard error (SE) is
-
-
The 95% Confidence Interval is
-
-
There is a 95% chance that the true population mean lies between [562.5, 637.5]
|
Testing the Means between Two Samples
|
-
Testing the Difference of the means of two samples
-
This is more complicated because you are estimating the variance
-
Two methods
-
If the variances are equal, then pool the variances
-
If the variances are unequal, then use a different method to pool the variance
-
Assume the variances are equal
-
Example
-
You survey 80 people at Mega Center
-
The average income is = $800 per month
-
The estimated variance is = 10,000
-
You survey 60 people at Thieves’ Market
-
The average income is = $500 per month
-
The estimated variance is = 2,500
-
Note – you should test data to determine if data is normally distributed
-
Variance is calculated at
Variance in Sample 1
|
|
|
(n 1 – 1)
|
(80 -1)(10,000)
|
790,000
|
Variance in Sample 2
|
|
|
(n 2 – 1)
|
(60 – 1)(2,500) |
147,500
|
|
Total Variance |
937,500 |
-
Total degrees of freedom = n 1 – 1 + n 2 – 1 = 80 + 60 – 2 = 138
-
The pooled variance is
-
The standard error is
-
The t-statistic is
-
If a = 0.05, then the t c = 1.977304
-
The p-value is 5.01 X 10 -15
-
The hypothesis test is
-
Reject the H 0 and conclude the population means are different
-
Can use a Confidence Interval for hypothesis test
-
Assume variances are unequal
-
Same example
-
The =10,000, n 1 = 80, =2,500, and n 2 = 60
-
However, we have to adjust the degrees of freedom
-
Round the degrees of freedom to 122
-
The t-statistic is
-
Reject the H 0 and conclude the population means are different
|
Difference of Means of Paired Observations
|
-
We have observations that are paired
-
Two treatments, A and B
-
Example: Patients are given two types of blood pressure medicine
Observations |
Treatment A |
Treatment B |
Difference |
1 |
64 |
84 |
-20 |
2 |
67 |
51 |
16 |
3 |
49 |
61 |
-12 |
. |
. |
. |
. |
23 |
72 |
70 |
2 |
-
Calculate the average for the differences,
-
Calculate the standard deviation of the differences
-
The standard error is
-
The hypothesis test is
-
The t-statistic is
-
The a = 0.05, df = 23 – 1 = 22, and t c = tdist( a, df) = 2.034
-
Fail to reject the H 0 and conclude both treatments are similar
-
The paired test is a more powerful test than the other two
-
Contains more information, because you took the extra step of pairing the observations
|