Let X represent the tire pressure of a Formula 1 car at its peak temperature, measured in PSI. With the current tires, it is known that X∼N(32,4). A new brand of tire is being considered, hoping that the PSI at peak temperature will be higher than 32, such that X∼N(μ>32,4). Suppose we collect a sample of size n=36 about the peak temperature PSI of the second brand of tires, and we observe x=35. We are interested in testing the following hypotheses:
H0H1:μ=32:μ>32.
Derive the p-value for this test.
Given α=0.05, should we reject or not reject H0?
Solution.
The test statistic for the scenario is
Z=σ/nX−μ∼N(0,1),
since X is normal and the variance is known. The sample test statistic (under H0) is then
z=2/3635−32=9.
The p-value is the probability of observing x=35 or something more extreme (i.e., numbers that give more evidence that μ>32). In this problem, this is
P(Z≥9)≈0.
We reject H0 at α=0.05.
Problem 2
Among the data collected for the WHO air quality monitoring project is a measure of suspended particles, in μg/m3. Let X and Y equal the concentration of suspended particles in μg/m3 in the city centers of Melbourne and Huston, respectively. Using nX=13 observations of X and nY=16 observations of Y, we shall test H0:μX=μY against H1:μX<μY.
Define the test statistic and critical region, assuming that the variances are equal. Let α=0.05.
If x=72.9, sX=25.6, y=81.7, and sY=28.3, calculate the value of the test statistic and state your conclusion.
Solution.
Since the sample size is small and the variances are unknown but equal, the test statistic is
T=SpnX1+nY1X−Y−(μX−μY)∼t(nX+nY−2),
where Sp is the pooled sample standard deviation. We have a one-sided test, so we reject if
t<−tα(27)=−1.703.
Under H0, μX−μY=0, so the sample test statistic is
t=sp131+16172.9−81.7=−0.8686.
This is not in the critical region, so we do not reject H0 at α=0.05.
Problem 3
Some nurses in county public health conducted a survey of women who had received inadequate prenatal care. They used information from birth certificates to select mothers for the survey. The mothers selected were divided into two groups:
Group A: mothers who said they had ≤5 prenatal visit.
Group B: mothers who said they had ≥6 prenatal visits.
Let X be the birth weight of babies from mothers in Group A and Y the birth weight of babies from mothers in Group B. Assume that X∼N(μX,σX2) and Y∼N(μY,σY2), where σX and σY are unknown but equal. Suppose that the result of the survey is that
Birth weight of babies from mothers in Group A: 49, 108, 110, 82, 93, 114, 134, 114, 96, 52, 101, 114, 120, 116
Birth weight of babies from mothers in Group B: 133, 108, 93, 119, 119, 98, 106, 131, 87, 153, 116, 129, 97, 110
Give a 95% confidence interval for μX−μY.
Test H0:μX=μY against H1:μX=μY at significance level 0.05.
Compute the associated p-value.
Redo part (1)-(3), but do not assume σX=σY, by pairing up the mothers of the two groups in the order they appear.
Solution.
For parts (1)-(3), since the variances are assumed to be equal, the test statistic is
T=SpnX1+nY1X−Y−(μX−μY)∼t(nX+nY−2).
A 95% confidence interval is
x−y±t2α(26)spnX1+nY1=[−30.7909,2.7909]
Since this is a two-sided test, the critical region is
t<−t2α(26)=−2.056ort>t2α(26)=2.056.
In our case, the sample test statistic is
t=−1.7143,
which is not in the critical region. Thus, we do not reject H0 at α=0.05.
Since this is a two-sided test, the p-value is
P(T≤−∣t∣ or T≥∣t∣)=2P(T≥∣t∣)=2(1−P(T≤∣t∣))=2(1−P(T≤1.7143)).
From the t-table in the book,
0.95<P(T≤1.7143)<0.975,
so the p-value satisfies
0.05<p-value<0.10.
What the problem asks you to do is to treat the samples as differences, e.g., d1=x1−y1=49−133, etc. This gives us the following sample:
d−84017−37−261628−179−101−15−15236
We need to test the hypotheses
H0H1:μD=0:μD=0.
In this case, we have n=14 differences and the test statistic is now
T=SD/nD−(μX−μY)∼t(13).
A 95% confidence interval is
d±t2α(13)nsD=[−36.2270,8.2270].
The critical region is
t<−t2α(13)=−2.160ort>t2α(13)=2.160.
The test statistic is
t=−1.3605,
which does not fall into the critical region, so we do not reject H0 at α=0.05. Lastly, the p-value is computed like before:
p-value=2(1−P(T≤1.3605)).
From the textbook's t-table, 0.90<P(T≤1.3605)<0.95, so the p-value satisfies
0.10<p-value<0.20.
Problem 4
A stream was studied on 26 days, half during dry weather (X), and the other half immediately after a significant rainfall (Y). Assume that the distributions of X and Y are N(μX,σ2) and N(μY,σ2), respectively. The following turbidities were recorded in units of NTUs (nephelometric turbidity units):
xy2.97.814.94.21.02.412.612.99.417.3
Test the null hypothesis H0:μX=μY against H1:μX>μY at α=0.01. Give bounds for the p-value and state your conclusion.
Solution.
Since the sample size is small and the variances are assumed to be equal, our test statistic is
T=SpnX1+nY1X−Y−(μX−μY)∼t(8).
Our observed test statistic is
t=−0.1970,
so the p-value is
P(T≥−0.1970)=1−P(T≤0.1970).
From the t-table, P(T≤0.1970)<0.60, so
p-value>0.40.
Thus, we do not reject H0 at α=0.01.
Problem 5
Let Y∼Bin(100,p). To test H0:p=0.08 against H1:p<0.08, we reject H0 and accept H1 if and only if Y≤6. Determine the significance level α of the test.
Solution.
The significance level α is the probability that we reject H0 given that H0 is true. In other words,
α=P(Y≤6).
We can use the normal approximation to a binomial distribution since n=100 is large: