Section 11.4 Hypothesis Test for one mean
In this section, you will consider the following options for null hypothesis and corresponding alternate hypothesis with respect to the unknown value \(\mu\text{:}\)
\begin{equation*}
H_0 : \mu = \mu_0 \\ H_a: \mu \neq \mu_0
\end{equation*}
or
\begin{equation*}
H_0 : \mu \le \mu_0 \\ H_a: \mu \gt \mu_0
\end{equation*}
or
\begin{equation*}
H_0 : \mu \ge \mu_0 \\ H_a:\mu \lt \mu_0
\end{equation*}
Again, we choose only one of these three options and as before the first is called a "two-tailed" test and the last two are called "one-tailed" tests. Note that some people will write all of these null hypothesis options using equality but the alternate hypothesis determines the number of tails.
Once again, if the test sample size is sufficiently large and the standard deviation \(\sigma\) of the underlying distribution is known, one can use the normal distribution to compute probabilities. If the test sample size is relatively small or if \(\sigma\) is not known (and therefore is approximated by the sample standard deviation s), then the t-distribution can be utilized to compute probabilities. We will only consider using the t-distribution to compute p-values.
The Standard Error \(\sigma_e\) is given by
\begin{equation*}
\sigma_e = \frac{s}{\sqrt{n}}
\end{equation*}
where s is the standard deviation of the sample. For this presentation, we will assume that the actual population is relatively large relative to the sample size. In cases where this is not true, an adjustment (not presented here) will need to be made when computing \(\sigma_e\text{.}\)
To determine the p-value for a given sample with sample mean \(\overline{x}\text{,}\)
\begin{equation*}
t = \frac{\overline{x}-\mu}{\frac{s}{\sqrt{n}}}
\end{equation*}
is a t-variable with n-1 degrees of freedom. Therefore, probabilities on t can be computed using the t-distribution.
Consider a Two-tailed Hypothesis test for \(\mu\) using
\begin{equation*}
H_0 : \mu = 200 \\ H_a: \mu \neq 200
\end{equation*}
using a sample of size n = 49 and with a resulting mean of \(\overline{x} = 206\text{,}\) a sample standard deviation of \(s = 15\text{,}\) and a significance level of \(\alpha = 0.01\text{.}\)
The standard error for this test is
\begin{equation*}
\sigma_e = \frac{15}{\sqrt{49}} = \frac{15}{7}
\end{equation*}
and so using the t-distribution with degrees of freedom n-1 = 48 yields a t-statistic of
\begin{equation*}
t = \frac{\overline{x} - 200}{\sigma_e} = \frac{206-200}{\frac{15}{7}} = \frac{14}{5} = 2.80.
\end{equation*}
To compute the p-value,
\begin{equation*}
P(t \gt 2.80) + P(t \lt 2.80) \approx 0.0037 + 0.0037 = 0.0074.
\end{equation*}
Since this p-value is less than our significance level \(\alpha = 0.01\) then you can reject the null hypothesis and accept the alternate.
A random sample of 10 observations was drawn from a large normally distributed population. The data is below.
22 20 22 20 25 18 22 23 19 18
To determine if you can infer at the 4% significance level that the population mean is not equal to 22, complete the steps below:
The value of the standardized test statistic:
Using interval notation with \(\infty\) written as “infty” and \(\cup\) written as “U” (capital u), determine each of the following:
The rejection region for the standardized test statistic:
The p-value is
Your decision for the hypothesis test therefore would be to not reject the null hypothesis \(H_0\text{.}\)
Checkpoint 11.4.2. WebWorK - Another Two Tailed Test.
A random sample of 10 observations was drawn from a large normally distributed population. The data is below.
\begin{equation*}
\begin{array}{ccccccccccc}
24 \amp 24 \amp 15 \amp 15 \amp 21 \amp 19 \amp 14 \amp 19 \amp 19 \amp 16
\end{array}
\end{equation*}
Test to determine if we can infer at the 10% significance level that the population mean is not equal to 19, filling in the requested information below.
A. The value of the standardized test statistic:
Note: For the next part, your answer should use interval notation. An answer of the form \((-\infty, a)\) is expressed (-infty, a), an answer of the form \((b, \infty)\) is expressed (b, infty), and an answer of the form \((-\infty, a) \cup (b, \infty)\) is expressed (-infty, a)U(b, infty).
B. The rejection region for the standardized test statistic:
C. The p-value is
D. Your decision for the hypothesis test:
- Reject \(H_1\text{.}\)
- Reject \(H_0\text{.}\)
- Do Not Reject \(H_1\text{.}\)
- Do Not Reject \(H_0\text{.}\)
Consider One-tailed Hypothesis test for \(\mu\) using an interesting application:
Suppose that a manufacturer bottling a delicious beverage and the label indicates that the bottle contains 16 fluid ounces. Since providing the customer too little product might cause a significant negative reaction relative to the modest additional cost of providing a little too much, consider the following hypothesis test:
\begin{equation*}
H_0 : \mu = 16 \\ H_a: \mu \gt 200.
\end{equation*}
To test this hypothesis at significance level \(\alpha = 0.05\text{,}\) you randomly pull out 20 bottles from the production line and accurately measure the amount of produce in each bottle. If the resulting average of these measurements is \(\overline{x} = 16.05\) ounces with a standard deviation of \(s = 0.08\) ounces, determine if you can safely make it known that more product is actually delivered in general to each consumer.
The standard error for this test is
\begin{equation*}
\sigma_e = \frac{0.08}{\sqrt{20}} \approx 0.01789
\end{equation*}
and using the t-distribution with n-1 = 19 degrees of freedom yields a t-statistic of
\begin{equation*}
t = \frac{\overline{x} - 16}{\sigma_e} = \frac{16.05-16}{0.01789} \approx 2.795.
\end{equation*}
To compute the p-value,
\begin{equation*}
P(t \gt 2.795) \approx 0.0058.
\end{equation*}
Since this p-value is less than our significance level \(\alpha = 0.05\) (by a lot) then you can reject the null hypothesis and accept the alternate. It is safe therefore to say that customers can expect at least 16 ounces! However, note that some folks will still be stiffed since the standard deviation of 0.08 certainly means that some bottles have less than 16.05-0.08 ounces of beverage.
Checkpoint 11.4.3. WebWorK - one tailed test.
Andrew thinks that people living in a rural environment have a healthier lifestyle than other people. He believes the average lifespan in the USA is 77 years. A random sample of 18 obituaries from newspapers from rural towns in Idaho give \(\bar{x} = 78.92\) and \(s = 2\text{.}\) Does this sample provide evidence that people living in rural Idaho communities live longer than 77 years?
(a) State the null and alternative hypotheses: (Type "mu" for the symbol \(\mu\) , e.g. mu \(>\)1 for the mean is greater than 1, mu \(\lt \) 1 for the mean is less than 1, mu not = 1 for the mean is not equal to 1)
\(H_0\) :
\(H_a\) :
(b) Find the test statistic, t =
(c) Answer the question: Does this sample provide evidence that people living in rural Idaho communities live longer than 77 years? (Use a 10% level of significance)
(Type: Yes or No)