Unit 4 Hypothesis Testing

7.4 Hypothesis Testing for Proportions


Requirements for Testing Claims about a Population Proportion p

  1. The sample observations are a simple random sample.
  2. The conditions for a binomial distribution are satisfied.
  3. The conditions \(n p \geq 5\) and \(n q \geq 5\) are both satisfied, so the binomial distribution of sample proportions can be approximated by a normal distribution with \(\mu=n p\) and \(\sigma=\sqrt{n p q}\). (Note: \(p\) is the assumed proportion not the sample proportion.)

Calculate the Test Statistic for a Claim about the Proportion

  1. Test Statistic for Proportion Claim:
    • n = sample size
    • p = population proportion q = 1 – p
    • \(\hat{p}=\frac{x}{n}=\) sample proportion
    • \(z=\frac{\hat{p}-p}{\sqrt{\frac{p q}{n}}}\)
  2. Use P-value to Evaluate the Null Hypothesis:

    (Will use NORMAL Calculator to get Critical Values)

    • Definition of P-value: The probability of being as extreme, or more extreme, than your data, assuming H0 is correct.
    • Making a decision about the null hypothesis based on the P-value
      1. If P-value \(\leq \alpha\), (Or TS is in the rejection area) reject \(H_0\) .
      2. If P-value \(> \alpha\), (Or TS is NOT in the rejection area) fail to reject \(H_0\) .
  3. Formulate a Conclusion About the Claim

    Use your decision about the null to make a statement about the claim:

    Null is the Claim Alternate is the Claim
    Reject the Null “There is sufficient sample evidence to reject the claim that…” “There is sufficient sample evidence to support the claim that…”
    Fail to Reject the Null “There is not sufficient sample evidence to reject the claim that…” “There is not sufficient sample evidence to support the claim that…”

Problems

  1. Driving and Texting: In a survey, 1864 out of 2246 randomly selected adults in the U.S. said that texting while driving should be illegal (based on data from Zogby International). Consider a hypothesis test that uses a 0.05 significance level to test the claim that more than 80% of adults believe that texting while driving should be illegal.

    Sample proportion: \(\hat{p}=\frac{1864}{2246}=.8299\)

    1. Design the Study
      1. The original Claim: \(p>0.8\)
      2. \(H_0\): \(p \leq 0.8\)     assume \(H_0\): \(p = 0.8\)
      3. \(H_A\): \(p > 0.8\)
      4. Type of Test: Right-tailed test
      5. Level of Significance: \(\alpha =.05\)
      6. Critical Value(s): \(z=1.645\) (Determines shaded area(s))

      Graph of a normal curve.  The x-axis is labeled from -3 to 3 in intervals of 1.  The top of the curve is at x=0.  The y-axis is labeled from 0 to 0.4 in intervals of 0.1.  The area under the curve is shaded for all x values greater than 1.645.  Under the graph, there are three boxes: First box: Mean:0, Second Box: Std. Dev.:1, Third box P(x>=1.6448536)=.05

    2. Results
      1. Test Statistic: A standardized value based on the sample data, assuming the null is true.

        \(z=\frac{\hat{p}-p}{\sqrt{\frac{p q}{n}}} \quad=\frac{8299-.8}{\sqrt{\frac{(.8)(.2)}{2246}}}=3.5426\)

        \(z=3.5449 \quad \text{from StatCrunch}\)

      2. p-value: 0.0002
      3. Rejection Criteria: Reject \(H_0\) if p value \(< \alpha\) and the test statistic (z) is in the rejection region.
      4. Decision about the null: Reject \(H_0\)

        Reason 1: 0.0002 < .05 or Reason 2: 3.5426 is in the rejection region

      5. Concluding statement about the claim: There is sufficient evidence to support the claim more than 80% of adults believe texting while driving should be illegal.

      One sample proportion summary hypothesis test: P: proportion of successes, H_O: p=0.8, H_A: p>0.8.
              Hypothesis Test results: 
              Proportion: p
              Count: 1864
              Total: 2246
              Sample Prop.: 0.82991986
              Std. Err.: 0.0084402462
              z-Stat: 3.5449034
              p-value: 0.0002

      Graph of Standard Normal Distribution with z-stat=3.54449, and p-value: 0.0002.  The x-axis is labeled from -4 to 4 in intervals of 2.  The top of the curve is at x=0.  The y-axis is labeled from 0 to 0.4 in intervals of 0.1.  The z-stat of 3.5449 is marked with a vertical line on the graph and the graphs is shaded to the end of the graph on the right.

  2. Cell phone and Cancer: In a study of 420,095 Danish cell phone users, 135 subjects developed cancer of the brain or nervous system (based on data from the Journal of the National Cancer Institute as reported in USA Today). Test the claim of a once popular belief that such cancers are affected by cell phone use. That is, test the claim that cell phone users develop cancer of the brain or nervous system at a rate that is different from the rate of 0.0340% for people who do not use cell phones. Because this issue has such great importance, use a 0.01 significance level.

    Sample proportion: \(\hat{p}=\frac{135}{420095} \approx 0.000321\)

    1. Design the Study
      1. The original Claim: \(p \neq 0.00034\)
      2. \(H_0\): \(p = 0.00034\)< assume \(H_0\): \(p = 0.00034\)
      3. \(H_A\): \(p \neq 0.00034\)
      4. Type of Test: Two-tailed test
      5. Level of Significance: \(\alpha =0.1\)
      6. Critical Value(s): \(z=\pm 2.576\) (Determines shaded area(s))

      Graph of a normal curve.  The x-axis is labeled from -3 to 3 in intervals of 1.  The top of the curve is at x=0.  The y-axis is labeled from 0 to 0.4 in intervals of 0.1.  The area under the curve is shaded for all x values between -2.576 and +2.576.  Under the graph, there are three boxes: First box: Mean:0, Second Box: Std. Dev.:1, Third box P(-2.5758<=x<=2.5758)=.99

    2. Results
      1. Test Statistic: A standardized value based on the sample data, assuming the null is true.

        \(z=\frac{\hat{p}-p}{\sqrt{\frac{p q}{n}}}=\frac{.000321-.00034}{\sqrt{\frac{(.00034)(.99966)}{420095}}}=0.6680\)

        \(z=-0.6555 \quad \text{from StatCrunch}\)

      2. p-value: 0.5122
      3. Rejection Criteria: Reject \(H_0\) if p value \(< \alpha\) and the test statistic (z) is in the rejection region.
      4. Decision about the null: Fail to reject \(H_0\)

        Reason 1: 0.5122 > .01 or Reason 2: -0.6680 (StatCrunch -0.6555) is not in the rejection region

      5. Concluding statement about the claim: There is not sufficient evidence to support the claim phone users develop cancer at a rate different than .00034 which is the rate for non cell phone users.

      One sample proportion summary hypothesis test: P: proportion of successes, H_O: p=0.00034, H_A: p is not equal to 0.00034.
                      Hypothesis Test results: 
                      Proportion: p
                      Count: 135
                      Total: 420095
                      Sample Prop.: 0.00032135588
                      Std. Err.: 0.000028444078
                      z-Stat: -0.65546566
                      p-value: 0.5122

      Graph of Standard Normal Distribution with z-stat=-0.6555, and p-value: 0.5122.  The x-axis is labeled from -4 to 4 in intervals of 2.  The top of the curve is at x=0.  The y-axis is labeled from 0 to 0.4 in intervals of 0.1.  The graph is shaded for all values of x less than -0.6555 and greater than 0.6555.

  3. Flu: In a clinical trial, 28 out of 800 patients taking a prescription drug complained of flulike symptoms. Suppose that it is known that 2.2% of patients taking competing drugs complain of flulike symptoms. Test the claim that more than 2.2% of this drug’s users experience flulike symptoms as a side effect at the 0.1 significance level.
    1. Design the Study
      1. The original Claim: \(p > 0.022\)
      2. \(H_0\): \(p \leq 0.022\)<     assume \(H_0\): \(p = 0.022\)
      3. \(H_A\): \(p > 0.022\)
      4. Type of Test: Right-tailed test
      5. Level of Significance: \(\alpha =0.01\)
      6. Critical Value(s): \(z=1.282\) (Determines shaded area(s))

      Graph of a normal curve.  The x-axis is labeled from -3 to 3 in intervals of 1.  The top of the curve is at x=0.  The y-axis is labeled from 0 to 0.4 in intervals of 0.1.  The area under the curve is shaded for all x values greater than 1.282.  Under the graph, there are three boxes: First box: Mean:0, Second Box: Std. Dev.:1, Third box P(x>=1.2815516)=0.1

    2. Results
      1. Test Statistic: A standardized value based on the sample data, assuming the null is true.

        \(z=\frac{\hat{p}-p}{\sqrt{\frac{p q}{n}}} = \frac{.035-.022}{\sqrt{\frac{(.022)(.978)}{800}}}=2.5067\)

      2. p-value: 0.0061
      3. Rejection Criteria: Reject \(H_0\) if p value \(< \alpha\) and the test statistic (z) is in the rejection region.
      4. Decision about the null: Reject \(H_0\)

        Reason 1: 0.0061 < .01 or Reason 2: 2.5067 is in the rejection region

      5. Concluding statement about the claim: There is sufficient evidence to support the claim more than 2.2% of the drug’s users experience flu like symptoms.

      One sample proportion summary hypothesis test: P: proportion of successes, H_O: p=0.022, H_A: p>0.022.
                      Hypothesis Test results: 
                      Proportion: p
                      Count: 28
                      Total: 800
                      Sample Prop.: 0.035
                      Std. Err.: 0.005186039
                      z-Stat: 2.5067301
                      p-value: 0.0061

      Graph of Standard Normal Distribution with z-stat=2.5067, and p-value: 0.0061.  The x-axis is labeled from -4 to 4 in intervals of 2.  The top of the curve is at x=0.  The y-axis is labeled from 0 to 0.4 in intervals of 0.1.  The z-stat of 2.5067 is marked with a vertical line on the graph and the graphs is shaded to the end of the graph on the right.

  4. Determine the minimum sample size needed if you wish to estimate the proportion of community members who favor a tax increase for more local school funding. You want the margin of error to be ± 2 % at the 95% confidence level.

    The minimum sample size needed is 2401 participants.

    Graph of Standard Confidence Interval for a proportion. The x-axis is labeled from 0 to 3500 in intervals of 500.  The y-axis is labeled from 0 to 0.06 in intervals of 0.01.  The minimum sample size is marked on the graphed at 2401 when the width is 0.04. The confidence interval that is entered is 0.95, the entered width is 0.04, and the target population is 0.5.

  5. Suppose that it is known in the scenario in #4 that 35% of the community's population will favor a tax increase. What is the minimum sample size needed if the margin of error is ±2% at the 95% confidence level.

    The minimum sample size needed is 2185 participants.

    Graph of Standard Confidence Interval for a proportion. The x-axis is labeled from 0 to 3000 in intervals of 500.  The y-axis is labeled from 0 to 0.06 in intervals of 0.01.  The minimum sample size is marked on the graphed at 2185 when the width is 0.04. The confidence interval that is entered is 0.95, the entered width is 0.04, and the target population is 0.35.