3.3 Central Limit Theorem


  1. Use a random number generator or go to random.org and use their true random number generator to generate 9 random numbers between 1-5000. List your random numbers here:





  2. At a cola bottling plant, a machine fills 2-liter soda bottles. The machine is calibrated to dispense 2.02 liters in each bottle. Actual quantities dispensed vary and the amounts are normally distributed with a standard deviation of 0.05 liters.


  3. Extra room is left at the top of each bottle for a maximum of 2.25 liters of soda before the bottle overflows and causes a sticky mess on the assembly line. What percent of the bottles will be overfilled by a correctly calibrated machine?

    \(P(x>2.25)=\) 0.00000211

    Write as a percentage: 0.000211%


  4. If the plant produces 1,500,000 bottles per year, how many times per year do you expect to clean up a spill from a correctly calibrated machine? 0.00000211(1,500,000)=3.315 We expect to clean up a spill 3 times per year.

  5. The company prefers that their 2-liter bottles contain at least 1.95 liters of soda. You are in charge of quality control. You take sample bottles off the assembly line to measure the amount of soda and decide if the machine needs to be recalibrated. The assembly line must be shut down for several hours in order to recalibrate the machine. If you select one bottle from the assembly line with a correctly calibrated machine, what is the probability it will contain less than 1.95 liters?

    \(P(x<1.95)=\) 0.0808

  6. Load the data set “Cola Bottling Company” from My Group in StatCrunch. Look at the row number that matches your first random number. Record the data value in that row for the amount of cola in liters from column 1. __________

  7. The plant foreman says you must recalibrate the machine if the sample bottle contains less than 1.95 liters. Based on your randomly selected “bottle,” do you need to recalibrate the dispensing machine? Yes or No? ______________________

  8. Add your data value for the amount of cola in one randomly selected bottle to the class dot plot.

  9. If the bottling plant is open 300 days per year and you randomly select 1 bottle each day to decide whether to recalibrate that day, how many times per year would you expect to have to recalibrate a correctly calibrated machine? _______________________________

  10. You decide it might be better to take a larger sample of bottles off the assembly line to decide whether to recalibrate. Look at the rows corresponding to your first four random numbers in column 1 of the “Cola Bottling Company” data set. Find the average amount of cola in liters for those 4 “bottles.” Round to two decimal places.

    Mean amount in 4 random bottles: __________________________


  11. Add your sample mean for the amount of cola in 4 randomly selected bottles to the class dot plot.

  12. What is the mean and standard deviation for the Distribution of Sample Means when the sample size (n) is 4 bottles? Mean: _______________ Standard deviation: _____________

  13. If you select four bottles from the assembly line with a correctly calibrated machine, what is the probability the mean amount of cola will be less than 1.95 liters? _________________________

  14. You decide to randomly select a sample of 9 bottles off the assembly line and test if their mean is less than 1.95 liters. Look at the rows corresponding to your nine random numbers in column 1 of the “Cola Bottling Company” data set. Find the average amount of cola in liters for those 9 “bottles.” Round to two decimal places.

    Mean amount in 9 random bottles: _________________________


  15. What is the mean and standard deviation for the Distribution of Sample Means when the sample size (n) is 9 bottles? Mean: _______________ Standard deviation: __________________

  16. If you select nine bottles from the assembly line with a correctly calibrated machine, what is the probability the mean amount of cola will be less than 1.95 liters? ________________________

  17. If the bottling plant operates 300 days per year, and you sample the 2-liter bottles once a day, how often would you have to unnecessarily recalibrate a correctly calibrated dispensing machine if you randomly select

    1 bottle per day?

    4 bottles per day?

    9 bottles per day?


  18. If you select a sample of 4 bottles and the mean amount of cola is 1.95 liters, do you think the machine needs recalibrating? Yes or No? ____________

    Explain:


  19. What do you think is the best sampling method for making a decision about machine recalibrating? Why?
  20. ACT scores for high school seniors at a large school district are normally distributed with a mean of 22 and a standard deviation of 4. Compare the two distribution curves given below.

    Description of Curve A: Every student is represented on the curve as an individual and the entire population is represented.

    Description of Curve B: Groups of 25 students are randomly selected from the population and the mean ACT of each group is represented on the curve.

    Curve A and Curve B are both normal curves. The horizontal axis represents the ACT scores and is numbered from 0 to 40, counting by 10. The vertical axis represents the number of students who earned each of the possible ACT scores. Curve A represents the graph of every student in the population and it is wide and short. It goeas from 10 to 36 with a mean of 22. Curve B represents the mean ACT score for each of the groups of 25 students. It is tal and narrow. It goes from about 18 to about 24, with a mean of 22.


  21. Identify the mean of Curve A and the mean of Curve B. What do you notice?

    \(\mu_{A}=22\)

    \(\mu_{B}=22\)

    Both curves have the same mean.

  22. Compare the standard deviation of Curve A and Curve B. What do you notice?

    The standard deviation of Curve A is greater than the standard deviation of Curve B.

    The standard deviation for the distribution of sample means is less than the standard deviation of the population the samples were selected from.

  23. Give an explanation for your observations in the previous two questions.

    The means are the same in the two curves because the mean of each sample will be close to the mean of the population. The standard deviation for the distribution of sample means is less than the distribution of the population because an extreme value randomly selected from one of the tails of the population curve is averaged together with 24 other values to produce the mean of the sample. Each sample mean will be close to the population mean and so the spread of sample means is less than the spread of individuals.

  24. The Central Limit Theorem:

    • A distribution of sample means approaches a normal distribution.
    • Samples must be from a normally distributed population, or the sample size must be greater than 30 (n > 30).
    • The distribution of sample means has the same mean as the population.
    • The standard deviation for the distribution of sample means is always less than the standard deviation population.
    • The standard deviation for the distribution of sample means is \(\frac{\sigma}{\sqrt{n}}\).

    Two normal curves on a graph. The horizontal axis is labeled Distributions of Individuals versus Means.  One curve is taller and skinner, and the other is shorter and wider. The short, wide curve is labeled distribution of individuals and includes the sigma symbol.  The tall, skinny curve is labeled normal distribution of sample means and includes the formula sigma sub x bar equals sigma divided by the square root of n.

  25. Birth weights in the United States have a distribution that is approximately normal with a mean of 3369g and a standard deviation of 567g (American Journal of Epidemiology).
    1. If one baby is randomly selected, find the probability that its birth weight is between 3360 grams and 3400 grams.

      \(\mu = \) 3369     \(\sigma = \) \(567\)

      \(P(3360\leq x\leq3400)=0.0281\) or 2.81%

    2. If 25 babies are randomly selected, find the probability that their mean birth weight is between 3360g and 3400g.

      \(\mu = \) 3369      \(\sigma = \) \(\frac{567}{\sqrt{25}}=113.4\)      \(z_{3360}=\frac{3360-3369}{113.4} \approx-0.0794\)      \(z_{3400}=\frac{3400-3369}{113.4} \approx 0.2734\)

      \(P(3360 \lt \overset-x \lt 3400)=P(-0.08 \lt z \lt 0.27)=0.1393\)

      A normal curve labeled with the mean and 3 standard deviations to the left and 2 standard deviations to the right of the mean on the horizontal axis. They are: 3028.8, 3142.2, 3255.6, 3369, 3482.4, and 3595.8. The area under the curve is shaded between 3360 and 3400.

      A standard normal curve labeled with the mean and 3 standard deviations to the left and to the right of the mean on the horizontal axis. They are: -3, -2, -1, 0, 1, 2, and 3. The area under the curve is shaded between -.08 and 0.27

      If 25 babies are randomly selected, the probability their mean birth weight is between 3360g and 3400g is approximately equal to 0.1393 or 13.93%.

  26. Pulse Rates of adult men are normally distributed with a mean of 73 bpm and a standard deviation of 8 bpm.
    1. If one man is randomly selected, find the probability that his pulse rate is less than 72.5 beats per minute.

      \(\mu = \) 73 bpm      \(\sigma = \) \(8\)

      \(P( x < 72.5)=0.4751\) or 47.51%

    2. If 10 adult males are randomly selected, what is the probability their mean pulse rate will be less than 72.5 bpm.

      \(\mu = \) 73 bpm      \(\sigma = \) \(\frac{8}{\sqrt{10}} \approx 2.53 \text{ bpm}\)      \(z_{72.5}=\frac{72.5-73}{\left(\frac{8}{\sqrt{10}}\right)} \approx-0.1976\)

      \(P(\overline{x} \lt 72.5)=P(z \lt -0.1976)=0.4217\)

      A normal curve labeled with the mean and 3 standard deviations to the left and to the right of the mean on the horizontal axis. They are: 65.41, 67.94, 70.47, 73, 75.53, 78.06, and 80.59. The area under the curve is shaded to the left of 72.5.

      A standard normal curve labeled with the mean and 3 standard deviations to the left and to the right of the mean on the horizontal axis. They are: -3, -2, -1, 0, 1, 2, and 3. The area under the curve is shaded to the left of -0.1976.

      If 10 adult males are randomly selected, the probability their mean pulse rate is less than 72.5 bpm is approximately equal to 0.4217 or 42.17%.

  27. Probability Decisions


    Circle the little words in each problem that have big meaning!



  28. Problem

    The mean amount of cash students at PSCC carry in their pocket is $15 and the standard deviation is $9. If 40 students are randomly selected, what is the probability the mean amount of cash in their pocket is greater than $18?

    Type of Probability Problem

    Normal distribution: use Normal Calculator

    • One individual: use given standard deviation
    • Sample mean: adjust standard deviation using \(\frac{\sigma}{\sqrt n}\)

    Binomial distribution: use Binomial Calculator

    Sample Mean

    Evidence and Result

    How did you know the type of probability problem?





    Calculations:

    \(\mu=15\\\sigma=9/\sqrt{40}\\P(x>18)=0.0175\)





  29. Problem

    People in the U.S. between the ages of 25 and 34 years old spend an average of $1630 per year on eating out. Money spent on food away from home is normally distributed with a standard deviation of $425. What is the probability that a randomly selected 25-34-year-old spent more than $1800 on eating out last year?

    Type of Probability Problem

    Normal distribution: use Normal Calculator

    • One individual: use given standard deviation
    • Sample mean: adjust standard deviation using \(\frac{\sigma}{\sqrt n}\)

    Binomial distribution: use Binomial Calculator

    One Individual

    Evidence and Result

    How did you know the type of probability problem?





    Calculations:

    \(\mu=1630\\\sigma=425\\P(x>1800)=0.3446\)





  30. Problem

    The mean salary for a Tennessee public school teacher is $54000 with a standard deviation of $5250. If you randomly select the salaries of 49 Tennessee public school teachers, what is the probability the sample mean is less than $50,000?

    Type of Probability Problem

    Normal distribution: use Normal Calculator

    • One individual: use given standard deviation
    • Sample mean: adjust standard deviation using \(\frac{\sigma}{\sqrt n}\)

    Binomial distribution: use Binomial Calculator

    Sample Mean

    Evidence and Result

    How did you know the type of probability problem?





    Calculations:

    \(\mu=54,000\\\sigma=5250/\sqrt{49}\\P(x<50,000)=0.00000005\)





  31. Problem

    Delta flights are on time 84% of the time. Suppose 10 flights are randomly selected. What is the probability that more than 8 flights are on time?

    Type of Probability Problem

    Normal distribution: use Normal Calculator

    • One individual: use given standard deviation
    • Sample mean: adjust standard deviation using \(\frac{\sigma}{\sqrt n}\)

    Binomial distribution: use Binomial Calculator

    Binomial Calculator

    Evidence and Result

    How did you know the type of probability problem?





    Calculations:

    \(p=0.84\\n=10\\P(x>8)=0.5080\)





  32. Problem

    LED light bulbs last an average of 32,500 hours with a standard deviation of 8750 hours. If you randomly purchase an LED light bulb, what is the probability it will last at least 50,000 hours?

    Type of Probability Problem

    Normal distribution: use Normal Calculator

    • One individual: use given standard deviation
    • Sample mean: adjust standard deviation using \(\frac{\sigma}{\sqrt n}\)

    Binomial distribution: use Binomial Calculator

    One Individual

    Evidence and Result

    How did you know the type of probability problem?





    Calculations:

    \(\mu=32,500\\\sigma=8750\\P(x\geq 50,000)=0.0228\)





  33. Problem

    In a recent study, the mean age of tablet users is 34 years old. Suppose the standard deviation is 15 years. Take a sample of size n = 100. Find the probability that the sample mean age is less than 30 years.

    Type of Probability Problem

    Normal distribution: use Normal Calculator

    • One individual: use given standard deviation
    • Sample mean: adjust standard deviation using \(\frac{\sigma}{\sqrt n}\)

    Binomial distribution: use Binomial Calculator

    Sample Mean

    Evidence and Result

    How did you know the type of probability problem?





    Calculations:

    \(\mu=34\\\sigma=15/\sqrt{100}\\P(x<30)=0.0038\)