Unit 1 Describing Data

2.3 Measures of Central Tendency

Mean, Median, Mode, Midrange


Notation:

  \(\sum\) denotes the sum of a set of values

 \(x\) is the variable usually used to represent the individual data values

 \(n\) lower case n represents the number of data values in a sample

 \(N\) upper case N represents the number of data values in a population

Arithmetic Mean:

  Mean of a Sample (this is a statistic) \(\overline{x}=\frac{\sum x}{n}\)

  Mean of a Population (this is a parameter) \(\mu=\frac{\sum x}{N}\)

Median:
Mode:
MidRange:
Round off Rule for Measures of Center:

 Carry one more decimal place than is present in the original set of values.



  1. Calculate the measures of central tendency for the life expectancies from the given data. Verify using technology.

    Country Health $ per capita Obesity % Life expectancy Universal Healthcare
    Canada 5292 30.1 82 yes
    China 420 7.3 76 no
    Germany 5411 22.7 81 yes
    Italy 3258 23.7 84 yes
    Japan 3703 3.5 84 yes
    Mexico 677 27.6 77 no
    Norway 9522 24.8 82 yes
    Switzerland 9674 21 83 yes
    United Kingdom 3377 29.8 82 yes
    United States 9403 35 79 no

    1. Mean: 81
    2. Median: 82
    3. Position of the Median: \(\frac{n+1}{2}=\frac{10+1}{2}=5.5\)
    4. Mode: 82
    5. Midrange: \(\frac{76+84}{2}=80\)
  2. Using the table in the problem above, calculate the measures of central tendency for the obesity percentages from the given data. Verify using technology.
    1. Mean: 22.55
    2. Median: 24.25
    3. Position of the Median: \(\frac{n+1}{2}=\frac{10+1}{2}=5.5\)
    4. Mode: none
    5. Midrange: \(\frac{3.5+35}{2}=19.25\)
  3. What do you notice about the measures for the 2 sets of data?


  4. Shanita made a note of the weights of 8 baskets of fruit, but she spilled coffee on the fruit and the last number got smudged. If the average weight of the baskets is 12.5 pounds, how much was the smudged number? In other words, how much did the 8th basket of fruit weigh?
    A chart showing the weights of all of the baskets of fruit. Basket 1 is 10.8 pounds, Basket 2 is 9.3 pounds, Basket 3 is 14.8 pounds, Basket 4 is 12.9 pounds, Basket 5 is 13.1 pounds, Basket 6 is 10.6 pounds and Basket 7 is 15.3 pounds. Basket 8 is covered in a coffee spill and we cannot read the weight of the basket.

    Sum of basket weights is \(8*12.5 = 100\), so basket 8 must be \(100 – 86.8 = 13.2\)


  5. The mean weight of three dogs is 60 pounds. Which of the following is possible? (Answer yes or no.)
    • One of the dogs weighs 100 pounds. yes
    • None of the dogs weighs more than 60 pounds. yes
    • Each dog weighs less than 60 pounds. no
    • The dogs weigh 25, 55, and 110 pounds. no
    • The dogs weigh 24, 52, and 104 pounds. yes
    • Altogether the dogs weigh 180 pounds. yes
  6. The mean can alternately be defined as the balancing point. Total distance below the mean = total distance above the mean. The mean of 2, 3, 6, and 9 is 5. Note that 3 + 2 = 1 + 4. A number line segment numbered 1 to 9, counting by 1. There is a dot above the 2,3,6 and 9. There is a dotted line segment drawn vertically at the 5. 'The mean' is written above the dotted line. There is one ray drawn above the number line from the dotted line to the left that ends at the 2 and has the number 3 written above it. There is another ray drawn from the dottend line to the left that ends at the 3 and has a 2 written above it. There is a ray drawn above the number line from the dotted line to the right that ends at the 9 and has the number 4 written above it. There is another ray drawn above the number line from the dotted line to the right that ends at the 6 and has the number 1 written above it.


    1. Which of the lines represents the mean of the data points shown below. A line segement divided into 8 equal parts by 9 hash marks. There is a dot above the second, sixth and seventh hash mark. Line A is at the third has mark, line B is at the fourth hash mark, line C is at the fifth hash mark and line D is at the sixth hash mark.

      C


    2. Which of the lines represents the mean of the data points shown below. A line segement divided into 8 equal parts by 9 hash marks. There are three dots above the second hash mark. There is one dot above the fifth, sixth and seventh hash mark. Line A is at the thirs has mark, line B is at the fourth hash mark, line C is at the fifth hash mark and line D is at the sixth hash mark.

      B

  7. Shapes of Distributions

    Symmetric (Normal):

     Mean and Median are the same as Mode

    A picture of a histogram and a picture of a curve drawn around the histogram with the bars removed. The histogram is labeled 'Histogram (n=21)'.The horizontal axis represents the x values and goes from 0 to 6, counting by 1. The vertical axis represents the frequency and goes from 0 to 10, counting by 2. The bar at 1 has a frequency of 2. The bar at 2 has a frequency of 5. The bar at 3 has a frequency of 7. The bar at 4 has a frequency of 5. The bar at 5 has a frequency of 2. The horizontal axis for the curve represents the x values and goes from 1.5 to 7.5, counting by 1. There is no vertical axis and the curve is a normal bell shaped curve.



    Skewed To The Left (negatively skewed):

     Mean and Median are to the LEFT of the Mode (Left Tail)

    A histogram labeled 'Histogram (n=29)'. The horizontal axis represents the x values and goes from 0 to 8, counting by 2. The vertical axis represents the frequency and goes from 0 to 12, counting by 2. The bar at 1 has a frequency of 1. The bar at 2 has a frequency of 3. The bar at 3 has a frequency of 5. The bar at 4 has a frequency of 7. The bar at 5 has a frequency of 9. The bar at 6 has a frequency of 4.
    Skewed To The Right (positively skewed):

     Mean and Median are to the RIGHT of the Mode (Right Tail)

    A histogram labeled 'Histogram (n=28)'. The horizontal axis represents the x values and goes from 0 to 8, counting by 2. The vertical axis represents the frequency and goes from 0 to 12, counting by 2. The bar at 1 has a frequency of 3. The bar at 2 has a frequency of 9. The bar at 3 has a frequency of 7. The bar at 4 has a frequency of 5. The bar at 5 has a frequency of 3. The bar at 6 has a frequency of 1.




    Bimodal:

     Two Modes

    A histogram labeled 'Histogram (n=35)'. The horizontal axis represents the x values and goes from 0 to 6, counting by 1. The vertical axis represents the frequency and goes from 0 to 12, counting by 2. The bar at 1 has a frequency of 5. The bar at 2 has a frequency of 10. The bar at 3 has a frequency of 5. The bar at 4 has a frequency of 10. The bar at 5 has a frequency of 5.
    Uniform:

     All frequencies are the same

    A histogram labeled 'Histogram (n=25)'. The horizontal axis represents the x values and goes from 0 to 6, counting by 1. The vertical axis represents the frequency and goes from 0 to 6, counting by 2. The bar at 1 has a frequency of 5. The bar at 2 has a frequency of 5. The bar at 3 has a frequency of 5. The bar at 4 has a frequency of 5. The bar at 5 has a frequency of 5.

    Weighted Means and Means of Grouped Data

  8. Calculate the semester grade point average (GPA) for the student with the following grades:
    Course Credit Hours Grade Point Value for Grade Quality Points (Points x hours)
    ENGL 1020 3 C

    2

    6

    MATH 1530 3 A

    4

    12

    COMM 1010 3 B

    3

    9

    BIOL 1110 4 C

    2

    8

    COLL 1000 1 B+

    3.5

    3.5

    Total

    14

    38.5

    \(\frac{38.5}{14}=2.75\)

  9. Calculating by Formula:

    \(\overline{x}=\frac{\sum(f \cdot x)}{\sum f} \quad\quad \text{Use class midpoints for x}\)


  10. Find the estimated mean of the following data:

    SECONDS FREQUENCY MIDPOINTS
    5.0 – 5.9 5

    5.45

    6.0 – 6.9 8

    6.45

    7.0 – 7.9 10

    7.45

    8.0 – 8.9 7

    8.45

    9.0 – 9.9 3

    9.45


    1. What is the estimated mean of the data? 7.30

      By Formula: \(\overline{x}=\frac{\sum(f \cdot x)}{\sum f}=\frac{5(5.45)+8(6.45)+10(7.45)+7(8.45)+3(9.45)}{33}=\frac{240.85}{33} \approx 7.30\)

    2. What is the upper limit of the 4th class? 8.9
    3. What is the class width? 1.0
    4. What is the shape of the distribution? Normal
    5. What class holds the median number of seconds? 7.0-7.9
  11. Below is a frequency distribution of the ages of riders for one ride on the Dahlonega Mine Train at Six Flags. Find the estimated mean of the following data:

    Age of Riders, in years Number of Riders
    10 - 14 1
    15 - 19 2
    20 - 24 7
    25 - 29 15
    30 - 34 4

    1. What is the estimated mean of the data? 25.3
    2. What is the upper limit of the 1st class? 14
    3. What is the class width? 5
      • Using LCL \(15-10=5\)
      • Using UCL \(29-24=5\)
      • Using midpt \(32-27=5\)
    4. What is the shape of the distribution? Left tail
    5. What class holds the median number of seconds? Class 25-29

      Position of the median: \(\frac{n+1}{2}=\frac{29+1}{2}=\frac{30}{2}=15\). Add frequencies until you reach 15.