Median as another central measure: 3.75

Supporting Materials

 

Indicator of Progress

Success depends on students finding the median of an odd-numbered set of numerical data and appreciating qualitatively how the median is affected by the inclusion of new data.

This is the second ‘central measure’ that students will experience; they have already used ‘mode’ (Level 3.25) and they are yet to meet ‘mean’ (Level 4.0).

At this level, they find the median from an even-numbered set by estimating or intuitively identifying the average of the two middle values, or simply saying it is between them. Later, they will find the median through the (weighted) mean of the middle two data values. The distinctions that are made at later levels between discrete numerical data (like shoe sizes) where no intermediate values are possible and continuous numerical data (like height) are not made at this level.

For more information, related indicators are:

 

Illustration 1: Moving from categorical to numerical data

outdoor thermometer

Students first become familiar with categorical data (such as football teams, favourite pets and colours) and understand that the most popular team, colour or pet can be found by voting. The category with the most votes is the ‘mode’. The mode is the only statistic suitable for categorical data.

The next step is to apply the concept of mode to numerical data. For example, if we read a thermometer outside the classroom every day at 1:00 p.m. and record the temperatures for a month then we might find the mode 1 pm temperature for February is 30 degrees and the mode for May is 22 degrees. These modes are the most common temperatures for those months and might be used for month-to-month comparisons.

 

Illustration 2: Moving from mode to median

While it is possible to use the mode for describing both categorical and numerical data, the median is useful only for data that can be ordered from smallest to largest (usually numerical).

If the 1 pm temperatures for every day of May were listed in order from the minimum to the maximum, the median is then the middle score. Sometimes the median is the same as the mode, but not always. For example, if the 1pm temperatures every day for a week were 22°, 22°, 22°, 20°, 26°, 25°, 24° then the mode and the median are both 22°. If instead the 20° day had a temperature of 29°, then the mode remains 22°, whilst the median is 24°.

 

Illustration 3: Moving from median to mean

Both the median and the mean are useful for summarising numerical data. The advantage of the median is that, as it is based on the simple process of ordering, it is easier to calculate than the mean. The disadvantage of the median is that it is not reflective of all the data set in the way that the mean is. The mean also has stronger mathematical properties, so it is used in advanced statistics.

 

Teaching Strategies

These activities are based around a set of 15 data cards which students arrange in various ways as they focus on the different data provided on the cards - Data cards small (PDF - 351Kb).

Larger versions of these cards are also provided for teacher’s use - Data cards large (PDF - 827Kb).

Students can use class data for later work. Each of the activities proceeds in four steps:

Step 1: Recognise variation in the data set by noting individual values and looking at the range
Step 2: Group the data by looking for patterns, calculating frequencies, column graphs etc
Step 3: Represent the data with mode, median and range
Step 4: Explore properties of the mode, median and range.

Activity 1: T-shirts reviews students’ knowledge of the mode for categorical data, using the four steps above.
Activity 2: Heights introduces the concept of the median, which applies to numerical data, using the four steps above.
Activity 3: Pets, shoe sizes and travel times establishes the work on median, mode and range in three different contexts, each using the four steps above.
Activity 4: Comparing groups splits the data set into 2 groups and students describe the differences between them, using mode, median and range. At this level, students do not deal with relationships between two numerical variables (e.g. shoe size and height).

 

Activity 1: T-shirts

Ask students questions about the data cards (see the suggestions below) focussing on the categorical variable of T-shirt design to review their knowledge of grouping, mode etc.

Step 1: Recognise variation in the data set, by noting individual values and looking at the range.

Students should group their cards according to the T-shirt design to answer the questions. Note that as this variable does not have any inherent order, the students may arrange their groups of cards in many different (acceptable) ways. This is in contrast to the numerical data such as number of pets in later activities.

  • What types of T-shirts are there? (dotty, grey, white, two sorts of stripes)
  • Who is wearing a dotty T-shirt? (Fay)
  • Who is wearing a white T-shirt? (any of Bill, Sue & Ruth)
  • Is there another child who is wearing the same T-shirt as Jeff? (Yes, Amy and Mark)

Step 2 : Group the data by looking for patterns, calculating frequencies, column graphs etc

These questions focus on one category at a time and then compare categories:

  • How many children are wearing a dotty T-shirt? (1)
  • How many children are wearing a grey T-shirt? (6)
  • How many children are wearing a T-shirt with horizontal stripes? (2)
  • How many children are wearing a T-shirt with vertical stripes? (3)
  • How many children are wearing a white T-shirt? (3)
  • Which is the least popular T-shirt? (dotty)
  • How many more children are wearing grey T-shirts than horizontal striped T-shirts? (4)
  • Which is the most popular T-shirt? (grey)

Arranging the cards in rows or columns based on T-shirt design will lead to a pictograph (using cards as objects on the graph) and then to a column graph similar to that shown below.

Bar graph: Number of children by tshirt design
T-shirt Number of children
dots
1
grey
6
horiz. stripes
2
vertical stripes
3
white
3

Step 3: Represent the data with mode (no median yet)

  • Which is the most popular T-shirt? (grey)

This is called the ‘mode’: it occurs more often than any other T-shirt.
If you owned a T-shirt shop, you would probably keep more grey T-shirts than any other colour.

Step 4: Explore properties of the mode

A new student, Tara, is about to join the other 15 students already on the data cards.

  • What will the mode be if Tara is wearing a dotty/white/grey/… T-shirt? (Mode is still grey).

Five new students are put onto the data cards.

  • Can adding these 5 students change the mode? How? (Yes – e.g. all wear white)
  • Can adding these 5 students keep the mode the same? How? (Yes – e.g. one of each colour).

 

Activity 2: Heights

This activity introduces the concept of the median as the middle value, seen when the data cards are arranged in order. This is numerical data, so whilst the cards can be grouped as in Activity 1 above, there is an inherent order between the groups. Students should be able to see the different heights of the children on the cards, as well as using the data on the cards.

Step 1: Recognise variation in the data set by noting individual values and looking at the range

Students ask and answer questions focusing on the individual values. For example:

  • How tall is Gail? (145 cm)
  • Is there another child who is the same height as Gail? (yes, Bill)
  • Who is shorter than Fay? (only Amy)
  • Who is exactly 155 cm tall? (Jeff and Nora)
  • Who is the tallest of the children? (Ruth)
  • Who is the shortest? (Amy)

Step 2: Group the data by looking for patterns, calculating frequencies, column graphs etc

Students look at the data in groups, focusing on one category at a time.

  • How many children are 120 cm tall? (1)
  • How many children are 125 cm tall? (0)
  • How many children are 130 cm tall? (2), etc
  • How many children are 150 cm or taller? (5)
  • How many children are taller than Amy (14)
  • How many children are taller than Dave and shorter than Jeff (3)
  • Compare the number of children who are shorter than Dave with the number of children who are taller than him.

Summarise these investigations with tables and charts.

Bar graph - number of children by height
Height Number of children
120
1
130
2
135
2
140
3
145
2
150
1
155
2
160
1
165
0
170
0
175
1

Step 3: Represent the data with mode, median and range

Arrange the data cards in order of height.

Here is one possible arrangement of the cards based on heights, from shortest to tallest. Note that students of the same height can be rearranged. Once the cards are ordered, a new statistic (the median) can be introduced. It is another ‘central measure’ like the mode. Note that the mode is 140 even if the cards are rearranged and Sue is the middle card instead of Dave.

120

130

130

135

135

140

140

140

145

145

150

155

155

160

175

Amy

Mark

Fay

Ian

Pete

Sue

Cath

Dave

Bill

Gail

Tom

Nora

Jeff

Luke

Ruth

The image below shows the middle seven cards from the above arrangement. There is a strong visual image of the median height being the height of the middle child.

Sketchs of kids of varying height

Report the 3 statistics: range, mode and median.

  • What is the range for the heights? (55cm, from 120 cm to 175 cm)
  • Which height is the mode? (140cm). More students are this height than any other height.
  • If we put the cards in order, what is the height written on the middle card? (140 cm)

The median height is 140 cm; the median is the middle value when the heights are arranged in order. Note: in this example, the median is the same as the mode, both 140 cm. This is not always the case, they can be different numbers (see Illustration 2).

Step 4: Explore the properties of the mode, median and range

To explore how the median, mode and range work, set students the task of designing cards for 2 new students Tara and Wong, so that:

  • the range stays the same and the mode changes to 155
  • the median changes
  • the median changes, but the mode stays the same
  • the mode changes, but the median stays the same.

Students can arrange themselves in order of height to find the median height for their own group.

 

Activity 3: Pets, shoe sizes and travel times

Activity 2 above, illustrated with heights, is repeated for three different variables:

  • number of pets
  • shoe sizes and
  • travel times.

For each variable, follow the four steps as illustrated in Activity 2, gradually reducing the support offered for finding the median.

Step 1: Recognise variation in the data set, by noting individual values and looking at the range
Step 2: Group the data, by looking for patterns, calculating frequencies, column graphs etc
Step 3: Represent the data with mode, median and range
Step 4: Explore the properties of the mode, median and range.

When the cards are put in order of these other variables, the median is not obvious from the visual arrangement, but it is still the middle card.

Number of pets

Bar graph - no. of children vs no. of pets

Number of pets

Number of children

0

6

1

4

2

2

3

0

4

1

5

1

6

0

7

0

8

0

9

0

10

1

 

0

0

0

0

0

0

1

1

1

1

2

2

4

5

10

Amy

Mark

Ian

Dave

Tom

Jeff

Pete

Cath

Bill

Luke

Gail

Nora

Sue

Ruth

Fay

Note that the mode and the median remain the same (0 and 1 respectively) even if the cards of people with the same number of pets are rearranged (e.g. if Pete and Cath are swapped).

The image below shows the middle seven cards from the above arrangement. Note the contrast with the image of heights; the attribute of height is no longer the basis for this activity.

Sketchs of kids

For Step 4, ask questions such as:

  • Two students are removed from the data set, so that the mode number of pets stays the same and the median stays the same. Who could they be? (Example answer: both have 1 pet e.g. Pete & Cath or Bill & Luke)
  • Four students are removed from the data set, so that the mode number of pets stays the same and the median changes. Who could they be? (Example answer: everyone with 1 pet)
  • Four students are removed from the data set, so that the mode number of pets changes and the median stays the same. Who could they be? (Example answer: 3 people with 0 pets, and 1 person with 2 or more pets e.g. Fay)

Shoe Size

Here is one correct ordering of cards from smallest shoe to largest. The median shoe size is 6.

3

4

4

5

5

5

5

6

7

7

7

7

8

9

11

Mark

Amy

Cath

Fay

Ian

Pete

Sue

Gail

Bill

Tom

Nora

Ruth

Dave

Luke

Jeff

 

Bar graph: number of children by shoe size

Shoe Size

Number of children

3

1

4

2

5

4

6

1

7

4

8

1

9

1

10

0

11

1

 

Activity 4: Comparing groups

Students can divide the data cards into groups, such as boys and girls. They can compare height, shoe size, number of pets and travel time for these two gender groups. They can also compare the categorical data (T-shirt colour, shoe colour, etc).

In each case, students will:

  • draw a column graph for each gender
  • find the mode and median for each gender*
  • compare the two data sets. If students think there is a difference in this data set, they should consider whether there might be a difference for boys and girls in general.

*Note: Finding the median for the 8 boys presents a new problem. Whereas there is always a middle person in the set of 7 girls, there is never a middle person in the set of 8 boys. To overcome this, students can either say the median is 'between 140 cm and 145 cm' for height, or they can take the number halfway between. Many students will know this is 142.5 cm, which is the average/mean of the two middle values.

Further Resources

The following resource contains sections that may be useful when designing learning experiences:

Digilearn object *

Rice crisp machine – students check whether a machine is packing most bags with an acceptable number of rice crisps (20–35 rice crisps per packet). Students take a sample of 100 packets and make a boxplot to analyse the results. Students sort the sample data into four equal groups. Students identify the range, median, first and third quartile values.
(https://www.eduweb.vic.gov.au/dlr/_layouts/dlr/Details.aspx?ID=4431)

* Note that Digilearn is a secure site; DEECD login required.