5.1.3 Box and Whisker Plots
In this topic we will learn how to:
- draw and interpret box and whisker plots
A box and whisker plot is used to represent discrete data. However, it does not show all the data points. It consists of the box and the whiskers. It helps to give a general understanding of how the data is spread.
To draw a box and whisker plot you need the following information:
- minimum value
- lower quartile
- median
- upper quartile
- maximum value
To calculate the median, we use the formula,
The formula above, gives us the position of the median, which is denoted by and represents the sample size.
To calculate the lower quartile, we use the formula,
Where represents the lower quartile.
To calculate the upper quartile, we use the formula,
Where represents the upper quartile.
To calculate the interquartile range, we use the formula,
Where represents the interquartile range, represents the upper quartile, represents the lower quartile.
Let’s look at some past paper questions.
1. The times in minutes taken to run a marathon were recorded for a group of marathon runners and were found to be as follows:
(9709/62/M/J/19 number 6)
190 | 203 | 215 | 246 | 249 | 253 | 255 | 254 | 258 | 260 | 261 |
263 | 267 | 269 | 274 | 276 | 280 | 288 | 283 | 287 | 294 | 300 |
307 | 318 | 327 | 331 | 336 | 345 | 351 | 353 | 360 | 368 | 375 |
(a) Draw a box and whisker plot to illustrate the times for these people.
To be able to draw a box and whisker plot. We need the minimum value, lower quartile, median, upper quartile and the maximum value. The minimum value is,
The maximum value is,
Let’s find the lower quartile,
Notice how, there is no data point in position , so we get the average of data points on position and ,
190 | 203 | 215 | 246 | 249 | 253 | 255 | 254 | 258 | 260 | 261 |
263 | 267 | 269 | 274 | 276 | 280 | 288 | 283 | 287 | 294 | 300 |
307 | 318 | 327 | 331 | 336 | 345 | 351 | 353 | 360 | 368 | 375 |
Let’s find the upper quartile,
Notice how, there is no data point in position , so we get the average of the data points on position and ,
190 | 203 | 215 | 246 | 249 | 253 | 255 | 254 | 258 | 260 | 261 |
263 | 267 | 269 | 274 | 276 | 280 | 288 | 283 | 287 | 294 | 300 |
307 | 318 | 327 | 331 | 336 | 345 | 351 | 353 | 360 | 368 | 375 |
Let’s find the median,
Find the data point at position ,
190 | 203 | 215 | 246 | 249 | 253 | 255 | 254 | 258 | 260 | 261 |
263 | 267 | 269 | 274 | 276 | 280 | 288 | 283 | 287 | 294 | 300 |
307 | 318 | 327 | 331 | 336 | 345 | 351 | 353 | 360 | 368 | 375 |
Now that we have the necessary requirements, we can now draw the box and whisker plot. Mark the minimum and maximum values with a line each, these will represent the whiskers. To draw the box, mark the lower quartile and the upper quartile, these will be the edges of the box. Inside the box, label the median with a line. Finally, join the whiskers to the box. Don’t forget to label the -axis, with the appropriate label. The box represents the interquartile range, the whiskers represent the extreme values.
(b) Find the interquartile range of these times.
The formula for interquartile range is,
Substitute the lower and upper quartiles,
Therefore, the final answer is,
2. Two machines and , produce metal rods of a certain type. The lengths, in metres, of rods produced by machine and rods produced by machine are measured. It is given that, the minimum value of is , maximum value is , lower quartile is , upper quartile is , median is . It is given that, the minimum value of is , maximum value is , lower quartile is , upper quartile is , median is .(9709/52/M/J/20 number 3)
(a) Draw box and whisker plots for and .
Sketch the box and whisker plots, with above , and label them. Ensure that the extreme values are the whiskers, and the box is marked by the lower and upper quartiles. Insert the lines for the median inside their respective boxes. Don’t forget to label the -axis.
(b) Hence make two comparisons between the lengths of the rods produced by machine and those produced by machine .
Since has a larger interquartile range (box) than , the length of rods it produces is more spread.
Since the median of is smaller than that of , the lengths of are, on average smaller than those of .