5.1.3 Box and Whisker Plots

In this topic we will learn how to:

  • draw and interpret box and whisker plots

A box and whisker plot is used to represent discrete data. However, it does not show all the data points. It consists of the box and the whiskers. It helps to give a general understanding of how the data is spread.

To draw a box and whisker plot you need the following information:

  • minimum value
  • lower quartile
  • median
  • upper quartile
  • maximum value

To calculate the median, we use the formula,
q2=n+12q_{2} = \frac{n + 1}{2}The formula above, gives us the position of the median, which is denoted by q2q_{2} and nn represents the sample size.

To calculate the lower quartile, we use the formula,
q1=14(n+1)q_{1} = \frac{1}{4}(n + 1)Where q1q_{1} represents the lower quartile.

To calculate the upper quartile, we use the formula,
q3=34(n+1)q_{3} = \frac{3}{4}(n + 1)Where q3q_{3} represents the upper quartile.

To calculate the interquartile range, we use the formula,
IQR=q3q1IQR = q_{3} - q_{1}Where IQRIQR represents the interquartile range, q3q_{3} represents the upper quartile, q1q_{1} represents the lower quartile.

Let’s look at some past paper questions.

1. The times in minutes taken to run a marathon were recorded for a group of 3333 marathon runners and were found to be as follows:

(9709/62/M/J/19 number 6)

190203215246249253255254258260261
263267269274276280288283287294300
307318327331336345351353360368375

(a) Draw a box and whisker plot to illustrate the times for these 3333 people.

To be able to draw a box and whisker plot. We need the minimum value, lower quartile, median, upper quartile and the maximum value. The minimum value is,
190190The maximum value is,
375375Let’s find the lower quartile,
q1=14(n+1)q_{1} = \frac{1}{4}(n + 1)q1=14(33+1)q_{1} = \frac{1}{4}(33 + 1)q1=8.5q_{1} = 8.5Notice how, there is no data point in position 8.58.5, so we get the average of data points on position 8\textcolor{#2192ff}{8} and 9\textcolor{#2192ff}{9},

190203215246249253255254258260261
263267269274276280288283287294300
307318327331336345351353360368375

q1=254+2582q_{1} = \frac{\textcolor{#2192ff}{254} + \textcolor{#2192ff}{258}}{2}q1=256q_{1} = 256Let’s find the upper quartile,
q3=34(n+1)q_{3} = \frac{3}{4}(n + 1)q3=34(33+1)q_{3} = \frac{3}{4}(33 + 1)q3=25.5q_{3} = 25.5Notice how, there is no data point in position 25.525.5, so we get the average of the data points on position 25\textcolor{#2192ff}{25} and 26\textcolor{#2192ff}{26},

190203215246249253255254258260261
263267269274276280288283287294300
307318327331336345351353360368375

q3=327+3312q_{3} = \frac{\textcolor{#2192ff}{327} + \textcolor{#2192ff}{331}}{2}q3=329q_{3} = 329Let’s find the median,
q2=12(n+1)q_{2} = \frac{1}{2}(n + 1)q2=12(33+1)q_{2} = \frac{1}{2}(33 + 1)q2=17q_{2} = 17Find the data point at position 17\textcolor{#2192ff}{17},

190203215246249253255254258260261
263267269274276280288283287294300
307318327331336345351353360368375

q2=280q_{2} = \textcolor{#2192ff}{280}Now that we have the necessary requirements, we can now draw the box and whisker plot. Mark the minimum and maximum values with a line each, these will represent the whiskers. To draw the box, mark the lower quartile and the upper quartile, these will be the edges of the box. Inside the box, label the median with a line. Finally, join the whiskers to the box. Don’t forget to label the xx-axis, with the appropriate label. The box represents the interquartile range, the whiskers represent the extreme values.

Rendered by QuickLaTeX.com

(b) Find the interquartile range of these times.

The formula for interquartile range is,
IQR=q3q1IQR = q_{3} - q_{1}Substitute the lower and upper quartiles,
IQR=329256IQR = 329 - 256IQR=73IQR = 73Therefore, the final answer is,
IQR=73IQR = 732. Two machines AA and BB, produce metal rods of a certain type. The lengths, in metres, of 1919 rods produced by machine AA and 1919 rods produced by machine BB are measured. It is given that, the minimum value of AA is 0.2200.220, maximum value is 0.2540.254, lower quartile is 0.2310.231, upper quartile is 0.2450.245, median is 0.2380.238. It is given that, the minimum value of BB is 0.2110.211, maximum value is 0.2560.256, lower quartile is 0.2240.224, upper quartile is 0.2430.243, median is 0.2320.232.(9709/52/M/J/20 number 3)

(a) Draw box and whisker plots for AA and BB.

Sketch the box and whisker plots, with AA above BB, and label them. Ensure that the extreme values are the whiskers, and the box is marked by the lower and upper quartiles. Insert the lines for the median inside their respective boxes. Don’t forget to label the xx-axis.

Rendered by QuickLaTeX.com

(b) Hence make two comparisons between the lengths of the rods produced by machine AA and those produced by machine BB.

Since BB has a larger interquartile range (box) than AA, the length of rods it produces is more spread.
Since the median of BB is smaller than that of AA, the lengths of BB are, on average smaller than those of AA.