What Are Boxplots
Through the visual representation of a dataset, boxplots are very informative and very useful when comparing distributions between two or more processes. Essentially, boxplots summarize the following statistics:
The Five-Number Summary
The Five-Number Summary is what is displayed graphically in a boxplot. The five numbers are:
Practical Example
A call centre manager wishes to know how long it is taking for three distinct branches (A, B, and C) to respond to client calls. The following comparative boxplots suggest that: 1) branch A has the greatest amount of variation (size of the box and extent of whiskers) however a lower median response time, 2) branches B and C have less variation however slower median response times, and 3) branches B and C showcase outliers in their processes; branch C in this case on both ends of the upper and lower whiskers.
Boxplot Limitations (Words of Caution)
Boxplots do not always display the number of data points being graphed. For example, one might be looking at two comparative boxplots, the first one with only 5 distinct numbers (the Five-Number Summary that is displayed in a boxplot), and another one with 500 data points, making it a stronger representation of that process’s central tendency and spread. So, use caution when interpreting boxplots!
The Boxplotly Playground
Check out this web app built by the author of this post to learn more about and interactively play with Boxplots online.