Statistics Basics: Mean, Median, Mode, and Measures of Spread
Statistics is the branch of mathematics that deals with collecting, organising, analysing, and interpreting data. Whether you are a student working through a textbook, a data analyst examining survey results, or a researcher testing hypotheses, a solid grasp of the basics is essential. This guide covers the fundamental measures of central tendency and spread that form the backbone of statistical analysis.
Measures of Central Tendency
Central tendency measures tell you where the "centre" of a dataset lies. The three most common are the mean, median, and mode.
The Mean (Average)
The arithmetic mean is the sum of all values divided by the number of values:
Example: For the dataset :
The mean is sensitive to extreme values (outliers). A single very large or very small value can pull the mean significantly in its direction.
The Median
The median is the middle value when the data is sorted in order. If there is an even number of values, the median is the mean of the two middle values.
Odd count example: For , the median is (the third value out of five).
Even count example: For :
The median is robust against outliers. For income data or house prices, the median is often more representative than the mean.
The Mode
The mode is the value that appears most frequently in a dataset. A dataset can have no mode (all values unique), one mode (unimodal), or multiple modes (bimodal, multimodal).
Example: For , the mode is (appears three times).
The mode is the only measure of central tendency that works with categorical (non-numeric) data. For example, the most popular colour chosen in a survey is the mode.
Measures of Spread
While central tendency tells you where the data is centred, measures of spread (or dispersion) tell you how spread out the values are.
Range
The simplest measure of spread:
For , the range is . The range is easy to calculate but is heavily influenced by outliers.
Variance
Variance measures the average squared deviation from the mean. There are two versions:
Population variance (when you have the entire population):
Sample variance (when you have a sample from a larger population):
The in the sample formula is called Bessel's correction. It provides an unbiased estimate of the population variance.
Standard Deviation
Standard deviation is the square root of variance and is expressed in the same units as the original data:
Full Worked Example
Dataset:
Step 1: Calculate the mean.
Step 2: Find each deviation from the mean.
Step 3: Square each deviation.
Step 4: Calculate the variance.
Step 5: Calculate the standard deviation.
Interquartile Range (IQR)
The IQR measures the spread of the middle 50% of the data:
Where is the first quartile (25th percentile) and is the third quartile (75th percentile). The IQR is useful for identifying outliers: any value below or above is typically considered an outlier.
When to Use Which Measure
- Symmetric data with no outliers: Use the mean and standard deviation.
- Skewed data or data with outliers: Use the median and IQR, as they are more robust.
- Categorical data: Use the mode.
- Comparing datasets with different scales: Use the coefficient of variation: .
Z-Scores: Standardising Data
A z-score tells you how many standard deviations a value is from the mean:
A z-score of means the value is two standard deviations above the mean. Z-scores are essential for comparing values from different distributions and for working with the normal distribution. Read more in our z-scores guide.
Common Mistakes
- Using the mean for skewed data. Income distributions are typically right-skewed. The mean salary will be higher than what most people earn. Use the median instead.
- Forgetting Bessel's correction. When working with a sample, divide by , not .
- Confusing variance and standard deviation. Variance is in squared units. Standard deviation is in the original units and is generally easier to interpret.
- Ignoring context. A standard deviation of 5 means very different things for exam scores (out of 100) versus heights (in centimetres).
Try It Yourself
Use our free Standard Deviation Calculator to compute the mean, variance, and standard deviation for any dataset. For more advanced analysis, try the Mean, Median, Mode Calculator or explore z-scores with our Z-Score Calculator.
Related Articles
Understanding Z-Scores and Normal Distribution
Learn what z-scores mean, how the bell curve works, and how to use z-score tables. Includes worked examples and links to our standard deviation calculator.
Trigonometry Basics: Sin, Cos, Tan, the Unit Circle, and Key Identities
A complete introduction to trigonometry covering sine, cosine, tangent, the unit circle, radians, Pythagorean identity, angle addition formulas, and the laws of sines and cosines.
P-Value Explained: What It Means and How to Calculate It
Understand what p-values really mean, how to calculate them from z-scores and t-scores, significance levels, and the most common misconceptions researchers make.