Overview

Descriptive statistics is the branch of statistics that organizes, summarizes and presents collections of data. Its purpose is to reduce a potentially large set of observations to a few informative numbers and graphics that reveal central tendency, variability and basic structure of the observed values without making probabilistic inferences beyond the sample.

Data types and scales

Summaries and visualizations depend on the type of measurement: nominal (categories), ordinal (ranked categories), interval and ratio (quantitative). Appropriate descriptive methods differ: counts and proportions suit categorical data, while means, medians and spreads apply to numeric data. A dataset should be examined for missing values, measurement scale and any grouping before computing summaries.

Common measures

  • Measures of location: mean, median, mode — describe a typical or central value.
  • Measures of spread: range, interquartile range (IQR), variance and standard deviation — quantify dispersion.
  • Shape descriptors: skewness and kurtosis provide summary information about asymmetry and tail weight, though they may be sensitive to outliers.
  • Percentiles and quantiles: divide data into ranked segments and help describe position within the distribution.

Tabulation and grouping

For large or continuous measurements it is common to group observations into classes and present frequency tables or relative frequencies. Grouped summaries can be displayed as a histogram or bar chart for categorical breakdowns. Careful choice of bin width and class boundaries affects interpretability.

Robust measures and transformations

Robust statistics (for example the median or trimmed means and the IQR) reduce the influence of extreme values. Transformations such as logarithms may stabilize variance or make skewed distributions more symmetric; transformed summaries should be interpreted on the transformed scale or converted back carefully.

Visual displays

  • Box plots: show median, quartiles and potential outliers.
  • Scatter plots: illustrate relationships between two quantitative variables.
  • Bar charts and pie charts: compare counts or proportions for categorical data.

Applications and limitations

Descriptive statistics are widely used in research, business, education and public communication to summarize observed data from a sample or a full population. They do not by themselves establish causation or generalize beyond the observations; for that, inferential methods are required. Common pitfalls include overreliance on single summary measures (which can hide subgroup differences), misuse of averages with skewed data, and misleading visual choices. Good practice combines multiple measures and clear graphics to present a balanced view of the data.

For introductory tutorials, examples and reproducible demonstrations see related resources: overview, example datasets, data types, sampling concepts and plotting guides such as histogram construction.