Frequency distribution is a fundamental descriptive tool in statistics used to summarize how often different values occur in a dataset. At its simplest, it lists each observed value together with a count of occurrences. For discrete variables this can be a straightforward tally; for continuous measurements it is usually presented in grouped intervals. A frequency distribution reveals patterns such as common values, gaps, and the overall shape of the data before formal modeling or inference.

Basic elements and types

A typical frequency distribution contains at least two columns: the value (or class interval) and the frequency (the number of observations in that value or interval). Variants and related concepts include:

  • Absolute frequency: raw counts of observations.
  • Relative frequency: proportions or percentages derived from absolute frequencies, useful for comparing samples of different sizes.
  • Cumulative frequency: running totals that show how many observations fall below or at each threshold.
  • Grouped frequency distribution: used for continuous or many-valued variables by aggregating values into classes or bins.

Constructing a distribution

For discrete variables, construction involves listing each possible value and counting its occurrences. For continuous variables, one chooses a sensible number of classes and interval widths, then counts observations falling into each bin. Several heuristic rules exist to guide the choice of class width and number of bins; the goal is to balance detail with readability so that the shape of the data is visible without excessive noise.

Presentation and visualization

Frequency distributions are presented as tables but are frequently visualized because graphics communicate shape more effectively. Common visual forms include histograms (for grouped continuous data), bar charts (for discrete or categorical data), frequency polygons, and cumulative frequency plots. Alternative text-based summaries include stem-and-leaf displays that preserve individual values while showing distributional form.

Uses, interpretation, and cautions

Frequency distributions are used to explore central tendency, dispersion, modality, skewness, and the presence of outliers before applying statistical models. They support quick comparisons between samples when converted to relative frequencies. Users should be cautious: the appearance of grouped distributions depends on chosen bin widths and endpoints, which can distort interpretation if selected poorly. When reporting results, it is often helpful to accompany a frequency table with relative and cumulative frequencies to give multiple perspectives.

The notion of a frequency distribution connects directly to probability distributions, which describe expected frequencies in large or theoretical populations. In applied work, a sample frequency distribution serves as the empirical counterpart to a parametric or nonparametric model. For definitions of related terms such as variable and more on exploratory data analysis, see entries or resources on variables and descriptive statistics.

Example note: a simple survey using a five-point agreement scale generates a discrete frequency distribution showing how many respondents selected each response; height measurements in a class are better summarized in grouped classes and visualized as a histogram to show overall spread.