Overview

The normal distribution, often called the Gaussian distribution, is a continuous probability distribution widely used to model measurements, errors, and many natural and social phenomena. Its graph is a symmetric, bell-shaped curve centered on a location parameter (the mean) and spread according to a scale parameter (the standard deviation). For general background see probability distribution.

Key characteristics

Members of the normal family are described by two parameters: the mean μ, which locates the center of the distribution, and the standard deviation σ, which controls its width. The probability density function for a value x is given by the familiar formula f(x) = (1/(σ√(2π))) exp(−(x−μ)²/(2σ²)), which determines relative likelihoods of different outcomes and is an example of a probability density. Important properties include symmetry about μ, unimodality (a single peak), and the equality mean = median = mode. The standard normal distribution has μ = 0 and σ = 1 and serves as a reference for computing z-scores.

Parameters and standardization

  • Mean (μ): central location; see mean.
  • Standard deviation (σ): typical distance from the mean; see standard deviation.
  • Scale and location: families of normals differ only by these two parameters; see location and scale.
  • Standardization: any normal variable X can be converted to the standard normal Z = (X−μ)/σ for tables or computation.

Historical development and theory

Ideas leading to the normal curve arose in the 18th and 19th centuries. Abraham de Moivre used a curve to approximate binomial probabilities, and Carl Friedrich Gauss applied the distribution to astronomical measurement errors and popularized methods that make use of it; for a short biography see Gauss. The prevalence of the normal form is explained in part by the central limit theorem, which states that sums (or averages) of many independent, small random effects tend toward a normal distribution under broad conditions; see central limit theorem. Mathematical results about moments, cumulants and generating functions make the normal distribution a convenient analytic tool.

Applications and limitations

The normal distribution is used to model heights, measurement errors, standardized test scores, and many sampling distributions that underpin confidence intervals and hypothesis tests. Practical examples include modeling adult human heights, IQ scores (commonly standardized to a normal with mean 100 and SD 15 in many test norms), and errors in physical experiments. It also appears in introductory models in finance and signal processing. However, not all data are well described by a normal: heavy-tailed phenomena, skewed distributions, and multimodal mixtures require different models or transformations; see bell curve for common visualizations and variance considerations.

Distinctions and practical notes

  1. Because the normal is determined entirely by two parameters, inference often reduces to estimating μ and σ and assessing fit.
  2. Goodness-of-fit tests, quantile–quantile plots, and robust statistics help detect departures from normality.
  3. Many statistical procedures assume normality for exact small-sample results but remain approximately valid under mild departures due to the central limit theorem; see a primer on means and scales.

For further reading on definitions, tables and computational methods consult introductory texts and reference entries dedicated to probability distributions and the normal law: distribution overview, technical references on density functions, and historical accounts of Gauss and early contributors such as de Moivre and Laplace (central limit context).