DNA — formally known as deoxyribonucleic acid — is the chemical molecule that stores and transmits genetic information in most living systems. It carries the instructions that direct development, physiology and reproduction across generations, and it is present in animals, plants, protists, archaea and bacteria. The basic chemical units of DNA are linked to form long chains; two such polynucleotide chains pair and coil to form the familiar double helix structure.

Structure and components

Each DNA strand is composed of a sugar‑phosphate backbone and four types of nitrogenous bases. The order of these bases encodes information that cells use to build molecules. Within a cell, DNA is the template for the production of proteins and functional RNAs. DNA sequences that make proteins are called genes, while other regions have regulatory or structural roles. The physical location of DNA varies with organism type: eukaryotic species store most DNA in a membrane‑bound nucleus and package it with proteins, whereas many prokaryotes keep DNA in the cytoplasm in a circular form.

Organization in cells

In eukaryotic cells DNA is organized into linear chromosomes. Those chromosomes are compacted by chromatin proteins, including histones, and are duplicated ahead of cell division through a process of replication. In contrast, bacteria and archaea typically have compact genomes with relatively little noncoding sequence and often possess circular chromosomes localized in the cytoplasm. Nuclear DNA in eukaryotes resides in the nucleus, although small genomes also exist in mitochondria and chloroplasts.

Not all DNA directly specifies proteins. Large amounts of genomic sequence can be non-coding DNA, some of which is transcribed into non-coding RNA molecules (for example transfer and ribosomal RNAs or regulatory RNAs) and some of which appears to have structural or regulatory roles only. The proportion of noncoding DNA varies widely among species; multicellular eukaryotes typically have more noncoding sequence than many microbes.

Information flow and replication

Genetic information flows from DNA to RNA to protein in processes called transcription and translation. Cells copy DNA by replication to pass information to daughter cells; replication is highly controlled to preserve sequence integrity but can also introduce variation. Viruses may carry genomes made of DNA or of RNA. DNA viruses often rely on cellular nuclear machinery to infect and replicate within host cells, while many RNA viruses replicate in the cytoplasm.

DNA is inherited when reproductive cells combine; children receive half their DNA from each parent, which explains familial resemblances such as certain physical traits. Mutations and recombination create genetic diversity that underlies evolution. Because DNA sequences differ between individuals and species, they are useful in forensics, ancestry studies, biodiversity surveys and breeding.

Historically, the discovery of DNA’s role in heredity and the determination of its double helical form transformed biology and medicine. Modern molecular biology and biotechnology apply DNA sequencing, editing and synthesis to diagnose disease, produce therapeutics and engineer organisms for agriculture and industry. For accessible introductions and resources, see general summaries about the molecule, the concept of the genetic code, and community resources on organisms (organisms, animals, plants, protists, archaea, bacteria). For focused topics see entries on cells, inheritance, genome structure (genomes), and technical discussions of viruses and infection dynamics. Further reading is widely available through educational sites and textbooks (deoxyribonucleic acid overview).