Overview

A genome is the full set of hereditary information that an organism carries. In most life forms this information is encoded in deoxyribonucleic acid (DNA), though some viruses store genetic material as ribonucleic acid (RNA). The genome encompasses both the sequences that encode genes and the non-coding regions that influence regulation, structure, and stability. For basic introductory material see accessible summaries of genome concepts.

Structure and components

Genomes are organized at multiple scales. Chromosomes are large DNA molecules that package sequence into discrete units; technical terms such as haploid and chromosome describe key aspects of that organization. In many species the nuclear genome—the complete set of nuclear DNA—is complemented by genomes of cellular organelles, such as the mitochondrial genome and the chloroplast genome in plants. Organelle genomes are typically smaller and often circular, and they contribute to cellular function and inheritance.

Variation, population concepts, and pangenomes

Within a species there is extensive genetic variation. Different versions of the same locus, called alleles, along with structural changes and mobile elements, make each individual distinct. Populations harbor many alleles and other variants, so the genome of a single individual represents only part of a species' total genetic diversity; population-level approaches consider many genomes together to describe that diversity and, more recently, to build a pangenome that captures shared and variable sequences across individuals or breeds. Population genetics and comparative genomics use samples from many genomes to infer evolutionary history and demographic processes in a population.

History and study

The term was coined in the early 20th century to refer to chromosome sets and has been broadened as molecular biology revealed the chemical basis of heredity. Early cytogenetics described karyotypes and chromosome behavior, while the later rise of sequencing transformed the field. Methods include Sanger sequencing, high-throughput short-read technologies, and newer long-read platforms; downstream steps such as genome assembly and annotation turn raw sequence data into maps of genes and features. Reference genomes provide representative sequences for many species, but they do not capture all individual variation.

Functional layers beyond sequence

Not all heritable or long-term regulatory effects are encoded solely in base sequence. Epigenetic marks, chromatin structure, and interactions among genes and regulatory elements modulate how the instructions in a genome are executed. These layers are studied alongside the primary sequence to understand development, cell differentiation, and responses to environment.

Applications and significance

Genome knowledge underpins diverse applications. In medicine it enables the identification of variants associated with disease risk, informs diagnosis and personalized treatment, and supports development of gene therapies. In agriculture, genomic tools assist breeding for yield, resilience, and quality. Conservation genetics uses genomic data to assess diversity, guide management of endangered species, and detect inbreeding. Other areas include forensics, ancestry testing, synthetic biology, and studies of microbial communities.

Ethics, data and challenges

Genomic data raise ethical and social issues such as privacy, informed consent, data sharing, and potential misuse. Technical challenges include assembling complex or repetitive regions, representing structural variation, and interpreting the effects of variants. Advances are expanding our ability to read and edit genomes, but scientific, ethical, and regulatory frameworks must keep pace.

Further reading

For general background and definitions consult introductory texts and authoritative reviews. Additional resources include historical discussions of the term and modern reviews of sequencing technology, interpretation, and applications. Relevant entry points: historical context, species concepts, and practical guides on genome analysis and annotation such as chromosome studies and methodological overviews at genome resources. For discussions of inheritance and cellular context see summaries on nuclear genomes, organellar genomes, and mitochondrial DNA. To explore population-level ideas and allele variation see entries on alleles and population genetics. For concise primers on sequencing and interpretation consult basic guides and technology reviews at DNA resources.