A DNA Sequencing History

Major landmarks in DNA sequencing and molecular biology

Strukturformel eines DNA-Ausschnittes (Wikipedia)

Discovery of the structure of the DNA double helix (Watson, Crick, Franklin).

Prove the semi-conservative nature of dna replication (Meselson, Stahl)

First dna triplet is decoded (Matthei, Nierenberg)

Development of recombinant DNA technology, which permits isolation of defined fragments of DNA; prior to this, the only accessible samples for sequencing were from bacteriophage or virus DNA.

The first gene is sequenced

The first complete DNA genome to be sequenced is that of bacteriophage φX174

Allan Maxam and Walter Gilbert publish “DNA sequencing by chemical degradation” [4].
Fred Sanger, independently, publishes “DNA sequencing by enzymatic synthesis”.

Fred Sanger and Wally Gilbert receive the Nobel Prize in Chemistry

Genbank starts as a public repository of DNA sequences.

Andre Marion and Sam Eletr from Hewlett Packard start Applied Biosystems in May, which comes to dominate automated sequencing.

Akiyoshi Wada proposes automated sequencing and gets support to build robots with help from Hitachi.

Restriction fragment length polymorphism fingerprinting (Jeffreys)

Medical Research Council scientists decipher the complete DNA sequence of the Epstein-Barr virus, 170 kb.

Kary Mullis and colleagues develop the polymerase chain reaction, a technique to replicate small fragments of DNA

Leroy E. Hood’s laboratory at the California Institute of Technology and Smith announce the first semi-automated DNA sequencing machine.

Applied Biosystems markets this first automated sequencing machine, the model ABI 370.

Walter Gilbert leaves the U.S. National Research Council genome panel to start Genome Corp., with the goal of sequencing and commercializing the data.

The U.S. National Institutes of Health (NIS) begins large-scale sequencing trials on Mycoplasma capricolum, Escherichia coli, Caenorhabditis elegans, and Saccharomyces cerevisiae (at 75 cents (US)/base).

BLAST algorithm for aligning sequences published (Lipman, Myers).

Capillary electrophoresis published (Barry Karger, Lloyd Smith, Norman Dovichi).

Official start of the Human Genome Project

Craig Venter develops strategy to find expressed genes with ESTs (Expressed Sequence Tags).

Uberbacher develops GRAIL, a gene-prediction program.

Craig Venter leaves NIH to set up The Institute for Genomic Research (TIGR).

William Haseltine heads Human Genome Sciences, to commercialize TIGR products.

Wellcome Trust begins participation in the Human Genome Project.

Simon et al. develop BACs (Bacterial Artificial Chromosomes) for cloning.

First chromosome physical maps published:
-Page et al. – Y chromosome[28];
-Cohen et al. chromosome 21[29].
-Lander – complete mouse genetic map[30];
-Weissenbach – complete human genetic map[31].

Wellcome Trust Sanger Institute (original file)

Wellcome Trust and MRC open Sanger Centre, near Cambridge, UK.

The GenBank database migrates from Los Alamos (DOE) to NCBI (NIH).

Venter, Fraser and Smith publish first sequence of free-living organism, Haemophilus influenzae (genome size of 1.8 Mb).

Richard Mathies et al. publish on sequencing dyes (PNAS, May)[32].

Michael Reeve and Carl Fuller, thermostable polymerase for sequencing[8].

International HGP partners agree to release sequence data into public databases within 24 hours.

International consortium releases genome sequence of yeast S. cerevisiae (genome size of 12.1 Mb).

Yoshihide Hayashizaki’s at RIKEN completes the first set of full-length mouse cDNAs.

Blattner, Plunkett et al. publish the sequence of E. coli (genome size of 5 Mb)[33]

First cloned animal, Sheep “Dolly”, is born (Wilmut)

Phil Green and Brent Ewing of Washington University publish ìphredî for interpreting sequencer data (in use since ë95)[34].

Venter starts new company (Celera), will sequence HG in 3 yrs for $300m.

Wellcome Trust doubles support for the HGP to $330 million for 1/3 of the sequencing.

NIH & DOE goal: “working draft” of the human genome by 2001.

Sulston, Waterston et al finish sequence of C. elegans (genome size of 97Mb)[35].

NIH moves up completion date for rough draft, to spring 2000.

NIH launches the mouse genome sequencing project.

First sequence of human chromosome 22 published[36].

Celera and collaborators sequence fruit fly Drosophila melanogaster (genome size of 180Mb) – validation of Venter’s shotgun method. HGP and Celera debate issues related to data release.

HGP consortium publishes sequence of chromosome 21.[37]

HGP & Celera jointly announce working drafts of HG sequence, promise joint publication.

Estimates for the number of genes in the human genome range from 35,000 to 120,000.

International consortium completes first plant sequence, Arabidopsis thaliana (genome size of 125 Mb).

HGP consortium publishes Human Genome Sequence draft in Nature (15 Feb)[38].

Celera publishes the Human Genome sequence[39].

HapMap project initiated to decipher human genetic variation

420,000 VariantSEQr human resequencing primer sequences published on new NCBI Probe database.

Genographic project launched to study human migration

A set of closely related species (12 Drosophilidae) are sequenced, launching the era of phylogenomics.

Craig Venter publishes his full diploid genome


Source: Wikipedia and ABI