The ability to sequence nucleic acids, including DNA and RNA, has revolutionized our understanding of genetics and has vast implications across numerous scientific disciplines, particularly in disease research and organismal biology. By determining the composition and order of nucleic acids, researchers can gain profound insights into an organism's genetic makeup. Sequencing not only decodes the genetic instructions that guide protein synthesis but also enables scientists to manipulate and verify DNA sequences through various molecular technologies. This capability allows for the design of custom recombinant proteins with specific functional elements, which are widely used to explore critical biological processes and answer a broad array of research questions. For more information on whole-genome sequencing reagents and resources, please visit the Next-Generation Sequencing page.
Early on, analytical chemistry methods were able to determine nucleic acid composition. However, Fred Sanger and colleagues developed methods - initially using radiolabeled digested fragments and two-dimensional fractionation - to provide some of the first complete nucleic sequences. Advancements in the field continued for several decades, with each improvement adding to a growing library of sequenced proteins and genomes. These improvements include the separation of nucleotides by length using gel electrophoresis followed by capillary electrophoresis. Additionally, the incorporation of fluorescently-labeled nucleotides and automated computer analysis of each nucleic acid fragment all now define the modern Sanger sequencing method.
While the methods to sequence DNA have continued to improve since the origin of this technology, several fundamental components are still necessary and used by modern DNA sequencing platforms. DNA preparation typically includes isolation and purification of the DNA from the host organism. Additional critical components, including free DNA bases, DNA primers, modified DNA bases containing fluorescent tags (terminator bases), and DNA polymerase are added together into a single vessel. The vessel containing all these components through a series of heating and cooling steps will produce a library of small DNA sequences relative to the full-length DNA sequence of interest, each ending with a fluorescently end-labeled terminator base. The new DNA strands containing the fluorescently end-labeled nucleotides are separated by length, passed through a capillary tube, and arranged by size. A laser is then used to excite the fluorescent base on each strand while a camera captures the signal. Lastly, a computer is typically used to assemble the collected information into the full-length DNA sequence.