Next-generation sequencing (NGS) is a high-throughput method that allows scientists to sequence millions of DNA (or RNA) fragments simultaneously. It’s a game-changer compared to older Sanger sequencing, which handled one fragment at a time. Here’s how it works, step by step, in a way that’s clear but doesn’t skimp on the essentials:

  1. Sample Preparation
    DNA Fragmentation: The DNA sample (could be genomic DNA, ctDNA from a liquid biopsy, etc.) is broken into smaller pieces, typically 100-500 base pairs long. This can be done mechanically (e.g., sonication) or enzymatically.
    Library Prep: Adapters—short, known DNA sequences—are attached to both ends of these fragments. These adapters act like handles for the sequencing machine to recognize and amplify the DNA. If you’re sequencing RNA (like for gene expression), it’s first converted to cDNA using reverse transcription.
  2. Amplification
    PCR or Cluster Generation: To get enough material to sequence, the fragments are amplified. In platforms like Illumina (the most common NGS tech), this happens on a solid surface (a flow cell). Each fragment is bound to the surface via its adapters, and PCR creates clusters of identical DNA copies—think millions of tiny DNA colonies. This boosts the signal for detection.
  3. Sequencing
    Sequencing by Synthesis: Most NGS platforms, like Illumina, use this method. Here’s the gist:
    A primer attaches to the adapter, and DNA polymerase starts adding fluorescently labeled nucleotides (A, T, C, G) one by one to build the complementary strand.
    Each nucleotide has a unique color tag and a reversible terminator. When it’s added, the terminator stops the chain, the machine takes a picture to record the color (indicating the base), then the terminator is cleaved off, and the process repeats.
    Cameras capture these fluorescent signals across millions of clusters simultaneously, generating “reads”—short sequences of bases.
    Other Methods: Alternatives exist, like pyrosequencing (Roche 454, now less common) or nanopore sequencing (Oxford Nanopore), which reads DNA as it passes through a tiny pore, measuring electrical changes. But Illumina’s approach dominates due to its accuracy and scale.
  4. Data Output
    Reads: The machine spits out millions to billions of short reads (e.g., 150 base pairs each). The number of reads determines “coverage”—how many times each part of the genome is sequenced. More coverage = higher confidence.
    Alignment: These reads are computationally mapped back to a reference genome (or assembled de novo if there’s no reference) to figure out where they belong. Software like BWA or Bowtie handles this.
  5. Analysis
    Variant Calling: Differences between the reads and the reference—like mutations, insertions, or deletions—are identified using tools like GATK or VarScan. This is key for finding cancer mutations or genetic variants.
    Applications: Depending on the goal (e.g., whole-genome sequencing, targeted panels, RNA-seq), the data might reveal gene expression levels, structural changes, or specific disease markers.
    Why It’s Powerful
    Scale: NGS can sequence an entire human genome (3 billion base pairs) in a day or two for a few hundred bucks, compared to years and millions of dollars with older methods.
    Parallelism: By sequencing millions of fragments at once, it’s insanely efficient.
    Flexibility: It’s used for everything—diagnosing rare diseases, tracking cancer evolution, studying microbiomes, even ancient DNA.
    For example, in a liquid biopsy, NGS might sequence ctDNA to spot mutations like EGFR in lung cancer, with enough sensitivity to detect changes at 0.1% frequency if the coverage is deep enough. The catch? It generates massive data—think terabytes—so bioinformatics is half the battle.

Leave a Reply

Your email address will not be published. Required fields are marked *

Trending