Targeted Sequencing: Principle, Steps, Methods, Uses, Diagram

Targeted Sequencing: Principle and Overview

Targeted next-generation sequencing (tNGS), also known as targeted resequencing, is a powerful and precise molecular technique that has become a cornerstone of both clinical diagnostics and research genomics. The fundamental principle of targeted sequencing involves focusing the immense sequencing capacity of Next-Generation Sequencing (NGS) platforms only on a specific, preselected subset of the genome. These regions of interest can include the protein-coding regions (exons, as in exome sequencing), a curated panel of disease-associated genes, specific genomic hotspots, or even non-coding regulatory elements. This selective approach is what differentiates it most profoundly from Whole Genome Sequencing (WGS), which attempts to sequence the entire genome, and Whole Exome Sequencing (WES), which sequences all coding regions.

The primary advantage of this targeted focus is the ability to achieve significantly deeper sequencing coverage over the regions of interest with a smaller total number of sequencing reads. This ultra-deep coverage is crucial for confidently detecting genetic variants, particularly rare alleles, somatic mutations in tumors, or low-frequency mutations present in challenging samples like cell-free DNA (cfDNA) from liquid biopsies or formalin-fixed paraffin-embedded (FFPE) tissues. By concentrating resources, tNGS reduces the overall time, cost, and computational burden associated with data analysis compared to broader sequencing methods, making it an efficient choice for focused studies and routine clinical application.

The Core Methodology: Key Steps

While targeted sequencing utilizes the same core NGS technology as WGS, it incorporates an essential additional stage known as target enrichment. The process is generally broken down into four main stages:

Firstly, **Sample Preparation** involves the careful extraction of high-quality nucleic acids (DNA or RNA) from the biological sample. The integrity and purity of the nucleic acid are paramount, especially when dealing with degraded or low-input samples.

Secondly, **Library Preparation** prepares the DNA fragments for the sequencing platform. The isolated nucleic acids are fragmented into smaller, manageable pieces (typically through mechanical shearing or enzymatic digestion). Universal sequencing adapter sequences are then ligated to both ends of these fragments. Often, unique molecular identifiers (UMIs) and index sequences (barcodes) are added during this phase, which allows multiple samples to be pooled and sequenced simultaneously in a single run—a process called multiplexing—with the ability to trace each read back to its original sample.

Thirdly, **Target Enrichment** is the defining step of tNGS. It is here that the specific genomic regions of interest are isolated and enriched from the entire library. This step ensures that only the desired fragments are carried forward for sequencing, maximizing data utility. The two main strategies for this enrichment are detailed below.

Finally, **Sequencing and Data Analysis** involves loading the enriched library onto an NGS platform (like an Illumina system) for massively parallel sequencing. The output data (millions of short reads) is then computationally analyzed, which includes quality control, aligning the reads to a reference genome, and performing variant calling to identify single nucleotide variants (SNVs), small insertions/deletions (Indels), and other structural changes within the target regions.

Methodologies for Target Enrichment

The choice of target enrichment method impacts the workflow, cost, and ultimately, the quality and coverage homogeneity of the final data. The two principle methods are Hybridization Capture and Amplicon-Based Sequencing:

The **Hybridization Capture** method is a sophisticated, solution-based technique that relies on molecular binding. Biotinylated oligonucleotide probes are meticulously designed to be complementary to the specific genomic regions of interest. The adapter-ligated library is incubated with these probes, allowing the probes to hybridize (bind) to their target DNA fragments. The resulting probe/target hybrids are then selectively pulled down and isolated using streptavidin-coated magnetic beads, which bind strongly to the biotin tag on the probes. Non-specific and unbound DNA fragments are washed away, resulting in a sequencing-ready library highly enriched for the target DNA. Hybridization capture is favored for very large target regions, such as the whole exome, or large, custom gene panels, offering excellent coverage homogeneity, but it is typically more complex and costly per sample than the amplicon approach.

**Amplicon-Based Sequencing** (or Multiplex PCR) relies on the selective power of the Polymerase Chain Reaction (PCR). Primer pairs are designed to flank each region of interest. A highly multiplexed PCR reaction is then performed, simultaneously amplifying hundreds to thousands of target regions (amplicons) from the sample DNA. This method is faster, requires less input DNA, and involves fewer steps than hybridization capture, making it highly suitable for sequencing small target regions, such as a panel of cancer hot spots or pathogen genomes (like SARS-CoV-2). While extremely efficient and cost-effective, it can be susceptible to amplification bias and dropout if primer design is not perfectly optimized, making it less ideal for detecting copy number variations or sequencing highly polymorphic regions like the human leucocyte antigen (HLA) genes.

Clinical and Research Applications of Targeted Sequencing

Targeted sequencing has rapidly become indispensable across many fields due to its high accuracy and depth:

In **Oncology**, targeted panels are used to sequence genes known to be associated with cancer, allowing for the sensitive detection of low-frequency somatic mutations in tumors and tracking minimal residual disease via liquid biopsy. The deep coverage is critical for accurate variant calling in rare tumor subclones.

In **Rare and Inherited Disease Research**, tNGS is a cost-effective alternative to WGS. Researchers focus on panels containing known disease-causing genes, enabling the rapid and accurate identification of genetic variants responsible for a specific suspected condition.

For **Transplantation Medicine**, targeted resequencing of the polymorphic human leucocyte antigen (HLA) gene loci is crucial for high-resolution HLA typing, which is essential for matching donors and recipients in hematopoietic stem cell and solid organ transplantation.

Finally, its rapid adaptability makes it effective in **Infectious Disease Surveillance**, such as quickly developing panels to sequence and monitor specific variants or entire genomes of emerging pathogens like the SARS-CoV-2 virus.

Note on Diagram Generation

I am unable to generate a visual diagram illustrating the targeted sequencing process, as my current capabilities do not extend to creating charts, graphs, or visual representations.

×

Download PDF

Enter your email address to unlock the full PDF download.

Generating PDF...

Leave a Comment