Advance Sequencing Technology
Advance Sequencing Technology
Advance Sequencing Technology
Dr. M.Vinoth,
Animal biotechnology
0
INTRODUCTION
1
Recently, another two massively parallel systems were announced: the Helicos
HeliscopeTM (www.helicosbio.com) and Pacific Biosciences SMRT
(www.pacificbiosciences.com) instruments.
Application
Discover all types of genetic variation: SNPs, insertions, deletions, copy
number variants, and rearrangements
Use targeted sequencing of association or linkage peaks to identify
variants that cause disease
Characterize new bacterial isolates by de novo sequencing and re-
sequencing
Resequence a collection of samples from any population or species
Profile DNA methylation status across the entire genome
Define somatic variations in cancer
Characterize complex RNA populations for new genes and transcript
structures
Create new applications enabled by massively parallel sequencing
Gene expression
Structural variation analysis
DNA Sequencing
Transcriptome Analysis
2
In 2008, 454 Sequencing launched the GS FLX Titanium ability to sequence
400-600 million base pairs per run with 400-500 base pair read lengths.
Sequencing Chemistry
In pyrosequencing
3
If a nucleotide complementary to the template strand is flowed into a
well, the polymerase extends the existing DNA strand by adding
nucelotide(s).
Demerit
However, the calibrated base calling cannot properly interpret long stretches
(>6) of the same nucleotide (homopolymer run), so these areas are prone to
base insertion and deletion errors during base calling.
Each incorporation step is nucleotide specific, substitution errors are
rarely encountered in Roche/454 sequence reads. The FLX instrument currently
provides 100 flows of each nucleotide during an 8-h run, which produces an
average read length of 250 .These raw reads are processed by the 454 analysis
software and then screened by various quality filters to remove poor-quality
sequences.
The resulting reads yield 100 Mb of quality data on
average.,FLXreads are of sufficient length to assemble small genomes such as
bacterial and viral genomes to high quality and contiguity
4
Figure 1
The method used by the Roche/454 sequencer to amplify single-stranded
that contain PCR reactants surrounded by oil, and pipetted into a 96-well
DNA.
5
6
VisiGen Biotechnologies
VisiGen Biotechnologies, Inc. is developing a revolutionary approach
for DNA sequencing that will enable rapid, inexpensive whole genome
sequencing.
7
Requires minimal sample amount that is prepared in a non-
destructive way.
Employs direct analysis and eliminates the need for cloning or
amplification.
Eliminates sequencing reaction processing and electrophoresis.
Does not destroy the original template material.
Does not produce an altered DNA polymer that may negatively
impact polymerase activity
8
cluster fragments specifically, the nucleotides carry a base-unique
fluorescent label and the 3_-OH group is chemically blocked such that
each incorporation is a unique event.
An imaging step follows each base incorporation step, during which each
flow cell lane is imaged in three 100-tile segments by the instrument
optics at a cluster density per tile of 30,000.
After each imaging step, the 3_ blocking group is chemically removed to
prepare each strand for the next incorporation by DNA polymerase.
This series of steps continues for a specific number of cycles, as
determined by user-defined instrument settings, which permits discrete
read lengths of 25–35 bases.
A base-calling algorithm assigns sequences and associated quality values
to each read and a quality checking pipeline evaluates the Illumina data
from each run, removing poor-quality sequences.
Figure 2
The Illumina sequencing-by-synthesis approach. Cluster strands created by
bridge amplification are primed and all four fluorescently labeled, 3_-OH
blocked nucleotides are added to the flow cell with DNA polymerase. The
cluster strands are extended by one nucleotide. Following the incorporation
step, the unused nucleotides and DNA polymerase molecules are washed away,
a scan buffer is added to the flow cell, and the optics system scans each lane of
the flow cell by imaging units called tiles. Once imaging is completed,
chemicals that effect cleavage of the fluorescent labels and the 3_-OH blocking
groups are added to the flow cell, which prepares the cluster strands for another
round of fluorescent nucleotide incorporation.
9
Illumina Genome Analyzer
10
11
12
Applied Biosystems SOLiDTM Sequencer
The SOLiD platform uses an adapter-ligated fragment library similar to
those of the other next-generation platforms, and uses an emulsion PCR
approach with small magnetic beads to amplify the fragments for
sequencing.
Unlike the other platforms, SOLiD uses DNA ligase and a unique
approach to sequence the amplified fragments .
Two flow cells are processed per instrument run, each of which can be
divided to contain different libraries in up to four quadrants.
Read lengths for SOLiD are user defined between 25–35 bp, and each
sequencing run yields between 2–4 Gb of DNA sequence data.
Once the reads are base called, have quality values, and low-quality
sequences have been removed, the reads are aligned to a reference
genome to enable a second tier of quality evaluation called two-base
encoding.
Disadvantage
(a) The length of a sequence read from all current nextgeneration platforms is
much shorter than that from a capillary sequencer
(b) Each nextgeneration read type has a unique error model different from that
already established for capillary sequence reads.
Both differences affect how the reads are utilized in bioinformatic analyses,
depending upon the application. For example,in strain-to-reference comparisons
(resequencing), the typical definition of repeat content must be revised in the
context of the shorter read length. In addition, a much higher read coverage or
sampling depth is required for comprehensive resequencing with short reads to
adequately cover the reference sequence at the depth and low gap size needed.
13
Figure 3
(a) The ligase-mediated sequencing approach of the Applied Biosystems
sequencing reactions, and are then deposited onto a flow cell slide. Ligase-
sequences on each amplified fragment, and then DNA ligase is provided along
with specific fluorescentlabeled 8mers, whose 4th and 5th bases are encoded by
detection, after which a regeneration step removes bases from the ligated 8mer
ligated 8mer identifies a two-base combination, the resulting sequence reads can
reference sequence.
14
Applied Biosystems SOLiDTM Sequencer
15
16
Comparison of Roche, Illumina, and Applied Biosystems
NGS Platforms.
17
HeliScope™ Single Molecule Sequencer
Principle:
Within two flow cells, billions of single molecules of sample DNA are captured
on an application-specific proprietary surface. These captured strands serve as
templates for the sequencing-by-synthesis process:
Polymerase and one fluorescently labeled nucleotide (C, G, A or T) are
added.
The polymerase catalyzes the sequence-specific incorporation of
fluorescent nucleotides into nascent complementary strands on all the
templates.
After a wash step, which removes all free nucleotides, the incorporated
nucleotides are imaged and their positions recorded.
The fluorescent group is removed in a highly efficient cleavage process,
leaving behind the incorporated nucleotide.
The process continues through each of the other three bases.
Multiple four-base cycles result in complementary strands greater than 25
bases in length synthesized on billions of templates—providing a greater
than 25-base read from each of those individual templates.
18
images to observe sequencing-by-synthesis reactions for billions of individual
DNA molecules in parallel.
Advantage :
Detects incorporation of individual bases on individual strands,
eliminating the need for amplification and its associated biases and errors.
Produces “digital data”, avoiding errors caused by de-phasing or intensity
averaging techniques, and thereby generates consistently low error rates
across all read lengths, resulting in high quality experiments.
Offers simple sample preparation and high sample capacity per run,
A throughput performance almost 100X greater than Sanger methods, and
faster than any of the "next-generation" methodologies.
19
A schematic of the sequencing-by-
synthesis approach used by Helicos
Biosciences. (Courtesy of Helicos
Biosciences.)
Instead of inspecting DNA copies after polymerase has done its work, SMRT
sequencing watches the enzyme in real time as it races along and copies an
individual strand stuck to the bottom of a tiny well. Every nucleotide used to
make the copy is attached to its own fluorescent molecule that lights up when
the nucleotide is incorporated. This light is spotted by a detector that identifies
the color and the nucleotide -- A, C, G, or T. By repeating this process
simultaneously in many wells, the technology hopes to bring about a substantial
boost in sequencing speed.
20
Advantage
The SMRT Cell, which enables single molecule, real-time observation of
individual fluorophores against a dense background of labeled nucleotides while
maintaining a high signal-to-noise ratio.
Phospholinked nucleotides, which enable long readlengths by producing a
completely natural DNA strand through fast, accurate, and processive DNA
synthesis, and
A novel detection platform that enables single molecule, real-time detection
as well as flexibility in run configurations and applications
21
Both excitation and detection occur without interruption through the transparent
glass bottom of the SMRT Cell.
CONCLUSION
22
Reference
http://www.454.com/enablingtechnology/the-system.asp
http://www.illumina.com/pages.ilmn?ID=203
http://marketing.appliedbiosystems.com
www.helicosbio.com
www.pacificbiosciences.com
http://www.dnamicroarray.com/products_stemcells.html
Arjournals.annualreviews.org
23