Alignment Methods

Sequence alignment is a method for arranging DNA, RNA, or protein sequences to identify similarities and infer relationships. It includes global and local alignment techniques, with algorithms like Needleman-Wunsch and Smith-Waterman used for scoring. Applications of sequence alignment include function prediction, gene finding, and database searching.

Uploaded by

anis442643

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views

Alignment Methods

Uploaded by

anis442643

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 33

Sequence Alignment

Dr. Shazia Rehman

Sequence Alignment
a way of arranging sequences of DNA,RNA or protein to identify regions of
similarity
 Helps in inferring functional , Structural or evolutionary relationship between
the sequence
Sequence alignment methods are used to find the best- matching sequences
The sequence alignment is made between a known sequence and unknown
sequence or between two unknown sequences.
The known sequence is called reference sequence. the unknown sequence is
called query sequence.
Conti…
Sequence alignment is important for:

prediction of function
database searching
gene finding
sequence divergence
sequence assembly
Scoring system

• Simple alignment scores

• A simple way (but not the best) to score an alignment is to count 1
for each match and 0 for each mismatch.
Types
Types of Sequence Alignment
Based on sequence Length – According to the length of sequence being
compared it is of following two types
1) Global sequence Alignment
2) Local sequence Alignment
Global alignment program is based on Needleman-Wunsch algorithm and local
alignment on Smith-Waterman. Both algorithms are derivates from the basic
dynamic programming algorithm.
Conti…
1) Global sequence Alignment – In this method, we consider the entire length of
the 2 sequences and try to match them to obtain the best alignment. these
alignments are also called as Needleman Wunsch.
It can be of two types
PSA (Pairwise sequence alignment)
MSA (Multiple sequence alignment)
 It is obtained by inserting gaps (spaces) to X and Y until the length of the two
sequences will be the same so that the two sequences are matched.
Gaps
• Gap is a succession of indels in alignment
• C T – - - AA
• C T C G C AA
Gaps represent
a) deletions or insertions events
b) sites with missing information
Scoring system
For example, consider the sequences
X = ACGCTGAT and Y = CAGCTAT. One possible global alignment is
If we set a scoring scheme as match score = 1, mismatch score = 0 and gap penalty
= 0, then the overall score for the above alignment will be,
Needleman-Wunsch Algorithm

• Needleman-Wunsch Algorithm
• One of the algorithms that uses dynamic programming to obtain global alignment
is the Needleman-Wunsch algorithm.
• This algorithm was published by Needleman and Wunsch in 1970
• The Needleman-Wunsch algorithm finds the best-scoring global alignment
between two sequences.
Conti…
2) Local sequence Alignment – In this alignment sequences are aligned to find a
region of higher density or strong similarity.

• For example, consider 2 sequences as X=GGTCTGATG and Y=AAACGATC.

Characters in bold are the subsequences to be considered. The best local
alignment is,
Scoring system
• If we set a scoring scheme as match score = 1, mismatch score = 0 and
gap penalty = 0, then the overall score for the above alignment will
be,
Smith–Waterman algorithm
The Smith–Waterman algorithm is a well-known algorithm for performing local
sequence alignment; that is,
for determining similar regions between two nucleotide or protein sequences.
Instead of looking at the total sequence, the Smith–Waterman algorithm
compares segments of all possible lengths and optimizes the similarity measure.
Conti…
Based on Number of sequence- According to number of sequence being compared
it is of following two types
1) Pairwise Sequence Alignment - This involves aligning two sequences and to
get the best region of similarity.
Seq 1 - 1 KTSSGNGAEDS 11
|||||||||||
Seq 2 - 1 KTSSGNGAEDS 11
Conti….
Pair-wise Alignment
1.Collect the two sequences
2. Align the sequences
3. Count the mutations in the alignment
4. Score the alignments
Conti…
A pairwise alignment consists of a series of paired bases, one base from each
sequence.
There are three types of pairs:
(1) matches = the same nucleotide appears in both sequences.
(2) mismatches = different nucleotides are found in the two sequences.
(3) gaps = a base in one sequence and a null base in the other
Methods for pairwise alignment
Various methods used for pairwise alignment of nucleotide and protein
sequences are:
1) Dot Plot – It is graphical method for two sequences to identify the region of
maximum similarity and dissimilarity, depicted by presence and absence of DOTS.
A dot matrix is a grid system where the similar nucleotides of two DNA
sequences are represented as dots.
It is a pairwise sequence alignment made in the computer.
Dot plot illustration
Conti…
In dot matrix , nucleotides of one sequence are written from the left to right on
the top row and those of the other sequence are written from the top to bottom
on the left side (column) of the matrix.
At every point, where the two nucleotides are the same , a dot in the intersection
of row and column becomes a dark dot.
when all these darken dots are connected, it gives a graph called dot plot.
Dynamic Programming Method

Dynamic Programming Method

Dynamic programming is a method that determines optimal alignment by
matching two sequences for all possible pairs of characters between the two
sequences.
It is fundamentally similar to the dot matrix method in that it also creates a two
dimensional alignment grid.
However, it finds alignment in more quantitative way to account for matches and
mismatches between sequences.
Conti…
Heuristic Method – When a single sequence is to be compared against the whole
database heuristic methods like BLAST and FASTA are used.
Multiple sequence Alignment
Multiple sequence Alignment - This involves the alignment of more than two
(protein, DNA) sequences and assess the sequence conservation of proteins
domains and protein structures.
It is an extrapolation of pairwise sequence alignment which reflects alignment of
similar sequences and provides a better alignment score.
Example –
Seq 1 - PQGGGGWGQ
Seq 2 - PHGGGWGQ
Seq 3 - PHGGGWGQ
Seq 4 - PHGGGWGQ
Seq 5 - PHGGGWGQ
Conti…
Tools and softwares for MSA
• Many tools:
• Clustal (ClustalW, ClustalX, Clustal Omega, etc.)
• T-Coffee
• MAFFT
• MUSCLE
Software
• MEGA
• BioEdit
CLUSTAL program
WORKING OF CLUSTAL
There are two types of Clustal (ver. 2) programs:
(1) ClustalW (has a command-line user interface)
and (2) ClustalX (has a GUI)
Clustal Omega is the latest addition to the Clustal family.
This high-capacity program aligns hundreds of thousands
of sequences in only a few hours.
it is preferable to work with protein sequences than
nucleotide sequences.
Clustal W
ClustalW uses a progressive method of alignment,
All pairs of sequences are aligned separately in order
to calculate a distance matrix giving the distance
between each pair of sequences.
A guide tree is calculated from the distance matrix;
The sequences are progressively aligned according to
the branching order in the guide tree.
Clustal W steps
Step2
Clustal out put
Conti…
The bottom row of the ClustalW output of multiple sequence alignment contains
stars (*),
 colons (:),
and dots (.)
A star below a column indicates a fully conserved or an invariant amino acid
residue,
a colon (:) denotes that all the residues in the column have roughly the same size
and hydrophobicity,
a dot (.) signifies that the different amino acid residues in the column are either
similar in size or hydrophobicity, while lack of a symbol indicates that the residues
in the column differ both in size and hydrophobicity.
Application of Sequence alignment

BioInformatics Quiz1 Week14
100% (4)
BioInformatics Quiz1 Week14
47 pages
Bif401 Highlighted Subjective Handouts by BINT - E - HAWA
No ratings yet
Bif401 Highlighted Subjective Handouts by BINT - E - HAWA
222 pages
Module 3 CSE3069 (Bioinformatics)
No ratings yet
Module 3 CSE3069 (Bioinformatics)
57 pages
Sequence Alignment
No ratings yet
Sequence Alignment
36 pages
Local and Global Sequence Alignment 12 by DR Sheikh Arslan Sehgal
No ratings yet
Local and Global Sequence Alignment 12 by DR Sheikh Arslan Sehgal
59 pages
Sequence Analysis in Bioinformatics
No ratings yet
Sequence Analysis in Bioinformatics
18 pages
Sequence Alignment Methods
No ratings yet
Sequence Alignment Methods
32 pages
Dynamic Programming Methods in Pairwise Alignment
No ratings yet
Dynamic Programming Methods in Pairwise Alignment
41 pages
3
No ratings yet
3
107 pages
05. Sequence Alignment
No ratings yet
05. Sequence Alignment
9 pages
L3.4 Alignment
No ratings yet
L3.4 Alignment
90 pages
Sequence Alignment Presentation
No ratings yet
Sequence Alignment Presentation
27 pages
Unit 2.1
No ratings yet
Unit 2.1
77 pages
Unit - Ii Sequence Analysis: Pair-Wise Sequence Comparison
No ratings yet
Unit - Ii Sequence Analysis: Pair-Wise Sequence Comparison
17 pages
Bio Medical Tics - Sequence Analysis - Alignment - 2011
No ratings yet
Bio Medical Tics - Sequence Analysis - Alignment - 2011
96 pages
Multiple Sequence Alignment Black and White
No ratings yet
Multiple Sequence Alignment Black and White
2 pages
Sequence Alignment Methods Final
No ratings yet
Sequence Alignment Methods Final
69 pages
4. Sequence Alignment
No ratings yet
4. Sequence Alignment
24 pages
Lecture 6- Sequence Analysis
No ratings yet
Lecture 6- Sequence Analysis
28 pages
Module-II
No ratings yet
Module-II
51 pages
Sequence Alingment
No ratings yet
Sequence Alingment
10 pages
Sequence Alignment Methods and Algorithms
75% (4)
Sequence Alignment Methods and Algorithms
37 pages
Sequence Alignment Methods and Algorithms
No ratings yet
Sequence Alignment Methods and Algorithms
37 pages
Sequence Alignment: Lecture - 4
No ratings yet
Sequence Alignment: Lecture - 4
19 pages
Sequence Alignment
No ratings yet
Sequence Alignment
27 pages
Blast 2 Sequences, A New Tool For Comparing Protein and Nucleotide Sequences
No ratings yet
Blast 2 Sequences, A New Tool For Comparing Protein and Nucleotide Sequences
17 pages
Sequence Analysis - Alignment
No ratings yet
Sequence Analysis - Alignment
57 pages
Notes Bioinformatics
No ratings yet
Notes Bioinformatics
14 pages
BLAST (Basic Local Alignment Search Tool)
100% (1)
BLAST (Basic Local Alignment Search Tool)
23 pages
Multiple Sequence Alignment 3
No ratings yet
Multiple Sequence Alignment 3
22 pages
Sequencing Alignment & Its Methods Group II
No ratings yet
Sequencing Alignment & Its Methods Group II
12 pages
Dr. Zoya Khalid Zoya - Khalid@nu - Edu.pk
No ratings yet
Dr. Zoya Khalid Zoya - Khalid@nu - Edu.pk
51 pages
Sequence Alignment: Sequence Alignment Is The Most Important Task in Bioinformatics!
No ratings yet
Sequence Alignment: Sequence Alignment Is The Most Important Task in Bioinformatics!
13 pages
Msa
No ratings yet
Msa
28 pages
lecture2_sequence_alignment
No ratings yet
lecture2_sequence_alignment
26 pages
Introduction-To-Computational Biology
No ratings yet
Introduction-To-Computational Biology
61 pages
Multiple Sequence Alignment
No ratings yet
Multiple Sequence Alignment
89 pages
Bioinformatics: Sequence Alignment Methods
No ratings yet
Bioinformatics: Sequence Alignment Methods
32 pages
Chap 03 BioInfo
No ratings yet
Chap 03 BioInfo
15 pages
Lecture 4
No ratings yet
Lecture 4
22 pages
Sequence Analysis - Pairwise Alignment
No ratings yet
Sequence Analysis - Pairwise Alignment
26 pages
Tabby
No ratings yet
Tabby
11 pages
20200831 - Sequence Alignment
No ratings yet
20200831 - Sequence Alignment
18 pages
Alignment Lecture 4
No ratings yet
Alignment Lecture 4
30 pages
W03_Pairwise
No ratings yet
W03_Pairwise
55 pages
Analytical
No ratings yet
Analytical
24 pages
G7 Sequence Alignment
No ratings yet
G7 Sequence Alignment
6 pages
2_split_2
No ratings yet
2_split_2
18 pages
Chapter 2 Bioinformatics
No ratings yet
Chapter 2 Bioinformatics
9 pages
Bioinfo Notes 2
No ratings yet
Bioinfo Notes 2
9 pages
Multiple Sequence Alignment
No ratings yet
Multiple Sequence Alignment
19 pages
MULTIPLE SEQUENCE ALIGNMENT (1)
No ratings yet
MULTIPLE SEQUENCE ALIGNMENT (1)
18 pages
BIOLOGICAL DATABASES
No ratings yet
BIOLOGICAL DATABASES
13 pages
Lecture 4.1 and 4.2 Sequence Alignment (Global and Local)
No ratings yet
Lecture 4.1 and 4.2 Sequence Alignment (Global and Local)
14 pages
36) Corpet 1988
No ratings yet
36) Corpet 1988
10 pages
Unit 3 Bioinformatics
No ratings yet
Unit 3 Bioinformatics
11 pages
B.I Sec 4.
No ratings yet
B.I Sec 4.
18 pages
Data Mining-Mining Sequence Patterns in Biological Data
No ratings yet
Data Mining-Mining Sequence Patterns in Biological Data
6 pages
Lecture 5: Multiple Sequence Alignment: Introduction To Computational Biology
No ratings yet
Lecture 5: Multiple Sequence Alignment: Introduction To Computational Biology
34 pages
Competitive Learning: Fundamentals and Applications for Reinforcement Learning through Competition
From Everand
Competitive Learning: Fundamentals and Applications for Reinforcement Learning through Competition
Fouad Sabry
No ratings yet
Exercises of Numerical Analysis
From Everand
Exercises of Numerical Analysis
Simone Malacrida
No ratings yet
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Smithwaterman 130216133804 Phpapp02
No ratings yet
Smithwaterman 130216133804 Phpapp02
15 pages
Cse Q
No ratings yet
Cse Q
8 pages
Bioinfo MCQS
100% (1)
Bioinfo MCQS
22 pages
Module 3 Session.2 Practical Assignment-Lucy Nakabazzi
No ratings yet
Module 3 Session.2 Practical Assignment-Lucy Nakabazzi
4 pages
Inexact Matching, Sequence Alignment, and Dynamic Programming
No ratings yet
Inexact Matching, Sequence Alignment, and Dynamic Programming
57 pages
(Ebooks PDF) Download Functional Microbial Genomics 1st Edition Brendan Wren Full Chapters
100% (3)
(Ebooks PDF) Download Functional Microbial Genomics 1st Edition Brendan Wren Full Chapters
84 pages
Introduction To Bioinformatics: Tolga Can
No ratings yet
Introduction To Bioinformatics: Tolga Can
21 pages
Running BLAST Through Perl
No ratings yet
Running BLAST Through Perl
35 pages
DNA Alignment
No ratings yet
DNA Alignment
76 pages
Smith Waterman
No ratings yet
Smith Waterman
9 pages
Bioinformatics Prof. M. Michael Gromiha Department of Biotechnology Indian Institute of Technology, Madras Lecture - 7b Sequence Alignment II
No ratings yet
Bioinformatics Prof. M. Michael Gromiha Department of Biotechnology Indian Institute of Technology, Madras Lecture - 7b Sequence Alignment II
26 pages
Alignment Methods: Introduction To Global and Local Sequence Alignment Methods
No ratings yet
Alignment Methods: Introduction To Global and Local Sequence Alignment Methods
57 pages
Sequence Alignment Algorithms: DEKM Book Notes From Dr. Bino John and Dr. Takis Benos
No ratings yet
Sequence Alignment Algorithms: DEKM Book Notes From Dr. Bino John and Dr. Takis Benos
53 pages
Bif601 Final Term Handous 15 To 61
No ratings yet
Bif601 Final Term Handous 15 To 61
28 pages
BLAST Analysis and Algorythim
No ratings yet
BLAST Analysis and Algorythim
11 pages
Malgene: Automatic Extraction of Malware Analysis Evasion Signature
No ratings yet
Malgene: Automatic Extraction of Malware Analysis Evasion Signature
12 pages
Chapter 5 Pairwise Alignment
No ratings yet
Chapter 5 Pairwise Alignment
8 pages
Cuda Smith Watermaan Speed Up
No ratings yet
Cuda Smith Watermaan Speed Up
7 pages
Sended To Jamal & Iqra
No ratings yet
Sended To Jamal & Iqra
5 pages
String Edit PDF
No ratings yet
String Edit PDF
39 pages
DNA Fragment Assembly: An Ant Colony System Approach
No ratings yet
DNA Fragment Assembly: An Ant Colony System Approach
12 pages
People Also Ask: Lazy As Fuck
No ratings yet
People Also Ask: Lazy As Fuck
12 pages
Application of Residue Number System To Bioinformatics: Kwara State University, Malete
No ratings yet
Application of Residue Number System To Bioinformatics: Kwara State University, Malete
14 pages
BIF401 MID Term Exam 2022 Preparation by BADSHA ALI
No ratings yet
BIF401 MID Term Exam 2022 Preparation by BADSHA ALI
6 pages
Laboratory Manual: Bioinformatics Laboratory (For Private Circulation Only)
No ratings yet
Laboratory Manual: Bioinformatics Laboratory (For Private Circulation Only)
52 pages