Bioinformatics Chapter 1
Bioinformatics Chapter 1
Bioinformatics Chapter 1
Begin?
Algorithmic Warm-Up
ACAAATTTGCATAATTTCGGGAAATTTCCT
atcaatgatcaacgtaagcttctaagcatgatcaaggtgctcacacagtttatccacaacctgagtgg
atgacatcaagataggtcgttgtatctccttcctctcgtactctcatgaccacggaaagatgatcaag
agaggatgatttcttggccatatcgcaatgaatacttgtgacttgtgcttccaattgacatcttcagc
gccatattgcgctggccaaggtgacggagcgggattacgaaagcatgatcatggctgttgttctgttt
atcttgttttgactgagacttgttaggatagacggtttttcatcactgactagccaaagccttactct
gcctgacatcgaccgtaaattgataatgaatttacatgcttccgcgacgatttacctcttgatcatcg
atccgattgaagatcttcaattgttaattctcttgcctcgactcatagccatgatgagctcttgatca
tgtttccttaaccctctattttttacggaagaatgatcaagctgctgctcttgatcatcgtttc
Too Many Frequent Words – Which
One is a Hidden Message?
atcaatgatcaacgtaagcttctaagcATGATCAAGgtgctcacacagtttatccacaacctgagtgg
atgacatcaagataggtcgttgtatctccttcctctcgtactctcatgaccacggaaagATGATCAAG
agaggatgatttcttggccatatcgcaatgaatacttgtgacttgtgcttccaattgacatcttcagc
gccatattgcgctggccaaggtgacggagcgggattacgaaagcatgatcatggctgttgttctgttt
atcttgttttgactgagacttgttaggatagacggtttttcatcactgactagccaaagccttactct
gcctgacatcgaccgtaaattgataatgaatttacatgcttccgcgacgatttacctCTTGATCATcg
atccgattgaagatcttcaattgttaattctcttgcctcgactcatagccatgatgagctCTTGATCA
TgtttccttaaccctctattttttacggaagaATGATCAAGctgctgctCTTGATCATcgtttc
atcaatgatcaacgtaagcttctaagcATGATCAAGgtgctcacacagtttatccacaacctgagtgg
atgacatcaagataggtcgttgtatctccttcctctcgtactctcatgaccacggaaagATGATCAAG
agaggatgatttcttggccatatcgcaatgaatacttgtgacttgtgcttccaattgacatcttcagc
gccatattgcgctggccaaggtgacggagcgggattacgaaagcatgatcatggctgttgttctgttt
atcttgttttgactgagacttgttaggatagacggtttttcatcactgactagccaaagccttactct
gcctgacatcgaccgtaaattgataatgaatttacatgcttccgcgacgatttacctCTTGATCATcg
atccgattgaagatcttcaattgttaattctcttgcctcgactcatagccatgatgagctCTTGATCA
TgtttccttaaccctctattttttacggaagaATGATCAAGctgctgctCTTGATCATcgtttc
ATGATCAAG
||||||||| are reverse complements and likely DnaA
boxes
TACTAGTTC (DnaA does not care what strand to bind to)
5’ oriC 3’
3’ oriC 5’
terC
DNA Strands Have Directions
5’ oriC 3’
3’ oriC 5’
terC
terC
Four DNA Polymerases Do the Job
oriC
5’ 3’
3’ 5’
oriC
terC
terC
Continue as Replication Fork
Enlarges
5’ 3’
3’ 5’
5’ 3’
3’ 5’
Big problemreplicating
No problem replicatingreverse
forwardhalf-strands
half-strands (thin
If you Were a UNIDIRECTIONAL DNA
Polymerase, How Would you Replicate a
Genome???
5’ 3’
3’ 5’
5’ 3’
3’ 5’
Wait until the Fork Opens and
Replicate
5’ 3’
3’ 5’
Replicate
Wait until the Fork Opens Even More
and…
Okazaki
fragments
Replicate
Wait until the Fork Opens Even More
and… Okazaki
fragments
Okazaki
fragments
REPLICA
TE!
Okazaki
fragments
C high C low
G low You walk along the genome and see that #G - #C G high
have been decreasing and then suddenly starts
increasing.
CATGGGCATCGGCCATACGCC
Skew Diagram of E. Coli:
Where is the Origin of Replication?
ori
C
You walk along the genome and see that #G - #C have been decreasing
and then suddenly starts increasing: WHERE ARE YOU IN THE
We Found the Replication Origin in E. Coli
BUT…
The minimum of the Skew Diagram
points to this region in E. coli:
aatgatgatgacgtcaaaaggatccggataaaacatggtgattgcctcgcataacgcggta
tgaaaatggattgaagcccgggccgtggattctactcaactttgtcggcttgagaaagacc
tgggatcctgggtattaaaaagaagatctatttatttagagatctgttctattgtgatctc
ttattaggatcgcactgccctgtggataacaaggatccggcttttaagatcaacaacctgg
aaaggatcattaactgtgaatgatcggtgatcctggaccgtataagctgggatcagaatga
ggggttatacacaactcaaaaactgaacaacagttgttctttggataactaccggttgatc
caagcttcctgacagagttatccacagtagatcgcacgatctgtatacttatttgagtaaa
ttaacccacgatcccagccattcttctgccggatcttccggaatgtcgtgatcaagaatgt
tgatcttcagtg
atcaatgatcaacgtaagcttctaagcATGATCAAGgtgctcacacagtttatccacaac
ctgagtggatgacatcaagataggtcgttgtatctccttcctctcgtactctcatgacca
cggaaagATGATCAAGagaggatgatttcttggccatatcgcaatgaatacttgtgactt
gtgcttccaattgacatcttcagcgccatattgcgctggccaaggtgacggagcgggatt
acgaaagcatgatcatggctgttgttctgtttatcttgttttgactgagacttgttagga
tagacggtttttcatcactgactagccaaagccttactctgcctgacatcgaccgtaaat
tgataatgaatttacatgcttccgcgacgatttacctCTTGATCATcgatccgattgaag
atcttcaattgttaattctcttgcctcgactcatagccatgatgagctCTTGATCATgtt
tccttaaccctctattttttacggaagaATGATCAAGctgctgctCTTGATCATcgtttc
oriC
oriC
Project
Director
Mikhail
The skew diagram for Sulfolocus salfataricus Gelfand
Project
Director
Uri Keich
Happy Rosalind!