Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
SlideShare a Scribd company logo
Akash Mahadev Iyer
S4 M.Sc Biochemistry
Department of Biochemistry
University of Kerala
 All human share about 99 % of their DNA.
 The remaining 1% accounts for variability between us- unless you are
an identical twin!.
 Repetitive DNA -two patterns of distribution in the chromosomes.
The two major classes of repetitive elements are
 Interspersed elements and
 Tandem arrays.
1. Interspersed repeated elements are usually present as single copies
and distributed widely throughout the genome.
2. Tandemly-repeated DNA may be further classified according to the
length and copy number of the basic repeat units as well as its
genomic localization;
3 main types
 Satellite
 Minisatellite
 Microsatellite
 A satellite is a highly repetitive DNA sequence with each repeated
sequence ranging from a thousand to several thousand base pairs.
 The entire satellite can be up to 100 million base pairs long,
Occur in regions of heterochromatin.
Satellite DNA - clustered in discrete
areas such as centromere.
 Differ in density from the remaining chromosomal DNA and are separable as
minor band in CsCl Density Gradient Centrifugation, called as SATELLITE
DNA.
 A minisatellite is an array of tandem repeats, with each repeat ranging
from nine to 100 base pairs.
 The entire array is usually 500 to 30,000 base pairs long.
 Most commonly found in euchromatin regions of the chromosome.
 Like satellite DNA sequences, they are also not transcribed
 A microsatellite is an array of very short repeats
 2-6 base pairs each, with the entire array ranging from 10,000 to 100,000
base pairs in length.
 Found in the euchromatin regions of vertebrate, insect and plant
chromosomes. The number of repeats varies among individuals in a
population, making microsatellites particularly useful to the population
geneticist.
VNTR- Minisatellite
 Tandem repeats occur in DNA when a pattern of one or more nucleotides
is repeated and the repetitions are directly adjacent to each other.
 An example would be:
AATTTTCCGGCCCCAAAATTCCAATTTTCCGGCCCCAAAATTCCAATTTTCCGGCCCCAAAATTCC
 These can be found on many chromosomes, and often show variations in
length between individuals.
 The number of elements in a given region may vary, hence they are known
as variable number tandem repeats.
 Each variant acts as an inherited allele, allowing them to be used for
personal or parental identification.
 Scientists use polymorphic loci that are known to
contain VNTRs/STRs in order to differentiate people based on their DNA.
Variable Number Tandem Repeats
 In standard genetics, we often think of an allele as a variation
of gene that would result in a difference in a physical manifestation
of that gene.
 In the case of VNTRs, these alleles are simply a difference in
number of repeats.
 The length of DNA within this locus is either longer or shorter and gives
rise to many different alleles.
 A locus in a population is polymorphic if it has more than one allele.
 This is often used in forensic science or in maternity/paternity cases.
 Any variation of a locus is referred to as an allele.
• VNTR regions are not genes, and our interest in them is solely related to
their use for identifying individuals. We therefore refer to them
as markers.
 Because different alleles consist of different numbers of repeats, VNTR
alleles can be identified by their lengths.
 If DNA fragments of different lengths are placed on a semisolid medium
(gel) in an electric field, they migrate at different rates;
 Different-sized fragments can therefore be identified by the distance
they travel between electrodes in such a gel
 First human minisatellite was discovered in 1980 by A.R. Wyman and
R. White
 Alec Jeffreys coined the term ‘minisatellite’
 14-100 bp sequences which repeats 20-50 times
 Hypervariable minisatellites, highly polymorphic, contain 100-1000
repeats
 HVMS DNA sequences; Not transcribed (ex; MUC1 locus)
 The variability is not just limited to the number of repeat but also the
sequence of repeats can vary in different members of an array
 Of the 300 minisatellite families, which have been typed less than 10
are hypervariable
 Individual repeats can be removed from (or added to) the VNTR via
recombination or replication errors.
 Different individuals within a population may have different numbers of
repeats.
 For example, one person might carry 3 tandem repeats in a particular
gene, while another person might bear 15 repeats
• Due to the repetitive nature of the sequence have allowed further
increases/decreases in the number of repeats resulting in the
polymorphisms observed
 These can be isolated from an individual’s DNA and therefore relatively
easy to map.
VNTRs have a disadvantage as genetic markers:
 They tend to bunch together at the ends of chromosomes, leaving the
interiors of the chromosomes relatively devoid of markers.
VNTR- Minisatellite
While the repeated sequences
themselves are usually the same from
person to person, the number of times
they are repeated tends to vary.
In order to create a DNA fingerprint
that is unique to each person
scientists had to identify areas in the
genome that are highly variable.
VNTRs are highly polymorphic-
VNTRs tends to differ among different
people.
The number of times the sequence is repeated can differ
within and between individuals therefore making these
VNTRs a polymorphic entity.
 VNTR loci tend to be hypermutable through an insertion/deletion of
short nucleotide repeat sequences.
 They are classified a hypermutable only when their mean mutation
frequency is greater than 0.5%
 Mutation frequency is defined as the frequency at which a specific kind
of mutation is found in a population of cells
 Great variation can be generated –as the mutation rates thousands-
millions, of times faster than the rest of the genome
 Short arrays of repeats can be stable over millions of years, while long
alleles can have an extremely high frequency of mutational changes (up
to 15%).
 The most highly mutable minisatellite locus described so
far is CEB1 (D2S90)- 13% mutation rate in the male
germline (Vergnaud et.al)
Organellar location;
 Minisatellites are not just restricted to nuclear DNA, they are
also present in mtDNA, cpDNA
VNTRs are thought to have arisen due to
 Slippages during DNA replication or
 As a consequence of Unequal crossing-over,
 During replication the DNA polymerase copies the template and sometimes
stutters in areas where tandem repeats are located.
 As a result the number of repeats increases. This is called backward
replication slippage.
 If a tandem repeat region contains many repeats a mechanism called
forward replication slippage may occur.
 During replication when the DNA is single stranded, the repeat region
forms loops.
 The DNA polymerase may accidentally skip this looped region and as a
result the replicated stand contains a decreased number of tandem
repeats.
VNTRs are very rarely found in polypeptide encoding DNA.
 Exception; MUC1 locus at 1q21 encodes a highly polymorphic glycoproteins-
mucins
 Mucins are HMW GPs
 Found in several epithelial tissues, mucous membranes and body
fluids(maintains integrity and differentiation of EC, promotes tumor cell
proliferation and metastasis)
 Mucins can be either secretory (MUC2, MUC6, MUC7) or membrane bound
(MUC1, MUC12, MUC13)
 The extracellular portion of mucins contain large no of TR repeats of similar
amino acids- 20 aa (Ser, Thr; O-linked glycosylation sites)
VNTR- Minisatellite
Variation at the insulin gene variable number of tandem
repeats (INS VNTR)
 Minisatellite has been shown to influence pancreatic insulin gene
transcription, both in the fetus and in adulthood
 The 5’ regulatory region of INS contains a variable number of tandem repeat
(VNTR) locus that has been associated with T1D.
 A(C/T)AGGGGT(G/C)C(G/C/T)(G/A/T)(G/A/T)G(G/C/T);
 14-15 bp consensus sequence situated at the 5’region of INS gene
 The VNTR lies 596 bp upstream of the insulin gene on chromosome
11p15.5, and has been suggested to influence transcription rate of these
genes.
 Known as Insulin gene minisatellite or ILPR- Insulin linked polymorphic
region
 No of repeats in the alleles ranges from 26 to >200
Three classes of alleles identified on the basis of overall size;
 Class I (26-63 repeats- associated with IDDM suscepibility)
 Class ll (80 repeats)
 Class lll (140-200 repeats)- cause transcription of INS gene in
developing thymus, thus increases efficiency of deletion of insulin-
reactive T-cell clones during development of immune system, reducing
risk of autoimmune attack
Low thymic insulin- hampered negative selection
Insulin Gene (INS)
Class I VNTR
26-63 repeats
21 alleles
Predisposing
Insulin Gene (INS)
Class III VNTR
140-200 repeats
15 alleles
Protective
VNTR = Variable Number of Tandem Repeats
Applications
 VNTR fingerprinting has been applied for subtyping of Salmonella
typhi and S.typhimurium
 VNTR typing as the next gold standard in the molecular
epidemiology of tuberculosis - Genotyping for early diagnosis of
M. tuberculosis super-infection or mixed infection
 Used as genetic markers in linkage analysis and population studies
 Highly valuable new tool in population ecology
 Fingerprinting
 Conservation biology
 Parentage
 Forensics
 Genetic identity
Importance in human genome
 Minisatellites have been found in association with important features of
human genome biology such as gene regulation, chromosomal fragile
sites, and imprinting
 Some are part of an open reading frame, which may or may not display
polymorphism in the human population
 Minisatellites located in the 5′ region of genes participate in the
regulation of transcription.
 Others located within introns interfere with splicing
 Minisatellites at imprinted loci are thought to play a role in the imprint
control
 Finally, minisatellites may constitute chromosome fragile sites and
have been found in the vicinity of a number of recurrent translocation
breakpoints and in the switch recombination site in immunoglobulin
heavy chain genes
References
 Human Molecular Genetics, Tom Strachan, 3rd edition, Garland
Science
 Molecular Markers and Plant Biotechnology-By Rukam S. Tomar,
New India Publishing
 Computational Systems-Biology and Bioinformatics- edited by
Jonathan H. Chan,
 Human Molecular Biology Laboratory Manual- by Stefan
Surzycki, Wiley
 Mini- and Microsatellites by Claes Ramel, Stockholm University
 Slideshare
 NCBI
VNTR- Minisatellite

More Related Content

VNTR- Minisatellite

  • 1. Akash Mahadev Iyer S4 M.Sc Biochemistry Department of Biochemistry University of Kerala
  • 2.  All human share about 99 % of their DNA.  The remaining 1% accounts for variability between us- unless you are an identical twin!.
  • 3.  Repetitive DNA -two patterns of distribution in the chromosomes. The two major classes of repetitive elements are  Interspersed elements and  Tandem arrays.
  • 4. 1. Interspersed repeated elements are usually present as single copies and distributed widely throughout the genome. 2. Tandemly-repeated DNA may be further classified according to the length and copy number of the basic repeat units as well as its genomic localization; 3 main types  Satellite  Minisatellite  Microsatellite
  • 5.  A satellite is a highly repetitive DNA sequence with each repeated sequence ranging from a thousand to several thousand base pairs.  The entire satellite can be up to 100 million base pairs long, Occur in regions of heterochromatin. Satellite DNA - clustered in discrete areas such as centromere.
  • 6.  Differ in density from the remaining chromosomal DNA and are separable as minor band in CsCl Density Gradient Centrifugation, called as SATELLITE DNA.
  • 7.  A minisatellite is an array of tandem repeats, with each repeat ranging from nine to 100 base pairs.  The entire array is usually 500 to 30,000 base pairs long.  Most commonly found in euchromatin regions of the chromosome.  Like satellite DNA sequences, they are also not transcribed  A microsatellite is an array of very short repeats  2-6 base pairs each, with the entire array ranging from 10,000 to 100,000 base pairs in length.  Found in the euchromatin regions of vertebrate, insect and plant chromosomes. The number of repeats varies among individuals in a population, making microsatellites particularly useful to the population geneticist.
  • 9.  Tandem repeats occur in DNA when a pattern of one or more nucleotides is repeated and the repetitions are directly adjacent to each other.  An example would be: AATTTTCCGGCCCCAAAATTCCAATTTTCCGGCCCCAAAATTCCAATTTTCCGGCCCCAAAATTCC  These can be found on many chromosomes, and often show variations in length between individuals.  The number of elements in a given region may vary, hence they are known as variable number tandem repeats.  Each variant acts as an inherited allele, allowing them to be used for personal or parental identification.  Scientists use polymorphic loci that are known to contain VNTRs/STRs in order to differentiate people based on their DNA. Variable Number Tandem Repeats
  • 10.  In standard genetics, we often think of an allele as a variation of gene that would result in a difference in a physical manifestation of that gene.  In the case of VNTRs, these alleles are simply a difference in number of repeats.  The length of DNA within this locus is either longer or shorter and gives rise to many different alleles.  A locus in a population is polymorphic if it has more than one allele.  This is often used in forensic science or in maternity/paternity cases.  Any variation of a locus is referred to as an allele. • VNTR regions are not genes, and our interest in them is solely related to their use for identifying individuals. We therefore refer to them as markers.
  • 11.  Because different alleles consist of different numbers of repeats, VNTR alleles can be identified by their lengths.  If DNA fragments of different lengths are placed on a semisolid medium (gel) in an electric field, they migrate at different rates;  Different-sized fragments can therefore be identified by the distance they travel between electrodes in such a gel
  • 12.  First human minisatellite was discovered in 1980 by A.R. Wyman and R. White  Alec Jeffreys coined the term ‘minisatellite’  14-100 bp sequences which repeats 20-50 times  Hypervariable minisatellites, highly polymorphic, contain 100-1000 repeats  HVMS DNA sequences; Not transcribed (ex; MUC1 locus)  The variability is not just limited to the number of repeat but also the sequence of repeats can vary in different members of an array  Of the 300 minisatellite families, which have been typed less than 10 are hypervariable
  • 13.  Individual repeats can be removed from (or added to) the VNTR via recombination or replication errors.  Different individuals within a population may have different numbers of repeats.  For example, one person might carry 3 tandem repeats in a particular gene, while another person might bear 15 repeats
  • 14. • Due to the repetitive nature of the sequence have allowed further increases/decreases in the number of repeats resulting in the polymorphisms observed  These can be isolated from an individual’s DNA and therefore relatively easy to map. VNTRs have a disadvantage as genetic markers:  They tend to bunch together at the ends of chromosomes, leaving the interiors of the chromosomes relatively devoid of markers.
  • 16. While the repeated sequences themselves are usually the same from person to person, the number of times they are repeated tends to vary. In order to create a DNA fingerprint that is unique to each person scientists had to identify areas in the genome that are highly variable. VNTRs are highly polymorphic- VNTRs tends to differ among different people. The number of times the sequence is repeated can differ within and between individuals therefore making these VNTRs a polymorphic entity.
  • 17.  VNTR loci tend to be hypermutable through an insertion/deletion of short nucleotide repeat sequences.  They are classified a hypermutable only when their mean mutation frequency is greater than 0.5%  Mutation frequency is defined as the frequency at which a specific kind of mutation is found in a population of cells  Great variation can be generated –as the mutation rates thousands- millions, of times faster than the rest of the genome  Short arrays of repeats can be stable over millions of years, while long alleles can have an extremely high frequency of mutational changes (up to 15%).
  • 18.  The most highly mutable minisatellite locus described so far is CEB1 (D2S90)- 13% mutation rate in the male germline (Vergnaud et.al) Organellar location;  Minisatellites are not just restricted to nuclear DNA, they are also present in mtDNA, cpDNA VNTRs are thought to have arisen due to  Slippages during DNA replication or  As a consequence of Unequal crossing-over,
  • 19.  During replication the DNA polymerase copies the template and sometimes stutters in areas where tandem repeats are located.  As a result the number of repeats increases. This is called backward replication slippage.
  • 20.  If a tandem repeat region contains many repeats a mechanism called forward replication slippage may occur.  During replication when the DNA is single stranded, the repeat region forms loops.  The DNA polymerase may accidentally skip this looped region and as a result the replicated stand contains a decreased number of tandem repeats.
  • 21. VNTRs are very rarely found in polypeptide encoding DNA.  Exception; MUC1 locus at 1q21 encodes a highly polymorphic glycoproteins- mucins  Mucins are HMW GPs  Found in several epithelial tissues, mucous membranes and body fluids(maintains integrity and differentiation of EC, promotes tumor cell proliferation and metastasis)  Mucins can be either secretory (MUC2, MUC6, MUC7) or membrane bound (MUC1, MUC12, MUC13)  The extracellular portion of mucins contain large no of TR repeats of similar amino acids- 20 aa (Ser, Thr; O-linked glycosylation sites)
  • 23. Variation at the insulin gene variable number of tandem repeats (INS VNTR)  Minisatellite has been shown to influence pancreatic insulin gene transcription, both in the fetus and in adulthood  The 5’ regulatory region of INS contains a variable number of tandem repeat (VNTR) locus that has been associated with T1D.  A(C/T)AGGGGT(G/C)C(G/C/T)(G/A/T)(G/A/T)G(G/C/T);  14-15 bp consensus sequence situated at the 5’region of INS gene  The VNTR lies 596 bp upstream of the insulin gene on chromosome 11p15.5, and has been suggested to influence transcription rate of these genes.  Known as Insulin gene minisatellite or ILPR- Insulin linked polymorphic region  No of repeats in the alleles ranges from 26 to >200
  • 24. Three classes of alleles identified on the basis of overall size;  Class I (26-63 repeats- associated with IDDM suscepibility)  Class ll (80 repeats)  Class lll (140-200 repeats)- cause transcription of INS gene in developing thymus, thus increases efficiency of deletion of insulin- reactive T-cell clones during development of immune system, reducing risk of autoimmune attack Low thymic insulin- hampered negative selection
  • 25. Insulin Gene (INS) Class I VNTR 26-63 repeats 21 alleles Predisposing Insulin Gene (INS) Class III VNTR 140-200 repeats 15 alleles Protective VNTR = Variable Number of Tandem Repeats
  • 26. Applications  VNTR fingerprinting has been applied for subtyping of Salmonella typhi and S.typhimurium  VNTR typing as the next gold standard in the molecular epidemiology of tuberculosis - Genotyping for early diagnosis of M. tuberculosis super-infection or mixed infection  Used as genetic markers in linkage analysis and population studies  Highly valuable new tool in population ecology  Fingerprinting  Conservation biology  Parentage  Forensics  Genetic identity
  • 27. Importance in human genome  Minisatellites have been found in association with important features of human genome biology such as gene regulation, chromosomal fragile sites, and imprinting  Some are part of an open reading frame, which may or may not display polymorphism in the human population  Minisatellites located in the 5′ region of genes participate in the regulation of transcription.  Others located within introns interfere with splicing  Minisatellites at imprinted loci are thought to play a role in the imprint control  Finally, minisatellites may constitute chromosome fragile sites and have been found in the vicinity of a number of recurrent translocation breakpoints and in the switch recombination site in immunoglobulin heavy chain genes
  • 28. References  Human Molecular Genetics, Tom Strachan, 3rd edition, Garland Science  Molecular Markers and Plant Biotechnology-By Rukam S. Tomar, New India Publishing  Computational Systems-Biology and Bioinformatics- edited by Jonathan H. Chan,  Human Molecular Biology Laboratory Manual- by Stefan Surzycki, Wiley  Mini- and Microsatellites by Claes Ramel, Stockholm University  Slideshare  NCBI

Editor's Notes

  1. Satellite DNA, -named after its separability from bulk DNA by buoyant density gradient centrifugation. consist of very high numbers of repetitions (b/w 1000 and more than 100,000 copies) of a basic sequence motif. sizes may range from two to several thousand bp, but 100 to 300 bp are most common. heterochromatic in nature, -often located in subtelomeric or centromeric regions. Satellite DNAs are only rarely used as molecular markers (e.g., for species identification). Minisatellites- intermediate-sized DNA motifs (about 10 to 60 bp), and show a lower degree of repetition at a given locus compared with satellites. minisatellites form families of related sequences that occur at many hundred loci in the nuclear genome. Tandem repeats made up from very short (i.e., about 1 to 6 bp) motifs were called simple sequences- microsatellites, simple repetitive sequences (SRS), simple sequence repeats (SSRs), or simple tandem repeats (STRs). \ characterized by a low degree of repetition at a particular locus.
  2. Gc cpontent-
  3. enetic markers are sequences of DNA which have been traced tospecific locations on the chromosomes and associated with particulartraits. A location in genome where a short nucleotide sequence is organized as a tandem repeat.
  4. Minisatellites form arrays of several hundred units of 7 to 100 bp in length. They are present everywhere with an increasing concentration toward the telomeres. They differ from satellites in that they are found only in moderate numbers of tandem repeats and because of their high degree of dispersion throughout chromosomes. Microsatellites, or simple sequence repeats (SSRs), are composed of units of one to six nucleotides, repeated up to a length of 100 bp or more. One-third are simple "polyadenylated" repeats, composed of nothing but adenine nucleotides. Other examples of abundant microsatellites are (AC)n, (AAAN)n, (AAAAN)n, and (AAN)n, where N represents any nucleotide and n is the number of repeats. Less abundant, but important because of their direct involvement in the generation of disease, are the (CAG/CTG)nand (CGG/CCG)n trinucleotide (or triplet) repeats. Telomeric and subtelomeric repeats are present at the end of the telomeres and are composed of short tandem repeats (STRs) of (TTAGGG)n, up to 30,000 bp long. This sequence is "highly conserved," meaning it has changed very little over evolutionary time, indicating it likely plays a very important role. These STRs function as caps or ends of the long linear chromosomal DNA molecule and are crucial to the maintenance of intact eukaryotic chromosomes. Subtelomeric repeats act as transitions between the boundary of the telomere and the rest of the chromosome. They contain units similar to the TTAGGG, but they are not conserved
  5. Marker is a dna sequnece genetically linked to the gene of interest Behave as tags Located near gene of interest
  6. Different mechanisms of germline length changes of minisatellites can be visualized-replication slippage, intramolecular recombination, unequal sister chromatid exchange, and unequal interallelic recombination or gene conversion
  7. Proposed function of regulating or binding carbohydrate sidechains.Interpro abstract (IPR000082):The SEA domain has been named after the first three proteins in which it was identified (Sperm protein, Enterokinase and Agrin). The SEA domain has around 120 residues, it is an extracellular domain whose function is not known. It is found in one or two copies in mosaic extracellular or transmembrane proteins. The SEA domain is closely associated with regions receiving extensive O-glycosylation. It has been proposed that carbohydrates are required to stabilise SEA domains and protect them against proteolytic degradation and that the extent of substitution may control proteolytic processing [(PUBMED:7670383), (PUBMED:9762901)]. The SEA domain contains an about 80-residue conserved region and an about 40-residue segment that separates the conserved region from the subsequent C-terminal domains. Secondary structure predictions and circular dichroism suggest an alternating conformation of beta sheets and alpha helices for the SEA domain [(PUBMED:7670383), (PUBMED:9030729)]. Some proteins known to contain a SEA domain include: Vertebrate agrin, an heparan sulfate proteoglycan of the basal lamina of the neuromuscular junction. It is responsible for the clustering of acetylcholine receptors (AChRs) and other proteins at the neuromuscular junction. Mammalian enterokinase. It catalyses the conversion of trypsinogen to trypsin which in turn activates other proenzymes, including chymotrypsinogen, procarboxypeptidases, and proelastases. 63 kDa sea urchin sperm protein (SP63). It might mediate sperm-egg or sperm-matrix interactions. Animal perlecan, a heparan sulfate containing proteoglycan found in all basement membranes. It interacts with other basement membrane components such as laminin and collagen type IV and serves as an attachment substrate for cells. Some vertebrate epithelial mucins. They form a family of secreted and cell surface glycoproteins expressed by epithelial tissues and implicated in epithelial cell protection, adhesion modulation and signaling. Mammalian cell surface antigen 114/A10, an integral transmembrane protein that is highly expressed in hematopoietic progenitor cells and IL-3-dependent cell lines