Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content
Background Evolutionary processes, such as gene family evolution or parasite-host co-speciation, can often be viewed as a tree evolving inside another tree. Relating two given trees under such a constraint is known as reconciling them.... more
Background Evolutionary processes, such as gene family evolution or parasite-host co-speciation, can often be viewed as a tree evolving inside another tree. Relating two given trees under such a constraint is known as reconciling them. Adequate software tools for generating illustrations of tree reconciliations are instrumental for presenting and communicating results and ideas regarding these phenomena. Available visualization tools have been limited to illustrations of the most parsimonious reconciliation.
Abstract Phylogeny is both a fundamental tool in biology and a rich source of fascinating modeling and algorithmic problems. Today's wealth of sequenced genomes makes it increasingly important to understand evolutionary events such as... more
Abstract Phylogeny is both a fundamental tool in biology and a rich source of fascinating modeling and algorithmic problems. Today's wealth of sequenced genomes makes it increasingly important to understand evolutionary events such as duplications, losses, transpositions, inversions, lateral transfers, and domain shuffling. We focus on the gene duplication event, that constitutes a major force in the creation of genes with new function [Ohno 1970; Lynch and Force 2000] and, thereby also, of biodiversity.
Abstract The microtubule network, the major organelle of the eukaryotic cytoskeleton, is involved in cell division and differentiation but also with many other cellular functions. In plants, microtubules seem to be involved in the ordered... more
Abstract The microtubule network, the major organelle of the eukaryotic cytoskeleton, is involved in cell division and differentiation but also with many other cellular functions. In plants, microtubules seem to be involved in the ordered deposition of cellulose microfibrils by a so far unknown mechanism. Microtubule-associated proteins (MAP) typically contain various domains targeting or binding proteins with different functions to microtubules.
Abstract Motivation: New generation sequencing technologies producing increasingly complex datasets demand new efficient and specialized sequence analysis algorithms. Often, it is only the 'novel'sequences in a complex dataset that are of... more
Abstract Motivation: New generation sequencing technologies producing increasingly complex datasets demand new efficient and specialized sequence analysis algorithms. Often, it is only the 'novel'sequences in a complex dataset that are of interest and the superfluous sequences need to be removed. Results: A novel algorithm, fast and accurate classification of sequences (FACSs), is introduced that can accurately and rapidly classify sequences as belonging or not belonging to a reference sequence.
Abstract Distance-based methods are popular for reconstructing evolutionary trees of protein sequences, mainly because of their speed and generality. A number of variants of the classical neighbor-joining (NJ) algorithm have been... more
Abstract Distance-based methods are popular for reconstructing evolutionary trees of protein sequences, mainly because of their speed and generality. A number of variants of the classical neighbor-joining (NJ) algorithm have been proposed, as well as a number of methods to estimate protein distances. We here present a large-scale assessment of performance in reconstructing the correct tree topology for the most popular algorithms.
We report the generation, assembly and annotation of expressed sequence tags (ESTs) from four chicken cDNA libraries, constructed from brain and testis tissue dissected from red junglefowl and White Leghorn. 21,285 5′-end ESTs were... more
We report the generation, assembly and annotation of expressed sequence tags (ESTs) from four chicken cDNA libraries, constructed from brain and testis tissue dissected from red junglefowl and White Leghorn. 21,285 5′-end ESTs were generated and assembled into 2,813 contigs and 9,737 singletons, giving 12,549 tentative unique transcripts. The transcripts were annotated using BLAST by matching to known chicken genes or to putative homologues in other species using the major gene/protein databases.
Abstract The debate over the prevalence of lateral gene transfers (LGTs) has been intense. There is now to a large extent consensus around the view that LGT is an important evolutionary force as well as regarding its relative importance... more
Abstract The debate over the prevalence of lateral gene transfers (LGTs) has been intense. There is now to a large extent consensus around the view that LGT is an important evolutionary force as well as regarding its relative importance across species. This consensus relies, however, mainly on studies of individual gene families. Up until now, the gold standard for identifying LGTs has been phylogenetic methods where LGTs are inferred from incongruities between a species tree and an associated gene tree.
Abstract Aligning biological sequences, DNA or proteins, is to identify positions in sequences by inserting blanks in a way that maximizes an objective function. The purpose is to understand how different sequences are evolutionary... more
Abstract Aligning biological sequences, DNA or proteins, is to identify positions in sequences by inserting blanks in a way that maximizes an objective function. The purpose is to understand how different sequences are evolutionary related and the objective function typically measure the amount of sequence similarity. This thesis presents variations on the sequence alignment theme.
The problem of aligning two DNA sequences with respect to the fact that they are coding for proteins is discussed. Criteria for a good alignment of coding DNA, together with an algorithm that satisfies them, are presented. The algorithm... more
The problem of aligning two DNA sequences with respect to the fact that they are coding for proteins is discussed. Criteria for a good alignment of coding DNA, together with an algorithm that satisfies them, are presented. The algorithm is robust against frame-shifts and forgiving towards silent substitutions. The important choice of objective function is examined and several variants are proposed.
The mitochondrial (mt) DNA control region (CR) of dogs and wolves contains an array of imperfect 10 bp tandem repeats. This region was studied for 14 domestic dogs representing the four major phylogenetic groups of nonrepetitive CR and... more
The mitochondrial (mt) DNA control region (CR) of dogs and wolves contains an array of imperfect 10 bp tandem repeats. This region was studied for 14 domestic dogs representing the four major phylogenetic groups of nonrepetitive CR and for 5 wolves. Three repeat types were found among these individuals, distributed so that different sequences of the repeat types were formed in different molecules. This enabled a detailed study of the arrays and of the mutation events that they undergo. Extensive heteroplasmy was observed in all individuals; 85 different array types were found in one individual, and the total number of types was estimated at 384. Among unrelated individuals, no identical molecules were found, indicating a high rate of evolution of the region. By performing a pedigree analysis, array types which had been inherited from mother to offspring and array types which were the result of somatic mutations, respectively, could be identified, showing that about 20% of the molecules within an individual had somatic mutations. By direct pairwise comparison of the mutated and the original array types, the physiognomy of the inserted or deleted elements (indels) and the approximate positions of the mutations could be determined. All mutations could be explained by replication slippage or point mutations. The majority of the indels were 1-5 repeats long, but deletions of up to 17 repeats were found. Mutations were found in all parts of the arrays, but at a higher frequency in the 5' end. Furthermore, the inherited array types within the mother-offspring pair were aligned and compared so that germ line mutations could be studied. The pattern of the germ line mutations was approximately the same as that of the somatic mutations.
A highly variable and heteroplasmic tandem repeat region situated in the mitochondrial mt DNA control region (CR) in domestic dogs and wolves was studied to evaluate its suitability as a forensic genetic marker for analysis of single... more
A highly variable and heteroplasmic tandem repeat region situated in the mitochondrial mt DNA control region (CR) in domestic dogs and wolves was studied to evaluate its suitability as a forensic genetic marker for analysis of single hairs. The tandem repeat array is composed of three 10-bp repeat types that are distributed so that a secondary DNA sequence is formed. Thus, the region presents two levels of variation: variation in the number of repeats and variation in the secondary DNA sequence of repeat types. Two analysis methods were therefore tested; fragment length analysis and analysis of the sequence of repeat types. Fragment analysis produced unique profiles that could be used to discriminate between blood samples from maternally closely related individuals. However, different hairs from one individual did not have the same fragment profile, and the method is, therefore, not suitable for analysis of single hairs. In contrast, analysis of the repeat type sequences (array types) is highly informative. When different hairs from one individual were studied, identical array types were found. The repeat-type sequence variation was studied among individuals having identical nonrepetitive CR mtDNA sequence variants. Seven, six, and two individuals, representing three different sequence variants, respectively, were analyzed. All these individuals had different array types, which implies a very high genetic variation between individuals in this region. The analysis method considerably improves the exclusion capacity of mtDNA analysis of domestic dogs compared with sequence analysis of non-repetitive DNA.