Molecular phylogenetic and phylogeographic reconstructions generally assume time-homogeneous subs... more Molecular phylogenetic and phylogeographic reconstructions generally assume time-homogeneous substitution processes. Motivated by computational convenience, this assumption sacrifices biological realism and offers little opportunity to uncover the temporal dynamics in evolutionary histories. Here, we propose an evolutionary approach that explicitly relaxes the time-homogeneity assumption by allowing the specification of different infinitesimal substitution rate matrices across different time intervals, called epochs, along the evolutionary history. We focus on an epoch model implementation in a Bayesian inference framework that offers great modeling flexibility in drawing inference about any discrete data type characterized as a continuous-time Markov chain, including phylogeographic traits. To alleviate the computational burden that the additional temporal heterogeneity imposes, we adopt a massively parallel approach that achieves both fine- and coarse-grain parallelization of the ...
Coronaviruses are enveloped, positive-stranded RNA viruses with a genome of approximately 30 kb. ... more Coronaviruses are enveloped, positive-stranded RNA viruses with a genome of approximately 30 kb. Based on genetic similarities, coronaviruses are classified into three groups. Two group 2 coronaviruses, human coronavirus OC43 (HCoV-OC43) and bovine coronavirus (BCoV), show remarkable antigenic and genetic similarities. In this study, we report the first complete genome sequence (30,738 nucleotides) of the prototype HCoV-OC43 strain (ATCC VR759). Complete genome and open reading frame (ORF) analyses were performed in comparison to the BCoV genome. In the region between the spike and membrane protein genes, a 290-nucleotide deletion is present, corresponding to the absence of BCoV ORFs ns4.9 and ns4.8. Nucleotide and amino acid similarity percentages were determined for the major HCoV-OC43 ORFs and for those of other group 2 coronaviruses. The highest degree of similarity is demonstrated between HCoV-OC43 and BCoV in all ORFs with the exception of the E gene. Molecular clock analysis ...
Like other RNA viruses, coxsackievirus B5 (CVB5) exists as circulating heterogeneous populations ... more Like other RNA viruses, coxsackievirus B5 (CVB5) exists as circulating heterogeneous populations of genetic variants. In this study, we present the reconstruction and characterization of a probable ancestral virion of CVB5. Phylogenetic analyses based on capsid protein-encoding regions (the VP1 gene of 41 clinical isolates and the entire P1 region of eight clinical isolates) of CVB5 revealed two major cocirculating lineages. Ancestral capsid sequences were inferred from sequences of these contemporary CVB5 isolates by using maximum likelihood methods. By using Bayesian phylodynamic analysis, the inferred VP1 ancestral sequence dated back to 1854 (1807 to 1898). In order to study the properties of the putative ancestral capsid, the entire ancestral P1 sequence was synthesized de novo and inserted into the replicative backbone of an infectious CVB5 cDNA clone. Characterization of the recombinant virus in cell culture showed that fully functional infectious virus particles were assembl...
The inflammatory bowel diseases (IBD), Crohn&... more The inflammatory bowel diseases (IBD), Crohn's disease (CD), and ulcerative colitis (UC), are complex multifactorial traits involving both environmental and genetic factors. Mannan-binding lectin (MBL) plays an important role in non-specific immunity and complement activation. Point mutations in codons 52, 54 and 57 of exon 1 of the MBL gene are associated with decreased MBL plasma concentrations and increased susceptibility
Proceedings. Biological sciences / The Royal Society, Jan 22, 2015
The frequency and global impact of infectious disease outbreaks, particularly those caused by eme... more The frequency and global impact of infectious disease outbreaks, particularly those caused by emerging viruses, demonstrate the need for a better understanding of how spatial ecology and pathogen evolution jointly shape epidemic dynamics. Advances in computational techniques and the increasing availability of genetic and geospatial data are helping to address this problem, particularly when both information sources are combined. Here, we review research at the intersection of evolutionary biology, human geography and epidemiology that is working towards an integrated view of spatial incidence, host mobility and viral genetic diversity. We first discuss how empirical studies have combined viral spatial and genetic data, focusing particularly on the contribution of evolutionary analyses to epidemiology and disease control. Second, we explore the interplay between virus evolution and global dispersal in more depth for two pathogens: human influenza A virus and chikungunya virus. We dis...
Phylogeographic approaches help uncover the imprint that spatial epidemiological processes leave ... more Phylogeographic approaches help uncover the imprint that spatial epidemiological processes leave in the genomes of fast evolving viruses. Recent Bayesian inference methods that consider phylogenetic diffusion of discretely and continuously distributed traits offer a unique opportunity to explore genotypic and phenotypic evolution in greater detail. To provide a taste of the recent advances in viral diffusion approaches, we highlight key findings arising at the intrahost, local and global epidemiological scales. We also outline future areas of research and discuss how these may contribute to a quantitative understanding of the phylodynamics of RNA viruses.
Proceedings of the National Academy of Sciences, 2012
We introduce a conceptual bridge between the previously unlinked fields of phylogenetics and math... more We introduce a conceptual bridge between the previously unlinked fields of phylogenetics and mathematical spatial ecology, which enables the spatial parameters of an emerging epidemic to be directly estimated from sampled pathogen genome sequences. By using phylogenetic history to correct for spatial autocorrelation, we illustrate how a fundamental spatial variable, the diffusion coefficient, can be estimated using robust nonparametric statistics, and how heterogeneity in dispersal can be readily quantified. We apply this framework to the spread of the West Nile virus across North America, an important recent instance of spatial invasion by an emerging infectious disease. We demonstrate that the dispersal of West Nile virus is greater and far more variable than previously measured, such that its dissemination was critically determined by rare, long-range movements that are unlikely to be discerned during field observations. Our results indicate that, by ignoring this heterogeneity, ...
Philosophical Transactions of the Royal Society B: Biological Sciences, 2013
The factors that determine the origin and fate of cross-species transmission events remain unclea... more The factors that determine the origin and fate of cross-species transmission events remain unclear for the majority of human pathogens, despite being central for the development of predictive models and assessing the efficacy of prevention strategies. Here, we describe a flexible Bayesian statistical framework to reconstruct virus transmission between different host species based on viral gene sequences, while simultaneously testing and estimating the contribution of several potential predictors of cross-species transmission. Specifically, we use a generalized linear model extension of phylogenetic diffusion to perform Bayesian model averaging over candidate predictors. By further extending this model with branch partitioning, we allow for distinct host transition processes on external and internal branches, thus discriminating between recent cross-species transmissions, many of which are likely to result in dead-end infections, and host shifts that reflect successful onwards transm...
Molecular phylogenetic and phylogeographic reconstructions generally assume time-homogeneous subs... more Molecular phylogenetic and phylogeographic reconstructions generally assume time-homogeneous substitution processes. Motivated by computational convenience, this assumption sacrifices biological realism and offers little opportunity to uncover the temporal dynamics in evolutionary histories. Here, we propose an evolutionary approach that explicitly relaxes the time-homogeneity assumption by allowing the specification of different infinitesimal substitution rate matrices across different time intervals, called epochs, along the evolutionary history. We focus on an epoch model implementation in a Bayesian inference framework that offers great modeling flexibility in drawing inference about any discrete data type characterized as a continuous-time Markov chain, including phylogeographic traits. To alleviate the computational burden that the additional temporal heterogeneity imposes, we adopt a massively parallel approach that achieves both fine- and coarse-grain parallelization of the ...
Coronaviruses are enveloped, positive-stranded RNA viruses with a genome of approximately 30 kb. ... more Coronaviruses are enveloped, positive-stranded RNA viruses with a genome of approximately 30 kb. Based on genetic similarities, coronaviruses are classified into three groups. Two group 2 coronaviruses, human coronavirus OC43 (HCoV-OC43) and bovine coronavirus (BCoV), show remarkable antigenic and genetic similarities. In this study, we report the first complete genome sequence (30,738 nucleotides) of the prototype HCoV-OC43 strain (ATCC VR759). Complete genome and open reading frame (ORF) analyses were performed in comparison to the BCoV genome. In the region between the spike and membrane protein genes, a 290-nucleotide deletion is present, corresponding to the absence of BCoV ORFs ns4.9 and ns4.8. Nucleotide and amino acid similarity percentages were determined for the major HCoV-OC43 ORFs and for those of other group 2 coronaviruses. The highest degree of similarity is demonstrated between HCoV-OC43 and BCoV in all ORFs with the exception of the E gene. Molecular clock analysis ...
Like other RNA viruses, coxsackievirus B5 (CVB5) exists as circulating heterogeneous populations ... more Like other RNA viruses, coxsackievirus B5 (CVB5) exists as circulating heterogeneous populations of genetic variants. In this study, we present the reconstruction and characterization of a probable ancestral virion of CVB5. Phylogenetic analyses based on capsid protein-encoding regions (the VP1 gene of 41 clinical isolates and the entire P1 region of eight clinical isolates) of CVB5 revealed two major cocirculating lineages. Ancestral capsid sequences were inferred from sequences of these contemporary CVB5 isolates by using maximum likelihood methods. By using Bayesian phylodynamic analysis, the inferred VP1 ancestral sequence dated back to 1854 (1807 to 1898). In order to study the properties of the putative ancestral capsid, the entire ancestral P1 sequence was synthesized de novo and inserted into the replicative backbone of an infectious CVB5 cDNA clone. Characterization of the recombinant virus in cell culture showed that fully functional infectious virus particles were assembl...
The inflammatory bowel diseases (IBD), Crohn&... more The inflammatory bowel diseases (IBD), Crohn's disease (CD), and ulcerative colitis (UC), are complex multifactorial traits involving both environmental and genetic factors. Mannan-binding lectin (MBL) plays an important role in non-specific immunity and complement activation. Point mutations in codons 52, 54 and 57 of exon 1 of the MBL gene are associated with decreased MBL plasma concentrations and increased susceptibility
Proceedings. Biological sciences / The Royal Society, Jan 22, 2015
The frequency and global impact of infectious disease outbreaks, particularly those caused by eme... more The frequency and global impact of infectious disease outbreaks, particularly those caused by emerging viruses, demonstrate the need for a better understanding of how spatial ecology and pathogen evolution jointly shape epidemic dynamics. Advances in computational techniques and the increasing availability of genetic and geospatial data are helping to address this problem, particularly when both information sources are combined. Here, we review research at the intersection of evolutionary biology, human geography and epidemiology that is working towards an integrated view of spatial incidence, host mobility and viral genetic diversity. We first discuss how empirical studies have combined viral spatial and genetic data, focusing particularly on the contribution of evolutionary analyses to epidemiology and disease control. Second, we explore the interplay between virus evolution and global dispersal in more depth for two pathogens: human influenza A virus and chikungunya virus. We dis...
Phylogeographic approaches help uncover the imprint that spatial epidemiological processes leave ... more Phylogeographic approaches help uncover the imprint that spatial epidemiological processes leave in the genomes of fast evolving viruses. Recent Bayesian inference methods that consider phylogenetic diffusion of discretely and continuously distributed traits offer a unique opportunity to explore genotypic and phenotypic evolution in greater detail. To provide a taste of the recent advances in viral diffusion approaches, we highlight key findings arising at the intrahost, local and global epidemiological scales. We also outline future areas of research and discuss how these may contribute to a quantitative understanding of the phylodynamics of RNA viruses.
Proceedings of the National Academy of Sciences, 2012
We introduce a conceptual bridge between the previously unlinked fields of phylogenetics and math... more We introduce a conceptual bridge between the previously unlinked fields of phylogenetics and mathematical spatial ecology, which enables the spatial parameters of an emerging epidemic to be directly estimated from sampled pathogen genome sequences. By using phylogenetic history to correct for spatial autocorrelation, we illustrate how a fundamental spatial variable, the diffusion coefficient, can be estimated using robust nonparametric statistics, and how heterogeneity in dispersal can be readily quantified. We apply this framework to the spread of the West Nile virus across North America, an important recent instance of spatial invasion by an emerging infectious disease. We demonstrate that the dispersal of West Nile virus is greater and far more variable than previously measured, such that its dissemination was critically determined by rare, long-range movements that are unlikely to be discerned during field observations. Our results indicate that, by ignoring this heterogeneity, ...
Philosophical Transactions of the Royal Society B: Biological Sciences, 2013
The factors that determine the origin and fate of cross-species transmission events remain unclea... more The factors that determine the origin and fate of cross-species transmission events remain unclear for the majority of human pathogens, despite being central for the development of predictive models and assessing the efficacy of prevention strategies. Here, we describe a flexible Bayesian statistical framework to reconstruct virus transmission between different host species based on viral gene sequences, while simultaneously testing and estimating the contribution of several potential predictors of cross-species transmission. Specifically, we use a generalized linear model extension of phylogenetic diffusion to perform Bayesian model averaging over candidate predictors. By further extending this model with branch partitioning, we allow for distinct host transition processes on external and internal branches, thus discriminating between recent cross-species transmissions, many of which are likely to result in dead-end infections, and host shifts that reflect successful onwards transm...
Uploads
Papers by Philippe Lemey