Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
MEEGID 2321 No. of Pages 9, Model 5G 25 April 2015 Infection, Genetics and Evolution xxx (2015) xxx–xxx 1 Contents lists available at ScienceDirect Infection, Genetics and Evolution journal homepage: www.elsevier.com/locate/meegid 5 6 3 4 7 8 9 10 11 12 13 14 15 16 1 3 8 4 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 A molecular phylogenetics-based approach for identifying recent hepatitis C virus transmission events Andrea D. Olmstead a,b,⇑, Jeffrey B. Joy c, Vincent Montoya a,b, Iris Luo b, Art F.Y. Poon c, Brendan Jacka d, François Lamoury d, Tanya Applegate d, Julio Montaner b,c, Yury Khudyakov e, Jason Grebely d, Darrel Cook a, P. Richard Harrigan c, Mel Krajden a,b a BC Centre for Disease Control, Vancouver, BC, Canada University of British Columbia, Vancouver, BC, Canada BC Centre for Excellence in HIV/AIDS, St Paul’s Hospital, Vancouver, BC, Canada d The Kirby Institute, UNSW Australia, Sydney, NSW, Australia e Centers for Disease Control and Prevention, Atlanta, Georgia, USA b c a r t i c l e i n f o Article history: Received 21 February 2015 Received in revised form 9 April 2015 Accepted 17 April 2015 Available online xxxx Keywords: Hepatitis C virus Recent transmission clusters British Columbia Epidemiology Sequencing Phylogenetics Surveillance Seroconversion a b s t r a c t Improved surveillance methods are needed to better understand the current hepatitis C virus (HCV) disease burden and to monitor the impact of prevention and treatment interventions on HCV transmission dynamics. Sanger sequencing (HCV NS5B, HVR1 and Core-E1-HVR1) and phylogenetics were applied to samples from individuals diagnosed with HCV in British Columbia, Canada in 2011. This included individuals with two or three sequential samples collected <1 year apart. Patristic distances between sequential samples were used to set cutoffs to identify recent transmission clusters. Factors associated with transmission clustering were analyzed using logistic regression. From 618 individuals, 646 sequences were obtained. Depending on the cutoff used, 63 (10%) to 92 (15%) unique individuals were identified within transmission clusters of predicted recent origin. Clustered individuals were more likely to be <40 years old (Adjusted Odds Ratio (AOR) 2.12, 95% CI 1.21–3.73), infected with genotype 1a (AOR 6.60, 95% CI 1.98–41.0), and to be seroconverters with estimated infection duration of <1 year (AOR 3.13, 95% CI 1.29–7.36) or >1 year (AOR 2.19, 95% CI 1.22–3.97). Conclusion: Systematic application of molecular phylogenetics may be used to enhance traditional surveillance methods through identification of recent transmission clusters. Ó 2015 Published by Elsevier B.V. 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 1. Introduction 55 Hepatitis C virus (HCV) has infected between 115 and 185 million individuals globally (Gower et al., 2014; Mohd Hanafiah et al., 2013). As of December 2013, 73,500 (1.6%) individuals in British Columbia (BC) were diagnosed as anti-HCV positive (BC Centre for Disease Control, 2013). About two-thirds of anti-HCV positive 56 57 58 59 Abbreviations: HCV, hepatitis C virus; PWID, people who inject drugs; BC, British Columbia; NS5B, non-structural protein-5B; E1, Envelope-1; HVR1, hypervariable region-1; cDNA, complementary DNA; RT-PCR, reverse transcriptase polymerase chain reaction; E2, Envelope-2; IU, international units; AOR, Adjusted Odds Ratio; CI, Confidence Interval; SD, Standard Deviation; OR, Odds Ratio; MSM, men who have sex with men. ⇑ Corresponding author at: The BC Centre for Disease Control/Department of Pathology, University of British Columbia, 2155 – 655 W 12th Ave, Vancouver, BC V5Z 4R4, Canada. E-mail address: andrea.olmstead@bccdc.ca (A.D. Olmstead). individuals in BC are baby boomers born between 1945 and 1965. Most were infected decades ago and are at risk of developing progressive liver disease as they age. HCV-infected British Columbians have about a 5-fold increased risk of all-cause and 20-fold increased risk of liver-related mortality (Yu et al., 2013). HCV continues to be transmitted by people who inject drugs (PWID), who account for >80% of new infections in many countries (Hajarizadeh et al., 2013; Nelson et al., 2011). Reducing the population disease burden will require enhanced screening, engagement into care and treatment as well as the potential use of treatment-as-prevention to reduce transmission among PWID (Martin et al., 2013; Smith et al., 2012; Wiessing et al., 2014). Informing effective public health interventions requires a clear understanding of the past and ongoing dynamics of viral epidemics. Phylogenetic trees constructed from viral sequences allow the historical relationships between infected individuals in a population to be modeled (Lam et al., 2010; Pybus and Rambaut, 2009; http://dx.doi.org/10.1016/j.meegid.2015.04.017 1567-1348/Ó 2015 Published by Elsevier B.V. Please cite this article in press as: Olmstead, A.D., et al. A molecular phylogenetics-based approach for identifying recent hepatitis C virus transmission events. Infect. Genet. Evol. (2015), http://dx.doi.org/10.1016/j.meegid.2015.04.017 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 MEEGID 2321 No. of Pages 9, Model 5G 25 April 2015 2 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 A.D. Olmstead et al. / Infection, Genetics and Evolution xxx (2015) xxx–xxx Volz et al., 2013). Distinct transmission clusters, i.e., groups of closely related infections, can be identified from phylogenies along with spatial, epidemiological and temporal information linked to their expansion (Grabowski and Redd, 2014; Hue et al., 2005; Lewis et al., 2008; van de Laar et al., 2009a,b). The ability to quantify mutation rates enables recent transmission clusters to be identified and characteristics associated with high rates of transmission to be inferred (Aldous et al., 2012; Poon et al., 2014). Phylogenetic cluster analyses are widespread in HIV research and have revealed that early/acute HIV infections play a major role in driving the epidemic and also that drug-resistant HIV variants can be transmitted and establish infection (Brenner et al., 2007; Kaye et al., 2008). HCV phylogenetic analyses have modeled the expansion of HCV throughout the 20th century in various global regions and have identified factors that likely contributed to the worldwide epidemic (Magiorkinis et al., 2009; Pybus et al., 2003, 2005; Tanaka et al., 2002). HCV phylogenetic cluster analyses have been used to confirm transmission events originating in health care settings and to investigate HCV clustering among HIV-infected individuals, men who have sex with men (MSM) and in PWID (Danta et al., 2007; Lanini et al., 2010; Oliveira et al., 2006; Sacks-Davis et al., 2012; Shemer-Avni et al., 2007; Urbanus et al., 2009; van de Laar et al., 2009a,b). The criteria for defining transmission clusters differs for both HIV and HCV-based phylogenetics but when strict cutoffs are imposed, clusters that represent more recent transmission events can be identified. Tracking recent transmission clusters over time at a population level may be used to support HCV incidence estimations and to identify sub-populations at high risk of HCV transmission. BC’s centralized laboratory testing and consistent use of demographic identifiers enables identification of a subset of recent HCV infections by linking new anti-HCV positive to previous anti-HCV negative test results. However, most HCV-infected individuals have no prior testing history, precluding estimation of when transmission occurred. The aim of this study was to develop a method for identifying recent transmission clusters in a population. The HCV NS5B, core, E1 and HVR-1 regions were sequenced in a proportion of individuals who tested anti-HCV positive in BC in 2011. The genetic distances between sequential specimens from the same individuals collected less than one year apart were used to assign cutoffs to identify recent phylogenetic transmission clusters. This study provides a proof-of-principle for applying molecular phylogenetics to support traditional surveillance methods through identification of recent transmission clusters in a population. 122 2. Methods 123 2.1. Study population and design 124 The BC Public Health Microbiology and Reference Laboratory performs approximately 95% of HCV serological screening in BC (Yu et al., 2013). Automated linkage of anti-HCV positive tests with past results enables identification of seroconverters i.e., individuals who previously tested anti-HCV negative and are anti-HCV positive on subsequent testing. For seroconverters the estimated infection duration was defined as the time between the specimen collection date and the midpoint between an individual’s last negative and first positive anti-HCV test. Individuals who tested anti-HCV positive in BC in 2011 were eligible for assessment. Individuals were classified as: (1) first time positives: individuals who tested positive for the first time in 2011 and had no previous testing history; thus, their infection duration was unknown; (2) past seroconverters: individuals who seroconverted prior to 2011 but tested positive again in 2011 and thus 125 126 127 128 129 130 131 132 133 134 135 136 137 138 had an estimated infection duration of >1 year; and (3) recent seroconverters: individuals who tested positive for the first time in 2011 and had an estimated infection duration of <1 year. The University of British Columbia Clinical Research Ethics Board approved this study. 139 2.2. HCV RNA testing and sequencing 144 HCV RNA was extracted from anti-HCV positive serum using the MagMAX™-96 Viral RNA Isolation kit (Life Technologies, Carlsbad, CA, USA). HCV viral loads were quantified using a semi-quantitative HCV RT-PCR assay (Meng and Li, 2010). Complementary DNA (cDNA) was synthesized using the SuperScriptÒ VILO™ cDNA Synthesis Kit (Life Technologies). An 828 bp fragment containing a portion of the HCV polymerase (Non-Structural-5B; NS5B) and a 1514 bp fragment containing the HCV Core, Envelope-1, hypervariable region-1 (HVR1) and the beginning of Envelope-2 (E2) were amplified using Velocity DNA polymerase (Bioline, London, UK). Amplicons were purified using Agencourt Ampure XP beads (Beckman-Coulter, Mississauga, Ontario) and were Sanger sequenced using the BigDye Terminator Cycle Sequencing kit (Life Technologies). HCV RNA extraction, PCR, Sanger sequencing and thermal cycling conditions are described in detail in the Supplementary information. 145 2.3. Sequence analysis 161 Sequences were assembled and edited using Geneious v.6.1.7 (Biomatters, Aukland, New Zealand; http://www.geneious.com/). The NS5B sequences were trimmed to 650 bp. The Core-E2 sequences were trimmed to 920 bp and are referred to hereafter as Core-HVR1. Samples that had a partial but not a full CoreHVR1 sequence available were trimmed to 100 bp and contained the HVR1 region. Reference sequences obtained from the Los Alamos HCV sequence database (hcv.lanl.gov; Kuiken et al., 2005) (see Supplementary information) were included in the analysis. Sequences were aligned using MUSCLE (Edgar, 2004) and alignments were edited in Geneious. The program jModeltest-2.1.4 (Darriba et al., 2012) was used to determine the nucleotide substitution model that best described the aligned sequences, which was determined to be the General Time Reversible model with gamma distributed rate variation among sites and a proportion of invariable sites. Maximum likelihood phylogenetic trees were generated using FastTree2 (Price et al., 2010). Path-o-gen v.1.4 was used to estimate the best fitting root for the phylogenetic trees, which were visualized and annotated using FigTree v.1.4.0 (http://tree. bio.ed.ac.uk/software/figtree/). 162 2.4. Cluster analysis 182 Phylogenies included sequences from seroconverters who had sequential HCV positive samples collected <1 year apart. The patristic distances (i.e., genetic distance between two sequences in a phylogenetic tree measured in nucleotide substitutions per site) between these sequential samples represents HCV sequence evolution expected to occur within one year. Geneious software was used to determine the patristic distances between each pair of sequences in a phylogenetic tree, which include the pairwise intra-person (two sequences from one individual) and the within-genotype inter-person (two sequences from two different individuals) distances. The intra-person distances were used to assign cutoffs that were applied to phylogenies using a custom python script (Poon et al., 2014). Sequences from different individuals with patristic distances falling below these cutoffs were interpreted as transmission clusters. The lower cutoff was set close to the maximum 183 Please cite this article in press as: Olmstead, A.D., et al. A molecular phylogenetics-based approach for identifying recent hepatitis C virus transmission events. Infect. Genet. Evol. (2015), http://dx.doi.org/10.1016/j.meegid.2015.04.017 140 141 142 143 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 MEEGID 2321 No. of Pages 9, Model 5G 25 April 2015 A.D. Olmstead et al. / Infection, Genetics and Evolution xxx (2015) xxx–xxx 223 observed intra-patient patristic distance (0.018 for NS5B, 0.03 for Core-HVR1 and 0.11 for HVR1). A second, higher cutoff was also assigned that excludes clusters between BC sequences and the Los Alamos reference sequences (0.020 for NS5B, 0.06 for CoreHVR1 and 0.15 for HVR1). A sensitivity analysis was performed using a third cutoff (0.022 for NS5B, 0.09 for Core-HVR1 and 0.19 for HVR1). Clusters within phylogenetic trees were manually examined; sequences from clusters that did not group together with branch support >80% were split into smaller phylogenetically linked clusters or excluded from the analysis. Clusters were visualized using Graphviz v.2.36 (Gansner and North, 1999). To provide additional validation for the selected patristic distance cutoffs, six NS5B reference sequences were obtained from Genbank (Supplementary material) representing three pairs of confirmed transmission events (2 individuals/transmission) (Nakayama et al., 2005; Toda et al., 2009). The sampling intervals between each of the two individuals in the reported transmission pairs were between 2 weeks and 3 months. At least one individual from each transmission pair was diagnosed with acute HCV at the time of sampling i.e., transmission occurred approximately 6 months or less prior to sample collection. A phylogenetic and cluster analysis using these additional sequences (along with the BC sequences and Los Alamos reference sequences) was performed to determine the patristic distances between individuals in transmission pairs. 224 2.5. Statistical analysis 225 238 Logistic regression was used to identify factors hypothesized to be associated with recent infection, including age, sex, HCV genotype, seroconversion status, HIV status and geographic location (Jacka et al., 2014; Miller et al., 2002; Oliveira et al., 2009; SacksDavis et al., 2012). As 84% of sequences were genotypes 1a and 3a, the remaining genotypes were grouped together for analysis. Age was dichotomized to <40 vs. P40 years since unadjusted analysis demonstrated that individuals <30, 30–34 and 35–39 years were more likely to be in clusters than those >61 years (data not shown). In multivariate analyses, factors that were significant at p < 0.20 in the unadjusted analysis were considered as potential independent variables associated with clustering. For individuals with sequential samples, the first sample per individual was used. Analyses were performed in R, version 3.03 (R Core Team, 2013). 239 3. Results 240 3.1. HCV sequencing 241 In 2011, of 3356 individuals that tested anti-HCV positive in BC, 51% (n = 1715) were first-time anti-HCV positive (unknown infection duration), 38% (n = 1267) seroconverted between 1993 and 2010 (estimated infection duration >1 year), and 11% (n = 374) seroconverted in 2011 (132 had an estimated infection duration <1 year and 242 had an estimated infection duration >1 year). For this study, 1546 of the 3356 anti-HCV positive samples were selected for analysis, including: (1) 897 randomly selected samples from the 1715 first time positive individuals; (2) 222 randomly selected samples from the 1267 seroconverters diagnosed prior to 2011 (many of whom are likely chronically infected), and (3) all available samples from 351 of 374 individuals who seroconverted in 2011 (283 had one sample and 68 had two or three sequential samples; total samples = 427) (Fig. 1A). Of the 1546 samples selected, 1138 (74%) had detectable HCV RNA (>200 IU/ml), which yielded 646 sequences from 618 individuals (593 NS5B and 415 Core-HVR1) (Fig. 1A). For 52 additional samples, only a partial Core-HVR1 sequence was available; for these, the HVR1 sequence alone was examined and compared to 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 226 227 228 229 230 231 232 233 234 235 236 237 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 3 all available HVR1 sequences (n = 466). Of the 68 individuals with sequential samples, sequences were obtained from 25 individuals. There were 363 samples that yielded sequences for all three regions (Fig. 1B). HCV RNA viral load of >20,000 IU/ml and being a seroconverter were associated with successful sequencing (Supplementary Table S1). Study subject characteristics are summarized in Table 1. 260 3.2. HCV phylogenetics 267 Phylogenies for NS5B, Core-HVR1 and HVR1 were constructed (Supplementary Figs. S1–S3). As expected, HCV sequences grouped together by genotype in both the NS5B and Core-HVR1 phylogenies. When HVR1 was examined alone, the majority of sequences also grouped by genotype, however there were several sequences that did not group with the predicted genotype. This likely reflects the high heterogeneity of HVR1 and the fact that many mutations in this region are driven by host immune pressure (Dowd et al., 2009; Liu et al., 2010). Also apparent from the phylogenetic trees is that the genotype 1a sequences form two large and distinct phylogenetic clusters. 268 3.3. Identification of HCV transmission clusters 279 Within each phylogeny, the patristic distances between pairs of intra-person sequential samples were determined and used to set cutoffs for identifying clusters as described in the Methods. For NS5B, 593 sequences were obtained from 574 individuals; 17 individuals had sequential samples with a median (interquartile range) of 75 (21–161) days between sample collections. The interperson patristic distances ranged from 0 to 0.35 (median 0.10) (Fig. 2A). Note that the presence of the bimodal hump in the distribution presented in Fig. 2A reflects the presence of the two genotype 1a phylogenetic groups. The intra-person patristic distances ranged from 0 to 0.017 (Fig. 2A). Of note, three sequential samples from one individual from 2009 and 2010 (23–454 days apart) also had patristic distances between 0 and 0.017. Applying the lower (0.018) and higher (0.020) cutoffs to the phylogeny identified 55 individuals (10%) in 25 clusters and 73 individuals (13%) in 31 clusters, respectively (Fig. 3A and Table 2). To provide additional validation, an analysis of three confirmed transmission pairs using published NS5B sequences was performed (Nakayama et al., 2005; Toda et al., 2009). The three transmission pairs were found to cluster with patristic distances of 0.002, 0.006 and 0.01, respectively. For Core-HVR1, 415 sequences were obtained from 396 individuals; 18 individuals had sequential samples with a median (interquartile range) of 71.5 (22.5–122) days between sample collections. The inter-person patristic distances ranged from 0.004 to 0.55 (median 0.22) (Fig. 2B). The intra-person patristic distances ranged from 0.0003 to 0.026 (Fig. 2B). Applying the lower (0.03) and higher (0.06) cutoffs to the phylogeny identified 24 individuals (6%) in 12 clusters and 48 individuals (12%) in 23 clusters, respectively (Fig. 3B and Table 2). For HVR1, 466 sequences were obtained from 445 individuals; 20 individuals had sequential samples with a median (interquartile range) of 71.5 (21.5–117.8) days between sample collections. The inter-person patristic distances ranged from 0.01 to 2.91 (median 0.78) (Fig. 2C). The intra-person patristic distances ranged from 0 to 0.109 (Fig. 2C). Applying the lower (0.15) and higher (0.19) cutoffs to the phylogeny identified 19 individuals (4%) in 9 clusters and 31 individuals (7%) in 15 clusters respectively (Fig. 3C and Table 2). When the sequences for all three regions were combined, 63 (10%; lower cutoffs) and 92 (15%; higher cutoffs) unique individuals were identified as belonging to clusters. In all, 27 individuals in 13 clusters were identified within both the NS5B and HVR1 280 Please cite this article in press as: Olmstead, A.D., et al. A molecular phylogenetics-based approach for identifying recent hepatitis C virus transmission events. Infect. Genet. Evol. (2015), http://dx.doi.org/10.1016/j.meegid.2015.04.017 261 262 263 264 265 266 269 270 271 272 273 274 275 276 277 278 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 MEEGID 2321 No. of Pages 9, Model 5G 25 April 2015 4 A.D. Olmstead et al. / Infection, Genetics and Evolution xxx (2015) xxx–xxx B A Fig. 1. Sequences obtained for HCV NS5B, Core-HVR1 and HVR1 regions. (A) Number of samples extracted, number samples with detectable HCV (>200 IU/ml), and number of sequences from each HCV genomic region. (B) Distribution of sequences for each of the HCV genomic regions. Table 1 Characteristics of individuals with available HCV sequences (n = 618). a b c Characteristic No. of individuals Sex Female Male Unknown 202 415 1 Age Mean (SD) <40 year P40 year (baby boomers) 46.2 (12.5) 258 360 HCV genotype 1a 1b 2a 2b 3a Other 393 35 22 37 124 7 BC geographic region Interior Fraser Vancouver Coastal Vancouver Island Northern Unknown 114 207 140 82 60 15 Test group Recent seroconvertera Past seroconverterb First-time HCV positivec 52 203 363 HIV co-infection Yes No Unknown 16 436 166 Individuals with estimated infection duration <1 year. Individuals with estimated infection duration >1 year. Individuals with unknown infection duration. 325 or Core-HVR1 regions (Fig. 3). Of note, 29 first time positive individuals with unknown infection duration were identified within clusters, suggesting that they were involved in recent transmission events. 326 3.4. Factors associated with transmission clusters 327 Factors associated with clustering were examined using the NS5B region, as this region has the greatest amount of data 322 323 324 328 available. Using unadjusted analysis, factors associated with NS5B clusters identified using the higher cutoff (0.02) included age, HCV genotype 1a and 3a, and being a seroconverter (recent or past) (Supplementary Table S2). In adjusted analysis, age <40 years vs. P40 years (Adjusted Odds Ratio (AOR) 2.12, 95% CI 1.21–3.73), genotype 1a infection vs. other genotypes (AOR 6.60, 95% CI 1.98–41.0), and being a recent seroconverter vs. first time positive (AOR 3.13, 95% CI 1.29–7.36) or a past seroconverter vs. first time positive (AOR 2.19, 95% CI 1.22–3.97) were independently associated with clustering (Table 3 and Supplementary Table S2). Unadjusted analysis of NS5B clusters identified using the lower cutoff (0.018) yielded similar results to those identified using the higher cutoff (Table 3 and Supplementary Table S3). In the adjusted analysis, genotype 1a and seroconverter status were independently associated with clustering. Factors associated with clustering in the Core-HVR1 and HVR1 regions were also analyzed and the results are comparable to those obtained using NS5B; however, some minor differences were observed dependent on the region and cutoff examined (Table 3). In a sensitivity analysis, factors associated with clustering were similar to those identified using the higher or lower cutoffs (Table 3). 329 4. Discussion 351 This study demonstrates the feasibility of using sequencedbased phylogenetics to identify recent HCV transmission clusters, which could support surveillance of population-level transmission dynamics. A convenience sample of sequential HCV RNA positive specimens from individual seroconverters permitted measurement of the expected intra-person HCV sequence evolution within 1 year. The maximum intra-person patristic distances were used as cutoffs to identify recent transmission clusters, and a higher cutoff was also applied to account for greater inter-person sequence variation that results from a change in host environments and immune pressures following transmission (Bull et al., 2011; Kuntzen et al., 2007; Mcallister et al., 1998; Merani et al., 2010). Using the lower and higher cutoffs, 63–92 unique individuals, were identified within clusters. Twenty-nine individuals in clusters were first-time positive testers with unknown infection duration and their identification within clusters indicates their involvement in a recent transmission event. Individuals in clusters were more likely to have characteristics associated with incident infection in PWID (young age, 352 Please cite this article in press as: Olmstead, A.D., et al. A molecular phylogenetics-based approach for identifying recent hepatitis C virus transmission events. Infect. Genet. Evol. (2015), http://dx.doi.org/10.1016/j.meegid.2015.04.017 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 MEEGID 2321 No. of Pages 9, Model 5G 25 April 2015 5 Cutoffs 0.250 0.200 0.150 0.100 0.050 Core-HVR1 Cutoffs 0.50 0.40 0.30 0.20 0.10 0.06 0.03 Inter-person Intra-person HVR1 2.50 2.00 1.50 1.00 0.50 Inter-person Intra-person 0.00 0.0 0.2 0.4 0.6 0.8 1.0 C 0.018 0.020 0.000 Cutoffs 0.00 Scaled density 0.0 0.2 0.4 0.6 0.8 1.0 B NS5B Inter-person Intra-person 0.11 0.15 A 0.0 0.2 0.4 0.6 0.8 1.0 A.D. Olmstead et al. / Infection, Genetics and Evolution xxx (2015) xxx–xxx Patristic distance Fig. 2. Patristic distance frequency distributions. Frequency distribution of patristic distances between (A) 574 independent NS5B sequences obtained from different individuals and between 22 pairs of sequential sequences from 17 individuals (B) 396 Core-HVR1 sequences obtained from different individuals and between 20 pairs of sequential sequences from 18 individuals. (C) 445 independent HVR1 sequences obtained from different individuals and between 22 pairs of sequential sequences from 20 individuals. Blue = inter-person; red = intra-person. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.) 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 seroconversion, Genotype 1a and 3a) (Jacka et al., 2014; Miller et al., 2002; Oliveira et al., 2009; Sacks-Davis et al., 2012) and were also more likely to have infection duration <1 year, supporting the hypothesis that individuals in clusters were involved in recent transmission events. In BC, >80% of seroconverters report injection drug use as their primary HCV acquisition risk factor. Consistent with these findings, a recent study by Jacka et al. (2014) demonstrated that individuals in clusters from a BC PWID cohort were more likely to be <40 years and to be seroconverters. These findings complement BC surveillance data, which shows that acute HCV infections are infrequent in baby boomers (Kuo et al., 2015) and substantiates a lower onward transmission risk in this birth cohort. Three regions, the HVR1, Core-HVR1 and NS5B, were examined because of their different genetic evolutionary rates, which allow transmissions over different time scales to be examined. Evolution of HVR1 is not linear (Mcallister et al., 1998) and mutations accumulate rapidly (Dowd et al., 2009; Kuntzen et al., 2007; Liu et al., 2010) making this region useful for verifying person-toperson transmissions and for monitoring HCV genomic changes over short but not longer time periods (Escobar-Gutierrez et al., 2012; Weiner et al., 1993). Use of the highly conserved Core region combined with E1 and HVR1 increases the capacity to identify transmissions that may not be apparent with HVR1 alone, and as expected, a larger number of clusters were identified using CoreHVR1. The NS5B region evolves at an intermediate and more stable rate, which allows transmissions over longer time periods to be examined while still supporting identification of recent transmission events (Cavalheiro Nde et al., 2009; Hmaied et al., 2007). The NS5B amplicon used for this study was longer than that typically used (650 bp vs. 380 bp) (Jacka et al., 2013), which may have improved its ability to identify clusters. The largest number of clusters was identified in NS5B and while many sequences were unavailable for comparison in the other regions, several NS5B clusters were also observed in HVR1 and/or Core-HVR1. Identification of clusters in two regions provides stronger support for transmission events. Some NS5B clusters had highly divergent Core-HVR1 or HVR1 sequences, suggesting no recent transmissions occurred between these individuals. In other NS5B clusters, the patristic distances between HVR1 and/or CoreHVR1 sequence pairs were only slightly above the cutoffs, suggesting that the cutoff in highly variable regions could be relaxed further to capture additional individuals within a transmission network. No clusters were identified between BC samples and the Los Alamos reference sequences, which supports the stringency of our method. Additional validation was provided by the use of confirmed transmission pairs, which were found to cluster in the NS5B region using the assigned cutoffs. It is important to note that various factors can influence the rate of HCV evolution over time (i.e. acute vs. chronic HCV infection, co-infection with HIV and immune status (Jabara et al., 2014; Merani et al., 2010; Ramachandran et al., 2011; Ray et al., 2005)), which will influence the patristic distance cutoffs and the identification of recent transmission clusters. Future work will further assess the utility of using Please cite this article in press as: Olmstead, A.D., et al. A molecular phylogenetics-based approach for identifying recent hepatitis C virus transmission events. Infect. Genet. Evol. (2015), http://dx.doi.org/10.1016/j.meegid.2015.04.017 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 MEEGID 2321 No. of Pages 9, Model 5G 25 April 2015 6 A.D. Olmstead et al. / Infection, Genetics and Evolution xxx (2015) xxx–xxx A NS5B Clusters <0.018 NS5B Clusters <0.02 B Core-HVR1 Clusters <0.03 Core-HVR1 Clusters <0.06 C HVR1 Clusters <0.11 HVR1 Clusters <0.15 Common cluster Genotype 1a Genotype 3a Genotype 2b Fig. 3. Transmission clusters. (A) Inter-person (between different individuals) NS5B sequence clusters with patristic distances <0.018 (left) or <0.02 (right). (B) Inter-person Core-HVR1 sequence clusters (between different individuals) with patristic distances <0.03 (left) or <0.06 (right). (C) Inter-person Core-HVR1 sequence clusters (between different individuals) with patristic distances <0.11 (left) or <0.15 (right). Nodes represent individual sequences and the connector length is proportional to the patristic distance between sequences. Common clusters (bold outline) are those that occur in both NS5B and Core-HVR1 or HVR1 alone. Table 2 Number of clusters and individuals in clusters with various patristic distance cutoffs. Cutoff Total n n in clusters % clustering Number of clusters 574 55 73 82 10 13 14 25 31 34 Core-HVR1 0.03 415 0.06 0.09b 24 48 88 6 12 21 12 23 38 HVR1 0.11 0.15 0.19b 19 31 45 4 7 10 9 15 20 NS5B 0.018 0.020 0.022b a b 425 426 427 428 429 430 431 432 433 434 435 436 437 438 a 445 n = Number of individuals. Sensitivity analysis. various regions and cutoffs for characterizing population level transmission dynamics. A limitation to this study is that the serum samples used were collected in 2011 for the purposes of antibody screening, and were not stored under optimal conditions. For real-time population studies, these protocols would be applied to fresh serum samples within a day of collection or to blood samples containing EDTA to increase HCV RNA integrity, which should improve sequence yield. A further limitation to this study was the use of the midpoint between the last negative and first positive test date to estimate infection duration in seroconverters. For HIV, seroconverters identified through routine public health screening were more likely to have been infected closer to the time of the first positive test (Skar et al., 2013). Thus, individuals with long gaps between the last negative and first positive tests and an estimated HCV infection duration >1 year may have in fact been infected more recently. This likely explains, in part, the association between clustering and being a past seroconverter with infection duration >1 year. On the other hand, unlike with HIV, it has not been demonstrated that acute/early infections disproportionately drive HCV transmission. Many HCV infected individuals go undiagnosed for years or even decades (Rein et al., 2012; Shah et al., 2013), also HCV viral loads can remain high and stable throughout the course of infection, and the majority of HCV infected individuals remain untreated. This suggests that chronically infected individuals may be a significant source of onward HCV transmission and thus it is unsurprising that past seroconverters are also linked with transmission clusters. The cluster analyses presented here cannot be directly extrapolated to create population incidence estimates given that this was a proof-of-principle study with a selection bias for seroconverters to maximize the identification of transmission clusters. In the future, these methods may be used, in combination with other strategies (i.e. tracking seroconverters and using diversity to estimate transmission timing (Montoya et al., 2015)) to obtain a comprehensive view of HCV incidence in BC. Every year >2000 individuals undergo HCV genotyping at the BC Centre for Disease Control and it would be feasible to apply routine sequencing and phylogenetics to these individuals given the declining cost of sequencing. Implementing this type of strategy could help to identify a subset of individuals with recent infection with no prior negative test available. It can also be used to identify clusters of individuals that are transmitting HCV and who may be targets for HCV prevention strategies. This study supports the application of sequencing, phylogenetics, and the use of patristic distance cutoffs to identify recent transmission clusters between HCV infected individuals with unknown Please cite this article in press as: Olmstead, A.D., et al. A molecular phylogenetics-based approach for identifying recent hepatitis C virus transmission events. Infect. Genet. Evol. (2015), http://dx.doi.org/10.1016/j.meegid.2015.04.017 439 440 441 442 443 444 445 446 447 448 449 450 451 452 453 454 455 456 457 458 459 460 461 462 463 464 465 466 467 468 469 470 MEEGID 2321 No. of Pages 9, Model 5G 25 April 2015 (0.43, 1.82) (2.27, 20.15) (1.55, 8.35) (0.78, 17.85) (0.58, 16.02) 0.89 6.74 3.50 2.79 2.37 1.16 6.58 4.83 1.34 2.12 1.53 (0.55, 4.38) 3.80 (0.80, 16.97) 2.53 (0.81, 8.76) N/Ab N/A (1.07, 3.18) (1.47, 7.81) (0.92, 3.06) (1.70, 20.81) (1.70, 23.65) Abbreviations: AOR: Adjusted Odds Ratio, CI: Confidence Interval, SD: Standard Deviation. Numbers in bold have a p-value < 0.05. a Sensitivity analysis. b N/A indicates the fitted probabilities numerically 0 or 1 occurred when the regression models included genotype. Characteristic Age Recent seroconverter Past seroconverter Genotype 1a Genotype 3a 1.85 3.72 2.34 4.29 3.79 (0.98, 3.50) (1.41, 9.47) (1.20, 4.62) (1.26, 26.9) (0.93, 25.6) 2.12 3.13 2.19 6.60 3.85 (1.21, 3.73) (1.29, 7.36) (1.22, 3.97) (1.98, 41.0) (0.95, 26.0) 2.28 2.25 1.60 8.11 4.28 (1.33, 3.93) (0.94, 5.16) (0.91, 2.81) (2.45, 50.2) (1.06, 28.8) 1.27 (0.51, 3.19) 9.77 (2.53, 42.33) 4.33 (1.41, 16.17) N/Ab N/A 1.98 6.54 3.64 5.69 6.07 (1.01, 3.95) (2.27, 19.77) (1.57, 9.24) (1.13, 103.69) (1.12, 113.01) 1.85 3.38 1.68 4.91 5.34 HVR1 0.09a 0.06 Core-HVR1 0.03 0.022a 0.02 0.018 NS5B Adjusted Odds Ratio (95% CI) Table 3 Multivariate regression analysis of factors associated with clustering in NS5B alone, Core-HVR1 alone, and HVR1 alone using multiple cutoffs. 0.11 0.15 (0.52, 2.63) (1.71, 26.79) (1.73, 15.79) (0.34, 8.85) (0.51, 14.56) 0.19a A.D. Olmstead et al. / Infection, Genetics and Evolution xxx (2015) xxx–xxx 7 infection histories. This technique, when applied at the population level and performed on an ongoing basis, may allow monitoring of HCV transmission networks and the changing dynamics of clusters over time, i.e. if new clusters arise from pre-existing clusters and if the number of clusters increases or decreases over time. This in turn could enable monitoring of transmission patterns in response to increased HCV screening and treatment uptake. It may also facilitate the tracking and evaluation of prevention interventions such as harm-reduction and treatment-as-prevention. 471 Funding 480 This work was supported by the Canadian Institutes for Health Research (CIHR) [Grant No. HES115697] and by the National Canadian Research Training Program in Hepatitis C (NCRTP-HepC) [ADO]. 481 Author contributions 485 ADO and MK designed this study with input from JJG, BJ, AFYP, JBJ, PRH, VM, YK and JM. ADO and VM performed all of the data analysis for this study with input from MK, JG, JBJ and AFYP. ADO, VM and IL performed all laboratory work for this study. The manuscript was written by ADO with assistance from MK and DC and with input from all contributing authors. 486 Potential conflicts of interest 492 MK has received grant funding via his institution from Roche Molecular Systems, Boehringer Ingelheim, Merck, Siemens Healthcare Diagnostics and Hologic Inc. JG is a consultant/advisor and has received research grants from Abbvie, Bristol Myers Squibb, Gilead and Merck. PRH has received grants from, served as an ad hoc advisor to, or spoken at various events sponsored by: Pfizer, Glaxo-SmithKline, Abbott, Merck, Tobira Therapeutics, Virco and Quest Diagnostics and served as a consultant for ViiV Health Care, Tobira Therapeutics, Selah Genomics Inc, and Quest Diagnostics. He holds stock in Merck, Illumina, Gilead and EKF Diagnostics. JM has received grants from Abbott, BoehringerIngelheim, Bristol-Myers Squibb, Gilead Sciences, Janssen, Merck and ViiV Healthcare. All other authors have no conflicts of interest to declare. 493 Acknowledgments 507 The authors thank Amanda Yu for performing data linkage and data extraction, and the BC Centre Disease Control laboratory staff for retrieving the samples used in this study. 508 Appendix A. Supplementary data 511 Supplementary data associated with this article can be found, in the online version, at http://dx.doi.org/10.1016/j.meegid.2015.04. 017. 512 References 515 Aldous, J.L., Pond, S.K., Poon, A., Jain, S., Qin, H., Kahn, J.S., Kitahata, M., Rodriguez, B., Dennis, A.M., Boswell, S.L., Haubrich, R., Smith, D.M., 2012. Characterizing HIV transmission networks across the United States. Clin. Infect. Dis. 55, 1135–1143. BC Centre for Disease Control, 2013. British Columbia Annual Summary of Reportable Diseases. Brenner, B.G., Roger, M., Routy, J.-P., Moisi, D., Ntemgwa, M., Matte, C., Baril, J.-G., Thomas, R., Rouleau, D., Bruneau, J., Leblanc, R., Legault, M., Tremblay, C., Charest, H., Wainberg, M.a., 2007. High rates of forward transmission events after acute/early HIV-1 infection. J. Infect. Dis. 195, 951–959. 516 517 518 519 520 521 522 523 524 Please cite this article in press as: Olmstead, A.D., et al. A molecular phylogenetics-based approach for identifying recent hepatitis C virus transmission events. Infect. Genet. Evol. (2015), http://dx.doi.org/10.1016/j.meegid.2015.04.017 472 473 474 475 476 477 478 479 482 483 484 487 488 489 490 491 494 495 496 497 498 499 500 501 502 503 504 505 506 509 510 513 514 MEEGID 2321 No. of Pages 9, Model 5G 25 April 2015 8 525 526 527 528 529 530 531 532 533 534 535 536 537 538 539 540 541 542 543 544 545 546 547 548 549 550 551 552 553 554 555 556 557 558 559 560 561 562 563 564 565 566 567 568 569 570 571 572 573 574 575 576 577 578 579 580 581 582 583 584 585 586 587 588 589 590 591 592 593 594 595 596 597 598 599 600 601 602 603 604 605 606 607 608 609 A.D. Olmstead et al. / Infection, Genetics and Evolution xxx (2015) xxx–xxx Bull, R.A., Luciani, F., McElroy, K., Gaudieri, S., Pham, S.T., Chopra, A., Cameron, B., Maher, L., Dore, G.J., White, P.A., Lloyd, A.R., 2011. Sequential bottlenecks drive viral evolution in early acute hepatitis C virus infection. PLoS Pathog. 7, e1002243. Cavalheiro Nde, P., De La Rosa, A., Elagin, S., Tengan, F.M., Araujo, E.S., Barone, A.A., 2009. Hepatitis C: sexual or intrafamilial transmission? Epidemiological and phylogenetic analysis of hepatitis C virus in 24 infected couples. Rev. Soc. Bras. Med. Trop. 42, 239–244. Danta, M., Brown, D., Bhagani, S., Pybus, O.G., Sabin, C.a., Nelson, M., Fisher, M., Johnson, A.M., Dusheiko, G.M., 2007. Recent epidemic of acute hepatitis C virus in HIV-positive men who have sex with men linked to high-risk sexual behaviours. AIDS 21, 983–991. Darriba, D., Taboada, G.L., Doallo, R., Posada, D., 2012. JModelTest 2: more models, new heuristics and parallel computing. Nat. Methods 9, 772. Dowd, K.A., Netski, D.M., Wang, X.-H., Cox, A.L., Ray, S.C., 2009. Selection pressure from neutralizing antibodies drives sequence evolution during acute infection with hepatitis C virus. Gastroenterology 136, 2377–2386. Edgar, R.C., 2004. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 32, 1792–1797. Escobar-Gutierrez, A., Vazquez-Pichardo, M., Cruz-Rivera, M., Rivera-Osorio, P., Carpio-Pedroza, J.C., Ruiz-Pacheco, J.A., Ruiz-Tovar, K., Vaughan, G., 2012. Identification of Hepatitis C virus transmission using a next generation sequencing approach. J. Clin. Microbiol. 50, 1461–1463. Gansner, E.R., North, S.C., 1999. An open graph visualization system and its applications to software engineering. Software: Pract. Exp. 00, 1–5. Gower, E., Estes, C.C., Hindman, S., Razavi-Shearer, K., Razavi, H., 2014. Global epidemiology and genotype distribution of the hepatitis C virus. J. Hepatol. http://dx.doi.org/10.1002/hep.27259. Grabowski, M.K., Redd, A.D., 2014. Molecular tools for studying HIV transmission in sexual networks. Curr. Opin. HIV AIDS 9, 126–133. Hajarizadeh, B., Grebely, J., Dore, G.J., 2013. Epidemiology and natural history of HCV infection. Nat. Rev. Gastroenterol. Hepatol. 10, 553–562. Hmaied, F., Mamou, M., Dubois, M., Pasquier, C., Sandres-Saune, K., Rostaing, L., Slim, A., Arrouji, Z., Ben Redjeb, S., Izopet, J., 2007. Determining the source of nosocomial transmission in hemodialysis units in Tunisia by sequencing NS5B and E2 sequences of HCV. J. Med. Virol. 79, 1089–1094. Hue, S., Pillay, D., Clewley, J.P., Pybus, O.G., 2005. Genetic analysis reveals the complex structure of HIV-1 transmission within defined risk groups. Proc. Natl. Acad. Sci. U.S.A. 102, 4425–4429. Jabara, C.B., Hu, F., Mollan, K.R., Williford, S.E., Menezes, P., Yang, Y., Eron, J.J., Fried, M.W., Hudgens, M.G., Jones, C.D., Swanstrom, R., Lemon, S.M., 2014. Hepatitis C Virus (HCV) NS3 sequence diversity and antiviral resistance-associated variant frequency in HCV/HIV coinfection. Antimicrob. Agents Chemother. 58, 6079– 6092. Jacka, B., Applegate, T., Krajden, M., Olmstead, A.R.H., Marshall, B., DeBeck, K., Milloy, M.-J., Lamoury, F., Pybus, O., Lima, V., Magiorkinis, G., Montoya, V., Montaner, J., Joy, J., Woods, C., Dobrer, S., Dore, G., Poon, A., Grebely, J., 2014. Phylogenetic clustering of hepatitis C virus among people who inject drugs in Vancouver, Canada. Hepatology 60, 1571–1580. Jacka, B., Lamoury, F., Simmonds, P., Dore, G.J., Grebely, J., Applegate, T., 2013. Sequencing of the hepatitis C virus: a systematic review. PLoS One 8, e67073. Kaye, M., Chibo, D., Birch, C., 2008. Phylogenetic investigation of transmission pathways of drug-resistant HIV-1 utilizing pol sequences derived from resistance genotyping. J. Acquir. Immune Defic. Syndr. 49, 9–16. Kuiken, C., Yusim, K., Boykin, L., Richardson, R., 2005. The Los Alamos HCV sequence database. Bioinformatics 21, 379–384. Kuntzen, T., Timm, J., Berical, A., Lewis-Ximenez, L.L., Jones, A., Nolan, B., Schulze zur Wiesch, J., Li, B., Schneidewind, A., Kim, A.Y., Chung, R.T., Lauer, G.M., Allen, T.M., 2007. Viral sequence evolution in acute hepatitis C virus infection. J. Virol. 81, 11658–11668. Kuo, M., Janjua, N., Burchell, A., Buxton, J.A., Krajden, M., Gilbert, M., 2015. Decreasing hepatitis C incidence among a population of repeat anti-HCV testers, British Columbia, Canada, 1993–2011. Am. J. Public Health (in press). Lam, T.T., Hon, C.C., Tang, J.W., 2010. Use of phylogenetics in the molecular epidemiology and evolutionary studies of viral infections. Crit. Rev. Clin. Lab. Sci. 47, 5–49. Lanini, S., Abbate, I., Puro, V., Soscia, F., Albertoni, F., Battisti, W., Ruta, A., Capobianchi, M.R., Ippolito, G., 2010. Molecular epidemiology of a hepatitis C virus epidemic in a haemodialysis unit: outbreak investigation and infection outcome. BMC Infect. Dis. 10, 257. Lewis, F., Hughes, G.J., Rambaut, A., Pozniak, A., Leigh Brown, A.J., 2008. Episodic sexual transmission of HIV revealed by molecular phylodynamics. PLoS Med. 5, e50. Liu, L., Fisher, B.E., Dowd, K., Astemborski, J., Cox, A.L., Ray, S.C., 2010. Acceleration of hepatitis C virus envelope evolution in humans is consistent with progressive humoral immune selection during the transition from acute to chronic infection. J. Virol. 84, 5067–5077. Magiorkinis, G., Magiorkinis, E., Paraskevis, D., Ho, S.Y.W., Shapiro, B., Pybus, O.G., Allain, J.-P., Hatzakis, A., 2009. The global spread of hepatitis C virus 1a and 1b: a phylodynamic and phylogeographic analysis. PLoS Med. 6, e1000198. Martin, N.K., Vickerman, P., Grebely, J., Hellard, M., Hutchinson, S.J., Lima, V.D., Foster, G.R., Dillon, J.F., Goldberg, D.J., Dore, G.J., Hickman, M., 2013. Hepatitis C virus treatment for prevention among people who inject drugs: modeling treatment scale-up in the age of direct-acting antivirals. Hepatology 58, 1598– 1609. Mcallister, J., Casino, C., Davidson, F., Power, J., Lawlor, E., Yap, P.L., Simmonds, P., Donald, B., Allister, J.M.C., Yap, P.L.E.E., Smith, D.B., 1998. Long-term evolution of the hypervariable region of hepatitis C virus in a common-source-infected cohort long-term evolution of the hypervariable region of hepatitis C virus in a common-source-infected cohort. J. Virol. 72, 4893. Meng, S., Li, J., 2010. A novel duplex real-time reverse transcriptase-polymerase chain reaction assay for the detection of hepatitis C viral RNA with armored RNA as internal control. Virol. J. 7, 117. Merani, S., Petrovic, D., James, I., Chopra, A., Cooper, D., Freitas, E., Rauch, A., Iulio, J., John, M., Lucas, M., Fitzmaurice, K., Mckiernan, S., Norris, S., Kelleher, D., Klenerman, P., Gaudieri, S., 2010. Effect of immune pressure on hepatitis C virus evolution: insights from a single-source outbreak. Hepatology 53, 396–405. Miller, C.L., Johnston, C., Spittal, P.M., Li, K., Laliberté, N., Montaner, J.S.G., Schechter, M.T., 2002. Opportunities for prevention: hepatitis C prevalence and incidence in a cohort of young injection drug users. Hepatology 36, 737–742. Mohd Hanafiah, K., Groeger, J., Flaxman, A.D., Wiersma, S.T., 2013. Global epidemiology of hepatitis C virus infection: new estimates of age-specific antibody to HCV seroprevalence. Hepatology 57, 1333–1342. Montoya, V., Olmstead, A., Janjua, N.Z., Tang, P., Grebely, J., Cook, D., Harrigan, P.R., Krajden, M., 2015. Differentiation of acute from chronic hepatitis C virus infection by NS5B deep sequencing: a population-level tool for incidence estimation. Hepatology. http://dx.doi.org/10.1002/hep.27734. Nakayama, H., Sugai, Y., Ikeya, S., Inoue, J., Nishizawa, T., Okamoto, H., 2005. Molecular investigation of interspousal transmission of hepatitis C virus in two Japanese patients who acquired acute hepatitis C after 40 or 42 years of marriage. J. Med. Virol. 75, 258–266. Nelson, P.K., Mathers, B.M., Cowie, B., Hagan, H., Des Jarlais, D., Horyniak, D., Degenhardt, L., 2011. Global epidemiology of hepatitis B and hepatitis C in people who inject drugs: results of systematic reviews. Lancet 378, 571–583. Oliveira, T.de, Pybus, O.G., Rambaut, A., Salemi, M., Cassol, S., Ciccozzi, M., Rezza, G., Gattinara, G.C., D’Arrigo, R., Amicosante, M., Perrin, L., Colizzi, V., Perno, C.F.Benghazi Study Group, 2006. HIV-1 and HCV sequences from Libyan outbreak. Nature 444, 836–837. Oliveira, M.D.L.A., Bastos, F.I., Telles, P.R., Hacker, M.D.A., Oliveira, S.A.N.De., Miguel, J.C., Yoshida, C.F.T., 2009. Epidemiological and genetic analyses of Hepatitis C virus transmission among young/short- and long-term injecting drug users from Rio de Janeiro, Brazil. J. Clin. Virol. 44, 200–206. Poon, A., Joy, J., Woods, C., Shurgold, S., Colley, G., Brumme, C., Hogg, R., Montaner, J., Harrigan, P., 2014. The impact of clinical, demographic and risk factors on rates of HIV transmission. A population-based phylogenetic analysis in British Columbia, Canada. J. Infect. Dis. http://dx.doi.org/10.1093/infdis/jiu560. Price, M.N., Dehal, P.S., Arkin, A.P., 2010. FastTree 2 – approximately maximumlikelihood trees for large alignments. PLoS One 5, e9490. Pybus, O.G., Cochrane, A., Holmes, E.C., Simmonds, P., 2005. The hepatitis C virus epidemic among injecting drug users. Infect. Genet. Evol. 5, 131–139. Pybus, O.G., Drummond, a.J., Nakano, T., Robertson, B.H., Rambaut, a., 2003. The epidemiology and iatrogenic transmission of hepatitis C virus in Egypt: a Bayesian coalescent approach. Mol. Biol. Evol. 20, 381–387. Pybus, O.G., Rambaut, A., 2009. Evolutionary analysis of the dynamics of viral infectious disease. Nat. Rev. Genet. 10, 540–550. R Core Team, 2013. R: A Language and Environment for Statistical Computing. Vienna, Austria. Ramachandran, S., Campo, D.S., Dimitrova, Z.E., Xia, G.L., Purdy, M.A., Khudyakov, Y.E., 2011. Temporal variations in the hepatitis C virus intrahost population during chronic infection. J. Virol. 85, 6369–6380. Ray, S.C., Fanning, L., Wang, X.-H., Netski, D.M., Kenny-Walsh, E., Thomas, D.L., 2005. Divergent and convergent evolution after a common-source outbreak of hepatitis C virus. J. Exp. Med. 201, 1753–1759. Rein, D.B., Smith, B.D., Wittenborn, J.S., Lesesne, S.B., Wagner, L.D., Roblin, D.W., Patel, N., Ward, J.W., Weinbaum, C.M., 2012. The cost-effectiveness of birthcohort screening for hepatitis C antibody in U.S. primary care settings. Ann. Intern. Med. 156, 263–270. Sacks-Davis, R., Daraganova, G., Aitken, C., Higgs, P., Tracy, L., Bowden, S., Jenkinson, R., Rolls, D., Pattison, P., Robins, G., Grebely, J., Barry, A., Hellard, M., 2012. Hepatitis C virus phylogenetic clustering is associated with the social-injecting network in a cohort of people who inject drugs. PLoS One 7, e47335. Shah, H.a., Heathcote, J., Feld, J.J., 2013. A Canadian screening program for hepatitis C: is now the time? CMAJ 185, 1325–1328. Shemer-Avni, Y., Cohen, M., Keren-Naus, A., Sikuler, E., Hanuka, N., Yaari, A., Hayam, E., Bachmatov, L., Zemel, R., Tur-Kaspa, R., 2007. Iatrogenic transmission of hepatitis C virus (HCV) by an anesthesiologist: comparative molecular analysis of the HCV-E1 and HCV-E2 hypervariable regions. Clin. Infect. Dis. 45, e32–e38. Skar, H., Albert, J., Leitner, T., 2013. Towards estimation of HIV-1 date of infection: a time-continuous IgG-model shows that seroconversion does not occur at the midpoint between negative and positive tests. PLoS One 8, e60906. Smith, B.D., Morgan, R.L., Beckett, G.A., Falck-Ytter, Y., Holtzman, D., Teo, C.G., Jewett, A., Baack, B., Rein, D.B., Patel, N., Alter, M., Yartel, A., Ward, J.W., 2012. Recommendations for the identification of chronic hepatitis C virus infection among persons born during 1945–1965. MMWR Recomm. Rep. 61, 1–32. Tanaka, Y., Hanada, K., Mizokami, M., Yeo, A.E.T., Shih, J.W.-K., Gojobori, T., Alter, H.J., 2002. A comparison of the molecular clock of hepatitis C virus in the United States and Japan predicts that hepatocellular carcinoma incidence in the United States will increase over the next two decades. Proc. Natl. Acad. Sci. U.S.A. 99, 15584–15589. Please cite this article in press as: Olmstead, A.D., et al. A molecular phylogenetics-based approach for identifying recent hepatitis C virus transmission events. Infect. Genet. Evol. (2015), http://dx.doi.org/10.1016/j.meegid.2015.04.017 610 611 612 613 614 615 616 617 618 619 620 621 622 623 624 625 626 627 628 629 630 631 632 633 634 635 636 637 638 639 640 641 642 643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661 662 663 664 665 666 667 668 669 670 671 672 673 674 675 676 677 678 679 680 681 682 683 684 685 686 687 688 689 690 691 692 693 694 MEEGID 2321 No. of Pages 9, Model 5G 25 April 2015 A.D. Olmstead et al. / Infection, Genetics and Evolution xxx (2015) xxx–xxx 695 696 697 698 699 700 701 702 703 704 705 706 707 708 709 710 Toda, T., Mitsui, T., Tsukamoto, Y., Ebara, T., Hirose, A., Masuko, K., Nagashima, S., Takahashi, M., Okamoto, H., 2009. Molecular analysis of transmission of hepatitis C virus in a nurse who acquired acute hepatitis C after caring for a viremic patient with epistaxis 1370, 1363–1370. Urbanus, A.T., van de Laar, T.J., Stolte, I.G., Schinkel, J., Heijman, T., Coutinho, R.a., Prins, M., 2009. Hepatitis C virus infections among HIV-infected men who have sex with men: an expanding epidemic. AIDS 23, F1–F7. Van de Laar, T., Pybus, O., Bruisten, S., Brown, D., Nelson, M., Bhagani, S., Vogel, M., } tz, H., Matthews, G.V., Neifer, S., Baumgarten, A., Chaix, M.-L., Fisher, M., Go White, P., Rawlinson, W., Pol, S., Rockstroh, J., Coutinho, R., Dore, G.J., Dusheiko, G.M., Danta, M., 2009a. Evidence of a large, international network of HCV transmission in HIV-positive men who have sex with men. Gastroenterology 136, 1609–1617. Van de Laar, T., Pybus, O., Bruisten, S., Brown, D., Nelson, M., Bhagani, S., Vogel, M., Baumgarten, A., Chaix, M.L., Fisher, M., Gotz, H., Matthews, G.V., Neifer, S., White, P., Rawlinson, W., Pol, S., Rockstroh, J., Coutinho, R., Dore, G.J., Dusheiko, 9 G.M., Danta, M., 2009b. Evidence of a large, international network of HCV transmission in HIV-positive men who have sex with men. Gastroenterology 136, 1609–1617. Volz, E.M., Koelle, K., Bedford, T., 2013. Viral phylodynamics. PLoS Comput. Biol. 9, e1002947. Weiner, A.J., Thaler, M.M., Crawford, K., Ching, K., Kansopon, J., Chien, D.Y., Hall, J.E., Hu, F., Houghton, M., 1993. A unique, predominant hepatitis C virus variant found in an infant born to a mother with multiple variants. J. Virol. 67, 4365– 4368. Wiessing, L., Ferri, M., Grady, B., Kantzanou, M., Sperle, I., Cullen, K.J., Hatzakis, A., Prins, M., Vickerman, P., Lazarus, J.V., Hope, V.D., Matheï, C., 2014. Hepatitis C virus infection epidemiology among people who inject drugs in Europe: a systematic review of data for scaling up treatment and prevention. PLoS One 9, e103345. Yu, A., Spinelli, J.J., Cook, D.A., Buxton, J.A., Krajden, M., 2013. Mortality among British Columbians testing for hepatitis C antibody. BMC Public Health 13, 291. 711 712 713 714 715 716 717 718 719 720 721 722 723 724 725 726 727 Please cite this article in press as: Olmstead, A.D., et al. A molecular phylogenetics-based approach for identifying recent hepatitis C virus transmission events. Infect. Genet. Evol. (2015), http://dx.doi.org/10.1016/j.meegid.2015.04.017