MEEGID 2321
No. of Pages 9, Model 5G
25 April 2015
Infection, Genetics and Evolution xxx (2015) xxx–xxx
1
Contents lists available at ScienceDirect
Infection, Genetics and Evolution
journal homepage: www.elsevier.com/locate/meegid
5
6
3
4
7
8
9
10
11
12
13
14
15
16
1
3 8
4
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
A molecular phylogenetics-based approach for identifying recent
hepatitis C virus transmission events
Andrea D. Olmstead a,b,⇑, Jeffrey B. Joy c, Vincent Montoya a,b, Iris Luo b, Art F.Y. Poon c, Brendan Jacka d,
François Lamoury d, Tanya Applegate d, Julio Montaner b,c, Yury Khudyakov e, Jason Grebely d,
Darrel Cook a, P. Richard Harrigan c, Mel Krajden a,b
a
BC Centre for Disease Control, Vancouver, BC, Canada
University of British Columbia, Vancouver, BC, Canada
BC Centre for Excellence in HIV/AIDS, St Paul’s Hospital, Vancouver, BC, Canada
d
The Kirby Institute, UNSW Australia, Sydney, NSW, Australia
e
Centers for Disease Control and Prevention, Atlanta, Georgia, USA
b
c
a r t i c l e
i n f o
Article history:
Received 21 February 2015
Received in revised form 9 April 2015
Accepted 17 April 2015
Available online xxxx
Keywords:
Hepatitis C virus
Recent transmission clusters
British Columbia
Epidemiology
Sequencing
Phylogenetics
Surveillance
Seroconversion
a b s t r a c t
Improved surveillance methods are needed to better understand the current hepatitis C virus (HCV) disease burden and to monitor the impact of prevention and treatment interventions on HCV transmission
dynamics. Sanger sequencing (HCV NS5B, HVR1 and Core-E1-HVR1) and phylogenetics were applied to
samples from individuals diagnosed with HCV in British Columbia, Canada in 2011. This included individuals with two or three sequential samples collected <1 year apart. Patristic distances between sequential
samples were used to set cutoffs to identify recent transmission clusters. Factors associated with transmission clustering were analyzed using logistic regression. From 618 individuals, 646 sequences were
obtained. Depending on the cutoff used, 63 (10%) to 92 (15%) unique individuals were identified within
transmission clusters of predicted recent origin. Clustered individuals were more likely to be <40 years
old (Adjusted Odds Ratio (AOR) 2.12, 95% CI 1.21–3.73), infected with genotype 1a (AOR 6.60, 95% CI
1.98–41.0), and to be seroconverters with estimated infection duration of <1 year (AOR 3.13, 95% CI
1.29–7.36) or >1 year (AOR 2.19, 95% CI 1.22–3.97).
Conclusion: Systematic application of molecular phylogenetics may be used to enhance traditional
surveillance methods through identification of recent transmission clusters.
Ó 2015 Published by Elsevier B.V.
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
1. Introduction
55
Hepatitis C virus (HCV) has infected between 115 and 185 million individuals globally (Gower et al., 2014; Mohd Hanafiah et al.,
2013). As of December 2013, 73,500 (1.6%) individuals in British
Columbia (BC) were diagnosed as anti-HCV positive (BC Centre
for Disease Control, 2013). About two-thirds of anti-HCV positive
56
57
58
59
Abbreviations: HCV, hepatitis C virus; PWID, people who inject drugs; BC, British
Columbia; NS5B, non-structural protein-5B; E1, Envelope-1; HVR1, hypervariable
region-1; cDNA, complementary DNA; RT-PCR, reverse transcriptase polymerase
chain reaction; E2, Envelope-2; IU, international units; AOR, Adjusted Odds Ratio;
CI, Confidence Interval; SD, Standard Deviation; OR, Odds Ratio; MSM, men who
have sex with men.
⇑ Corresponding author at: The BC Centre for Disease Control/Department of
Pathology, University of British Columbia, 2155 – 655 W 12th Ave, Vancouver, BC
V5Z 4R4, Canada.
E-mail address: andrea.olmstead@bccdc.ca (A.D. Olmstead).
individuals in BC are baby boomers born between 1945 and
1965. Most were infected decades ago and are at risk of developing
progressive liver disease as they age. HCV-infected British
Columbians have about a 5-fold increased risk of all-cause and
20-fold increased risk of liver-related mortality (Yu et al., 2013).
HCV continues to be transmitted by people who inject drugs
(PWID), who account for >80% of new infections in many countries
(Hajarizadeh et al., 2013; Nelson et al., 2011). Reducing the population disease burden will require enhanced screening, engagement into care and treatment as well as the potential use of
treatment-as-prevention to reduce transmission among PWID
(Martin et al., 2013; Smith et al., 2012; Wiessing et al., 2014).
Informing effective public health interventions requires a clear
understanding of the past and ongoing dynamics of viral epidemics. Phylogenetic trees constructed from viral sequences allow
the historical relationships between infected individuals in a population to be modeled (Lam et al., 2010; Pybus and Rambaut, 2009;
http://dx.doi.org/10.1016/j.meegid.2015.04.017
1567-1348/Ó 2015 Published by Elsevier B.V.
Please cite this article in press as: Olmstead, A.D., et al. A molecular phylogenetics-based approach for identifying recent hepatitis C virus transmission
events. Infect. Genet. Evol. (2015), http://dx.doi.org/10.1016/j.meegid.2015.04.017
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
MEEGID 2321
No. of Pages 9, Model 5G
25 April 2015
2
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
A.D. Olmstead et al. / Infection, Genetics and Evolution xxx (2015) xxx–xxx
Volz et al., 2013). Distinct transmission clusters, i.e., groups of closely related infections, can be identified from phylogenies along
with spatial, epidemiological and temporal information linked to
their expansion (Grabowski and Redd, 2014; Hue et al., 2005;
Lewis et al., 2008; van de Laar et al., 2009a,b). The ability to quantify mutation rates enables recent transmission clusters to be identified and characteristics associated with high rates of transmission
to be inferred (Aldous et al., 2012; Poon et al., 2014).
Phylogenetic cluster analyses are widespread in HIV research
and have revealed that early/acute HIV infections play a major role
in driving the epidemic and also that drug-resistant HIV variants
can be transmitted and establish infection (Brenner et al., 2007;
Kaye et al., 2008). HCV phylogenetic analyses have modeled the
expansion of HCV throughout the 20th century in various global
regions and have identified factors that likely contributed to the
worldwide epidemic (Magiorkinis et al., 2009; Pybus et al., 2003,
2005; Tanaka et al., 2002). HCV phylogenetic cluster analyses have
been used to confirm transmission events originating in health
care settings and to investigate HCV clustering among HIV-infected
individuals, men who have sex with men (MSM) and in PWID
(Danta et al., 2007; Lanini et al., 2010; Oliveira et al., 2006;
Sacks-Davis et al., 2012; Shemer-Avni et al., 2007; Urbanus et al.,
2009; van de Laar et al., 2009a,b). The criteria for defining transmission clusters differs for both HIV and HCV-based phylogenetics
but when strict cutoffs are imposed, clusters that represent more
recent transmission events can be identified. Tracking recent transmission clusters over time at a population level may be used to
support HCV incidence estimations and to identify sub-populations at high risk of HCV transmission.
BC’s centralized laboratory testing and consistent use of demographic identifiers enables identification of a subset of recent HCV
infections by linking new anti-HCV positive to previous anti-HCV
negative test results. However, most HCV-infected individuals
have no prior testing history, precluding estimation of when transmission occurred. The aim of this study was to develop a method
for identifying recent transmission clusters in a population. The
HCV NS5B, core, E1 and HVR-1 regions were sequenced in a proportion of individuals who tested anti-HCV positive in BC in
2011. The genetic distances between sequential specimens from
the same individuals collected less than one year apart were used
to assign cutoffs to identify recent phylogenetic transmission
clusters. This study provides a proof-of-principle for applying
molecular phylogenetics to support traditional surveillance
methods through identification of recent transmission clusters in
a population.
122
2. Methods
123
2.1. Study population and design
124
The BC Public Health Microbiology and Reference Laboratory
performs approximately 95% of HCV serological screening in BC
(Yu et al., 2013). Automated linkage of anti-HCV positive tests with
past results enables identification of seroconverters i.e., individuals
who previously tested anti-HCV negative and are anti-HCV positive
on subsequent testing. For seroconverters the estimated infection
duration was defined as the time between the specimen collection
date and the midpoint between an individual’s last negative and
first positive anti-HCV test.
Individuals who tested anti-HCV positive in BC in 2011 were
eligible for assessment. Individuals were classified as: (1) first time
positives: individuals who tested positive for the first time in 2011
and had no previous testing history; thus, their infection duration
was unknown; (2) past seroconverters: individuals who seroconverted prior to 2011 but tested positive again in 2011 and thus
125
126
127
128
129
130
131
132
133
134
135
136
137
138
had an estimated infection duration of >1 year; and (3) recent seroconverters: individuals who tested positive for the first time in
2011 and had an estimated infection duration of <1 year. The
University of British Columbia Clinical Research Ethics Board
approved this study.
139
2.2. HCV RNA testing and sequencing
144
HCV RNA was extracted from anti-HCV positive serum using the
MagMAX™-96 Viral RNA Isolation kit (Life Technologies, Carlsbad,
CA, USA). HCV viral loads were quantified using a semi-quantitative HCV RT-PCR assay (Meng and Li, 2010). Complementary DNA
(cDNA) was synthesized using the SuperScriptÒ VILO™ cDNA
Synthesis Kit (Life Technologies). An 828 bp fragment containing
a portion of the HCV polymerase (Non-Structural-5B; NS5B) and
a 1514 bp fragment containing the HCV Core, Envelope-1, hypervariable region-1 (HVR1) and the beginning of Envelope-2 (E2)
were amplified using Velocity DNA polymerase (Bioline, London,
UK). Amplicons were purified using Agencourt Ampure XP beads
(Beckman-Coulter, Mississauga, Ontario) and were Sanger
sequenced using the BigDye Terminator Cycle Sequencing kit
(Life Technologies). HCV RNA extraction, PCR, Sanger sequencing
and thermal cycling conditions are described in detail in the
Supplementary information.
145
2.3. Sequence analysis
161
Sequences were assembled and edited using Geneious v.6.1.7
(Biomatters, Aukland, New Zealand; http://www.geneious.com/).
The NS5B sequences were trimmed to 650 bp. The Core-E2
sequences were trimmed to 920 bp and are referred to hereafter
as Core-HVR1. Samples that had a partial but not a full CoreHVR1 sequence available were trimmed to 100 bp and contained
the HVR1 region. Reference sequences obtained from the Los
Alamos HCV sequence database (hcv.lanl.gov; Kuiken et al., 2005)
(see Supplementary information) were included in the analysis.
Sequences were aligned using MUSCLE (Edgar, 2004) and alignments were edited in Geneious. The program jModeltest-2.1.4
(Darriba et al., 2012) was used to determine the nucleotide substitution model that best described the aligned sequences, which was
determined to be the General Time Reversible model with gamma
distributed rate variation among sites and a proportion of invariable sites. Maximum likelihood phylogenetic trees were generated
using FastTree2 (Price et al., 2010). Path-o-gen v.1.4 was used to
estimate the best fitting root for the phylogenetic trees, which
were visualized and annotated using FigTree v.1.4.0 (http://tree.
bio.ed.ac.uk/software/figtree/).
162
2.4. Cluster analysis
182
Phylogenies included sequences from seroconverters who had
sequential HCV positive samples collected <1 year apart. The
patristic distances (i.e., genetic distance between two sequences
in a phylogenetic tree measured in nucleotide substitutions per
site) between these sequential samples represents HCV sequence
evolution expected to occur within one year. Geneious software
was used to determine the patristic distances between each pair
of sequences in a phylogenetic tree, which include the pairwise
intra-person (two sequences from one individual) and the
within-genotype inter-person (two sequences from two different
individuals) distances.
The intra-person distances were used to assign cutoffs that
were applied to phylogenies using a custom python script (Poon
et al., 2014). Sequences from different individuals with patristic
distances falling below these cutoffs were interpreted as transmission clusters. The lower cutoff was set close to the maximum
183
Please cite this article in press as: Olmstead, A.D., et al. A molecular phylogenetics-based approach for identifying recent hepatitis C virus transmission
events. Infect. Genet. Evol. (2015), http://dx.doi.org/10.1016/j.meegid.2015.04.017
140
141
142
143
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
MEEGID 2321
No. of Pages 9, Model 5G
25 April 2015
A.D. Olmstead et al. / Infection, Genetics and Evolution xxx (2015) xxx–xxx
223
observed intra-patient patristic distance (0.018 for NS5B, 0.03 for
Core-HVR1 and 0.11 for HVR1). A second, higher cutoff was also
assigned that excludes clusters between BC sequences and the
Los Alamos reference sequences (0.020 for NS5B, 0.06 for CoreHVR1 and 0.15 for HVR1). A sensitivity analysis was performed
using a third cutoff (0.022 for NS5B, 0.09 for Core-HVR1 and 0.19
for HVR1). Clusters within phylogenetic trees were manually
examined; sequences from clusters that did not group together
with branch support >80% were split into smaller phylogenetically
linked clusters or excluded from the analysis. Clusters were visualized using Graphviz v.2.36 (Gansner and North, 1999).
To provide additional validation for the selected patristic
distance cutoffs, six NS5B reference sequences were obtained from
Genbank (Supplementary material) representing three pairs of
confirmed transmission events (2 individuals/transmission)
(Nakayama et al., 2005; Toda et al., 2009). The sampling intervals
between each of the two individuals in the reported transmission
pairs were between 2 weeks and 3 months. At least one individual
from each transmission pair was diagnosed with acute HCV at the
time of sampling i.e., transmission occurred approximately
6 months or less prior to sample collection. A phylogenetic and
cluster analysis using these additional sequences (along with the
BC sequences and Los Alamos reference sequences) was performed
to determine the patristic distances between individuals in transmission pairs.
224
2.5. Statistical analysis
225
238
Logistic regression was used to identify factors hypothesized to
be associated with recent infection, including age, sex, HCV genotype, seroconversion status, HIV status and geographic location
(Jacka et al., 2014; Miller et al., 2002; Oliveira et al., 2009; SacksDavis et al., 2012). As 84% of sequences were genotypes 1a and
3a, the remaining genotypes were grouped together for analysis.
Age was dichotomized to <40 vs. P40 years since unadjusted analysis demonstrated that individuals <30, 30–34 and 35–39 years
were more likely to be in clusters than those >61 years (data not
shown). In multivariate analyses, factors that were significant at
p < 0.20 in the unadjusted analysis were considered as potential
independent variables associated with clustering. For individuals
with sequential samples, the first sample per individual was used.
Analyses were performed in R, version 3.03 (R Core Team, 2013).
239
3. Results
240
3.1. HCV sequencing
241
In 2011, of 3356 individuals that tested anti-HCV positive in BC,
51% (n = 1715) were first-time anti-HCV positive (unknown infection duration), 38% (n = 1267) seroconverted between 1993 and
2010 (estimated infection duration >1 year), and 11% (n = 374)
seroconverted in 2011 (132 had an estimated infection duration
<1 year and 242 had an estimated infection duration >1 year).
For this study, 1546 of the 3356 anti-HCV positive samples were
selected for analysis, including: (1) 897 randomly selected samples
from the 1715 first time positive individuals; (2) 222 randomly
selected samples from the 1267 seroconverters diagnosed prior
to 2011 (many of whom are likely chronically infected), and
(3) all available samples from 351 of 374 individuals who seroconverted in 2011 (283 had one sample and 68 had two or three
sequential samples; total samples = 427) (Fig. 1A).
Of the 1546 samples selected, 1138 (74%) had detectable HCV
RNA (>200 IU/ml), which yielded 646 sequences from 618 individuals (593 NS5B and 415 Core-HVR1) (Fig. 1A). For 52 additional
samples, only a partial Core-HVR1 sequence was available; for
these, the HVR1 sequence alone was examined and compared to
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
226
227
228
229
230
231
232
233
234
235
236
237
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
3
all available HVR1 sequences (n = 466). Of the 68 individuals with
sequential samples, sequences were obtained from 25 individuals.
There were 363 samples that yielded sequences for all three
regions (Fig. 1B). HCV RNA viral load of >20,000 IU/ml and being
a seroconverter were associated with successful sequencing
(Supplementary Table S1). Study subject characteristics are summarized in Table 1.
260
3.2. HCV phylogenetics
267
Phylogenies for NS5B, Core-HVR1 and HVR1 were constructed
(Supplementary Figs. S1–S3). As expected, HCV sequences grouped
together by genotype in both the NS5B and Core-HVR1 phylogenies. When HVR1 was examined alone, the majority of sequences
also grouped by genotype, however there were several sequences
that did not group with the predicted genotype. This likely reflects
the high heterogeneity of HVR1 and the fact that many mutations
in this region are driven by host immune pressure (Dowd et al.,
2009; Liu et al., 2010). Also apparent from the phylogenetic trees
is that the genotype 1a sequences form two large and distinct
phylogenetic clusters.
268
3.3. Identification of HCV transmission clusters
279
Within each phylogeny, the patristic distances between pairs of
intra-person sequential samples were determined and used to set
cutoffs for identifying clusters as described in the Methods.
For NS5B, 593 sequences were obtained from 574 individuals;
17 individuals had sequential samples with a median (interquartile
range) of 75 (21–161) days between sample collections. The interperson patristic distances ranged from 0 to 0.35 (median 0.10)
(Fig. 2A). Note that the presence of the bimodal hump in the distribution presented in Fig. 2A reflects the presence of the two genotype 1a phylogenetic groups. The intra-person patristic distances
ranged from 0 to 0.017 (Fig. 2A). Of note, three sequential samples
from one individual from 2009 and 2010 (23–454 days apart) also
had patristic distances between 0 and 0.017. Applying the lower
(0.018) and higher (0.020) cutoffs to the phylogeny identified
55 individuals (10%) in 25 clusters and 73 individuals (13%) in
31 clusters, respectively (Fig. 3A and Table 2).
To provide additional validation, an analysis of three confirmed
transmission pairs using published NS5B sequences was performed
(Nakayama et al., 2005; Toda et al., 2009). The three transmission
pairs were found to cluster with patristic distances of 0.002, 0.006
and 0.01, respectively.
For Core-HVR1, 415 sequences were obtained from 396 individuals; 18 individuals had sequential samples with a median
(interquartile range) of 71.5 (22.5–122) days between sample collections. The inter-person patristic distances ranged from 0.004 to
0.55 (median 0.22) (Fig. 2B). The intra-person patristic distances
ranged from 0.0003 to 0.026 (Fig. 2B). Applying the lower (0.03)
and higher (0.06) cutoffs to the phylogeny identified 24 individuals
(6%) in 12 clusters and 48 individuals (12%) in 23 clusters, respectively (Fig. 3B and Table 2).
For HVR1, 466 sequences were obtained from 445 individuals;
20 individuals had sequential samples with a median (interquartile
range) of 71.5 (21.5–117.8) days between sample collections. The
inter-person patristic distances ranged from 0.01 to 2.91 (median
0.78) (Fig. 2C). The intra-person patristic distances ranged from
0 to 0.109 (Fig. 2C). Applying the lower (0.15) and higher (0.19) cutoffs to the phylogeny identified 19 individuals (4%) in 9 clusters and
31 individuals (7%) in 15 clusters respectively (Fig. 3C and Table 2).
When the sequences for all three regions were combined,
63 (10%; lower cutoffs) and 92 (15%; higher cutoffs) unique individuals were identified as belonging to clusters. In all, 27 individuals in 13 clusters were identified within both the NS5B and HVR1
280
Please cite this article in press as: Olmstead, A.D., et al. A molecular phylogenetics-based approach for identifying recent hepatitis C virus transmission
events. Infect. Genet. Evol. (2015), http://dx.doi.org/10.1016/j.meegid.2015.04.017
261
262
263
264
265
266
269
270
271
272
273
274
275
276
277
278
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
MEEGID 2321
No. of Pages 9, Model 5G
25 April 2015
4
A.D. Olmstead et al. / Infection, Genetics and Evolution xxx (2015) xxx–xxx
B
A
Fig. 1. Sequences obtained for HCV NS5B, Core-HVR1 and HVR1 regions. (A) Number of samples extracted, number samples with detectable HCV (>200 IU/ml), and number of
sequences from each HCV genomic region. (B) Distribution of sequences for each of the HCV genomic regions.
Table 1
Characteristics of individuals with available HCV sequences (n = 618).
a
b
c
Characteristic
No. of individuals
Sex
Female
Male
Unknown
202
415
1
Age
Mean (SD)
<40 year
P40 year (baby boomers)
46.2 (12.5)
258
360
HCV genotype
1a
1b
2a
2b
3a
Other
393
35
22
37
124
7
BC geographic region
Interior
Fraser
Vancouver Coastal
Vancouver Island
Northern
Unknown
114
207
140
82
60
15
Test group
Recent seroconvertera
Past seroconverterb
First-time HCV positivec
52
203
363
HIV co-infection
Yes
No
Unknown
16
436
166
Individuals with estimated infection duration <1 year.
Individuals with estimated infection duration >1 year.
Individuals with unknown infection duration.
325
or Core-HVR1 regions (Fig. 3). Of note, 29 first time positive individuals with unknown infection duration were identified within
clusters, suggesting that they were involved in recent transmission
events.
326
3.4. Factors associated with transmission clusters
327
Factors associated with clustering were examined using the
NS5B region, as this region has the greatest amount of data
322
323
324
328
available. Using unadjusted analysis, factors associated with
NS5B clusters identified using the higher cutoff (0.02) included
age, HCV genotype 1a and 3a, and being a seroconverter (recent
or past) (Supplementary Table S2). In adjusted analysis, age
<40 years vs. P40 years (Adjusted Odds Ratio (AOR) 2.12, 95% CI
1.21–3.73), genotype 1a infection vs. other genotypes (AOR 6.60,
95% CI 1.98–41.0), and being a recent seroconverter vs. first time
positive (AOR 3.13, 95% CI 1.29–7.36) or a past seroconverter vs.
first time positive (AOR 2.19, 95% CI 1.22–3.97) were independently associated with clustering (Table 3 and Supplementary
Table S2). Unadjusted analysis of NS5B clusters identified using
the lower cutoff (0.018) yielded similar results to those identified
using the higher cutoff (Table 3 and Supplementary Table S3). In
the adjusted analysis, genotype 1a and seroconverter status were
independently associated with clustering.
Factors associated with clustering in the Core-HVR1 and HVR1
regions were also analyzed and the results are comparable to those
obtained using NS5B; however, some minor differences were
observed dependent on the region and cutoff examined (Table 3).
In a sensitivity analysis, factors associated with clustering were
similar to those identified using the higher or lower cutoffs
(Table 3).
329
4. Discussion
351
This study demonstrates the feasibility of using sequencedbased phylogenetics to identify recent HCV transmission clusters,
which could support surveillance of population-level transmission
dynamics. A convenience sample of sequential HCV RNA positive
specimens from individual seroconverters permitted measurement
of the expected intra-person HCV sequence evolution within
1 year. The maximum intra-person patristic distances were used
as cutoffs to identify recent transmission clusters, and a higher cutoff was also applied to account for greater inter-person sequence
variation that results from a change in host environments and
immune pressures following transmission (Bull et al., 2011;
Kuntzen et al., 2007; Mcallister et al., 1998; Merani et al., 2010).
Using the lower and higher cutoffs, 63–92 unique individuals, were
identified within clusters. Twenty-nine individuals in clusters were
first-time positive testers with unknown infection duration and
their identification within clusters indicates their involvement in
a recent transmission event.
Individuals in clusters were more likely to have characteristics
associated with incident infection in PWID (young age,
352
Please cite this article in press as: Olmstead, A.D., et al. A molecular phylogenetics-based approach for identifying recent hepatitis C virus transmission
events. Infect. Genet. Evol. (2015), http://dx.doi.org/10.1016/j.meegid.2015.04.017
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
MEEGID 2321
No. of Pages 9, Model 5G
25 April 2015
5
Cutoffs
0.250
0.200
0.150
0.100
0.050
Core-HVR1
Cutoffs
0.50
0.40
0.30
0.20
0.10
0.06
0.03
Inter-person
Intra-person
HVR1
2.50
2.00
1.50
1.00
0.50
Inter-person
Intra-person
0.00
0.0 0.2 0.4 0.6 0.8 1.0
C
0.018
0.020
0.000
Cutoffs
0.00
Scaled density
0.0 0.2 0.4 0.6 0.8 1.0
B
NS5B
Inter-person
Intra-person
0.11
0.15
A
0.0 0.2 0.4 0.6 0.8 1.0
A.D. Olmstead et al. / Infection, Genetics and Evolution xxx (2015) xxx–xxx
Patristic distance
Fig. 2. Patristic distance frequency distributions. Frequency distribution of patristic distances between (A) 574 independent NS5B sequences obtained from different
individuals and between 22 pairs of sequential sequences from 17 individuals (B) 396 Core-HVR1 sequences obtained from different individuals and between 20 pairs of
sequential sequences from 18 individuals. (C) 445 independent HVR1 sequences obtained from different individuals and between 22 pairs of sequential sequences from 20
individuals. Blue = inter-person; red = intra-person. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
seroconversion, Genotype 1a and 3a) (Jacka et al., 2014; Miller
et al., 2002; Oliveira et al., 2009; Sacks-Davis et al., 2012) and were
also more likely to have infection duration <1 year, supporting the
hypothesis that individuals in clusters were involved in recent
transmission events. In BC, >80% of seroconverters report injection
drug use as their primary HCV acquisition risk factor. Consistent
with these findings, a recent study by Jacka et al. (2014) demonstrated that individuals in clusters from a BC PWID cohort were
more likely to be <40 years and to be seroconverters. These findings complement BC surveillance data, which shows that acute
HCV infections are infrequent in baby boomers (Kuo et al., 2015)
and substantiates a lower onward transmission risk in this birth
cohort.
Three regions, the HVR1, Core-HVR1 and NS5B, were examined
because of their different genetic evolutionary rates, which allow
transmissions over different time scales to be examined.
Evolution of HVR1 is not linear (Mcallister et al., 1998) and mutations accumulate rapidly (Dowd et al., 2009; Kuntzen et al., 2007;
Liu et al., 2010) making this region useful for verifying person-toperson transmissions and for monitoring HCV genomic changes
over short but not longer time periods (Escobar-Gutierrez et al.,
2012; Weiner et al., 1993). Use of the highly conserved Core region
combined with E1 and HVR1 increases the capacity to identify
transmissions that may not be apparent with HVR1 alone, and as
expected, a larger number of clusters were identified using CoreHVR1. The NS5B region evolves at an intermediate and more stable
rate, which allows transmissions over longer time periods to be
examined while still supporting identification of recent transmission events (Cavalheiro Nde et al., 2009; Hmaied et al., 2007).
The NS5B amplicon used for this study was longer than that typically used (650 bp vs. 380 bp) (Jacka et al., 2013), which may have
improved its ability to identify clusters.
The largest number of clusters was identified in NS5B and while
many sequences were unavailable for comparison in the other
regions, several NS5B clusters were also observed in HVR1 and/or
Core-HVR1. Identification of clusters in two regions provides stronger support for transmission events. Some NS5B clusters had
highly divergent Core-HVR1 or HVR1 sequences, suggesting no
recent transmissions occurred between these individuals. In other
NS5B clusters, the patristic distances between HVR1 and/or CoreHVR1 sequence pairs were only slightly above the cutoffs, suggesting that the cutoff in highly variable regions could be relaxed further to capture additional individuals within a transmission
network. No clusters were identified between BC samples and
the Los Alamos reference sequences, which supports the stringency
of our method. Additional validation was provided by the use of
confirmed transmission pairs, which were found to cluster in the
NS5B region using the assigned cutoffs. It is important to note that
various factors can influence the rate of HCV evolution over time
(i.e. acute vs. chronic HCV infection, co-infection with HIV and
immune status (Jabara et al., 2014; Merani et al., 2010;
Ramachandran et al., 2011; Ray et al., 2005)), which will influence
the patristic distance cutoffs and the identification of recent transmission clusters. Future work will further assess the utility of using
Please cite this article in press as: Olmstead, A.D., et al. A molecular phylogenetics-based approach for identifying recent hepatitis C virus transmission
events. Infect. Genet. Evol. (2015), http://dx.doi.org/10.1016/j.meegid.2015.04.017
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
MEEGID 2321
No. of Pages 9, Model 5G
25 April 2015
6
A.D. Olmstead et al. / Infection, Genetics and Evolution xxx (2015) xxx–xxx
A
NS5B Clusters <0.018
NS5B Clusters <0.02
B
Core-HVR1 Clusters <0.03
Core-HVR1 Clusters <0.06
C
HVR1 Clusters <0.11
HVR1 Clusters <0.15
Common cluster
Genotype 1a
Genotype 3a
Genotype 2b
Fig. 3. Transmission clusters. (A) Inter-person (between different individuals) NS5B sequence clusters with patristic distances <0.018 (left) or <0.02 (right). (B) Inter-person
Core-HVR1 sequence clusters (between different individuals) with patristic distances <0.03 (left) or <0.06 (right). (C) Inter-person Core-HVR1 sequence clusters (between
different individuals) with patristic distances <0.11 (left) or <0.15 (right). Nodes represent individual sequences and the connector length is proportional to the patristic
distance between sequences. Common clusters (bold outline) are those that occur in both NS5B and Core-HVR1 or HVR1 alone.
Table 2
Number of clusters and individuals in clusters with various patristic distance cutoffs.
Cutoff
Total n
n in clusters
% clustering
Number of clusters
574
55
73
82
10
13
14
25
31
34
Core-HVR1
0.03
415
0.06
0.09b
24
48
88
6
12
21
12
23
38
HVR1
0.11
0.15
0.19b
19
31
45
4
7
10
9
15
20
NS5B
0.018
0.020
0.022b
a
b
425
426
427
428
429
430
431
432
433
434
435
436
437
438
a
445
n = Number of individuals.
Sensitivity analysis.
various regions and cutoffs for characterizing population level
transmission dynamics.
A limitation to this study is that the serum samples used were
collected in 2011 for the purposes of antibody screening, and were
not stored under optimal conditions. For real-time population
studies, these protocols would be applied to fresh serum samples
within a day of collection or to blood samples containing EDTA
to increase HCV RNA integrity, which should improve sequence
yield. A further limitation to this study was the use of the midpoint
between the last negative and first positive test date to estimate
infection duration in seroconverters. For HIV, seroconverters identified through routine public health screening were more likely to
have been infected closer to the time of the first positive test (Skar
et al., 2013). Thus, individuals with long gaps between the last
negative and first positive tests and an estimated HCV infection
duration >1 year may have in fact been infected more recently.
This likely explains, in part, the association between clustering
and being a past seroconverter with infection duration >1 year.
On the other hand, unlike with HIV, it has not been demonstrated
that acute/early infections disproportionately drive HCV transmission. Many HCV infected individuals go undiagnosed for years or
even decades (Rein et al., 2012; Shah et al., 2013), also HCV viral
loads can remain high and stable throughout the course of infection, and the majority of HCV infected individuals remain
untreated. This suggests that chronically infected individuals may
be a significant source of onward HCV transmission and thus it is
unsurprising that past seroconverters are also linked with transmission clusters.
The cluster analyses presented here cannot be directly extrapolated to create population incidence estimates given that this was a
proof-of-principle study with a selection bias for seroconverters to
maximize the identification of transmission clusters. In the future,
these methods may be used, in combination with other strategies
(i.e. tracking seroconverters and using diversity to estimate transmission timing (Montoya et al., 2015)) to obtain a comprehensive
view of HCV incidence in BC. Every year >2000 individuals undergo
HCV genotyping at the BC Centre for Disease Control and it would
be feasible to apply routine sequencing and phylogenetics to these
individuals given the declining cost of sequencing. Implementing
this type of strategy could help to identify a subset of individuals
with recent infection with no prior negative test available. It can
also be used to identify clusters of individuals that are transmitting
HCV and who may be targets for HCV prevention strategies.
This study supports the application of sequencing, phylogenetics, and the use of patristic distance cutoffs to identify recent transmission clusters between HCV infected individuals with unknown
Please cite this article in press as: Olmstead, A.D., et al. A molecular phylogenetics-based approach for identifying recent hepatitis C virus transmission
events. Infect. Genet. Evol. (2015), http://dx.doi.org/10.1016/j.meegid.2015.04.017
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
MEEGID 2321
No. of Pages 9, Model 5G
25 April 2015
(0.43, 1.82)
(2.27, 20.15)
(1.55, 8.35)
(0.78, 17.85)
(0.58, 16.02)
0.89
6.74
3.50
2.79
2.37
1.16
6.58
4.83
1.34
2.12
1.53 (0.55, 4.38)
3.80 (0.80, 16.97)
2.53 (0.81, 8.76)
N/Ab
N/A
(1.07, 3.18)
(1.47, 7.81)
(0.92, 3.06)
(1.70, 20.81)
(1.70, 23.65)
Abbreviations: AOR: Adjusted Odds Ratio, CI: Confidence Interval, SD: Standard Deviation.
Numbers in bold have a p-value < 0.05.
a
Sensitivity analysis.
b
N/A indicates the fitted probabilities numerically 0 or 1 occurred when the regression models included genotype.
Characteristic
Age
Recent seroconverter
Past seroconverter
Genotype 1a
Genotype 3a
1.85
3.72
2.34
4.29
3.79
(0.98, 3.50)
(1.41, 9.47)
(1.20, 4.62)
(1.26, 26.9)
(0.93, 25.6)
2.12
3.13
2.19
6.60
3.85
(1.21, 3.73)
(1.29, 7.36)
(1.22, 3.97)
(1.98, 41.0)
(0.95, 26.0)
2.28
2.25
1.60
8.11
4.28
(1.33, 3.93)
(0.94, 5.16)
(0.91, 2.81)
(2.45, 50.2)
(1.06, 28.8)
1.27 (0.51, 3.19)
9.77 (2.53, 42.33)
4.33 (1.41, 16.17)
N/Ab
N/A
1.98
6.54
3.64
5.69
6.07
(1.01, 3.95)
(2.27, 19.77)
(1.57, 9.24)
(1.13, 103.69)
(1.12, 113.01)
1.85
3.38
1.68
4.91
5.34
HVR1
0.09a
0.06
Core-HVR1
0.03
0.022a
0.02
0.018
NS5B
Adjusted Odds Ratio (95% CI)
Table 3
Multivariate regression analysis of factors associated with clustering in NS5B alone, Core-HVR1 alone, and HVR1 alone using multiple cutoffs.
0.11
0.15
(0.52, 2.63)
(1.71, 26.79)
(1.73, 15.79)
(0.34, 8.85)
(0.51, 14.56)
0.19a
A.D. Olmstead et al. / Infection, Genetics and Evolution xxx (2015) xxx–xxx
7
infection histories. This technique, when applied at the population
level and performed on an ongoing basis, may allow monitoring of
HCV transmission networks and the changing dynamics of clusters
over time, i.e. if new clusters arise from pre-existing clusters and if
the number of clusters increases or decreases over time. This in
turn could enable monitoring of transmission patterns in response
to increased HCV screening and treatment uptake. It may also facilitate the tracking and evaluation of prevention interventions such
as harm-reduction and treatment-as-prevention.
471
Funding
480
This work was supported by the Canadian Institutes for
Health Research (CIHR) [Grant No. HES115697] and by the
National Canadian Research Training Program in Hepatitis C
(NCRTP-HepC) [ADO].
481
Author contributions
485
ADO and MK designed this study with input from JJG, BJ, AFYP,
JBJ, PRH, VM, YK and JM. ADO and VM performed all of the data
analysis for this study with input from MK, JG, JBJ and AFYP.
ADO, VM and IL performed all laboratory work for this study. The
manuscript was written by ADO with assistance from MK and DC
and with input from all contributing authors.
486
Potential conflicts of interest
492
MK has received grant funding via his institution from Roche
Molecular Systems, Boehringer Ingelheim, Merck, Siemens
Healthcare Diagnostics and Hologic Inc. JG is a consultant/advisor
and has received research grants from Abbvie, Bristol Myers
Squibb, Gilead and Merck. PRH has received grants from, served
as an ad hoc advisor to, or spoken at various events sponsored
by: Pfizer, Glaxo-SmithKline, Abbott, Merck, Tobira Therapeutics,
Virco and Quest Diagnostics and served as a consultant for ViiV
Health Care, Tobira Therapeutics, Selah Genomics Inc, and Quest
Diagnostics. He holds stock in Merck, Illumina, Gilead and EKF
Diagnostics. JM has received grants from Abbott, BoehringerIngelheim, Bristol-Myers Squibb, Gilead Sciences, Janssen, Merck
and ViiV Healthcare. All other authors have no conflicts of interest
to declare.
493
Acknowledgments
507
The authors thank Amanda Yu for performing data linkage and
data extraction, and the BC Centre Disease Control laboratory staff
for retrieving the samples used in this study.
508
Appendix A. Supplementary data
511
Supplementary data associated with this article can be found, in
the online version, at http://dx.doi.org/10.1016/j.meegid.2015.04.
017.
512
References
515
Aldous, J.L., Pond, S.K., Poon, A., Jain, S., Qin, H., Kahn, J.S., Kitahata, M., Rodriguez, B.,
Dennis, A.M., Boswell, S.L., Haubrich, R., Smith, D.M., 2012. Characterizing HIV
transmission networks across the United States. Clin. Infect. Dis. 55, 1135–1143.
BC Centre for Disease Control, 2013. British Columbia Annual Summary of
Reportable Diseases.
Brenner, B.G., Roger, M., Routy, J.-P., Moisi, D., Ntemgwa, M., Matte, C., Baril, J.-G.,
Thomas, R., Rouleau, D., Bruneau, J., Leblanc, R., Legault, M., Tremblay, C.,
Charest, H., Wainberg, M.a., 2007. High rates of forward transmission events
after acute/early HIV-1 infection. J. Infect. Dis. 195, 951–959.
516
517
518
519
520
521
522
523
524
Please cite this article in press as: Olmstead, A.D., et al. A molecular phylogenetics-based approach for identifying recent hepatitis C virus transmission
events. Infect. Genet. Evol. (2015), http://dx.doi.org/10.1016/j.meegid.2015.04.017
472
473
474
475
476
477
478
479
482
483
484
487
488
489
490
491
494
495
496
497
498
499
500
501
502
503
504
505
506
509
510
513
514
MEEGID 2321
No. of Pages 9, Model 5G
25 April 2015
8
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
A.D. Olmstead et al. / Infection, Genetics and Evolution xxx (2015) xxx–xxx
Bull, R.A., Luciani, F., McElroy, K., Gaudieri, S., Pham, S.T., Chopra, A., Cameron, B.,
Maher, L., Dore, G.J., White, P.A., Lloyd, A.R., 2011. Sequential bottlenecks drive
viral evolution in early acute hepatitis C virus infection. PLoS Pathog. 7,
e1002243.
Cavalheiro Nde, P., De La Rosa, A., Elagin, S., Tengan, F.M., Araujo, E.S., Barone, A.A.,
2009. Hepatitis C: sexual or intrafamilial transmission? Epidemiological and
phylogenetic analysis of hepatitis C virus in 24 infected couples. Rev. Soc. Bras.
Med. Trop. 42, 239–244.
Danta, M., Brown, D., Bhagani, S., Pybus, O.G., Sabin, C.a., Nelson, M., Fisher, M.,
Johnson, A.M., Dusheiko, G.M., 2007. Recent epidemic of acute hepatitis C virus
in HIV-positive men who have sex with men linked to high-risk sexual
behaviours. AIDS 21, 983–991.
Darriba, D., Taboada, G.L., Doallo, R., Posada, D., 2012. JModelTest 2: more models,
new heuristics and parallel computing. Nat. Methods 9, 772.
Dowd, K.A., Netski, D.M., Wang, X.-H., Cox, A.L., Ray, S.C., 2009. Selection pressure
from neutralizing antibodies drives sequence evolution during acute infection
with hepatitis C virus. Gastroenterology 136, 2377–2386.
Edgar, R.C., 2004. MUSCLE: multiple sequence alignment with high accuracy and
high throughput. Nucleic Acids Res. 32, 1792–1797.
Escobar-Gutierrez, A., Vazquez-Pichardo, M., Cruz-Rivera, M., Rivera-Osorio, P.,
Carpio-Pedroza, J.C., Ruiz-Pacheco, J.A., Ruiz-Tovar, K., Vaughan, G., 2012.
Identification of Hepatitis C virus transmission using a next generation
sequencing approach. J. Clin. Microbiol. 50, 1461–1463.
Gansner, E.R., North, S.C., 1999. An open graph visualization system and its
applications to software engineering. Software: Pract. Exp. 00, 1–5.
Gower, E., Estes, C.C., Hindman, S., Razavi-Shearer, K., Razavi, H., 2014. Global
epidemiology and genotype distribution of the hepatitis C virus. J. Hepatol.
http://dx.doi.org/10.1002/hep.27259.
Grabowski, M.K., Redd, A.D., 2014. Molecular tools for studying HIV transmission in
sexual networks. Curr. Opin. HIV AIDS 9, 126–133.
Hajarizadeh, B., Grebely, J., Dore, G.J., 2013. Epidemiology and natural history of
HCV infection. Nat. Rev. Gastroenterol. Hepatol. 10, 553–562.
Hmaied, F., Mamou, M., Dubois, M., Pasquier, C., Sandres-Saune, K., Rostaing, L.,
Slim, A., Arrouji, Z., Ben Redjeb, S., Izopet, J., 2007. Determining the source of
nosocomial transmission in hemodialysis units in Tunisia by sequencing NS5B
and E2 sequences of HCV. J. Med. Virol. 79, 1089–1094.
Hue, S., Pillay, D., Clewley, J.P., Pybus, O.G., 2005. Genetic analysis reveals the
complex structure of HIV-1 transmission within defined risk groups. Proc. Natl.
Acad. Sci. U.S.A. 102, 4425–4429.
Jabara, C.B., Hu, F., Mollan, K.R., Williford, S.E., Menezes, P., Yang, Y., Eron, J.J., Fried,
M.W., Hudgens, M.G., Jones, C.D., Swanstrom, R., Lemon, S.M., 2014. Hepatitis C
Virus (HCV) NS3 sequence diversity and antiviral resistance-associated variant
frequency in HCV/HIV coinfection. Antimicrob. Agents Chemother. 58, 6079–
6092.
Jacka, B., Applegate, T., Krajden, M., Olmstead, A.R.H., Marshall, B., DeBeck, K.,
Milloy, M.-J., Lamoury, F., Pybus, O., Lima, V., Magiorkinis, G., Montoya, V.,
Montaner, J., Joy, J., Woods, C., Dobrer, S., Dore, G., Poon, A., Grebely, J., 2014.
Phylogenetic clustering of hepatitis C virus among people who inject drugs in
Vancouver, Canada. Hepatology 60, 1571–1580.
Jacka, B., Lamoury, F., Simmonds, P., Dore, G.J., Grebely, J., Applegate, T., 2013.
Sequencing of the hepatitis C virus: a systematic review. PLoS One 8, e67073.
Kaye, M., Chibo, D., Birch, C., 2008. Phylogenetic investigation of transmission
pathways of drug-resistant HIV-1 utilizing pol sequences derived from
resistance genotyping. J. Acquir. Immune Defic. Syndr. 49, 9–16.
Kuiken, C., Yusim, K., Boykin, L., Richardson, R., 2005. The Los Alamos HCV sequence
database. Bioinformatics 21, 379–384.
Kuntzen, T., Timm, J., Berical, A., Lewis-Ximenez, L.L., Jones, A., Nolan, B., Schulze zur
Wiesch, J., Li, B., Schneidewind, A., Kim, A.Y., Chung, R.T., Lauer, G.M., Allen, T.M.,
2007. Viral sequence evolution in acute hepatitis C virus infection. J. Virol. 81,
11658–11668.
Kuo, M., Janjua, N., Burchell, A., Buxton, J.A., Krajden, M., Gilbert, M., 2015.
Decreasing hepatitis C incidence among a population of repeat anti-HCV testers,
British Columbia, Canada, 1993–2011. Am. J. Public Health (in press).
Lam, T.T., Hon, C.C., Tang, J.W., 2010. Use of phylogenetics in the molecular
epidemiology and evolutionary studies of viral infections. Crit. Rev. Clin. Lab.
Sci. 47, 5–49.
Lanini, S., Abbate, I., Puro, V., Soscia, F., Albertoni, F., Battisti, W., Ruta, A.,
Capobianchi, M.R., Ippolito, G., 2010. Molecular epidemiology of a hepatitis C
virus epidemic in a haemodialysis unit: outbreak investigation and infection
outcome. BMC Infect. Dis. 10, 257.
Lewis, F., Hughes, G.J., Rambaut, A., Pozniak, A., Leigh Brown, A.J., 2008. Episodic
sexual transmission of HIV revealed by molecular phylodynamics. PLoS Med. 5,
e50.
Liu, L., Fisher, B.E., Dowd, K., Astemborski, J., Cox, A.L., Ray, S.C., 2010. Acceleration of
hepatitis C virus envelope evolution in humans is consistent with progressive
humoral immune selection during the transition from acute to chronic
infection. J. Virol. 84, 5067–5077.
Magiorkinis, G., Magiorkinis, E., Paraskevis, D., Ho, S.Y.W., Shapiro, B., Pybus, O.G.,
Allain, J.-P., Hatzakis, A., 2009. The global spread of hepatitis C virus 1a and 1b: a
phylodynamic and phylogeographic analysis. PLoS Med. 6, e1000198.
Martin, N.K., Vickerman, P., Grebely, J., Hellard, M., Hutchinson, S.J., Lima, V.D.,
Foster, G.R., Dillon, J.F., Goldberg, D.J., Dore, G.J., Hickman, M., 2013. Hepatitis C
virus treatment for prevention among people who inject drugs: modeling
treatment scale-up in the age of direct-acting antivirals. Hepatology 58, 1598–
1609.
Mcallister, J., Casino, C., Davidson, F., Power, J., Lawlor, E., Yap, P.L., Simmonds, P.,
Donald, B., Allister, J.M.C., Yap, P.L.E.E., Smith, D.B., 1998. Long-term evolution of
the hypervariable region of hepatitis C virus in a common-source-infected
cohort long-term evolution of the hypervariable region of hepatitis C virus in a
common-source-infected cohort. J. Virol. 72, 4893.
Meng, S., Li, J., 2010. A novel duplex real-time reverse transcriptase-polymerase
chain reaction assay for the detection of hepatitis C viral RNA with armored
RNA as internal control. Virol. J. 7, 117.
Merani, S., Petrovic, D., James, I., Chopra, A., Cooper, D., Freitas, E., Rauch, A., Iulio, J.,
John, M., Lucas, M., Fitzmaurice, K., Mckiernan, S., Norris, S., Kelleher, D.,
Klenerman, P., Gaudieri, S., 2010. Effect of immune pressure on hepatitis
C virus evolution: insights from a single-source outbreak. Hepatology 53,
396–405.
Miller, C.L., Johnston, C., Spittal, P.M., Li, K., Laliberté, N., Montaner, J.S.G., Schechter,
M.T., 2002. Opportunities for prevention: hepatitis C prevalence and incidence
in a cohort of young injection drug users. Hepatology 36, 737–742.
Mohd Hanafiah, K., Groeger, J., Flaxman, A.D., Wiersma, S.T., 2013. Global
epidemiology of hepatitis C virus infection: new estimates of age-specific
antibody to HCV seroprevalence. Hepatology 57, 1333–1342.
Montoya, V., Olmstead, A., Janjua, N.Z., Tang, P., Grebely, J., Cook, D., Harrigan, P.R.,
Krajden, M., 2015. Differentiation of acute from chronic hepatitis C virus
infection by NS5B deep sequencing: a population-level tool for incidence
estimation. Hepatology. http://dx.doi.org/10.1002/hep.27734.
Nakayama, H., Sugai, Y., Ikeya, S., Inoue, J., Nishizawa, T., Okamoto, H., 2005.
Molecular investigation of interspousal transmission of hepatitis C virus in two
Japanese patients who acquired acute hepatitis C after 40 or 42 years of
marriage. J. Med. Virol. 75, 258–266.
Nelson, P.K., Mathers, B.M., Cowie, B., Hagan, H., Des Jarlais, D., Horyniak, D.,
Degenhardt, L., 2011. Global epidemiology of hepatitis B and hepatitis C in
people who inject drugs: results of systematic reviews. Lancet 378, 571–583.
Oliveira, T.de, Pybus, O.G., Rambaut, A., Salemi, M., Cassol, S., Ciccozzi, M., Rezza, G.,
Gattinara, G.C., D’Arrigo, R., Amicosante, M., Perrin, L., Colizzi, V., Perno,
C.F.Benghazi Study Group, 2006. HIV-1 and HCV sequences from Libyan
outbreak. Nature 444, 836–837.
Oliveira, M.D.L.A., Bastos, F.I., Telles, P.R., Hacker, M.D.A., Oliveira, S.A.N.De., Miguel,
J.C., Yoshida, C.F.T., 2009. Epidemiological and genetic analyses of Hepatitis C
virus transmission among young/short- and long-term injecting drug users
from Rio de Janeiro, Brazil. J. Clin. Virol. 44, 200–206.
Poon, A., Joy, J., Woods, C., Shurgold, S., Colley, G., Brumme, C., Hogg, R., Montaner, J.,
Harrigan, P., 2014. The impact of clinical, demographic and risk factors on rates
of HIV transmission. A population-based phylogenetic analysis in British
Columbia, Canada. J. Infect. Dis. http://dx.doi.org/10.1093/infdis/jiu560.
Price, M.N., Dehal, P.S., Arkin, A.P., 2010. FastTree 2 – approximately maximumlikelihood trees for large alignments. PLoS One 5, e9490.
Pybus, O.G., Cochrane, A., Holmes, E.C., Simmonds, P., 2005. The hepatitis C virus
epidemic among injecting drug users. Infect. Genet. Evol. 5, 131–139.
Pybus, O.G., Drummond, a.J., Nakano, T., Robertson, B.H., Rambaut, a., 2003. The
epidemiology and iatrogenic transmission of hepatitis C virus in Egypt: a
Bayesian coalescent approach. Mol. Biol. Evol. 20, 381–387.
Pybus, O.G., Rambaut, A., 2009. Evolutionary analysis of the dynamics of viral
infectious disease. Nat. Rev. Genet. 10, 540–550.
R Core Team, 2013. R: A Language and Environment for Statistical Computing.
Vienna, Austria.
Ramachandran, S., Campo, D.S., Dimitrova, Z.E., Xia, G.L., Purdy, M.A., Khudyakov,
Y.E., 2011. Temporal variations in the hepatitis C virus intrahost population
during chronic infection. J. Virol. 85, 6369–6380.
Ray, S.C., Fanning, L., Wang, X.-H., Netski, D.M., Kenny-Walsh, E., Thomas, D.L., 2005.
Divergent and convergent evolution after a common-source outbreak of
hepatitis C virus. J. Exp. Med. 201, 1753–1759.
Rein, D.B., Smith, B.D., Wittenborn, J.S., Lesesne, S.B., Wagner, L.D., Roblin, D.W.,
Patel, N., Ward, J.W., Weinbaum, C.M., 2012. The cost-effectiveness of birthcohort screening for hepatitis C antibody in U.S. primary care settings. Ann.
Intern. Med. 156, 263–270.
Sacks-Davis, R., Daraganova, G., Aitken, C., Higgs, P., Tracy, L., Bowden, S., Jenkinson,
R., Rolls, D., Pattison, P., Robins, G., Grebely, J., Barry, A., Hellard, M., 2012.
Hepatitis C virus phylogenetic clustering is associated with the social-injecting
network in a cohort of people who inject drugs. PLoS One 7, e47335.
Shah, H.a., Heathcote, J., Feld, J.J., 2013. A Canadian screening program for hepatitis
C: is now the time? CMAJ 185, 1325–1328.
Shemer-Avni, Y., Cohen, M., Keren-Naus, A., Sikuler, E., Hanuka, N., Yaari, A., Hayam,
E., Bachmatov, L., Zemel, R., Tur-Kaspa, R., 2007. Iatrogenic transmission of
hepatitis C virus (HCV) by an anesthesiologist: comparative molecular analysis
of the HCV-E1 and HCV-E2 hypervariable regions. Clin. Infect. Dis. 45, e32–e38.
Skar, H., Albert, J., Leitner, T., 2013. Towards estimation of HIV-1 date of infection: a
time-continuous IgG-model shows that seroconversion does not occur at the
midpoint between negative and positive tests. PLoS One 8, e60906.
Smith, B.D., Morgan, R.L., Beckett, G.A., Falck-Ytter, Y., Holtzman, D., Teo, C.G.,
Jewett, A., Baack, B., Rein, D.B., Patel, N., Alter, M., Yartel, A., Ward, J.W., 2012.
Recommendations for the identification of chronic hepatitis C virus infection
among persons born during 1945–1965. MMWR Recomm. Rep. 61, 1–32.
Tanaka, Y., Hanada, K., Mizokami, M., Yeo, A.E.T., Shih, J.W.-K., Gojobori, T., Alter,
H.J., 2002. A comparison of the molecular clock of hepatitis C virus in the United
States and Japan predicts that hepatocellular carcinoma incidence in the United
States will increase over the next two decades. Proc. Natl. Acad. Sci. U.S.A. 99,
15584–15589.
Please cite this article in press as: Olmstead, A.D., et al. A molecular phylogenetics-based approach for identifying recent hepatitis C virus transmission
events. Infect. Genet. Evol. (2015), http://dx.doi.org/10.1016/j.meegid.2015.04.017
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
MEEGID 2321
No. of Pages 9, Model 5G
25 April 2015
A.D. Olmstead et al. / Infection, Genetics and Evolution xxx (2015) xxx–xxx
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
Toda, T., Mitsui, T., Tsukamoto, Y., Ebara, T., Hirose, A., Masuko, K., Nagashima, S.,
Takahashi, M., Okamoto, H., 2009. Molecular analysis of transmission of
hepatitis C virus in a nurse who acquired acute hepatitis C after caring for a
viremic patient with epistaxis 1370, 1363–1370.
Urbanus, A.T., van de Laar, T.J., Stolte, I.G., Schinkel, J., Heijman, T., Coutinho, R.a.,
Prins, M., 2009. Hepatitis C virus infections among HIV-infected men who have
sex with men: an expanding epidemic. AIDS 23, F1–F7.
Van de Laar, T., Pybus, O., Bruisten, S., Brown, D., Nelson, M., Bhagani, S., Vogel, M.,
} tz, H., Matthews, G.V., Neifer, S.,
Baumgarten, A., Chaix, M.-L., Fisher, M., Go
White, P., Rawlinson, W., Pol, S., Rockstroh, J., Coutinho, R., Dore, G.J., Dusheiko,
G.M., Danta, M., 2009a. Evidence of a large, international network of HCV
transmission in HIV-positive men who have sex with men. Gastroenterology
136, 1609–1617.
Van de Laar, T., Pybus, O., Bruisten, S., Brown, D., Nelson, M., Bhagani, S., Vogel, M.,
Baumgarten, A., Chaix, M.L., Fisher, M., Gotz, H., Matthews, G.V., Neifer, S.,
White, P., Rawlinson, W., Pol, S., Rockstroh, J., Coutinho, R., Dore, G.J., Dusheiko,
9
G.M., Danta, M., 2009b. Evidence of a large, international network of HCV
transmission in HIV-positive men who have sex with men. Gastroenterology
136, 1609–1617.
Volz, E.M., Koelle, K., Bedford, T., 2013. Viral phylodynamics. PLoS Comput. Biol. 9,
e1002947.
Weiner, A.J., Thaler, M.M., Crawford, K., Ching, K., Kansopon, J., Chien, D.Y., Hall, J.E.,
Hu, F., Houghton, M., 1993. A unique, predominant hepatitis C virus variant
found in an infant born to a mother with multiple variants. J. Virol. 67, 4365–
4368.
Wiessing, L., Ferri, M., Grady, B., Kantzanou, M., Sperle, I., Cullen, K.J., Hatzakis, A.,
Prins, M., Vickerman, P., Lazarus, J.V., Hope, V.D., Matheï, C., 2014. Hepatitis C
virus infection epidemiology among people who inject drugs in Europe: a
systematic review of data for scaling up treatment and prevention. PLoS One 9,
e103345.
Yu, A., Spinelli, J.J., Cook, D.A., Buxton, J.A., Krajden, M., 2013. Mortality among
British Columbians testing for hepatitis C antibody. BMC Public Health 13, 291.
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
Please cite this article in press as: Olmstead, A.D., et al. A molecular phylogenetics-based approach for identifying recent hepatitis C virus transmission
events. Infect. Genet. Evol. (2015), http://dx.doi.org/10.1016/j.meegid.2015.04.017