Abstract
Phages exert profound evolutionary pressure on bacteria by interacting with receptors on the cell surface to initiate infection. While the majority of phages use chromosomally encoded cell surface structures as receptors, plasmid-dependent phages exploit plasmid-encoded conjugation proteins, making their host range dependent on horizontal transfer of the plasmid. Despite their unique biology and biotechnological significance, only a small number of plasmid-dependent phages have been characterized. Here we systematically search for new plasmid-dependent phages targeting IncP and IncF plasmids using a targeted discovery platform, and find that they are common and abundant in wastewater, and largely unexplored in terms of their genetic diversity. Plasmid-dependent phages are enriched in non-canonical types of phages, and all but one of the 65 phages we isolated were non-tailed, and members of the lipid-containing tectiviruses, ssDNA filamentous phages or ssRNA phages. We show that plasmid-dependent tectiviruses exhibit profound differences in their host range which is associated with variation in the phage holin protein. Despite their relatively high abundance in wastewater, plasmid-dependent tectiviruses are missed by metaviromic analyses, underscoring the continued importance of culture-based phage discovery. Finally, we identify a tailed phage dependent on the IncF plasmid, and find related structural genes in phages that use the orthogonal type 4 pilus as a receptor, highlighting the evolutionarily promiscuous use of these distinct contractile structures by multiple groups of phages. Taken together, these results indicate plasmid-dependent phages play an under-appreciated evolutionary role in constraining horizontal gene transfer via conjugative plasmids.
Similar content being viewed by others
Introduction
Viral infections pose a constant threat to the majority of life on Earth1,2. Viruses recognize their hosts by interacting with structures (receptors) on the cell surface3. For viruses that infect bacteria (phages), these receptors are usually encoded on the chromosome, and are part of core cellular processes including transport proteins or structurally integral lipopolysaccharides4. However, certain mobile genetic elements, such as conjugative plasmids, also contribute to the cell surface landscape by building secretory structures (e.g., type 4 secretion systems) which enable transfer into neighboring bacterial cells5,6. Plasmid-dependent phages (PDPs) have evolved to use these plasmid-encoded structures as receptors, and can only infect plasmid-containing bacteria7. Conjugative plasmids can often transmit between distantly related bacterial cells, creating new phage-susceptible hosts by horizontal transfer of receptors8.
Almost all previously identified PDPs belong to unusual ânon-tailedâ groups of phages, some of which have more in common with eukaryotic viruses than the âtailedâ phages that make up the majority of bacterial virus collections9,10. This includes the dsDNA alphatectiviruses, and members of the ssDNA inoviruses and ssRNA fiersviruses. The handful of known PDPs have had profound impacts on the field of molecular biology, enabling phage display technology11 (F plasmid-dependent phage M13), and in vivo RNA imaging12 (F plasmid-dependent phage MS2). PDPs have also aided in our understanding of the origin of viruses: tectiviruses are thought to represent ancient ancestors of adenoviruses13.
Predation by PDPs exerts strong selection on bacteria to lose conjugative plasmids, or to mutate/repress the conjugation machinery including the pilus14,15,16,17. As antibiotic resistance genes are frequently carried and spread by conjugative plasmids18,19,20,21, selection against plasmid carriage functionally selects against antibiotic resistance in many instances. The extent to which this is a significant evolutionary pressure on antibiotic resistance depends on how frequent these phages are in nature.
Despite the remarkable properties of these phages and their intriguing association with conjugative plasmids, only a handful of PDPs exist in culture. In the 1970sâ80s at least 39 different PDPs were reported targeting 17 different plasmid types (classified by âincompatibilityâ groups)7. However, most of these reports predated the era of genome sequencing, and to our knowledge, most of the reported PDPs have been lost to science. Here, we use a targeted discovery approach to show that PDPs are easily discoverable in the environment, and associated with unappreciated genetic and phenotypic diversity.
Results
Co-culture enables direct discovery of plasmid-dependent phages
PDPs have historically been identified and quantified by taking phage collections that were isolated on bacteria containing conjugative plasmids and screening them on isogenic plasmid-free bacteria, to look for plasmid-specific phenotypes22. As PDPs use the proteins expressed by conjugative plasmids as receptors, their host range mirrors plasmid host range, and typically crosses bacterial genera7. Exploiting this property, some studies have used multi-species enrichment methods to increase the likelihood of finding PDPs that can infect multiple different plasmid-containing hosts23,24. Alternatively, relative enrichment of PDPs by the depletion of species-specific phages, so called âsomaticâ phages, has been described25. However, these enrichment methods do not allow the direct quantification of PDPs relative to somatic phages in a sample, and suffer from other drawbacks such as increased likelihood of repeated isolation of the same phage, and bias against PDPs that may use both species-specific and plasmid-encoded receptors. In order to isolate and directly assess the abundance and diversity of PDPs in the environment, we set out to develop a targeted isolation approach. The challenge of targeted isolation is discriminating PDPs, in a direct, non-labor-intensive way, from somatic phages, which may be more or less abundant than PDPs depending on the environmental sample, the plasmid in question and the host species26.
To differentiate PDPs, we co-cultured Salmonella enterica and Pseudomonas putida, a pair of taxonomically distinct bacteria with no known shared phages, that grew well in coculture. We also selected a known PDP, the Alphatectivirus PRD1, which depends on IncP group conjugative plasmids such as RP4 and pKJK5, and can infect S. enterica and P. putida provided they contain an IncP plasmid. We made a modification to the traditional phage plaque assay, by co-culturing these strains with differential fluorescent tags, together in the same soft-agar lawn. After applying dilutions of phages, the plaquing phenotype of the PDP PRD1, which efficiently killed both fluorescently labeled strains in the lawn (resulting in no fluorescent signal) was immediately discernible from species-specific phage 9NA (infecting S. enterica) and SVOΦ44 (infecting P. putida) (Fig. 1a). This observation formed the basis of the targeted phage discovery method we termed âPhage discovery by cocultureâ (Phage DisCo) (Fig. 1b).
To directly isolate PDPs dependent on the IncP plasmids using Phage DisCo, environmental samples putatively containing PDPs can be mixed together with fluorescently labeled S. enterica and P. putida strains containing the IncP conjugative plasmid RP4 (Fig. 1b). After growth of the bacterial lawn, phages are immediately identifiable by the fluorescence phenotype of their plaques: P. putida phages appear as red plaques where only S. enterica RP4 (red) is able to grow, S. enterica phages appear as blue plaques where only P. putida RP4 (blue) is able to grow, and PDPs make dark plaques where both bacteria in the lawn are killed (Fig. 1b). As a proof of principle, we mixed equimolar amounts of the test phages, 9NA, SVOΦ44, and PRD1, to simulate an environmental sample containing both species-specific phages and PDPs (Fig. 1c). After incubation and growth of the bacteria in the lawn, the plate was photographed using a custom fluorescence imaging setup. Once the two fluorescent image channels were digitally merged, plaques of all three phages were easy to identify by fluorescence phenotype, and importantly, the PRD1 plaques could be easily discerned from the plaques made by the two species-specific phages.
Having established the efficacy of the phage DisCo method using phages we had in culture, we set out to look for new PDPs in samples collected from compost, farm waste, and wastewater in the Greater Boston area (MA, USA) (Supplementary Dataset 1e). We chose to focus on phages depending on conjugative plasmids of the IncP and IncF incompatibility groups. IncP and IncF plasmids are also associated with extensive antibiotic resistance gene cargo and are frequently isolated from environmental27 or clinical28,29 samples, respectively. The archetypal IncF plasmid, the F plasmid originally isolated from E. coli K12, has a narrower host range than IncP plasmids, so we changed the DisCo hosts strains to E. coli and S. enterica. As S. enterica strains natively encode an IncF plasmid, we used a derivative that had been cured of all plasmids and prophages to mitigate any interference from these elements. Initially we collected 50 novel unique phages dependent on the IncP plasmid, and 13 dependent on the IncF plasmid. In order to identify any narrow-host range phages that may have been missed in our IncP-search using S. enterica and P. putida, we capitalized on the low abundance of P. putida phage in wastewater to perform a traditional single-host phage screen that captured an additional two narrow-host range phages dependent on the IncP plasmid in P. putida. Therefore, in total, we collected 65 novel PDPs in this study (Fig. 1d). All phages were further characterized by genome sequencing and we adopted a naming system wherein each phage was given a unique color identifier with a prefix consistent with previously isolated PDPs.
IncP plasmid-dependent tectiviruses from a limited geographic area fully encompass the previously known global diversity
Genomic analysis revealed that 51 of the 52 IncP-dependent phages in our collection belonged to the Alphatectivirus genus in the Tectiviridae family, and are related to Enterobacteriophage PRD1. Surprisingly, despite our sampling being limited to a small geographical area and short time frame, the phages we isolated represented significantly more genomic diversity than the six previously known plasmid-dependent tectiviruses that were isolated across multiple continents, suggesting these phages are greatly under sampled. We estimate our collection expands the genus Alphatectivirus from two species (represented by type isolates PRD1 and PR4) to 12, as determined by pairwise nucleotide identity of all alphatectiviruses, including the six previously known alphatectiviruses and our 51 new isolates (Fig. S1a) (species cutoff <95% nucleotide identity, according to guidelines published by the International Committee on Taxonomy of Viruses (ICTV))30.
Additionally, by querying genome databases we identified one published tectivirus genome, Burkholderia phage BCE1, closely related to PRD1 by whole genome phylogeny (Fig. 2a). As Burkholderia sp. are known hosts of IncP-type conjugative plasmids31 we expect that the Burkholderia cenocepacia host used to isolate BCE1 carried such a plasmid (highlighting the serendipitous nature by which PDPs are often found) and we include BCE1 in our known plasmid-dependent tectivirus phylogeny. Novel conjugative plasmids have recently been detected in Burkholderia contaminans isolates, after their existence was implicated by the isolation of alphatectiviruses on these strains32, suggesting Burkholderia species may be common hosts for these plasmids and phages.
While the new plasmid-dependent tectiviruses we report expand the known diversity of this group of phages from two to twelve proposed species, we found that all 51 phages in our collection had perfectly conserved gene synteny (Fig. S1b). Just like the six previously known alphatectiviruses, they have no accessory genome and contain homologs of all 31 predicted coding genes of the PRD1 reference genome, suggesting strong constraints on genomic expansion in this group of phages. However, the isolates in our collection contain a large number of single nucleotide polymorphisms (SNPs) distributed across the entire ~15âKb genome (Fig. 2c), and isolates ranged from 82.5% to 99% average pairwise nucleotide identity. Certain regions of the genome are highly associated with polymorphism across our collection, such as the center and C-terminus of the DNA polymerase gene, I (P1). Two small genes toward the end of the phage genome, XXXVII (P37) and XIX (P19), are especially associated with nucleotide polymorphisms across our genome collection. Interestingly, XXXVII (also called gp v, P37) is the outer-membrane unit of a two-component spanin system thought to be responsible for fusion of the inner and outer membrane in the final stages of cell lysis33.
Newly isolated ssRNA and ssDNA phages targeting IncF and IncP plasmids
In total, we isolated eight novel ssRNA phages (Fig. 1d), seven targeting the IncF plasmid and one targeting IncP. All eight ssRNA phages were related to phages in the Fiersviridae family of ssRNA phages and had mostly syntenic genome architectures. Analysis of the sequence of the RNA-dependent RNA polymerase or replicase, rep, protein homologs of the eight phages in context of other reference phage sequences showed that three of the IncF-dependent phages belonged to the Emesvirus genus and were closely related to phage MS2. The other four IncF-dependent ssRNA phages were related to Qbeta, in the Qubevirus genus. Finally, the IncP-dependent ssRNA phage, PRRlime, was closely related to phage PRR1, suggesting it is the same species and the second isolate of the Perrunavirus genus. Interestingly, although PRRlime was the only IncP-dependent ssRNA phage we isolated, by amino acid identity of the replicase protein it is more closely related to the MS2 phage group, than the MS2 and Qbeta groups are to each other (Fig. 3a). None of the new ssRNA phages in our collection exhibited <80% replicase protein amino acid identity to the closest reference isolate, and therefore do not meet proposed cutoff criteria for new ssRNA phage species34.
Finally, we isolated five new ssDNA phages targeting the IncF plasmid. All five phages were found to be related to the filamentous phage M13, within the Inovirus genus (Fig. 3b). One of the novel inoviruses, FfLavender, was significantly different from others in our analysis and shared 88% identity to phage M13 at the nucleic acid level across the whole genome. In line with current taxonomic guidelines, we propose that FfLavender is first isolate of the novel species Inovirus lavender. In general, we observed less relative diversity in the IncF-dependent phages than the IncP-dependent phages described above.
A novel tailed phage targeting the F plasmid is related to phages targeting an orthogonal contractile pilus
The final IncF PDP we isolated, which we named FtMidnight, was found to have a 40,995âbp dsDNA genome containing putative tail genes (Fig. 3c). This finding distinguishes FtMidnight from any known PDP, which all belong to non-tailed ssRNA or ssDNA phage groups. Transmission electron microscopy confirmed that FtMidnight is a tailed phage resembling the morphological class of flexible tailed siphoviruses (Fig. 3d). To confirm the interaction of FtMidnight with the conjugation machinery of the F plasmid specifically, we sampled phage resistant micro-colonies from within FtMidnight plaques to obtain two resistant mutants that still encoded the plasmid-borne antibiotic resistance marker. The FtMidnight-resistant mutants were collaterally resistant to F-plasmid dependent phages MS2 and Qbeta (Fig. S2c), and sequencing revealed the two mutants had independent SNPs causing a frameshift and a premature truncation of conjugation proteins TraA and TraF, respectively (Fig. S2b). Both these proteins are essential components of the conjugative pilus, suggesting that ablation of the conjugative pilus renders cells resistant to FtMidnight, and that the phage interacts directly with the pilus.
As there are no phage tail proteins known to interact with plasmid-encoded conjugation machinery, identification of the FtMidnight receptor-binding protein might have long-term applications in the development of alternative antimicrobial therapies35. To this end, we used structure-guided homology search to infer the probable structure and function of the FtMidnight tail proteins. Based on similarity to the distantly related marine roseophage vB_DshS-R4C36, we identified a cluster of five proteins that are predicted to compose the distal, receptor-interacting, end of the FtMidnight tail, gp18â22 (Fig. 3c and Fig. S3a). By searching for homologs of these genes, we detected a number of siphoviral phage genomes that possess related distal-tail regions to FtMidnight (Fig. S3b). Intriguingly, many of these FtMidnight-related phages, which were isolated on hosts including Pseudomonas, Xylella, and Stenotrophomonas, have been documented to use the type 4 pilus (also called T4P or type IV pili) as their receptors (Fig. S3b, Supplementary Dataset 2), an orthogonal contractile pilus that is thought to be unrelated to the conjugal pilus (a type 4 secretion system)37. Phylogenetic analysis of the five putative distal-tail proteins of FtMidnight implicate gp18 as most likely to be involved in receptor recognition, as it is much more divergent from homologs in T4P-associated phages than the other four proteins, and we speculate it is specifically adapted to facilitate adsorption to the F plasmid pilus (Fig. S3c).
Remarkably, both the ssRNA Fiersviridae and the ssDNA filamentous Inoviridae families include phages that use either the conjugative pilus or chromosomally encoded T4P as receptors38,39. Therefore, the FtMidnight-like group of phages is the third example of a phage group that can adapt to use either contractile pilus structure in short evolutionary timescales. This suggests that from a phage entry perspective, there is a high degree of functional overlap between the conjugative pilus and the T4P, which we hypothesize to be their biophysical, contractile nature.
Plasmid-dependent tectiviruses show substantial phenotypic differences despite perfectly syntenic genomes with no accessory genes
PDP host range is dependent on plasmid host range, and therefore phages dependent on broad-host range plasmids are required to replicate in diverse host cells. Indeed, tectiviruses (alphatectiviruses) dependent on the broad-host range IncP plasmid exhibit a remarkably wide host range40, surpassing the host breadth of any other described group of phages. This ability comes in stark contrast with their small genome size, perfectly conserved gene synteny, and lack of accessory genome. While the broad-host-range phenotype of PRD1-like phages has been long appreciated41, the six previously isolated PRD1-like phages have been assumed to be mostly phenotypically homogeneous, perhaps due to the high level of conservation between their genomes. However, subtle differences in efficiency of plating (EoP) of some alphatectiviruses on different host strains were previously reported42. To explore the extent to which this constrained genomic diversity permits phenotypic variation in our larger collection of PDPs, we constructed a set of 13 hosts representing diverse Gammaproteobacteria, carrying the IncP conjugative plasmid pKJK5 (indicated by P). We initially observed that PDPs exhibited substantial differences in plating efficiency across hosts (Fig. 4a). For example, while PRD1 is able to plaque efficiently in all but one of the hosts, PRDcerulean can only efficiently form plaques on Pseudomonas hosts, representing a decrease in plaquing efficiency of at least four orders of magnitude in most other hosts. In contrast, PRDchartreuse and PRDjuniper decrease their plaquing efficiency by a similar magnitude in P. putidaP when compared against P. fluorescensP. Notably, these isolates share >95% nucleotide identity to PRD1 and have no variation in gene content (Fig. S1).
We quantified host preference differences of all 51 phages on all 13 bacterial species using a high throughput liquid growth assay43. For each phage-host pair we calculated a liquid assay score, which represents the bacterial growth inhibition incurred by a fixed phage concentration, normalized as a percentage relative to the bacterial growth in a phage-free control (Fig. 4b). We found that, consistent with earlier plaque assays (Fig. 4a), the growth inhibition phenotype was highly variable across phage isolates (Fig. 4c). We identified more examples of phages such as PRDmint and PRDcanary that displayed a host-specialist behavior, akin to that of PRDcerulean, while others, like PRDobsidian and PRDamber appeared to robustly inhibit the growth of a wide range of hosts (host generalism). Surprisingly, when looking at the data broadly, we found that neither the phage nor the host phylogenetic relationships were strong predictors of host preference. To rule out that these host-preference differences are caused by sequence-specific anti-phage systems, we characterized the CRISPR-Cas and restriction modification (RM) systems encoded in the host genomes. We found that only four of the hosts encoded a complete Cas operon, and that none of the spacers in the CRISPR arrays matched any of the phages in our collection (Fig. S4, Supplementary Dataset 3). We also found that all but two of the bacterial hosts harbor at least one RM system. Although interactions between plasmid-dependent tectiviruses and RM systems might play a role in host range, it cannot fully explain the differences we observe. Instead, we speculate that these host preference patterns reflect adaptation of the plasmid-dependent tectiviruses to the physiologies of different host cells. The composition of natural polymicrobial communities containing IncP plasmids likely require PDPs to rapidly adapt to infect particular assortments of taxonomically distant hosts.
Holin protein variation contributes to host range differences in plasmid-dependent tectiviruses
To explore the genetic basis of the host-range preferences, we focused on PRDcerulean, which was the only tectivirus isolated in our narrow-host IncP plasmid screen. PRDcerulean displays the most restricted host range of our collection, and only replicates efficiently on P. putidaP. On S. entericaP (and most IncP plasmid-containing hosts in our screen) PRDcerulean does not make plaques even at high titers (Fig. 4a). We reasoned that differences in host range between these phages are unlikely to be due to adsorption failure, as the receptor is encoded by an identical conjugative plasmid present in each host strain, and if there were pilus elaboration problems in any of the hosts, this should affect all phages in our collection rather than some phages individually. In line with this assumption, there was no difference in adsorption efficiency between PRD1 and PRDcerulean on S. entericaP (Fig. S5a), indicating that the replication defect in S. entericaP is receptor independent, and probably occurs once the phage chromosome has reached the interior of the cell. To understand where the replication defect occurs, we conducted high multiplicity of infection (MOI) experiments of PRDcerulean on S. entericaP in order to isolate a spontaneous escape mutant that could form plaques. We identified a single rare mutant of PRDcerulean, cer1, which was able to make plaques on S. entericaP. Sequencing revealed cer1 contained a number of mutations relative to wildtype PRDcerulean, notably in the holin gene, which encodes the protein (P35) responsible for triggering the destruction of the bacterial cell wall during cell lysis44. To confirm this observation, we generated a chimeric phage, cer6, by recombining the P35âP36 region of PRDcerulean with the respective sequence from PRD1. Strikingly, exchange of these two proteins restored the plating efficiency of PRDcerulean on S. entericaP, although we note that the cer6 recombinant phage formed smaller plaques than PRD1 (Fig. 5a).
Analysis of the holin protein indicated that it is predicted to form three transmembrane domains (TMDs), with a short N-terminal periplasmic segment, and a longer disordered C-terminal region that extends into the cytoplasm (Fig. 5b), a topology characteristic of class I holins. The holin protein of PRDcerulean has a distinct five amino acid motif at the C-terminal end, shared only with one other phage in our collection, PRDfuschia (Fig. S5d). Despite this similarity at the C-terminus, the TMD1 and TMD2 of PRDcerulean differ significantly from those of PRDfuschia, which is able to replicate efficiently on S. entericaP. As the TMDs have been shown to be especially important for holin function45, we hypothesized that variations in these regions could be associated with PRDceruleanâs reduced host range, and we attempted to individually replace each of the regions corresponding to the TMDs of the PRDcerulean holin with the respective sequences of PRDfuschia, by recombination. Recombination with TMD1 did not yield any chimeric phages that could plaque on S. entericaP, but recombinants of TMD2 yielded phages that plaqued on S. entericaP almost as efficiently as cer6 (Fig. 5a and Fig. S5b). Additionally, to recapitulate the original variant we had seen in the spontaneous mutant, we replaced the TMD3 from PRDcerulean with that of cer1.
Sequencing of the resulting recombinant phages revealed that rather than recombining the entire region corresponding to the TMDs, the phages had recombined specific SNPs from within the donor TMD sequences, allowing us to pinpoint individual variants in the PRDcerulean holin that expand host range to S. entericaP. Cer9 had two amino acid changes within TMD2 and cer10 had a single amino acid change inside TMD3. Furthermore, the cer10 recombinant, which was associated with poor EoP and plaque size on S. entericaP, proved to be unstable in culture, and larger plaques spontaneously appeared after one round of replication. Re-sequencing of the larger plaque mutant, cer11, revealed it had acquired a frameshift mutation in the C-terminal end of the protein, which reverts the C-terminal motif close to the longer motif found in most variants in our collection, e.g., PRDaquamarine (Fig. S5e). This indicates that the mutation in the C-terminal cytoplasmic end of the protein might have an epistatic interaction with the TMD mutations.
Mapping of these mutations onto the holin membrane topology prediction showed that they are clearly located within membrane-embedded portions of the holin protein, and further AlphaFold modeling of the holin secondary structure indicated that the TMD2&3 mutations may be spatially proximal in the native protein structure (Fig. S5c). Overall, these results indicate that one of the largest hurdles for plasmid-dependent tectiviruses to achieve infection of diverse bacterial hosts may be adapting the phage lysis components to various host cell physiology, e.g., inner membrane composition. We speculate that the PRDcerulean lysis proteins may be specifically adapted to work in Pseudomonad hosts, and note that though the phage appears to have a narrow-host range in our limited screen, we may simply not be testing the natural hosts of PRDcerulean. Notably, none of the holin mutations affected replication on P. putidaP (Fig. 5a and Fig. S5b). The finding that altered host range is accessible within a small number of point mutations suggests there is immense functional flexibility encoded within the proteins of plasmid-dependent tectiviruses, and while there appears to be a strict constraint on genome size, these viruses may acquire accessory function through protein variation rather than gene gain, in contrast to canonical tailed phages which are associated with extensive mosaicism.
Metagenomic approaches fail to recover plasmid-dependent tectiviruses
Given the small number of plasmid-dependent tectiviruses known prior to this study (6, excluding BCE1) we were surprised by how readily discoverable these phages were in our samples (though we note that low numbers of characterized representative phages does not necessarily reflect low environmental abundance). To quantify their absolute abundance, we used Phage DisCo to estimate the concentration of IncP PDPs in fresh influent from two wastewater sites in Massachusetts, USA, relative to species-specific phages of E. coli, S. enterica, and P. putida (Fig. 6a). Phages dependent on the IncP plasmid RP4 were present in wastewater at ~1000âphages/mL, the same order of magnitude as species-specific phages of E. coli at ~4000âphages/mL. Species-specific phages of S. enterica and P. putida were less abundant than IncP-PDPs, present at ~100âphages/mL and ~5âphages/mL, respectively. While this absolute quantification is limited by the use of a single strain to capture all species-specific phages, PDP quantification may be similarly limited by use of a single plasmid and two host species to capture all IncP PDPs. Nevertheless, wastewater is considered one of the best samples in which to find E. coli and S. enterica phages, and therefore despite the limitations of this relative abundance metric, the data show that these phages are common, at least in built environments (human-made environments). The extent to which this abundance is a characteristic of phages dependent on IncP-type plasmids as opposed to PDPs in general remains to be seen, although these estimates are in line with reports of phage abundance for multiple different plasmid types in wastewater from Denmark and Sweden26.
Metagenomic-based viral discovery techniques have been extremely successful in expanding known viral diversity46,47,48. Although some studies have identified tectiviruses in metagenomic datasets49 and metagenomic-assembled genomes50, alphatectiviruses have yet to be found in metagenomic analyses, at odds with the relatively high abundance of the plasmid-dependent alphatectiviruses in wastewater (Fig. 6a). With the increasing availability of metagenomic datasets, we decided to reexamine the presence of this group of phages in assembled metagenome collections. We queried the JGI IMG/VR database of uncultivated viral genomes (UViGs) and retrieved genomes with a match to the Pfam model PF09018, which corresponds to the PRD1 coat protein, which is conserved across all known tectiviruses. This search retrieved a set of diverse genomes in which, using refined models built from our alphatectivirus collection, we identified homology to diagnostic tectivirus proteins14, such as DNA polymerase (P1), DNA packaging ATPase (P9), and tail tube DNA translocation proteins (P18, P32) in addition to the coat protein (P3) used for the retrieval of these sequences (Fig. S6b). However, none of the UViGs appear to belong to any of the pre-existing groups of isolated tectiviruses (Fig. S6c) suggesting there is large unexplored diversity in the Tectiviridae family.
We tested if we could recover alphatectivirus sequences through metagenomic sequencing of our own wastewater samples, where we knew these phages were present at high abundance (around 1000âPFU/mL) (Fig. 6a). We processed our samples by filtration, and further concentrated the viral fraction by 100-fold, before performing DNA extraction and bulk sequencing (average of 2M reads per sample). We classified our metagenomics dataset with Kraken2 and found that a very small proportion of the reads (<0.001%) could be assigned to the Alphatectivirus taxonomic group, which would not be sufficient for assembly (Fig. 6b). This implied that, despite there being no assembled alphatectiviruses in public databases, they may still be identifiable in raw reads.
We then looked at additional published wastewater metagenomic sequencing datasets, and processed samples from diverse projects, representing different sequencing depths, locations, and sample processing methods, comprising a total of 290 samples and more than 5 billion reads total (Supplementary Dataset 2). Over 75% of the samples contained five or fewer reads assigned to alphatectiviruses (Fig. 6b). However, we found some alphatectivirus reads, primarily from the larger datasets, which directly mapped to the PRD1 reference genome (Fig. S6a). The recovered reads appeared to be bona fide alphatectivirus sequences, as shown by the high mapping quality to the reference, a conservative approach that would fail to identify isolates with higher variation. Taken together, no single dataset we analyzed contains enough reads to assemble a complete alphatectivirus genome. We hypothesize that a combination of a low relative abundance, small genome size, and highly polymorphic population might be responsible for the absence of alphatectiviruses in metagenomic assembled genome collections. Overall, this finding points to a discordance between culture-based and metagenomic-based virus surveillance.
Discussion
Our finding that phages exploiting conjugative plasmid-encoded receptors are common and abundant in the urban environment suggests that PDPs act as an important and underappreciated constraint on the spread of conjugative plasmids. Though studies have shown that conjugative plasmids can rapidly evolve resistance to PDPs17,51, these studies also suggest that resistance comes with a tradeoff in conjugation efficiency, such that phage-resistant plasmids cannot easily spread to new hosts. This suggests that with further study and discovery, PDPs could be exploited to manipulate the dynamics of conjugative plasmid mobility, and thus the spread of antibiotic resistance genes in high-risk environments. PDPs may be particularly applicable to controlling epidemics of plasmid acquired resistance, for example the current epidemic of carbapenem-resistant Enterobacteriaceae mobilized by IncX3 conjugative plasmids52,53,54.
A challenge in the potential translational application of PDPs may be ensuring that phage host range is sufficiently broad to avoid the formation of plasmid reservoirs in bacterial hosts that cannot be infected by phages. Our finding that plasmid-dependent tectiviruses have highly variable host range preferences reinforces the significance of this hurdle. However, our investigation into the host range of these phages showed that this phenotype is, to some extent, genetically encoded in the phage lysis machinery. Further study is necessary to better understand the genetic basis of host range in plasmid-dependent tectiviruses and PDPs more broadly, but the expansion of host range of PRDcerulean via gene exchange may be an exciting step toward predicting and engineering the host range of these phages. From a virus evolution perspective, this finding illustrates the great functional flexibility contained within PDP lysis proteins, which we speculate may be necessary for rapid adaptation to new host cells as their associated conjugative plasmids transmit across communities of diverse bacteria.
Our discovery of FtMidnight, along with the significant expansion of other known conjugative PDP families, highlights the power of Phage DisCo to uncover new phage diversity. A more comprehensive understanding of the diversity of PDPs may shed light on outstanding questions as to the evolution of plasmid-dependency in phages. Indeed, our discovery that the F pilus-dependent phage FtMidnight is related to type 4 pilus targeting phages suggests that there may be some functional, if not evolutionary, relationship between these purportedly unrelated structures. It remains to be seen how this finding translates to other groups of PDPs. For example, only the Alphatectivirus genus within the broader Tectiviridae family are known to depend on plasmid-encoded receptors, and the receptors of other genera, such as the betatectiviruses that infect Bacillus species, are thought to be components of the cell wall55, although we note there is very little similarity between the spike proteins of PRD1 and the most characterized Betatectivirus Bam3556. Likewise, a recent study identified a new pair of short tailed phages that are dependent on conjugative plasmids belonging to the IncN group57 but the receptor dependency of phages with related tail proteins is unknown. Further study of such viruses that are evolutionarily adjacent to plasmid-dependent groups may reveal parallel evolutionary routes to plasmid-dependency. Additionally, further characterization of the diversity of phage receptor-binding proteins that interact with plasmid-encoded pili could eventually facilitate the engineering of plasmid-targeting phenotypes into genetically engineered phages or phage-derived particles, which may offer long-term promise as alternative antimicrobial therapies35.
The relatively high abundance of IncP PDPs in wastewater as measured by culture-based methods contrasts with their absence from metagenomic datasets, indicating a blind spot in bulk sequencing based approaches to detect certain groups of viruses. The biochemical properties of some viruses have been suggested to play a role in their depletion from metagenomic datasets, such as DNA genomes with covalently bound terminal proteins58. Though we cannot rule out a similar phenomenon is responsible for the lack of plasmid-dependent tectiviruses in some metagenomic samples, our metagenomic extractions were protease treated yet had comparable abundance of plasmid-dependent tectiviruses relative to public datasets. We speculate that other factors might play a role, including the small genome size of PDPs relative to other viruses, low relative abundance compared to other viruses, and high within sample sequence diversity interfering with consensus-assembly based methods. Consistent with our observations, high strain heterogeneity has previously been shown to hinder metagenomic assembly of abundant marine viruses59, and benchmarking studies with simulated metagenomic data has found this to be an intrinsic limitation of both viromic and metagenomic sequencing studies60,61. These discrepancies point to the continued need for systematic culture-based viral discovery and method innovation.
Though we chose to focus this initial study on conjugative plasmids that are already known to be targeted by PDPs, we anticipate that the Phage DisCo method will be generally applicable to identifying phages dependent on other conjugative plasmid systems, as well as translatable to further specialized phage discovery screens. The diversity and abundance of the PDPs we detected in the urban environment leads us to hypothesize that the interplay between phages and conjugative plasmids, both selfish genetic elements, may be driving the diversification of the conjugation systems mediating horizontal gene transfer in bacteria. This work represents a major first step in the large-scale exploration of this functional group of phages, and much remains to be discovered about their ecology and biology, including how they interact with the plethora of defense systems present in bacteria62.
Methods
Strains and growth conditions
Details of all bacterial strains, plasmids, phages, and primers used and constructed in this study are available in Supplementary Dataset 1aâd. Unless stated otherwise, bacteria were grown at 37 or 30â°C in autoclaved LBLennox broth (LB: 10âg/L Bacto Tryptone, 5âg/L Bacto Yeast Extract, 5âg/L NaCl) with aeration (shaking 200ârpm) or on LB agar plates, solidified with 2% Bacto Agar at 37 or 30â°C. Salt-free LBO media contained 10âg/L Bacto Tryptone, 5âg/L Bacto Yeast Extract. When required antibiotics were added at the following concentrations: 50âµg/mL kanamycin monosulfate (Km), 100âµg/mL ampicillin sodium (Ap), 20âµg/mL tetracycline hydrochloride (Tc), 30âµg/mL trimethoprim (Tm), 20âµg/mL chloramphenicol (Cm), and 20âµg/mL gentamicin sulfate (Gm).
Phage replication
Replication host strains for all phages used in this study are detailed in Supplementary Dataset 1c. High titer phage stocks were produced by adding ~105 Plaque Forming Units (PFU) to exponential phase cultures at ~OD600 0.1, and infected cultures were incubated for at least 3âh at 37â°C (with aeration). Phage lysates were centrifuged (10,000âÃâg, 1âmin) and supernatants were sterilized with 0.22âµm filters. Phage lysates were serial-diluted (decimal dilutions) with SM buffer and PFU enumeration was performed by double-layer overlay plaque assay63, as follows. Bacterial lawns were prepared with stationary phase cultures of the host strains, diluted 40 times with warm top agar (0.5 % agar in LB, 55â°C). The seeded top agar was poured on LB 2% agar bottom layer: 3âmL for 8.6âcm diameter petri dishes or 5âmL for 8.6âÃâ12âcm rectangular petri dishes. When required, antibiotics were added to the top agar at concentrations specified above.
Plasmid construction
The F plasmid from strain SVO150 was modified via recombineering to encode a gfp locus and kanamycin resistance locus (aph) for selection (FÎfinO::aph-Plac-gfp) to aid in conjugation and rapid identification of plasmid+ colonies. Briefly, SVO150 was electroporated with the pSIM5tet recombineering plasmid (Supplementary Dataset 1b), and the native IS3-interrupted finO locus was replaced with the aph-Plac-gfp cassette from pKJK5 using primers NQO2_9 and NQO2_12. The replaced region was amplified with primers NQO2_5 and NQO2_6 and sent for Sanger sequencing to confirm the correct replacement.
Strain construction
For differential identification of plaques in coculture and transconjugant selection, constitutive sgfp2* or mScarlet-I loci along with a chloramphenicol resistance locus were added to E. coli, S. enterica, and P. putida strains (Supplementary Dataset 1a). Tn7 transposons from pMRE-Tn7-145 and pMRE-Tn7-152 were introduced into the atttn7 site via conjugation from an auxotrophic E. coli donor strain as previously described64.
The RP4 plasmid was introduced into chromosomally tagged S. enterica and P. putida via conjugation using the BL103 donor strain. Overnight liquid cultures of donor and recipient strains were mixed at a 1:10 (donor:recipient) ratio and concentrated into a volume of 20âµl by centrifugation. The cell slurry was transferred to the top of a 12âmm, 0.45âµm nitrocellulose membrane on the surface of an LB agar plate for 4âh at temperature optimal for the recipient strain (see Supplementary Dataset 1a) to permit conjugation. Transconjugants were selected by plating on LB supplemented with chloramphenicol and kanamycin. For FÎfinO::aph-Plac-gfp, a plasmid and prophage-cured S. enterica strain (SNW555, D23580 ÎΦ ÎpSLT-BT ÎpBT1 ÎpBT2 ÎpBT365) was used to mitigate any interference from the IncF Salmonella virulence plasmid (pSLT) and native prophages. The FÎfinO::aph-Plac-gfp plasmid was introduced into SNW555 and NQO62 via conjugation, exactly as described above.
For IncP-PDP host range experiments, the pKJK5 plasmid was transconjugated into P. putida KT2440, Pectobacterium atrosepticum SCRI1043, Shewanella oneidensis MR1, Serratia marcescens ATCC 1388, Enterobacter cloacae ATCC 13047, Pseudomonas fluorescens Pf0-1, Klebsiella pneumoniae PCI 602, Citrobacter werkmanii IC19Y, Citrobacter freundii ATCC 8090, Edwardsiella tarda ATCC 15947, Proteus mirabilis BB2000 âugd(immobile mutant), and S. enterica serovar Typhimurium LT2 via the cross streak method. The pKJK5 plasmid contains gfp under the control of the Plac promoter, which results in derepressed fluorescence in non-E. coli (lac negative) hosts66. Additionally, the pKJK5 donor strain, NQO38, constitutively expresses mCherry, permitting easy identification of transconjugants without need for dual selection. Briefly, an overnight liquid culture of the donor strain NQO38 was applied vertically in a single streak down the center of an LB agar plate. Subsequently, an overnight liquid culture of a recipient strain was streaked horizontally across the plate, crossing over the donor streak. After incubation at the recipient optimal temperature, transconjugant colonies were purified on the basis of green fluorescence signal.
Optimization of PDP detection by fluorescence-enabled coculture
To validate the use of fluorescence-enabled coculture to detect PDPs, a S. enterica-specific phage (9NA), a P. putida-specific phage (SVOΦ44), and an IncP PDP (PRD1) were mixed at equal concentration (~103âPFU/mL). In total, 100âµL each of overnight liquid cultures of S. enterica LT2 attTn7::Tn7-mScarlet-Iâ+âRP4 (NQO89) and P. putida attTn7::Tn7-SGFP2*â+âRP4 (NQO80) was added to 3âmL molten LB top agar, along with 10âµL of the phage mixture, and poured onto an LB agar plate. Plates were incubated overnight at 30â°C and then imaged in brightfield, red fluorescence channel, and green fluorescence channel using a custom imaging platform.
The custom imaging setup has a Canon EOS R camera with a Canon 100âmm lens with LEDs paired with excitation and emission filters (Green: 490â515ânm LED with 494ânm EX and 540/50ânm EM filters; Red: 567ânm LED with 562ânm EX and 641/75ânm EM filters). Excitation filters are held in a Starlight Xpress emission filter wheel. The camera, LEDs, and filter wheel are all controlled with custom software. Exposure times were 0.25 [green] and 0.5âs [red], with camera set to ISO-200 and f/3.5 as experimentally determined to maximize dynamic range. Imagining parameters were selected such that when green and red fluorescence channel images were merged, all three phages could be easily identified by fluorescent plaque phenotype: 9NA phages were visible as green plaques (only P. putida attTn7::Tn7-SGFP2*â+âRP4 grows in these areas), SVOΦ44 plaques were visible as red plaques (only S. enterica LT2 attTn7::Tn7-mScarlet-Iâ+âRP4 grows in these areas) and PRD1 plaques had no fluorescent signal (neither species grew in these areas). The red and green channels were separated from their raw images, their exposure linearly rescaled, and remapped to the red and blue channels respectively (to enhance visual color contrast). All image manipulations were done with scikit-image v0.17.267.
Collection and processing of environmental samples
For phage isolation, wastewater primary influent from a total of four sites in Massachusetts were collected, along with soil, animal waste, and compost from farms, community gardens and parks close to Boston, USA. Sample collection details can be found in Supplementary Dataset 1e (Environmental Samples). All samples were resuspended (if predominantly solid matter) in up to 25âmL of sterile water and incubated at 4â°C for 12âh with frequent vortexing to encourage suspension and homogenization of viral particles. The resuspended samples were centrifuged at 4000âÃâg for 30âmin to pellet large biomass, and the clarified supernatant was filter sterilized using a 0.22âµm vacuum driven filtration unit to remove bacteria. Filtered samples were stored at 4â°C. For metaviromic sequencing and phage enumeration in wastewater influent, two 100âmL samples were collected in September 2022 from two separate intake sources of wastewater at a treatment plant in Boston, MA. Samples were processed by filtration as described above, except that processing was initiated immediately upon sample collection to avoid any sample degradation.
Isolation of novel environmental PDPs by fluorescence-enabled coculture
For high throughput discovery of PDPs targeting the IncP plasmid pilus, co-culture lawns of S. enterica LT2 attTn7::Tn7-mScarlet-Iâ+âRP4 (NQO 89) and P. putida attTn7::Tn7-SGFP2* + RP4 (NQO80) were prepared as described earlier, except that 100âµl of filtered environmental samples containing putative novel phages were added instead of the reference phages. In cases where phage load in samples was too high, and subsequent lawn did not grow uniformly due to widespread lysis, the amount of filtered sample added to the lawns was diluted 10-fold until single plaques were obtained. Putative PDP plaques (exhibiting no fluorescence) were sampled using sterile filter tips, diluted and re-plated for single plaques at least twice to ensure purity. For the IncF plasmid-targeting phages, the procedure was the same, except that strains SVO348 (E. coli MG1655 attTn7::mScatlet-I-gmR + FÎfinO::aph-gfp) and NQO87 (S. enterica D23580 ÎΦ ÎpSLT-BT ÎpBT1 ÎpBT2 ÎpBT3 + FÎfinO::aph-gfp) were used in the lawns. The plasmid and prophage-cured strain of S. enterica was used for the IncF-dependent phage screen to mitigate interference from the native Salmonella virulence plasmid (which belongs to incompatibility group F68) and prophages.
Once putative novel PDPs had been purified from environmental samples, 5âµl drops of tenfold dilutions were plated on lawns of isogenic plasmid-free host strains (BL131, SVO126, SVO50, or SNW555) to confirm plasmid-dependency. We note that false positives (i.e plasmid independent phages that infected both species in the coculture) were occasionally obtained during the IncF PDP isolation, due to the phylogenetic proximity between E. coli and S. enterica, suggesting that use of more distinct host strains (if possible for the plasmid of interest) maximizes assay efficiency.
Phage DNA and RNA extraction and sequencing
Pure phage stocks that had undergone at least two rounds of purification from single plaques and had titers of at least 109âPFU/mL were used for nucleic acid extraction. The Invitrogen Purelink viral RNA/DNA mini kit was used to extract genetic material from all phages according to manufacturer instructions. High absorbance ratios (260/280) of 2.0â2.2 were considered indicative of RNA phage genomes. To remove host material contamination, putative RNA samples were incubated with DNase I (NEB) for 1âh at 37â°C and inactivated afterwards with EDTA at a final concentration of 5âmM. RNA was reverse transcribed using SuperScript⢠IV VILO⢠(Invitrogenâ¢) for first strand synthesis, per the manufacturerâs instructions. Second strand synthesis was performed by incubating the cDNA with DNA Ligase, DNA Polymerase I, and RNase H in NEBNext® Second Strand Synthesis Reaction Buffer (NEB) at 16â°C for 3âh. cDNA was then used in downstream library preparation. Additionally, as all known non-RNA IncF PDPs have ssDNA genomes which are incompatible with tagmentation-based library preparation, any putative DNA sample from IncF PDPs was subjected to second strand synthesis as described above. Illumina sequencing libraries of the DNA and cDNA samples were prepared as previously described69. Sequencing was carried out on the Illumina Novaseq or iSeq to produce 150âbp paired end reads. To improve the assembly quality of the RNA phage genomes, we conducted a second round of sequencing of the same RNA samples using the NGS provider SeqCoast Genomics. RNA samples were prepared for whole genome sequencing using an Illumina Stranded Total RNA Prep Ligation with Ribo-Zero Plus Microbiome and unique dual indexes. Sequencing was performed on the Illumina NextSeq2000 platform using a 300 cycle flow cell kit to produce 150âbp paired reads. The genetic composition (dsDNA vs ssDNA) for phage FtMidnight was inferred via fluorescence signal using the Quant-IT dsDNA kit (Invitrogen).
For metaviromic DNA extraction, 45âmL of freshly filtered influent from each of the two extraction sites was concentrated 100à into 500âµl using 100âkDa molecular weight cutoff centrifugal filter units (Amicon). Nucleic acids were extracted from 200âµl of concentrated filtrate, and sent to SeqCenter for library preparation and Illumina sequencing. Sample libraries were prepared using the Illumina DNA Prep kit and IDT 10âbp UDI indices, and sequenced on an Illumina NextSeq 2000, producing 2âÃâ151âbp reads.
Phage genome assembly and annotation
Sequencing reads were adapter trimmed (NexteraPE adapters) and quality filtered with Trimmomatic v.0.3970. For samples with very high read depth, filtered reads were subsampled with rasusa v.0.5.071 to an ~200Ã coverage to facilitate assembly. The reads were then assembled with Unicycler v.0.4.872 or rnaviralSPAdes v.3.15.573. The annotations from curated PRD1, MS2, Qbeta, and M13 reference genomes were transferred to the resulting assemblies with RATT v.1.0.374 and manually curated for completion. Phage isolates with redundant genomes were removed from the analysis and all phages included in this study represent unique isolates. Reads are deposited in the NCBI Sequence Read Archive (SRA). All accession numbers for previously published genomes and those generated in this study are listed in Supplementary Dataset 1c.
Nucleotide diversity
To calculate nucleotide diversity among the alphatectiviruses, all the assembled isolates were aligned to the PRD1 reference genome with minimap2 v2.2475. Resulting alignments were processed with bcftools v1.976 and samtools v1.677 to then calculate nucleotide diversity with vcftools v0.1.1678 with a sliding window of size 100âbp. Results were plotted with seaborn v0.12.279 and matplotlib80. Novel species classifications for alphatectiviruses were proposed where average pairwise nucleotide diversity was less than 95%30.
Phage enumeration in wastewater by plaque assay
Two freshly filtered wastewater influent samples were processed as previously described (see âCollection and processing of environmental samplesâ) and the concentration of phages in volumes of 10, 100, and 500âμm were enumerated by single-host plaque assay on strains SVO50, BL131, and SVO126 and by fluorescence-enabled co-culture plaque assay on NQO89 and NQO80. All phage enumeration was performed with three biological replicates. Titers per milliliter were calculated and plotted for both sites.
Determination of phage host range
Host range of the IncP-PDPs was assessed by traditional EoP assay or by killing in liquid culture by OD660 measurement, based on a previously described method43. All the phages were challenged against the following bacteria containing the pKJK5 plasmid: P. putida KT2440, Pectobacterium atrosepticum SCRI1043, Shewanella oneidensis MR1, Serratia marcescens ATCC 1388, Enterobacter cloacae ATCC 13047, Pseudomonas fluorescens Pf0-1, Klebsiella pneumoniae PCI 602, Citrobacter werkmanii IC19Y, Citrobacter freundii ATCC 8090, Edwardsiella tarda ATCC 15947, Proteus mirabilis BB2000 âugd, and S. enterica serovar Typhimurium LT2. These hosts were chosen as they all showed some degree of susceptibility to IncP-dependent phages when transconjugated with the pKJK5 plasmid, indicating proper elaboration of the IncP pilus.
For the high throughput determination of host range, phages were normalized to a titer of 107âPFU/mL as measured in strain NQO36, with the exception of PRDchartreuse, PRDcanary, PRDjuniper, and PRDmamacita, which were normalized to the same titer in NQO37, due to their inability to replicate to high titers in NQO36. Growth curve experiments were set up in 96-well plates with each well containing 180âµL of bacterial culture at OD600 of ~0.1 and 20âµL of phage stock when appropriate, for a final concentration of 106âPFU/mL. They were grown in a plate reader (Tecan Sunriseâ¢) for 10âh with shaking, at the optimal temperature for the strain (see Supplementary Dataset 1a), measuring the optical density at 660ânm, every 5âmin. Each 96-well plate had a phage-free control, cell-free control, and the strain-phage condition in triplicate. To calculate the liquid assay score of each host-phage pair we followed the method described previously43. Briefly, we calculate the area under the growth curve for each host-phage pair, as well as for its corresponding phage-free control grown in the same plate. The mean area under the curve value is then normalized as a percentage of the mean area under the curve in the phage-free control. Growth curves are plotted with shading representing the standard error. Liquid assays scores are plotted as a heatmap, and are vertically sorted according to the previously computed alphatectivirus tree and horizontally sorted according to a 16S tree of the bacterial hosts (see Supplementary Dataset 1a).
Adsorption assay
In total, 50âμl of exponentially growing SVO126 and NQO37 cells at a density of ~108âCFU/ml were mixed in a 96-well plate with 50âμl of PRD1 or PRDcerulean at a density of 106âPFU/ml to achieve an MOI of â¼0.01. Adsorption was done in triplicate for each strain-phage combination, and cell-free media controls were used in place of cells to quantify the maximum unadsorbed concentration of phage. After 10âmin adsorption time at 37â°C, the 96-well plate containing the cell-phage mixtures was mounted on top of a sterile 96-well MultiScreenHTS GV Filter Plate with sterile 0.22âµm membrane (Millipore) and centrifuged at 4000âÃâg to remove cells and adsorbed phages. Unadsorbed phage was quantified by serial dilution of the filtrate and plaque assay as described in Phage Replication above. Unadsorbed phage were represented as a percent of the maximum unadsorbed phage derived from cell-free media controls.
Phage recombination
To replace the holin gene of PRDcerulean, S. enterica-RP4 with recombineering plasmid pSIM5tet (SVO296) was used. Briefly, bacteria were grown to exponential phase in LB at 30â°C, with selective antibiotics for both plasmids, as specified in Supplementary Dataset 1b. The culture was then infected with high titer PRDcerulean lysate to a final concentration of ~107âPFU/mL for 15âmin. The culture was then induced for recombination for 15âmin at 42â°C. Electro-competent cells were then prepared by cooling the cells for 10âmin followed by three washes with cold sterile Mili-Q water at a 1:1 volume, and concentrated 50 times in cold sterile Mili-Q water. Competent cells were then mixed with ~100âng of DNA in 1âmm gap cuvettes and electroporated (1.8âkV, 25âµF, 200âΩ). Electroporated bacteria were mixed with 100âµL of fresh overnight liquid culture of S. enterica RP4 (NQO89) and the mixture was plated in a double-layer overlay plaque assay as described above. Only successful recombinant phages formed plaques on the bacterial lawn, and those where isolated and purified for further analysis. The holin gene DNA substrates for recombination were obtained with primers NQO3_13âNQO3_20.
Holin structure prediction
The topology of the PRDcerulean holin protein (P35cer) was predicted with the CCTOP v1.1.0 web server81 and drawn with Protter v.1.082. The structure was predicted with ColabFold v1.5.383, and rendered with PyMol v.2.5.684. Model parameters are specified in the code repository. The holin multiple sequence alignment was generated with clustalo v1.2.485 and visualized with UGENE v.38.186.
FtMidnight-resistant mutants
To isolate mutants of the F-plasmid that were spontaneously resistant to FtMidnight, a dilution series of phage was plated on SVO348, with kanamycin in the top agar in order to select against phage resistance via plasmid loss (the F plasmid derivate FÎfinO::aph-Plac-gfp contains a kanamycin resistance marker), as already described in Phage Replication above. Plates were incubated for 24âh at 37â°C, and then a further 24âh at room temperature, after which phage resistant micro-colonies were visible within the zones of phage lysis. Two independent colonies were picked and restreaked onto LB kanamycin agar. Restreaking was repeated once more to ensure purity. To understand resistance phenotype, the mutants were screened by plaque assay for susceptibility to FtMidnight, MS2, and Qbeta. To find the causative mutations, genomic DNA was extracted using the Quick-DNA Miniprep Plus Kit (Zymo) according to manufacturer instruction, and sequenced to >30à genome coverage with Nanopore technology using v14 library preparation chemistry and an R10.4.1 flow cell by the provider Plasmidsaurusâ bacterial genome sequencing service.
FtMidnight structural annotation and homology search
The genome of FtMignight was originally annotated with prokka v1.14.687 using the PHROGs database88. Annotations were manually curated to identify specific structural components by template-based homology search against the PDB_mmCIF70 database with HHpred through the MPI Bioinformatics Toolkit89. The structure model for FtMidnight was based on these structural hits, which can be found in Supplementary Dataset 2. A list of phage genomes containing homologs of the gp18-gp22 proteins was collected by searching with the tblastn90 web server against the nucleotide collection. A selection of phages with a conserved distal-tail region were visualized with clinker91, and the receptor was annotated if found in the literature. Accessions and references can be found in Supplementary Dataset 2.
Annotation of CRISPR-Cas and RM in bacterial genomes
CRISPR-Cas systems and spacers were annotated with CRISPRCasTyper v1.8.092, and RMs were annotated with DefenseFinder v1.0.993. All the spacers were then searched with blastn v2.15.094 against the complete alphatectivirus genomes, but no hits were recovered from this search. All the Accessions to bacterial genomes can be found in Supplementary Dataset 1a, and results of this search are included in Supplementary Dataset 3.
Search and comparison of tectiviruses in metagenomic assembled genomes
To collect metagenomic assembled genomes of tectiviruses, a search was performed in the JGI IMG/VR95 for UViGs matching Pfam model PF0901896, which corresponds to the tectivirus capsid protein. The recovered assemblies were annotated with prokka v1.14.687 using the PHROGs database88. To refine these annotations, our large collection of alphatectiviruses was used to build protein alignments for each protein in the PRD1 genome, using clustalo v1.2.485 and manually curated for quality. These alignments were then used to build hmm profile models with HMMER v3.3.197, to search them against the collected tectivirus MAGs. A representative selection of annotated MAGs was selected and visualized with clinker v0.0.2891 and colored to show homology. Shaded connectors represent proteins with >0.3 sequence identity, while annotations with the same color represent significant (pâ<â0.01) homologs according to the HMMER search.
Search for alphatectiviruses in metagenomic reads
Kraken2 v2.1.298 was used to search for the presence of alphatectiviruses reads in metagenomic datasets. A custom database was built by adding our new alphatectivirus assemblies to the default RefSeq viral reference library. With this database, a collection of reads from wastewater sequencing projects was searched. The SRA BioProject accession numbers of this collection can be found in Supplementary Dataset 2. The individual reads from each sequencing run that were classified as belonging to alphatectiviruses according to Kraken2 were extracted and mapped to the PRD1 reference genome with minimap2 v2.22. The resulting mapped reads were processed with samtools v1.6 and visualized with IGV v2.11.499.
Phylogenetic trees
For the alphatectivirus tree, all previously published genomes and those collected in this study were aligned with clustalo v1.2.4. The resulting multiple sequence alignment was manually curated to ensure quality of the alignment. The tree was then built with iqtree v2.2.0.3100, and visualized with iTOL v6.7101.
For the Fiersviridae and Inovirus trees, the protein sequence of the RNA-dependent RNA polymerase (replicase) or the whole nucleotide content, respectively, were aligned. For the FtMidnight distal-tail protein trees, one alignment per protein was generated. All alignments were performed with clustalo v1.2.4, the trees were then generated with phyml v3.2.0102 and visualized with FigTree v.1.4.4103.
For the tectivirus ATPase tree, the amino acid sequences for protein P9 (ATPase) from all known tectiviruses were aligned with clustalo v1.2.4. This alignment was used to create an hmm profile model with HMMER, which was then used to search the amino acid sequences extracted from the annotated MAGs (see âSearch for tectiviruses in metagenomic assembled genomesâ). Significant hits were extracted and aligned to the model with HMMER. We also included in this alignment the previously metagenomic-assembled tectiviruses listed in Yutin et al.50, and a selection of characterized representatives of the five tectivirus genera. A tree of the resulting ATPase alignment was built with phyml v3.2.0, and visualized with iTOL v6.7.
Specific alignment and tree building parameters can be found in the code repository. All accession numbers of sequences used to build these trees are listed in Supplementary Dataset 2.
Electron microscopy
Carbon grids were glow discharged using a EMS100x Glow Discharge Unit for 30âs at 25âmA. High titer phage stocks were diluted 1:10 in water and 5âµL was adsorbed to the glow discharged carbon grid for 1âmin. Excess sample was blotted with filter paper and the grids were washed once with water before staining with 1% uranyl acetate for 20âs. Excess stain was blotted with filter paper and the grids were air dried prior to examination with a Tecnai G2 Spirit BioTWIN Transmission Electron Microscope at the Harvard Medical School Electron Microscopy Facility.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
Raw sequencing reads have been deposited in the NCBI BioProject database under accession number PRJNA954020. Accession numbers for novel phage genomes generated in this study can be found in Supplementary Dataset 1c. Raw data used in figures are available in a Github repository.
Code availability
All code is available in a Github repository: https://github.com/baymlab/2023_QuinonesOlvera-Owen.
References
Greene, S. E. & Reid, A. Viruses Throughout Life & Time: Friends, Foes, Change Agents: A Report on an American Academy of Microbiology Colloquium San Francisco // July 2013 (American Society for Microbiology, 2013).
Lefkowitz, E. J. et al. Virus taxonomy: the database of the International Committee on Taxonomy of Viruses (ICTV). Nucleic Acids Res. 46, D708âD717 (2018).
Dimitrov, D. S. Virus entry: molecular mechanisms and biomedical applications. Nat. Rev. Microbiol. 2, 109â122 (2004).
Bertozzi Silva, J., Storms, Z. & Sauvageau, D. Host receptors for bacteriophage adsorption. FEMS Microbiol. Lett. 363, fnw002 (2016).
Waksman, G. From conjugation to T4S systems in gram-negative bacteria: a mechanistic biology perspective. EMBO Rep. 20, e47012 (2019).
Goessweiner-Mohr, N., Arends, K., Keller, W. & Grohmann, E. Conjugation in gram-positive bacteria. Microbiol. Spectr. 2, PLAS-0004-2013 (2014).
Frost, L. S. Conjugative pili and pilus-specific phages. in Bacterial Conjugation (ed. Clewell, D. B.) 189â221 (Springer, US, 1993).
Bottery, M. J. Ecological dynamics of plasmid transfer and persistence in microbial communities. Curr. Opin. Microbiol. 68, 102152 (2022).
Mäntynen, S., Sundberg, L.-R., Oksanen, H. M. & Poranen, M. M. Half a century of research on membrane-containing bacteriophages: bringing new concepts to modern virology. Viruses 11, 76 (2019).
Wolf, Y. I. et al. Origins and evolution of the global RNA virome. mBio 9, e02329-18 (2018).
Barderas, R. & Benito-Peña, E. The 2018 Nobel Prize in Chemistry: phage display of peptides and antibodies. Anal. Bioanal. Chem. 411, 2475â2479 (2019).
George, L., Indig, F. E., Abdelmohsen, K. & Gorospe, M. Intracellular RNA-tracking methods. Open Biol. 8, 180104 (2018).
Koonin, E. V., Krupovic, M. & Yutin, N. Evolution of doubleâstranded DNA viruses of eukaryotes: from bacteriophages to transposons to giant viruses. Ann. N. Y. Acad. Sci. 1341, 10â24 (2015).
Jalasvuori, M., Friman, V.-P., Nieminen, A., Bamford, J. K. H. & Buckling, A. Bacteriophage selection against a plasmid-encoded sex apparatus leads to the loss of antibiotic-resistance plasmids. Biol. Lett. 7, 902â905 (2011).
Colom, J. et al. Sex pilus specific bacteriophage to drive bacterial population towards antibiotic sensitivity. Sci. Rep. 9, 12616 (2019).
Penttinen, R., Given, C. & Jalasvuori, M. Indirect selection against antibiotic resistance via specialized plasmid-dependent bacteriophages. Microorganisms 9, 280 (2021).
Ojala, V., Laitalainen, J. & Jalasvuori, M. Fight evolution with evolution: plasmid-dependent phages with a wide host range prevent the spread of antibiotic resistance. Evol. Appl. 6, 925â932 (2013).
DelaFuente, J. et al. Within-patient evolution of plasmid-mediated antimicrobial resistance. Nat. Ecol. Evol. 6, 1980â1991 (2022).
Anderson, R. M. The pandemic of antibiotic resistance. Nat. Med. 5, 147â149 (1999).
Getino, M. & de la Cruz, F. Natural and artificial strategies to control the conjugative transmission of plasmids. Microbiol. Spectr. 6, 6.1.03 (2018).
Conlan, S. et al. Plasmid dynamics in KPC-positive Klebsiella pneumoniae during long-term patient colonization. mBio 7, e00742-16 (2016).
Vinjé, J., Oudejans, S. J. G., Stewart, J. R., Sobsey, M. D. & Long, S. C. Molecular detection and genotyping of male-specific coliphages by reverse transcription-PCR and reverse line blot hybridization. Appl. Environ. Microbiol. 70, 5996 (2004).
Meynell, G. G. & Lawn, A. M. Filamentous phages specific for the I sex factor. Nature 217, 1184â1186 (1968).
Bradley, D. E. et al. Broad host range, filamentous bacterial virus. Microbiology 126, 389â396 (1981).
Nuttall, D., Maker, D. & Colleran, E. A method for the direct isolation of IncH plasmid-dependent bacteriophages. Lett. Appl. Microbiol. 5, 37â40 (1987).
He, Z., Parra, B., Nesme, J., Smets, B. F. & Dechesne, A. Quantification and fate of plasmid-specific bacteriophages in wastewater: beyond the F-coliphages. Water Res. 227, 119320 (2022).
Popowska, M. & Krawczyk-Balska, A. Broad-host-range IncP-1 plasmids and their resistance potential. Front. Microbiol. 4, 40688 (2013).
Turton, J. F. et al. Wide distribution of Escherichia coli carrying IncF plasmids containing blaNDM-5 and rmtB resistance genes from hospitalized patients in England. J. Med. Microbiol. 71, 001569 (2022).
Woodford, N. et al. Complete nucleotide sequences of plasmids pEK204, pEK499, and pEK516, encoding CTX-M enzymes in three major Escherichia coli lineages from the United Kingdom, all belonging to the international O25:H4-ST131 clone. Antimicrob. Agents Chemother. 53, 4472â4482 (2009).
Adriaenssens, E. & Brister, J. R. How to name and classify your phage: an informal guide. Viruses 9, 70 (2017).
Brooks, L. E., Kaze, M. & Sistrom, M. Where the plasmids roam: large-scale sequence analysis reveals plasmids with large host ranges. Microb. Genomics 5, e000244 (2019).
Stanton, C. R., Petrovski, S. & Batinovic, S. Isolation of a PRD1-like phage uncovers the carriage of three putative conjugative plasmids in clinical Burkholderia contaminans. Res. Microbiol. https://doi.org/10.1016/j.resmic.2024.104202 (2024).
KrupoviÄ, M., CvirkaitÄâKrupoviÄ, V. & Bamford, D. H. Identification and functional analysis of the Rz/Rz1-like accessory lysis genes in the membrane-containing bacteriophage PRD1. Mol. Microbiol. 68, 492â503 (2008).
Callanan, J. et al. Expansion of known ssRNA phage genomes: from tens to over a thousand. Sci. Adv. 6, eaay5981 (2020).
MacNair, C. R., Rutherford, S. T. & Tan, M.-W. Alternative therapeutic strategies to treat antibiotic-resistant pathogens. Nat. Rev. Microbiol. 1â14 https://doi.org/10.1038/s41579-023-00993-0 (2023).
Huang, Y. et al. Structure and proposed DNA delivery mechanism of a marine roseophage. Nat. Commun. 14, 3609 (2023).
Hospenthal, M. K., Costa, T. R. D. & Waksman, G. A comprehensive guide to pilus biogenesis in gram-negative bacteria. Nat. Rev. Microbiol. 15, 365â379 (2017).
Hay, I. D. & Lithgow, T. Filamentous phages: masters of a microbial sharing economy. EMBO Rep. 20, e47427 (2019).
Tittes, C., Schwarzer, S. & Quax, T. E. F. Viral hijack of filamentous surface structures in archaea and bacteria. Viruses 13, 164 (2021).
Olsen, R. H., Siak, J.-S. & Gray, R. H. Characteristics of PRD1, a plasmid-dependent broad host range DNA bacteriophage. J. Virol. 14, 689â699 (1974).
Bamford, D. H., Caldentey, J. & Bamford, J. K. H. Bacteriophage Prd1: a broad host range DSDNA tectivirus with an internal membrane. in Advances in Virus Research, Vol. 45 (eds Maramorosch, K., Murphy, F. A. & Shatkin, A. J.) 281â319 (Academic Press, 1995).
Bamford, D. H., Rouhiainen, L., Takkinen, K. & Soderlund, H. Comparison of the lipid-containing bacteriophages PRD1, PR3, PR4, PR5 and L17. J. Gen. Virol. 57, 365â373 (1981).
Xie, Y., Wahab, L. & Gill, J. J. Development and validation of a microtiter plate-based assay for determination of bacteriophage host range and virulence. Viruses 10, 189 (2018).
Wang, I. N., Smith, D. L. & Young, R. Holins: the protein clocks of bacteriophage infections. Annu. Rev. Microbiol. 54, 799â825 (2000).
Bläsi, U., Fraisl, P., Chang, C.-Y., Zhang, N. & Young, R. The C-terminal sequence of the λ holin constitutes a cytoplasmic regulatory domain. J. Bacteriol. 181, 2922â2929 (1999).
Nayfach, S. et al. A genomic catalog of Earthâs microbiomes. Nat. Biotechnol. 1â11 https://doi.org/10.1038/s41587-020-0718-6 (2020).
Roux, S. et al. Cryptic inoviruses revealed as pervasive in bacteria and archaea across Earthâs biomes. Nat. Microbiol. 4, 1895â1906 (2019).
Edgar, R. C. et al. Petabase-scale sequence alignment catalyses viral discovery. Nature 602, 142â147 (2022).
Strange, J. E. S., Leekitcharoenphon, P., Møller, F. D. & Aarestrup, F. M. Metagenomics analysis of bacteriophages and antimicrobial resistance from global urban sewage. Sci. Rep. 11, 1600 (2021).
Yutin, N., Bäckström, D., Ettema, T. J. G., Krupovic, M. & Koonin, E. V. Vast diversity of prokaryotic virus genomes encoding double jelly-roll major capsid proteins uncovered by genomic and metagenomic sequence analysis. Virol. J. 15, 67 (2018).
Grahn, A. M., Haase, J., Lanka, E. & Bamford, D. H. Assembly of a functional phage PRD1 receptor depends on 11 genes of the IncP plasmid mating pair formation complex. J. Bacteriol. 179, 4733â4740 (1997).
Guo, X. et al. Global prevalence, characteristics, and future prospects of IncX3 plasmids: a review. Front. Microbiol. 13, 979558 (2022).
Mouftah, S. F. et al. Epidemic IncX3 plasmids spreading carbapenemase genes in the United Arab Emirates and worldwide. Infect. Drug Resist. 12, 1729â1742 (2019).
Liakopoulos, A. et al. Genomic and functional characterisation of IncX3 plasmids encoding blaSHV-12 in Escherichia coli from human and animal origin. Sci. Rep. 8, 7674 (2018).
GaidelytÄ, A., CvirkaitÄ-Krupovic, V., Daugelavicius, R., Bamford, J. K. H. & Bamford, D. H. The entry mechanism of membrane-containing phage Bam35 infecting Bacillus thuringiensis. J. Bacteriol. 188, 5925 (2006).
Laurinmäki, P. A., Huiskonen, J. T., Bamford, D. H. & Butcher, S. J. Membrane proteins modulate the bilayer curvature in the bacterial virus Bam35. Structure 13, 1819â1828 (2005).
Parra, B. et al. Isolation and characterization of novel plasmid-dependent phages infecting bacteria carrying diverse conjugative plasmids. Microbiol. Spectr. 12, e0253723 (2023).
Kauffman, K. M. et al. A major lineage of non-tailed dsDNA viruses as unrecognized killers of marine bacteria. Nature 554, 118â122 (2018).
Martinez-Hernandez, F. et al. Single-virus genomics reveals hidden cosmopolitan and abundant viruses. Nat. Commun. 8, 15892 (2017).
Sczyrba, A. et al. Critical Assessment of Metagenome Interpretation-a benchmark of metagenomics software. Nat. Methods 14, 1063â1071 (2017).
Roux, S., Emerson, J. B., Eloe-Fadrosh, E. A. & Sullivan, M. B. Benchmarking viromics: an in silico evaluation of metagenome-enabled estimates of viral community composition and diversity. PeerJ 5, e3817 (2017).
Bernheim, A. & Sorek, R. The pan-immune system of bacteria: antiviral defence as a community resource. Nat. Rev. Microbiol. 18, 113â119 (2020).
Kropinski, A. M., Mazzocco, A., Waddell, T. E., Lingohr, E. & Johnson, R. P. Enumeration of bacteriophages by double agar overlay plaque assay. in Methods in Molecular Biology. https://doi.org/10.1007/978-1-60327-164-6_7 (Clifton, NJ, 2009).
Schlechter, R. O. et al. Chromatic bacteria â a broad host-range plasmid and chromosomal insertion toolbox for fluorescent protein expression in bacteria. Front. Microbiol. 9, 3052 (2018).
Owen, S.V. et al. Prophages encode phage-defense systems with cognate self-immunity. Cell Host Microbe 29, 1620â1633 (2021).
Klümper, U. et al. Broad host range plasmids can invade an unexpectedly diverse fraction of a soil bacterial community. ISME J. 9, 934â945 (2015).
van der Walt, S. et al. scikit-image: image processing in Python. PeerJ 2, e453 (2014).
Villa, L., GarcÃa-Fernández, A., Fortini, D. & Carattoli, A. Replicon sequence typing of IncF plasmids carrying virulence and resistance determinants. J. Antimicrob. Chemother. 65, 2518â2529 (2010).
Baym, M. et al. Inexpensive multiplexed library preparation for megabase-sized genomes. PLoS ONE 10, e0128036 (2015).
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114â2120 (2014).
Hall, M. Rasusa: randomly subsample sequencing reads to a specified coverage. JOSS 7, 3941 (2022).
Wick, R. R., Judd, L. M., Gorrie, C. L. & Holt, K. E. Unicycler: resolving bacterial genome assemblies from short and long sequencing reads. PLoS Comput. Biol. 13, e1005595 (2017).
Meleshko, D., Hajirasouliha, I. & Korobeynikov, A. coronaSPAdes: from biosynthetic gene clusters to RNA viral assemblies. Bioinformatics 38, 1â8 (2021).
Otto, T. D., Dillon, G. P., Degrave, W. S. & Berriman, M. RATT: rapid annotation transfer tool. Nucleic Acids Res. 39, e57âe57 (2011).
Li, H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 34, 3094â3100 (2018).
Danecek, P. et al. Twelve years of SAMtools and BCFtools. GigaScience 10, giab008 (2021).
Li, H. et al. The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078â2079 (2009).
Danecek, P. et al. The variant call format and VCFtools. Bioinformatics 27, 2156â2158 (2011).
Waskom, M. seaborn: statistical data visualization. JOSS 6, 3021 (2021).
Hunter, J. D. Matplotlib: A 2D graphics environment. Comput. Sci. Eng. 9, 90â95 (2007).
Dobson, L., Reményi, I. & Tusnády, G. E. CCTOP: a Consensus Constrained TOPology prediction web server. Nucleic Acids Res. 43, W408âW412 (2015).
Omasits, U., Ahrens, C. H., Müller, S. & Wollscheid, B. Protter: interactive protein feature visualization and integration with experimental proteomic data. Bioinformatics 30, 884â886 (2014).
Mirdita, M. et al. ColabFold: making protein folding accessible to all. Nat. Methods 19, 679â682 (2022).
Schrödinger, L. & DeLano, W. The PyMOL molecular graphics system. https://pymol.org/support.html (2010).
Sievers, F. et al. Fast, scalable generation of highâquality protein multiple sequence alignments using Clustal Omega. Mol. Syst. Biol. 7, 539 (2011).
Okonechnikov, K., Golosova, O. & Fursov, M. UGENE team Unipro UGENE: a unified bioinformatics toolkit. Bioinformatics 28, 1166â1167 (2012).
Seemann, T. Prokka: rapid prokaryotic genome annotation. Bioinformatics 30, 2068â2069 (2014).
Terzian, P. et al. PHROG: families of prokaryotic virus proteins clustered using remote homology. NAR Genom. Bioinform. 3, lqab067 (2021).
Zimmermann, L. et al. A completely reimplemented MPI bioinformatics toolkit with a new HHpred server at its core. J. Mol. Biol. 430, 2237â2243 (2018).
Sayers, E. W. et al. Database resources of the national center for biotechnology information. Nucleic Acids Res. 50, D20âD26 (2022).
Gilchrist, C. L. M. & Chooi, Y.-H. clinker & clustermap.js: automatic generation of gene cluster comparison figures. Bioinformatics 37, 2473â2475 (2021).
Russel, J., Pinilla-Redondo, R., Mayo-Muñoz, D., Shah, S. A. & Sørensen, S. J. CRISPRCasTyper: automated identification, annotation, and classification of CRISPR-Cas loci. CRISPR J. 3, 462â469 (2020).
Tesson, F. et al. Systematic and quantitative view of the antiviral arsenal of prokaryotes. Nat. Commun. 13, 2561 (2022).
Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. Basic local alignment search tool. J. Mol. Biol. 215, 403â410 (1990).
Camargo, A. P. et al. IMG/VR v4: an expanded database of uncultivated virus genomes within a framework of extensive functional, taxonomic, and ecological metadata. Nucleic Acids Res. 51, D733âD743 (2023).
El-Gebali, S. et al. The Pfam protein families database in 2019. Nucleic Acids Res. 47, D427âD432 (2019).
Eddy, S. R. Accelerated profile HMM searches. PLoS Comput. Biol. 7, e1002195 (2011).
Wood, D. E., Lu, J. & Langmead, B. Improved metagenomic analysis with Kraken 2. Genome Biol. 20, 257 (2019).
Thorvaldsdottir, H., Robinson, J. T. & Mesirov, J. P. Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration. Brief. Bioinform. 14, 178â192 (2013).
Minh, B. Q. et al. IQ-TREE 2: new models and efficient methods for phylogenetic inference in the genomic era. Mol. Biol. Evolut. 37, 1530â1534 (2020).
Letunic, I. & Bork, P. Interactive Tree Of Life (iTOL) v5: an online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 49, W293âW296 (2021).
Guindon, S. et al. New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst. Biol. 59, 307â321 (2010).
Rambaut, A. FigTree v1.3.1. Institute of Evolutionary Biology (University of Edinburgh, Edinburgh, 2010).
Acknowledgements
We are grateful for the gifts of bacterial strains, plasmids, phages, or wastewater from the labs of Uli Klümper, Catherine Putonti, George OâToole, Karine Gibbs, Jay Hinton, Pamela Silver, and Ameet Pinto. We thank the other instructors and students of the HMS Phages 2022 summer course: Thomas Bernhardt, Amelia McKitterick, Kate Hummels, Thomas Bartlett, Nawonh Charles, Melanie Justice, Tosin Bademosi, and Ahadu Molla, which was partially supported by the HHMI Science Education Alliance. N.Q.O. thanks the Marine Biological Laboratory at Woods Hole and all instructors from the 2019 Microbial Diversity course. Electron microscopy imaging and consultation were performed in the HMS Electron Microscopy Facility. Custom instrumentation was built with assistance from the Research Instrumentation core at Harvard Medical School. Computational work used the O2 cluster supported by the Research Computing Group at Harvard Medical School. This work was supported by the NIGMS of the National Institutes of Health (R35GM133700), the David and Lucile Packard Foundation, the Pew Charitable Trusts, Alfred P. Sloan Foundation and NSF grant IOS-2331228. N.Q.O. acknowledges support from Consejo Nacional de Ciencia y TecnologÃa (CONACYT, México). M.G.M., E.A.R., R.P., and J.S.P. acknowledge support from the Systems, Synthetic, and Quantitative Biology PhD program training award (T32GM135014). A.C.F. was supported in part by the NSF-Simons Center for Mathematical and Statistical Analysis of Biology at Harvard (award number #1764269), and the Harvard Quantitative Biology Initiative.
Author information
Authors and Affiliations
Contributions
N.Q.O., S.V.O., and M.B. conceived and designed the study. N.Q.O., S.V.O., L.M.M., A.C.F., O.J.M.D., K.P., C.E.S.C., R.P., and J.S.P. conducted experiments and acquired data. M.G.M. and E.A.R. contributed resources and data interpretation. N.Q.O., S.V.O., and M.B. analyzed the data and wrote the manuscript. All authors read and approved the manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. A peer review file is available.
Additional information
Publisherâs note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the articleâs Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the articleâs Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Quinones-Olvera, N., Owen, S.V., McCully, L.M. et al. Diverse and abundant phages exploit conjugative plasmids. Nat Commun 15, 3197 (2024). https://doi.org/10.1038/s41467-024-47416-z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-024-47416-z
This article is cited by
-
Rediscovering plasmid-dependent phages
Nature Reviews Microbiology (2024)
-
Autoregulation ensures vertical transmission of the linear prophage GIL01
Communications Biology (2024)
-
Characterization and Abundance of Plasmid-Dependent Alphatectivirus Bacteriophages
Microbial Ecology (2024)