Average Nucleotide Identity (ANI)
Average Nucleotide Identity (ANI)
Average Nucleotide Identity (ANI)
Introduction
BBHs between a genome pair are computed as pairwise bidirectional best nSimScan
hits of genes having 70% or more identity and at least 70% coverage of the shorter
gene. Genome(s) can either be selected from IMG or uploaded as a nucleotide
sequence in FASTA format (using the Upload File button) to compute ANI to selected
genome(s) in IMG.
Pairwise ANI is precomputed for most genomes. Private genomes and newly added
genomes will be precomputed on-demand which will slow down the computation (Fig.
1b).
Figure 1b. Pairwise ANI - results
Same Species Plot
Selecting in the Species table under “Number of Genomes” for a species links to the
“Genomes for Species” page (Fig. 2b). Selecting “Number of Cliques” for a species links
to the “Cliques for Species” page (Fig. 2c).
Figure 2b. Genomes for Species
Each dot in the Same Species Plot represents the final ANI vs. final AF for a genome
pair present in a given species. Selecting a dot will bring up a dialog box with a table to
which the 2 genomes represented in the dot will be added. These genomes can then be
reviewed and added to a genome cart.
ANI Cliques
Clicking on the clique id in the All Cliques tab will link to the details page for that clique
(Fig. 4).
Figure 3b. ANI Cliques by Species
Figure 3c. ANI Cliques by Taxonomy
The leaf nodes in the Taxonomy tree display the Genus Species followed by the count
of cliques for that genus-species. Clicking on the count links to the Genus Species detail
page. The first tab of this page lists the cliques for this Species (Fig. 3d), while the
second tab lists the genomes in that Species (Fig. 3f).
Figure 3d. Genus Species Detail page - Cliques for Species
Any two genomes in a clique group will be connected by an edge in the graph if they
have an ANI >= 96.5 and an AF >= 0.6. Clicking on the clique graph representation links
to details page for that clique group (Fig. 4).
Clique Details
ANI information for a particular genome can be found on the genome detail page,
located to the right of Genome Statistics.