HIV-TRACE identifies potential transmission clusters within a supplied FASTA file using pairwise TN93 genetic distances.
pip install hivtracehivtrace -i INPUT.FASTA -a resolve -r HXB2_prrt -t .015 -m 500 -g .05 -cFASTA file with nucleotide sequences. Sequence names may include munged attributes, e.g. ISOLATE_XYZ|2005|SAN DIEGO|MSM.
Strategy for handling ambiguous nucleotides. See the MBE paper for details.
| Option | Description |
|---|---|
resolve |
Count any resolutions that match as a perfect match |
average |
Average all possible resolutions |
skip |
Skip all positions with ambiguities |
gapmm |
Count character-gap positions as 4-way mismatches, else average |
Reference sequence for alignment. Built-in options:
| Option | Region |
|---|---|
HXB2_prrt |
Protease + RT |
NL4-3_prrt |
Protease + RT (NL4-3) |
HXB2_pol |
Protease + RT + Integrase |
HXB2_pr |
Protease |
HXB2_rt |
Reverse Transcriptase |
HXB2_int |
Integrase |
HXB2_gag |
Gag |
HXB2_env |
Envelope |
HXB2_nef |
Nef |
HXB2_vif |
Vif |
HXB2_vpr |
Vpr |
HXB2_vpu |
Vpu |
HXB2_tat |
Tat |
HXB2_rev |
Rev |
Or provide a path to a custom reference FASTA file. See HIV-1 genome landmarks for reference.
Maximum pairwise TN93 distance for two sequences to be connected by an edge.
Minimum number of overlapping non-gap characters required for distance calculation. Aim for at least 2 / threshold aligned characters.
Maximum ambiguity fraction (0–1) for the resolve strategy. Sequences exceeding this fraction fall back to average.
Screen for contaminants.
| Option | Description |
|---|---|
remove |
Remove spurious edges from the network |
report |
Flag sequences sharing a cluster with the reference |
separately |
Flag sequences via secondary tn93 run |
none |
Do nothing |
Phylogenetic test of conditional independence to remove spurious transitive connections (A→B→C appearing as triangles).
| Option | Description |
|---|---|
remove |
Remove spurious transitive edges |
report |
Report spurious transitive edges |
Mask drug resistance-associated mutation (DRAM) codon sites.
| Option | Description |
|---|---|
lewis |
Sites from Lewis et al |
wheeler |
Sites from Wheeler et al |
Compare sequences to public sequences from the Los Alamos HIV Sequence Database.
Use input sequences as-is (assumes pre-aligned). Incompatible with --compare.
Path to cawlign executable (default: cawlign in PATH).
Score matrix for alignment (default: HIV_BETWEEN_F).
Output filename. Defaults to <input>.results.json.
hivtrace_viz results.jsonOr visit hivtrace-viz and click Load File.