Skip to main content Link Menu Expand (external link) Document Search Copy Copied

Resistance markers

This module can only be run with the v4 amplicon panel.

This module is designed to identify variants associated with drug resistance in Plasmodium falciparum by utilizing the PseudoCIGAR string extracted from allele_data.txt. It works by examining mutations within specific genomic coordinates of interest outlined in (resistance_markers_amplicon_v4.txt). Any new indels or single nucleotide polymorphisms (SNPs) that occur within these coordinates are reported.

In the case that there are multiple loci covering the same drug resistance marker being inspected these will be reported on separate lines. You can identify these as they will have the same GeneID, Gene, and CodonID, but will have different values in the locus column. In most cases we would expect the Codon and amino acid (AA) reported to be the same on both loci, however if you find divergence one option would be to take the one with the highest number of reads. Please reach out to the UCSF team if you have any questions about interpreting these files!

File Outputs

There are three files that output by this module:

resmarker_table.txt

This file contains all codons that are found in the ASV as specified by the genomic coordinates in the provided resistance marker table. The file summarizes what the 3-base sequence was in the sample, what was expected, and whether there was a synonomous or non-synomous amino acid change.

Column Description
SampleID The sample being reported
Locus The locus corresponding to the marker
GeneID A numeric identifier for the P. falciparum gene and gene position
Gene The name of the gene
CodonID The codon number
RefCodon The codon in the reference
Codon The codon in the ASV
CodonStart The codon start position
CodonRefAlt Can be ‘REF’ or ‘ALT’, depending on whether the ASV codon matches the reference codon (‘ALT’ if they do not match)
RefAA The amino acid coded by the RefCodon
AA The amino acid code by the Codon
AARefAlt Can be ‘REF’ or ‘ALT’, depending on whether the ASV amino acid matches the reference amino acid (‘ALT’ if they do not match)
Reads The number of reads that contain this Codon

resmarker_microhap_table.txt

This file provides the same information as the resistance marker table above, but it is more informative as it strings the mutations from each locus together into haplotypes.

Column Description
SampleID The sample being reported
Locus The locus corresponding to the marker
GeneID A numeric identifier for the P. falciparum gene and gene position
Gene The name of the gene
MicrohapIndex A collapsed verison of the ‘CodonID’ (see ‘resmarker_table.txt’). This will contain all codon IDs that are included in the microhaplotype.
RefMicrohap A collapsed version of the ‘Ref’ column that reports all reference amino acids in order by CodonID
Microhaplotype A collapsed version of the ‘Alt’ column that reports all ASV amino acids in order by CodonID
MicrohapRefAlt Can be ‘ALT’ or ‘REF’, depending on whether the ASV microhaplotype matches the reference microhaplotype (‘ALT’ if they do not match).
Reads Number of reads that have the Microhaplotype.

resmarker_new_mutations.txt

This file contains DNA mutations that were not within the specified genomic coordinates. The indels and SNPs here are listed in this file to allow end users an opportunity to see other mutations found. They should be interpreted with some caution as they are simply reported and not filtered in any way.

Column Description
SampleID The sample being reported
Locus The locus corresponding to the marker
GeneID A numeric identifier for the P. falciparum gene and gene position
Gene The name of the gene
CodonID The codon number
Position The position in the reference sequence where the indel or SNP was found
Alt The DNA base found at the position in the ASV
Ref The DNA base found at the position in the reference sequence
Reads The number of reads that support the Alt base