Datasets¶

The dataset is a subset of 1000 M. tuberculosis isolates whole genome sequenced by Casali et al.: Evolution and transmission of drug-resistant tuberculosis in a Russian population. In the previous tutorial, we generated an initial set of variants from a single M. tuberculosis isolate. In this subsequent tutorial, we will be repeating that for additinal isolates. We will then combine all of these variants into a single dataset for further analysis.

File Content
Mtb_repeats.bed precalculated, coordinates of the repetitive regions in the reference M.tuberculosis genome