but it is replicated across the various kits, chips and machines used in this study. While we had originally intended to include the high G+C% organism, Deinococcus maricopensis ( Methods ), the inconsistency in read throughput across chips for this species necessitated its exclusion. However, we analysed the highest throughput run for D. maricopensis (kit - 200 bp Manual Kit, chip - 316) and observed an extreme bias against high G+C% regions, with lower G+C content regions (45%) having 2.5× the mean genomic coverage, but higher G+C regions (80%) receiving only 0.2× the mean coverage (Figure S6).