For all MethylC-Seq data sets, methylated cytosines were identified from the mapped and processed read data as described previously18. The bisulphite conversion rates for all samples were over 99% (Supplementary Table 1). Correction of any DNA methylation sites incorrectly categorized as non-CG owing to SNPs in the sample versus reference genomes was performed as described previously18.