SNPs were excluded from analysis for the following reasons (applied in order): call rate <95% (n = 1910), allele frequency <1% in (n = 280) and deviation from Hardy Weinberg equilibrium (P<0.0001, n = 5819). Subsequent analyses were performed on a dataset of 309,494 SNPs.