We also examined the effect that thinning the SNP data for LD had on the predictive ability of the scores. We employed the “LD based results clumping” routine from the PLINK software package [36] to generate the thinned data. Briefly, this routine orders the GWA meta-analysis association p values from strongest to weakest. SNPs are then selected in this order, with the proviso that a variant cannot be included, if it is in LD with a previously selected SNP. For the purposes of this analysis we defined LD as the variants being r2>0.2 and within 250 kb of each other.