Because the full sample is composed of families, and individuals within families are correlated with respect to genotypes and phenotypes, we always kept individuals from the same family within the same subsample. This prevented the algorithm, for example, from deriving the SNP score on one twin and cross-validating it on the other – clearly in that case we expect prediction bias given correlation between twins on the phenotype and the genotype.