In these analyses we have identified a set of highly informative SNPs. Previous studies have shown that similar sets of SNPs have been effective at verifying self-reported ethnicity in other samples [9,10]. The SNPs we have identified may serve as a "genomic control set" in these data. Runs using the 20 SNPs with the highest differences in allele frequencies between populations show 97% similarity to the STRP results (Table 3). Future studies could confirm the general applicability of these SNPs by replicating these results in other samples.