Subsets of the 128 marker set were chosen using the In algorithm (Rosenberg, et al., 2003) with the goal of finding the most informative markers distinguishing one or more of the following: 1) four continental populations EURA, AFR, AMI, and EAS; 2) three continental populations (EURA, AFR, and AMI); or 3) two continental populations (EURA and AFR or EURA and AMI). Each subset was determined using 80 subjects from each ethnic group (described in Statistical Methods) and marker selection was based on the most informative set for each analysis (provided in Supplementary Table S2).