To verify and correct the misclassification of self-reported race, we compared the GWAS data from all subjects with genotypes from the HapMap 3 reference CEU, YRI and CHB populations. Principal component (PC) analysis was conducted in the entire GWAS sample using Eigensoft19,20 and 145 472 SNPs that were common to the GWAS data set and HapMap panel (after pruning the GWAS SNPs for linkage disequilibrium (LD) (r2) >80%) to characterize the underlying genetic architecture of the samples. The first PC score distinguished AAs and EAs; these groups were subsequently analyzed separately. We then conducted PC analyses within the two groups, and the first three PCs were used in all subsequent analyses to correct for residual population stratification.