FlashPCA30 was run for principal component analysis (PCA) to infer genetic ancestry by genotype. The regression model assumed an additive genetic model and included the first three eigenvalues from FlashPCA as covariates. For imputed data of smaller sample size, which was enrolled in our analysis later, we changed the method score to EM algorithm to accommodate smaller sample size.