alternate half chromosomes, and 3) half genome SNP sets. These sets were chosen to eliminate any dependency in each test between the two half datasets based on linkage disequilibrium. Thus, correlation of the independent SNP sets should be due to similar substructure. In addition, the current study also examined whether the distribution of individuals in each principal component (PC) was normally distributed using the Shapiro and Wilk's W-statistic test for normality [42]. In the absence of population structure, the null hypothesis is that the data will be normally distributed.