Our core association analyses to identify schizophrenia risk factors focused on single-marker tests of the 312,565 QC-passed SNPs that were genotyped in both cohorts. To control for the possibility of spurious associations resulting from population stratification we used the EIGENSTRAT approach of Price et al [57]. This method derives the principal components of the correlations among gene variants and corrects for those correlations in the association tests. In principle, therefore the principal components in the analyses should reflect population ancestry. We have noticed however that some of the leading axes appear to depend on other sources of correlation, such as sets of variants near one another that show extended association. We have documented the potential for inversions to create this effect and it may be created by other causes of extended linkage disequilibrium as well (Text S1). For this reason we inspected the SNP ‘loadings’ for each of the leading axes to determine if they depended on many or relatively few SNPs, as would be expected if the given axis reflected population ancestry or a more localized linkage disequilibrium effect