In each dataset and for each p-value threshold used for the PGC-SCZ2 results, the schizophrenia polygenic risk score was regressed against the five substance use disorder phenotypes using logistic regression in R (44). Age, sex, and the first ten population-stratification principal components were included as covariates.