A full summary of the methods can be found in Supplementary Information. Briefly, all study participants gave signed consent for their data and samples to be used in studies that have been approved by the appropriate Research Ethics Committee or the GS access Committee. Genomic DNA from each individual was whole genome amplified in triplicate, the products pooled and amplified with primer pairs tiled across 528 kb of TRAX/DISC1 (hg18 chr1:229723339-230251606; hg19 chr1:231656716-232184983). For each sample, the pooled products were sheared, converted into paired-end Illumina libraries and sequenced on an Illumina GAII or HiSeq 2000 sequencer to >80% coverage and >30-fold depth. Sequences were aligned to the UCSC hg18 reference sequence, variants called using MAQ software49 and the variants in repeats removed. Ten percent of all remaining variants were validated using Sanger sequencing chemistry on an ABI3730 sequencer, and the derived information used to optimise the quality control filters. After quality control screening, all exonic and low frequency (MAF<1%) variants were also validated by Sanger chemistry sequencing as above. The variants were functionally annotated using SNPnexus50 (http://www.snp-nexus.org). Non-coding variants were