Chunk #15 — Method — Genome-wide Scoring Procedure

Source: Three mutually informative ways to understand the genetic relationships among behavioral disinhibition, alcohol use, drug use, nicotine use/dependence, and their co-occurrence: twin biometry, GCTA, and genome-wide scoring.
Embedded: yes

Text

Because the full sample is composed of families, and individuals within families are correlated with respect to genotypes and phenotypes, we always kept individuals from the same family within the same subsample. This prevented the algorithm, for example, from deriving the SNP score on one twin and cross-validating it on the other – clearly in that case we expect prediction bias given correlation between twins on the phenotype and the genotype.