In search of rare variants: preliminary results from whole genome sequencing of 1,325 individuals with psychophysiological endophenotypes.
- Authors
- Vrieze, Scott I; Malone, Stephen M; Vaidyanathan, Uma; Kwong, Alan; Kang, Hyun Min; Zhan, Xiaowei; Flickinger, Matthew; Irons, Daniel; Jun, Goo; Locke, Adam E; Pistis, Giorgio; Porcu, Eleonora; Levy, Shawn; Myers, Richard M; Oetting, William; McGue, Matt; Abecasis, Goncalo; Iacono, William G
- Year
- 2014
- Journal
- Psychophysiology
- PMID
- 25387710
- DOI
- 10.1111/psyp.12350
- PMCID
- PMC4231480
Whole genome sequencing was completed on 1,325 individuals from 602 families, identifying 27 million autosomal variants. Genetic association tests were conducted for those individuals who had been assessed for one or more of 17 endophenotypes (N range = 802-1,185). No significant associations were found. These 27 million variants were then imputed into the full sample of individuals with psychophysiological data (N range = 3,088-4,469) and again tested for associations with the 17 endophenotypes. No association was significant. Using a gene-based variable threshold burden test of nonsynonymous variants, we obtained five significant associations. These findings are preliminary and call for additional analysis of this rich sample. We argue that larger samples, alternative study designs, and additional bioinformatics approaches will be necessary to discover associations between these endophenotypes and genomic variation.
Schematic analysis overview. For additional details see Iacono et al., 2014.
LLM interpretation
This figure is a schematic flow chart outlining a genomic analysis pipeline. It details the sample selection process, starting from an MCTFR sample (n=8405) and a whole genome sequenced MTFS subsample (n=1328), moving through quality control and the inclusion of MZ twin data. The workflow culminates in two stages of association analyses: the first using the original MTFS sample and the second using a larger imputed MCTFR sample (n=3088-4469) with verified results via PCR.
Discordance rates between the integrated array genotypes and sequence genotypes. This plot provides a description of the accuracy of the genotype calls from whole genome sequencing. The bar chart along the bottom gives the fraction of genotypes that were homozygous reference (HomRef), heterozygous (Het), and homozygous alternate (HomAlt), for the full range of possible nonreference allele counts. For example, if an individual in the study was called homozygous reference on the array (i.e., homozygous for the same allele that exists on the reference genome GRCh37), then the red dots give the rate at which that individual was called something other than homozygous reference in the sequence data. For SNPs with a nonreference allele count of 1–10, the sequence error rate was approximately 1 in 10,000. For SNPs with nonreference allele count of 1,500 (MAF ~50%), the sequence error rate was approximately 1 in 1,000. Similarly, if an individual was called heterozygous for some SNP on the array, then the green dots give the rate at which individuals were called heterozygous in the sequences. For a site with nonreference allele count of 1–10, the sequence error rate was about 20%. (Note that this 20% is based on only 141 genotypes—individuals homozygous for an alternate allele are rare themselves.) For sites with nonreference allele counts of 1,500, the rate was a little over 1 in 1,000. In general, sequencing was highly accurate, with accuracy falling off for the rarest variants.
LLM interpretation
This figure consists of a scatter plot overlaid on a bar chart, comparing genotype discordance rates between array and sequence data across non-reference allele counts. The y-axis on the left (log scale) shows genotype discordance for homozygous reference (red), heterozygous (green), and homozygous alternate (blue) calls, while the right y-axis and bottom bar chart show the fraction of genotypes for each group. Discordance is lowest for HomRef calls at low allele counts and highest for Het calls at low allele counts, with all three groups converging toward a discordance rate of approximately 1 in 1,000 as the non-reference allele count increases.
Imputation accuracy comparison between MTFS sequence and 1000 Genomes reference panels. This plot provides a comparison of the imputation accuracy on Chromosome 20 obtained with 1000 Genomes compared to MCTFR sequence. Plotted in solid lines is the squared Pearson correlation between the imputed dosage and the genotyped minor allele counts for a range of allele frequencies, using MCTFR sequences (black) and 1000 Genomes sequences (blue) as imputation reference panels. To make a direct comparison, the SNPs are restricted imputation to sites on the genome-wide 660W-Quad SNP array, and then tested accuracy on rare variants on the exome chip. The number of SNPs contributing to each window are given in red, and each window was centered on each dot in the red line. The plots show that the MTFS reference panel provides imputation results for all MAFs, and especially better results for less common alleles.
LLM interpretation
This line graph compares imputation accuracy (Dosage $R^2$) between MTFS sequences (blue line) and 1000 Genomes (black line) across various Minor Allele Frequency (MAF) bins on the x-axis. The MTFS reference panel consistently shows higher $R^2$ values than the 1000 Genomes panel, with the performance gap being most pronounced at lower MAF percentages. A secondary y-axis on the right tracks the number of SNPs per window, represented by a red dashed line.
| Name | Type |
|---|---|
| 1000 Genomes Project | cohort |
| 1000 Genomes sequences local | cohort |
| 10× sequencing local | drug |
| 660W-Quad local | drug |
| alcohol | phenotype |
| alcohol dependence | phenotype |
| Alcohol Use | phenotype |
| ALK local | gene |
| ALK intronic variant local | variant |
| alpha EEG frequency at O1O2 local | phenotype |
| antisaccade local | phenotype |
| antisaccade error local | phenotype |
| ANXA3 local | gene |
| Applied Biosystems PRISM 7500 Real Time PCR System local | drug |
| autism | phenotype |
| autosomal biallelic SNPs | cohort |
| autosomal candidate gene local | gene |
| aversive difference startle modulation local | phenotype |
| basic brain function local | phenotype |
| behavioral traits | phenotype |
| beta oscillations | phenotype |
| bipolar disorder | phenotype |
| body mass index | phenotype |
| bone mineral density | phenotype |
| BRIDGES local | cohort |
| candidate genes | cohort |
| chr2:29978404 local | variant |
| chr2:29994680 local | variant |
| clinical phenotypes | phenotype |
| common variants | cohort |
| complex endophenotypes local | phenotype |
| complex traits | phenotype |
| doubletons local | variant |
| EEG alpha frequency local | phenotype |
| EMMAX | drug |
| endophenotype | phenotype |
| EPACTS local | drug |
| essential splice SNPs local | variant |
| European ancestry | cohort |
| exome array local | drug |
| exome chip variants local | variant |
| externalizing disorders | phenotype |
| Family cohort (48 samples) local | cohort |
| four associated genes local | gene |
| Full Array Genotyped Sample local | cohort |
| full sample | cohort |
| Gbx2 | gene |
| GEDI | cohort |
| genes | gene |
| height | phenotype |
| HumanExome array local | drug |
| Human Genome Project | cohort |
| imputation accuracy | drug |
| imputed SNPs | variant |
| indel | variant |
| integrated array local | drug |
| intermediate phenotypes | phenotype |
| KIF18A local | gene |
| less common variants local | variant |
| major depressive disorder | phenotype |
| mania | phenotype |
| MCTFR | cohort |
| MCTFR participants local | cohort |
| mid/hindbrain region local | anatomy |
| missense SNPs local | variant |
| monomorphic sites local | variant |
| MTFS | cohort |
| MTFS sequences local | cohort |
| mutations | variant |
| NIDA Center for Genetic Studies | cohort |
| NIDA Genes, Environment, and Development Initiative local | cohort |
| nonsense SNPs local | variant |
| O1O2 local | anatomy |
| P3 genetic factor local | phenotype |
| phenotype | phenotype |
| pleasant difference startle modulation local | phenotype |
| protein-coding gene | gene |
| psychiatric disorders | phenotype |
| Psychophysiological Data local | phenotype |
| psychophysiological endophenotypes local | phenotype |
| psychophysiological measurements local | phenotype |
| psychophysiology GWAS local | cohort |
| QT interval local | phenotype |
| quadrupletons local | variant |
| rare nonsynonymous variants local | variant |
| rare variant | cohort |
| RDoC | cohort |
| sample (N < 4,500) local | cohort |
| schizophrenia | phenotype |
| Sequenced cohort local | cohort |
| Sequenced Individuals local | cohort |
| Sequenced Variants local | variant |
| singleton local | variant |
| singletons | cohort |
| singleton variant local | variant |
| SLC27A6 local | gene |
| SNP | cohort |
| SNPs with MAF < 1% local | variant |
| somatic SNP mutation local | variant |
| stop-gain variant local | variant |
| stop-gain variants local | variant |
| structural variant | cohort |
| Study cohort of 1,325 individuals local | cohort |
| TaqMan Universal PCR Master Mix | drug |
| task-related brain activity | phenotype |
| total cholesterol | phenotype |
| tripletons local | variant |
| Twin cohort | cohort |
No uploaded files.
In this knowledge base
External
| Title | Authors | Journal | Year | Link |
|---|---|---|---|---|
| Mapping 60 Years of Psychophysiology: A Bibliometric Analysis of Journal Performance, Authorship Trends, and Thematic Evolution. | Panitz C et al. | — | 2025 | → |
| Using multivariate endophenotypes to identify psychophysiological mechanisms associated with polygenic scores for substance use, schizophrenia, and education attainment. | Harper J et al. | — | 2022 | → |
| Validating Online Measures of Cognitive Ability in Genes for Good, a Genetic Study of Health and Behavior. | Liu M et al. | — | 2020 | → |
| Exome Chip Meta-analysis Fine Maps Causal Variants and Elucidates the Genetic Architecture of Rare Coding Variants in Smoking and Alcohol Use. | Brazel DM et al. | — | 2019 | → |
| The utility of twins in developmental cognitive neuroscience research: How twins strengthen the ABCD research design. | Iacono WG et al. | — | 2018 | → |
| A functional U-statistic method for association analysis of sequencing data. | Jadhav S et al. | — | 2017 | → |
| Miscellanea Dependent generalized functional linear models. | Jadhav S et al. | — | 2017 | → |
| Psychophysiological endophenotypes to characterize mechanisms of known schizophrenia genetic loci. | Liu M et al. | — | 2017 | → |
| The genetics of anxiety-related negative valence system traits. | Savage JE et al. | — | 2017 | → |
| What can time-frequency and phase coherence measures tell us about the genetic basis of P3 amplitude? | Malone SM et al. | — | 2017 | → |
| Whole genome sequence association and ancestry-informed polygenic profile of EEG alpha in a Native American population. | Peng Q et al. | — | 2017 | → |
| A computational method for genotype calling in family-based sequencing data. | Chang LC et al. | — | 2016 | → |
| High-resolution DNA accessibility profiles increase the discovery and interpretability of genetic associations | Madar A et al. | — | 2016 | — |
| Next-generation genotype imputation service and methods. | Das S et al. | — | 2016 | → |
| Endophenotypes for Alcohol Use Disorder: An Update on the Field. | Salvatore JE et al. | — | 2015 | → |
| Longitudinal stability and predictive utility of the visual P3 response in adults with externalizing psychopathology. | Yoon HH et al. | — | 2015 | → |
| Rare variant association studies: considerations, challenges and opportunities. | Auer PL et al. | — | 2015 | → |
| The Power of Theory, Research Design, and Transdisciplinary Integration in Moving Psychopathology Forward. | Vaidyanathan U et al. | — | 2015 | → |
| Decomposing P300 to identify its genetic basis. | Ford JM | — | 2014 | → |
| Genetic associations of nonsynonymous exonic variants with psychophysiological endophenotypes. | Vrieze SI et al. | — | 2014 | → |
| Genome-wide scans of genetic variants for psychophysiological endophenotypes: a methodological overview. | Iacono WG et al. | — | 2014 | → |
| Heritability and molecular genetic basis of antisaccade eye tracking error rate: a genome-wide association study. | Vaidyanathan U et al. | — | 2014 | → |
| Heritability and molecular genetic basis of electrodermal activity: a genome-wide association study. | Vaidyanathan U et al. | — | 2014 | → |