Pervasive Downward Bias in Estimates of Liability-Scale Heritability in Genome-wide Association Study Meta-analysis: A Simple Solution.
- Authors
- Grotzinger, Andrew D; Fuente, Javier de la; PrivΓ©, Florian; Nivard, Michel G; Tucker-Drob, Elliot M
- Year
- 2023
- Journal
- Biological psychiatry
- PMID
- 35973856
- DOI
- 10.1016/j.biopsych.2022.05.029
- PMCID
- PMC10066905
BACKGROUND: Single nucleotide polymorphism-based heritability is a fundamental quantity in the genetic analysis of complex traits. For case-control phenotypes, for which the continuous distribution of risk in the population is unobserved, observed-scale heritability estimates must be transformed to the more interpretable liability scale. This article describes how the field standard approach incorrectly performs the liability correction in that it does not appropriately account for variation in the proportion of cases across the cohorts comprising the meta-analysis. We propose a simple solution that incorporates cohort-specific ascertainment using the summation of effective sample sizes across cohorts. This solution is applied at the stage of single nucleotide polymorphism-based heritability estimation and does not require generating updated meta-analytic genome-wide association study summary statistics. METHODS: We began by performing a series of simulations to examine the ability of the standard approach and our proposed approach to recapture liability-scale heritability in the population. We went on to examine the differences in estimates obtained from these 2 approaches for real data for 12 major case-control genome-wide association studies of psychiatric and neurologic traits. RESULTS: We found that the field standard approach for performing the liability conversion can downwardly bias estimates by as much as approximately 50% in simulation and approximately 30% in real data. CONCLUSIONS: Prior estimates of liability-scale heritability for genome-wide association study meta-analysis may be drastically underestimated. To this end, we strongly recommend using our proposed approach of using the sum of effective sample sizes across contributing cohorts to obtain unbiased estimates.
Simulation results across conditions. Panel (A) depicts the mean percentage bias on the y-axis across the 11 simulation conditions on the x-axis. Error bars depict Β±1 SD. Panels (BβL) depict the individual point estimates from the 100 simulations per condition across the 11 conditions. The red dashed line indicates the liability-scale h2 of 15% in the population. All panels depict the results from using βEffNk to account for cohort-specific ascertainment in green and the results from using vTotal for the liability correction in blue, which denotes using the total sample prevalence calculated using the aggregate number of cases and controls across cohorts. Because vTotal and βEffNk produced equivalent solutions for panels (B) and (L), the blue and green distributions are entirely overlapping.
LLM interpretation
This figure presents simulation results across 11 conditions comparing two ascertainment methods: Total Ascertainment (green) and Cohort-specific Ascertainment (blue). Panel A is a dot plot showing mean percentage bias with $\pm$1 SD error bars, where the blue method generally shows higher positive bias than the green method. Panels BβL are histograms showing the frequency of $h^2$ point estimates for each condition, with a red dashed line marking the true population value of 15%.
| Name | Type |
|---|---|
| ADGC local | cohort |
| alcohol dependence | phenotype |
| Alcohol Use Disorder | phenotype |
| Alz | phenotype |
| Alzheimerβs disease | phenotype |
| anorexia nervosa | phenotype |
| apoE | gene |
| attention deficit hyperactivity disorder | phenotype |
| autism spectrum disorder | phenotype |
| binary trait | phenotype |
| bipolar disorder | phenotype |
| cannabis use disorder | phenotype |
| case-control disease traits local | phenotype |
| CHARGE | cohort |
| Cohort 10 local | cohort |
| Cohort 7 local | cohort |
| Cohort 8 local | cohort |
| Cohort 9 local | cohort |
| Cohort-level sample size local | cohort |
| community sample | cohort |
| EADI local | cohort |
| βEffNk local | drug |
| Freeze 3 | drug |
| GERAD local | cohort |
| GWAS | cohort |
| GWAS cohort 1 local | cohort |
| GWAS cohort 10 local | cohort |
| GWAS cohort 2 local | cohort |
| GWAS cohort 3 local | cohort |
| GWAS cohort 4 local | cohort |
| GWAS cohort 5 local | cohort |
| GWAS cohort 6 local | cohort |
| GWAS cohort 7 local | cohort |
| GWAS cohort 8 local | cohort |
| GWAS cohort 9 local | cohort |
| GWAS summary statistics local | cohort |
| HapMap3 | cohort |
| heritability estimate local | phenotype |
| Inpatient sample local | cohort |
| LDSC intercept local | phenotype |
| liability-scale h2 local | phenotype |
| Liability-scale heritability local | phenotype |
| liability-scale heritability estimates local | phenotype |
| major depressive disorder | phenotype |
| MASS package local | drug |
| mvrnorm local | drug |
| obsessive-compulsive disorder | phenotype |
| population | cohort |
| Population prevalence local | phenotype |
| posttraumatic stress disorder | phenotype |
| Psychiatric Genomics Consortium | cohort |
| psychiatric traits | phenotype |
| schizophrenia | phenotype |
| Severity of cases local | phenotype |
| sex | phenotype |
| SNP | cohort |
| study cohort | cohort |
| Tourette syndrome | phenotype |
| vTotal local | drug |
No uploaded files.
| Citation | PMID | DOI | Status |
|---|---|---|---|
| AkingbuwaWA, HammerschlagAR, BartelsM, MiddeldorpCM (2022): Systematic review: Molecular studies of common genetic variation in child and adolescent psychiatric disorders [published correction appears in J Am Acad Child Adolesc Psychiatry 2022; 61:837]. J Am Acad Child Adolesc Psychiatry 61:227β242.3393249410.1016/j.jaac.2021.03.020 | β | β | β |
| ArnoldPD, AsklandKD, BarlassinaC, BellodiL, BienvenuOJ, BlackD (2018): Revealing the complex genetic architecture of obsessiveβcompulsive disorder using meta-analysis. Mol Psychiatry 23:1181β1188.2876108310.1038/mp.2017.154PMC6660151 | β | β | β |
| Bulik-SullivanB, FinucaneHK, AnttilaV, GusevA, DayFR, LohPR, (2015): An atlas of genetic correlations across human diseases and traits. Nat Genet 47:1236β1241.2641467610.1038/ng.3406PMC4797329 | β | β | β |
| Bulik-SullivanBK, LohPR, FinucaneHK, RipkeS, YangJ, Schizophrenia Working Group of the Psychiatric Genomics Consortium, (2015): LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat Genet 47:291β295.2564263010.1038/ng.3211PMC4495769 | β | β | β |
| de la FuenteJ, GrotzingerAD, MarioniRE, NivardMG, Tucker-DrobEM (2021): Multivariate modeling of direct and proxy GWAS indicates substantial common variant heritability of Alzheimerβs disease. medRxiv. 10.1101/2021.05.06.21256747. | β | β | β |
| DemontisD, WaltersRK, MartinJ, MattheisenM, AlsTD, AgerboE, (2019): Discovery of the first genome-wide significant risk loci for attention deficit/hyperactivity disorder. Nat Genet 51:63β75.3047844410.1038/s41588-018-0269-7PMC6481311 | β | β | β |
| DempsterER, LernerIM (1950): Heritability of threshold characters. Genetics 35:212β236.1724734410.1093/genetics/35.2.212PMC1209482 | β | β | β |
| GrotzingerAD, RhemtullaM, de VlamingR, RitchieSJ, MallardTT, HillWD, (2019): Genomic structural equation modelling provides insights into the multivariate genetic architecture of complex traits. Nat Hum Behav 3:513β525.3096261310.1038/s41562-019-0566-xPMC6520146 | β | β | β |
| GroveJ, RipkeS, AlsTD, MattheisenM, WaltersRK, WonH, (2019): Identification of common genetic risk variants for autism spectrum disorder. Nat Genet 51:431β444.3080455810.1038/s41588-019-0344-8PMC6454898 | β | β | β |
| JohnsonEC, DemontisD, ThorgeirssonTE, WaltersRK, PolimantiR, HatoumAS, (2020): A large-scale genome-wide association study meta-analysis of cannabis use disorder [published correction appears in Lancet Psychiatry 2022; 9:e12]. Lancet Psychiatry 7:1032β1045.3309604610.1016/S2215-0366(20)30339-4PMC7674631 | β | β | β |
| KunkleBW, Grenier-BoleyB, SimsR, BisJC, DamotteV, NajAC, (2019): Genetic meta-analysis of diagnosed Alzheimerβs disease identifies new risk loci and implicates Ab, tau, immunity and lipid processing [published correction appears in Nat Genet 2019; 51: 1423β1424]. Nat Genet 51:414β430.3082004710.1038/s41588-019-0358-2PMC6463297 | β | β | β |
| LeeSH, WrayNR, GoddardME, VisscherPM (2011): Estimating missing heritability for disease from genome-wide association studies. Am J Hum Genet 88:294β305.2137630110.1016/j.ajhg.2011.02.002PMC3059431 | β | β | β |
| MaierRM, VisscherPM, RobinsonMR, WrayNR (2018): Embracing polygenicity: A review of methods and tools for psychiatric genetics research. Psychol Med 48:1055β1067.2884733610.1017/S0033291717002318PMC6088780 | β | β | β |
| MallardTT, LinnΓ©rRK, GrotzingerAD, Sanchez-RoigeS, SeidlitzJ, OkbayA, (2022): Multivariate GWAS of psychiatric disorders and their cardinal symptoms reveal two dimensions of cross-cutting genetic liabilities. Cell Genom 2:100140.10.1016/j.xgen.2022.100140PMC926440335812988 | β | β | β |
| MullinsN, ForstnerAJ, OβConnellKS, CoombesB, ColemanJRI, QiaoZ, (2021): Genome-wide association study of more than 40,000 bipolar disorder cases provides new insights into the underlying biology. Nat Genet 53:817β829.3400209610.1038/s41588-021-00857-4PMC8192451 | β | β | β |
| NievergeltCM, MaihoferAX, KlengelT, AtkinsonEG, ChenCY, ChoiKW, (2019): International meta-analysis of PTSD genome-wide association studies identifies sex-and ancestry-specific genetic risk loci. Nat Commun 10:4558.3159494910.1038/s41467-019-12576-wPMC6783435 | β | β | β |
| PeyrotWJ, BoomsmaDI, Penninx BWJH, Wray NR (2016): Disease and polygenic architecture: Avoid trio design and appropriately account for unscreened control subjects for common disease. Am J Hum Genet 98:382β391.2684911310.1016/j.ajhg.2015.12.017PMC4746372 | β | β | β |
| PrivΓ©F, ArbelJ, AschardH, VilhjΓ‘lmssonBJ (2022): Identifying and correcting for misspecifications in GWAS summary statistics and polygenic scores. HGG Adv 3(4):100136.10.1016/j.xhgg.2022.100136PMC946534336105883 | β | β | β |
| SpeedD, CaiN, UCLEB Consortium, JohnsonMR, NejentsevS, BaldingDJ (2017): Reevaluation of SNP heritability in complex human traits. Nat Genet 49:986β992.2853067510.1038/ng.3865PMC5493198 | β | β | β |
| TrubetskoyV, PardiΓ±asAF, QiT, PanagiotaropoulouG, AwasthiS, BigdeliTB, (2022): Mapping genomic loci implicates genes and synaptic biology in schizophrenia. Nature 604:502β508.3539658010.1038/s41586-022-04434-5PMC9392466 | β | β | β |
| TurleyP, WaltersRK, MaghzianO, OkbayA, LeeJJ, FontanaMA, (2018): Multi-trait analysis of genome-wide association summary statistics using MTAG [published correction appears in Nat Genet 2019; 51:1190 and Nat Genet 2019; 51:1295]. Nat Genet 50:229β237.2929238710.1038/s41588-017-0009-4PMC5805593 | β | β | β |
| WaltersRK, PolimantiR, JohnsonEC, McClintickJN, AdamsMJ, AdkinsAE, (2018): Transancestral GWAS of alcohol dependence reveals common genetic underpinnings with psychiatric disorders. Nat Neurosci 21:1656β1669.3048294810.1038/s41593-018-0275-1PMC6430207 | β | β | β |
| WatsonHJ, YilmazZ, ThorntonLM, HΓΌbelC, ColemanJRI, GasparHA, (2019): Genome-wide association study identifies eight risk loci and implicates metabo-psychiatric origins for anorexia nervosa. Nat Genet 51:1207β1214.3130854510.1038/s41588-019-0439-2PMC6779477 | β | β | β |
| WrayNR, RipkeS, MattheisenM, TrzaskowskiM, ByrneEM, AbdellaouiA, (2018): Genome-wide association analyses identify 44 risk variants and refine the genetic architecture of major depression. Nat Genet 50:668β681.2970047510.1038/s41588-018-0090-3PMC5934326 | β | β | β |
| YangJ, BakshiA, ZhuZ, HemaniG, VinkhuyzenAA, NolteIM, (2015): Genome-wide genetic homogeneity between sexes and populations for human height and body mass index. Hum Mol Genet 24:7445β7449.2649490110.1093/hmg/ddv443 | β | β | β |
| YangJ, LeeSH, GoddardME, VisscherPM (2011): GCTA: A tool for genome-wide complex trait analysis. Am J Hum Genet 88: 76β82.2116746810.1016/j.ajhg.2010.11.011PMC3014363 | β | β | β |
| YuD, SulJH, TsetsosF, NawazMS, HuangAY, ZelayaI, (2019): Interrogating the genetic determinants of Touretteβs syndrome and other tic disorders through genome-wide association studies. Am J Psychiatry 176:217β227.3081899010.1176/appi.ajp.2018.18070857PMC6677250 | β | β | β |
In this knowledge base
| Title | Year | PMID |
|---|---|---|
| Genome-wide analyses identify 30 loci associated with obsessive-compulsive disorder. | 2025 | 40360802 |
External
| Title | Authors | Journal | Year | Link |
|---|---|---|---|---|
| Comprehensive benchmarking single and multi ancestry polygenic score methods with the PGS-hub platform. | Chen X et al. | β | 2026 | β |
| Genome-wide association study identifies candidate genes regulating multi-stage body size traits in Longling yellow goats. | He XY et al. | β | 2026 | β |
| Genome-wide association study of major anxiety disorders in 122,341 European-ancestry cases identifies 58 loci and highlights GABAergic signaling. | Strom NI et al. | β | 2026 | β |
| Mapping the genetic landscape across 14 psychiatric disorders. | Grotzinger AD et al. | β | 2026 | β |
| Shared Genetic Liability across Systems of Psychiatric and Physical Illness. | Lawrence JM et al. | β | 2026 | β |
| UK Biobank at 20 - a growing, global resource for dementia research. | Matthews PM et al. | β | 2026 | β |
| Age-specific childhood obesity and adult cholelithiasis: association and shared transcriptomic bases. | Liu L et al. | β | 2025 | β |
| Comparison of the multivariate genetic architecture of eight major psychiatric disorders across sex. | Schwaba T et al. | β | 2025 | β |
| Estimating disorder probability based on polygenic prediction using the BPC approach. | Uffelmann E et al. | β | 2025 | β |
| Evaluating metabolome-wide causal effects on risk for psychiatric and neurodegenerative disorders. | Gilchrist L et al. | β | 2025 | β |
| Examining the genetic links between clusters of immune-mediated diseases and psychiatric disorders. | Breunig S et al. | β | 2025 | β |
| Examining the genetic relationship between Alzheimer's disease, schizophrenia and their shared risk factors using genomic structural equation modelling. | Almeida E Sousa I et al. | β | 2025 | β |
| Genetic associations with educational fields. | Cheesman R et al. | β | 2025 | β |
| Genome-wide analyses identify 30 loci associated with obsessive-compulsive disorder. | Strom NI et al. | β | 2025 | β |
| Genome-wide association study meta-analysis provides insights into the etiology of heart failure and its subtypes. | Henry A et al. | β | 2025 | β |
| Host genetic regulation of specific functional groups in the rumen microbiome of dairy cows: Implications for lactation trait. | Bai H et al. | β | 2025 | β |
| Immune-Developmental Processes Contribute to Schizophrenia Risk: Insights From a Genetic Overlap Study With Height. | Romero C et al. | β | 2025 | β |
| PGSFusion streamlines polygenic score construction and epidemiological applications in biobank-scale cohorts. | Yang S et al. | β | 2025 | β |
| Polygenic risk score prediction accuracy convergence. | Henches L et al. | β | 2025 | β |
| Protocol for finding genetic variation associated with unmeasured traits through GenomicSEM common-factor GWAS. | Johnston KJA et al. | β | 2025 | β |
| The Genetic Architecture of Differentiating Behavioral and Emotional Problems in Early Life. | Askelund AD et al. | β | 2025 | β |
| Visualizing the Genetic Heterogeneity of Major Depressive Disorder using GDVIS | Thijssen AB et al. | β | 2025 | β |
| Characterizing genetic pathways unique to autism spectrum disorder at multiple levels of biological analysis. | Schaffer LS et al. | β | 2024 | β |
| Dementias Platform UK: Bringing genetics into life. | Leonenko G et al. | β | 2024 | β |
| Examining the role of common variants in rare neurodevelopmental conditions. | Huang QQ et al. | β | 2024 | β |
| Gene discovery and biological insights into anxiety disorders from a large-scale multi-ancestry genome-wide association study. | Friligkou E et al. | β | 2024 | β |
| Genetic architecture of the structural connectome. | Wainberg M et al. | β | 2024 | β |
| Genome-wide association analysis provides insights into the molecular etiology of dilated cardiomyopathy. | Zheng SL et al. | β | 2024 | β |
| Genome-wide meta-analysis of ascertainment and symptom structures of major depression in case-enriched and community cohorts. | Adams MJ et al. | β | 2024 | β |
| Genomic SEM applied to explore etiological divergences in bipolar subtypes. | Lawrence JM et al. | β | 2024 | β |
| Partitioning the Genomic Components of Behavioral Disinhibition and Substance Use (Disorder) Using Genomic Structural Equation Modeling. | Horwitz TB et al. | β | 2024 | β |
| Pervasive biases in proxy genome-wide association studies based on parental history of Alzheimer's disease. | Wu Y et al. | β | 2024 | β |
| Role of the X Chromosome in Alzheimer Disease Genetics. | Belloy ME et al. | β | 2024 | β |
| Statistical examination of shared loci in neuropsychiatric diseases using genome-wide association study summary statistics. | Spargo TP et al. | β | 2024 | β |
| Cross-disorder genetic analysis of immune diseases reveals distinct gene associations that converge on common pathways. | Demela P et al. | β | 2023 | β |
| Examining the shared etiology of psychopathology with genome-wide association studies. | Mallard TT et al. | β | 2023 | β |
| GWAS meta-analysis of over 29,000 people with epilepsy identifies 26 risk loci and subtype-specific genetic architecture. | International League Against Epilepsy Consortium on Complex Epilepsies | β | 2023 | β |
| Inferring disease architecture and predictive ability with LDpred2-auto. | PrivΓ© F et al. | β | 2023 | β |
| Multivariate genome-wide association meta-analysis of over 1 million subjects identifies loci underlying multiple substance use disorders. | Hatoum AS et al. | β | 2023 | β |
| Multivariate genomic architecture of cortical thickness and surface area at multiple levels of analysis. | Grotzinger AD et al. | β | 2023 | β |
| New insights from the last decade of research in psychiatric genetics: discoveries, challenges and clinical implications. | Andreassen OA et al. | β | 2023 | β |
| Shared genetic liability for alcohol consumption, alcohol problems, and suicide attempt: Evaluating the role of impulsivity. | Stephenson M et al. | β | 2023 | β |
| Step by step: towards a better understanding of the genetic architecture of Alzheimer's disease. | Lambert JC et al. | β | 2023 | β |
| The complex genetic architecture of Alzheimer's disease: novel insights and future directions. | Andrews SJ et al. | β | 2023 | β |
| Transcriptome-Wide Structural Equation Modeling of 13 Major Psychiatric Disorders for Cross-Disorder Risk and Drug Repurposing. | Grotzinger AD et al. | β | 2023 | β |
| Identifying and correcting for misspecifications in GWAS summary statistics and polygenic scores. | PrivΓ© F et al. | β | 2022 | β |
| Integrated analysis of direct and proxy genome wide association studies highlights polygenicity of Alzheimer's disease outside of the APOE region. | de la Fuente J et al. | β | 2022 | β |
| Transcriptome-wide and stratified genomic structural equation modeling identify neurobiological pathways shared across diverse cognitive traits. | Grotzinger AD et al. | β | 2022 | β |