Bayesian methods for examining Hardy-Weinberg equilibrium.
- Authors
- Wakefield, Jon
- Year
- 2010
- Journal
- Biometrics
- PMID
- 19459838
- DOI
- 10.1111/j.1541-0420.2009.01267.x
- PMCID
- PMC4535922
Testing for Hardy-Weinberg equilibrium is ubiquitous and has traditionally been carried out via frequentist approaches. However, the discreteness of the sample space means that uniformity of p-values under the null cannot be assumed, with enumeration of all possible counts, conditional on the minor allele count, offering a computationally expensive way of p-value calibration. In addition, the interpretation of the subsequent p-values, and choice of significance threshold depends critically on sample size, because equilibrium will always be rejected at conventional levels with large sample sizes. We argue for a Bayesian approach using both Bayes factors, and the examination of posterior distributions. We describe simple conjugate approaches, and methods based on importance sampling Monte Carlo. The former are convenient because they yield closed-form expressions for Bayes factors, which allow their application to a large number of single nucleotide polymorphisms (SNPs), in particular in genome-wide contexts. We also describe straightforward direct sampling methods for examining posterior distributions of parameters of interest. For large numbers of alleles at a locus we resort to Markov chain Monte Carlo. We discuss a number of possibilities for prior specification, and apply the suggested methods to a number of real datasets.
Prior distribution for a generic fixation index with (a) k = 4 alleles, (b) k = 9 alleles, given a Dirichlet prior with parameters 1, on the 10 (k = 4) and 45 (k = 9) allele frequencies.
Prior (top row) and posterior (bottom row) distributions based on 5000 samples, for the four group data and the single inbreeding coefficient f model. The MLE is indicated as the cross in (f). We examine p1 for illustration, and could just as easily have picked p2, p3, or p4.
Bayes factor and p-value summaries for the GWAS for age-related macular degeneration data: (a) –log10 p-values from a χ2 test versus those from the exact test; (b) histogram of exact p-values (vertical axis truncated, the count at a p-value of 1 is indicated); (c) QQ-plot of observed versus expected –log10 p-values, assuming uniformity of p-values under the null; and (d) –log10 p-values against –log10 Bayes factors. The dashed lines corresponds to the Bonferroni threshold (p-value axes) and Bayes factor thresholds (Bayes factor axes).
No entities extracted from this document yet.
No uploaded files.
In this knowledge base
| Title | Year | PMID |
|---|---|---|
| Quality control and quality assurance in genotypic data for genome-wide association studies. | 2010 | 20718045 |
External
| Title | Authors | Journal | Year | Link |
|---|---|---|---|---|
| <i>COMT</i> and <i>ACE</i> (Epi)genetic Variation Is Associated with Cognitive and Metabolic Resilience in Swiss Tactical Athletes. | Flück M et al. | — | 2026 | → |
| Tests for segregation distortion in tetraploid F1 populations. | Gerard D et al. | — | 2025 | → |
| Tests for Segregation Distortion in Tetraploid F1 Populations | Gerard D et al. | — | 2024 | — |
| Double reduction estimation and equilibrium tests in natural autopolyploid populations. | Gerard D | — | 2023 | → |
| Checking for model failure and for prior-data conflict with the constrained multinomial model. | Englert BG et al. | — | 2021 | → |
| A Bayesian analysis for investigating the association between rs13266634 polymorphism in SLC30A8 gene and type 2 diabetes. | Soltanian AR et al. | — | 2020 | → |
| A test for deviations from expected genotype frequencies on the X chromosome for sex-biased admixed populations. | Backenroth D et al. | — | 2019 | → |
| Bayesian model selection for the study of Hardy-Weinberg proportions and homogeneity of gender allele frequencies. | Puig X et al. | — | 2019 | → |
| Interleukin-8 Gene -251 A/T (rs4073) Polymorphism and Coronary Artery Disease Risk: A Meta-Analysis. | Zhang S et al. | — | 2019 | → |
| The Summer Institute in Statistical Genetics. | Weir BS | — | 2019 | → |
| A Bayesian test for Hardy-Weinberg equilibrium of biallelic X-chromosomal markers. | Puig X et al. | — | 2017 | → |
| Association Between the Lower Extremity Deep Venous Thrombosis, the Warfarin Maintenance Dose, and CYP2C9*3, CYP2D6*10, and CYP3A5*3 Genetic Polymorphisms: A Case-Control Study. | Ju S et al. | — | 2017 | → |
| Testing Departure from Hardy-Weinberg Proportions. | Wang J et al. | — | 2017 | → |
| Testing for Hardy-Weinberg equilibrium at biallelic genetic markers on the X chromosome. | Graffelman J et al. | — | 2016 | → |
| Testing Hardy-Weinberg equilibrium with a simple root-mean-square statistic. | Ward R et al. | — | 2014 | → |
| A note on exact conditional and unconditional tests for Hardy-Weinberg equilibrium. | Shan G | — | 2013 | → |
| Inference of potential genetic risks associated with large-scale releases of red sea bream in Kanagawa prefecture, Japan based on nuclear and mitochondrial DNA analysis. | Blanco Gonzalez E et al. | — | 2013 | → |
| Reducing bias of allele frequency estimates by modeling SNP genotype data with informative missingness. | Lin WY et al. | — | 2012 | → |
| Testing departure from Hardy-Weinberg proportions. | Wang J et al. | — | 2012 | → |
| PlatinumCNV: a Bayesian Gaussian mixture model for genotyping copy number polymorphisms using SNP array signal intensity data. | Kumasaka N et al. | — | 2011 | → |
| Testing Hardy-Weinberg equilibrium: an objective Bayesian analysis. | Consonni G et al. | — | 2011 | → |
| Characterizing allelic association in the genome era. | Weir BS et al. | — | 2010 | → |
| Quality control and quality assurance in genotypic data for genome-wide association studies. | Laurie CC et al. | — | 2010 | → |
| Bayesian statistical methods for genetic association studies. | Stephens M et al. | — | 2009 | → |
| Exact tests for Hardy-Weinberg proportions. | Engels WR | — | 2009 | → |