Bayesian methods for examining Hardy-Weinberg equilibrium.
- Authors
- Wakefield, Jon
- Year
- 2010
- Journal
- Biometrics
- PMID
- 19459838
- DOI
- 10.1111/j.1541-0420.2009.01267.x
- PMCID
- PMC4535922
Testing for Hardy-Weinberg equilibrium is ubiquitous and has traditionally been carried out via frequentist approaches. However, the discreteness of the sample space means that uniformity of p-values under the null cannot be assumed, with enumeration of all possible counts, conditional on the minor allele count, offering a computationally expensive way of p-value calibration. In addition, the interpretation of the subsequent p-values, and choice of significance threshold depends critically on sample size, because equilibrium will always be rejected at conventional levels with large sample sizes. We argue for a Bayesian approach using both Bayes factors, and the examination of posterior distributions. We describe simple conjugate approaches, and methods based on importance sampling Monte Carlo. The former are convenient because they yield closed-form expressions for Bayes factors, which allow their application to a large number of single nucleotide polymorphisms (SNPs), in particular in genome-wide contexts. We also describe straightforward direct sampling methods for examining posterior distributions of parameters of interest. For large numbers of alleles at a locus we resort to Markov chain Monte Carlo. We discuss a number of possibilities for prior specification, and apply the suggested methods to a number of real datasets.
Prior distribution for a generic fixation index with (a) k = 4 alleles, (b) k = 9 alleles, given a Dirichlet prior with parameters 1, on the 10 (k = 4) and 45 (k = 9) allele frequencies.
Prior (top row) and posterior (bottom row) distributions based on 5000 samples, for the four group data and the single inbreeding coefficient f model. The MLE is indicated as the cross in (f). We examine p1 for illustration, and could just as easily have picked p2, p3, or p4.
Bayes factor and p-value summaries for the GWAS for age-related macular degeneration data: (a) βlog10 p-values from a Ο2 test versus those from the exact test; (b) histogram of exact p-values (vertical axis truncated, the count at a p-value of 1 is indicated); (c) QQ-plot of observed versus expected βlog10 p-values, assuming uniformity of p-values under the null; and (d) βlog10 p-values against βlog10 Bayes factors. The dashed lines corresponds to the Bonferroni threshold (p-value axes) and Bayes factor thresholds (Bayes factor axes).
| # | Section | Preview |
|---|---|---|
| 40 | 4. Examples β 4.3 Genome-Wide Data | Turning now to a Bayesian approach we assume Dir(1,1,1) priors under the alternative and compareβ¦ |
| 41 | 4. Examples β 4.3 Genome-Wide Data | To pick a threshold for rejection of HWE using Bayes factors we need to specify the PO of H0, andβ¦ |
| 42 | 4. Examples β 4.3 Genome-Wide Data | In a GWAS context, another approach to prior specification is to recognize that under theβ¦ |
| 43 | 5. Discussion | In this article, we have described a Bayesian approach to examination of HWE, using a variety of⦠|
| 44 | 5. Discussion | For illustration, consider the two allele case and suppose we have a p-value based on the Wald⦠|
| 45 | 5. Discussion | data. This p-value prior gives ABFp=(1+K)1β2exp(βnf^22K1+K) and under this prior identicalβ¦ |
| 46 | 5. Discussion | R and WinBUGS codes to implement the methods presented here are available from⦠|
No entities extracted from this document yet.
No uploaded files.
In this knowledge base
| Title | Year | PMID |
|---|---|---|
| Quality control and quality assurance in genotypic data for genome-wide association studies. | 2010 | 20718045 |
External
| Title | Authors | Journal | Year | Link |
|---|---|---|---|---|
| <i>COMT</i> and <i>ACE</i> (Epi)genetic Variation Is Associated with Cognitive and Metabolic Resilience in Swiss Tactical Athletes. | FlΓΌck M et al. | β | 2026 | β |
| Tests for segregation distortion in tetraploid F1 populations. | Gerard D et al. | β | 2025 | β |
| Tests for Segregation Distortion in Tetraploid F1 Populations | Gerard D et al. | β | 2024 | β |
| Double reduction estimation and equilibrium tests in natural autopolyploid populations. | Gerard D | β | 2023 | β |
| Checking for model failure and for prior-data conflict with the constrained multinomial model. | Englert BG et al. | β | 2021 | β |
| A Bayesian analysis for investigating the association between rs13266634 polymorphism in SLC30A8 gene and type 2 diabetes. | Soltanian AR et al. | β | 2020 | β |
| A test for deviations from expected genotype frequencies on the X chromosome for sex-biased admixed populations. | Backenroth D et al. | β | 2019 | β |
| Bayesian model selection for the study of Hardy-Weinberg proportions and homogeneity of gender allele frequencies. | Puig X et al. | β | 2019 | β |
| Interleukin-8 Gene -251 A/T (rs4073) Polymorphism and Coronary Artery Disease Risk: A Meta-Analysis. | Zhang S et al. | β | 2019 | β |
| The Summer Institute in Statistical Genetics. | Weir BS | β | 2019 | β |
| A Bayesian test for Hardy-Weinberg equilibrium of biallelic X-chromosomal markers. | Puig X et al. | β | 2017 | β |
| Association Between the Lower Extremity Deep Venous Thrombosis, the Warfarin Maintenance Dose, and CYP2C9*3, CYP2D6*10, and CYP3A5*3 Genetic Polymorphisms: A Case-Control Study. | Ju S et al. | β | 2017 | β |
| Testing Departure from Hardy-Weinberg Proportions. | Wang J et al. | β | 2017 | β |
| Testing for Hardy-Weinberg equilibrium at biallelic genetic markers on the X chromosome. | Graffelman J et al. | β | 2016 | β |
| Testing Hardy-Weinberg equilibrium with a simple root-mean-square statistic. | Ward R et al. | β | 2014 | β |
| A note on exact conditional and unconditional tests for Hardy-Weinberg equilibrium. | Shan G | β | 2013 | β |
| Inference of potential genetic risks associated with large-scale releases of red sea bream in Kanagawa prefecture, Japan based on nuclear and mitochondrial DNA analysis. | Blanco Gonzalez E et al. | β | 2013 | β |
| Reducing bias of allele frequency estimates by modeling SNP genotype data with informative missingness. | Lin WY et al. | β | 2012 | β |
| Testing departure from Hardy-Weinberg proportions. | Wang J et al. | β | 2012 | β |
| PlatinumCNV: a Bayesian Gaussian mixture model for genotyping copy number polymorphisms using SNP array signal intensity data. | Kumasaka N et al. | β | 2011 | β |
| Testing Hardy-Weinberg equilibrium: an objective Bayesian analysis. | Consonni G et al. | β | 2011 | β |
| Characterizing allelic association in the genome era. | Weir BS et al. | β | 2010 | β |
| Quality control and quality assurance in genotypic data for genome-wide association studies. | Laurie CC et al. | β | 2010 | β |
| Bayesian statistical methods for genetic association studies. | Stephens M et al. | β | 2009 | β |
| Exact tests for Hardy-Weinberg proportions. | Engels WR | β | 2009 | β |