The effect of minor allele frequency on the likelihood of obtaining false positives.
- Authors
- Tabangin, Meredith E; Woo, Jessica G; Martin, Lisa J
- Year
- 2009
- Journal
- BMC proceedings
- PMID
- 20018033
- DOI
- 10.1186/1753-6561-3-S7-S41
- PMCID
- PMC2795940
Determining the most promising single-nucleotide polymorphisms (SNPs) presents a challenge in genome-wide association studies, when hundreds of thousands of association tests are conducted. The power to detect genetic effects is dependent on minor allele frequency (MAF), and genome-wide association studies SNP arrays include SNPs with a wide distribution of MAFs. Therefore, it is critical to understand MAF's effect on the false positive rate.Data from the Framingham Heart Study simulated data (Problem 3, with answers) was used to examine the effects of varying MAFs on the likelihood of false positives. Replication set 1 was used to generate 1 million permutations of case/control status in unrelated individuals. Logistic regression was used to test for the association between each SNP and myocardial infarction using an additive model. We report the number of "significant" tests by MAF at alpha = 10-4, 10-5, and 10-6.Common SNPs exhibited fewer false positives than expected. At alpha = 10-4, SNPs with MAF 25% and 50% resulted in 69.2 [95%CI: 62.8-75.6] and 70.8 [95%CI: 61.3-80.4] false positives, respectively, compared to 100 expected. Rare SNPs exhibited more variability but did not show more false-positive results than expected by chance. However, at alpha = 10-4, MAF = 5% exhibited significantly more false positives (105.5 [95%CI: 81-130.1]) than MAF = 25% and 50%. Similar results were seen at the other alpha values.These results suggest that removal of low MAF SNPs from analysis due to concerns about inflated false-positive results may not be appropriate.
Permuted runs reaching significance. Means Β± 95% confidence intervals of number of permuted runs reaching significance at Ξ± β€ 10-4 (Panel A), Ξ± β€ 10-5 (Panel B), and Ξ± β€ 10-6 (Panel C). The dotted line in each panel represents the expected number of significant tests expected by chance.
LLM interpretation
This figure consists of three scatter plots (Panels A, B, and C) showing the number of permuted runs reaching significance across different Minor Allele Frequencies (1, 5, 10, 25, and 50). The y-axis represents the "# Meeting Threshold" for three different alpha levels ($\alpha \le 10^{-4}$, $10^{-5}$, and $10^{-6}$), with data points displayed as means $\pm$ 95% confidence intervals. A red dotted line in each panel indicates the expected number of significant tests by chance.
No entities extracted from this document yet.
No uploaded files.
| Citation | PMID | DOI | Status |
|---|---|---|---|
| ArdlieKGLunettaKLSeielstadMTesting for population subdivision and association in four case-control studiesAm J Hum Genet20027130431110.1086/34171912096349PMC379163 | β | β | β |
| CupplesLAArrudaHTBenjaminEJD'AgostinoRBSrDemissieSDeStefanoALDupuisJFallsKMFoxCSGottliebDJGovindarajuDRGuoCYHeard-CostaNLHwangSJKathiresanSKielDPLaramieJMLarsonMGLevyDLiuCYLunettaKLMailmanMDManningAKMeigsJBMurabitoJMNewton-ChehCO'ConnorGTO'DonnellCJPandeyMSeshadriSVasanRSWangZYWilkJBWolfPAYangQAtwoodLDThe Framingham Heart Study 100 k SNP genome-wide association study resource: overview of 17 phenotype working group reportsBMC Med Genet20078Suppl 1S110.1186/1471-2350-8-S1-S117903291PMC1995613 | β | β | β |
| FlorezJCManningAKDupuisJMcAteerJIrenzeKGianninyLMirelDBFoxCSCupplesLAMeigsJBA 100 k genome-wide association scan for diabetes and related traits in the Framingham Heart Study: replication and integration with other genome-wide datasetsDiabetes2007563063307410.2337/db07-045117848626 | β | β | β |
| GorlovIPGorlovaOYSunyaevSRSpitzMRAmosCIShifting paradigm of association studies: value of rare single-nucleotide polymorphismsAm J Hum Genet20088210011210.1016/j.ajhg.2007.09.00618179889PMC2253956 | β | β | β |
| LamACSchoutenMAulchenkoYSHaleyCSKoningD-JRapid and robust association mapping of expression QTLBMC Proc20071Suppl 1S14410.1186/1753-6561-1-s1-s14418466488PMC2367564 | β | β | β |
| MoskvinaVCraddockNHolmansPOwenMJO'DonovanMCEffects of differential genotyping error rate on the type I error probability of case-control studiesHum Hered200661556410.1159/00009255316612103 | β | β | β |
In this knowledge base
| Title | Year | PMID |
|---|---|---|
| Genome-wide association study of opioid dependence: multiple associations mapped to calcium and potassium pathways. | 2014 | 24143882 |
External
| Title | Authors | Journal | Year | Link |
|---|---|---|---|---|
| Aggregating single nucleotide polymorphisms improves filtering for false-positive associations postimputation. | Stahl K et al. | β | 2025 | β |
| Association Between Single Nucleotide Polymorphisms in the Aquaporin-4 Gene and Longitudinal Changes in White Matter Free Water and Cognitive Function in Non-Demented Older Adults. | Liu L et al. | β | 2025 | β |
| Causal association between blood and urine biomarkers, immune cells, and bladder cancer: A Mendelian randomization and mediation analysis. | Lin F et al. | β | 2025 | β |
| Exploring the diversity of three Northern Atlantic sea beet populations. | Bertram L et al. | β | 2025 | β |
| Genetic Variants of Interleukin-8 and Interleukin-16 and Their Association with Cervical Cancer Risk. | Watrowski R et al. | β | 2025 | β |
| Genomic variance partitioning of carcass and meat quality traits in Angus beef cattle. | Baneh H et al. | β | 2025 | β |
| Identification of Schizophrenia-Risk Regulatory Variant rs1399178 in the Non-coding Region With Its Impact on NRF1 Binding. | Ji L et al. | β | 2025 | β |
| Unraveling the genetic traits and functional diversity of the pan-genome in Pantoea dispersa. | He S et al. | β | 2025 | β |
| Whole plant transpiration responses of common bean (Phaseolus vulgaris L.) to drying soil: Water channels and transcription factors. | Cordoba-Novoa H et al. | β | 2025 | β |
| Association of tyrosine kinase 2 polymorphisms with susceptibility to microscopic polyangiitis in a Guangxi population. | Yang B et al. | β | 2024 | β |
| Genome-Wide Association Analysis of Effective Tillers in Rice under Different Nitrogen Gradients. | Liu Y et al. | β | 2024 | β |
| Systems genetics uncover new loci containing functional gene candidates in Mycobacterium tuberculosis-infected Diversity Outbred mice. | Gatti DM et al. | β | 2024 | β |
| Understanding genetic diversity in drought-adaptive hybrid parental lines in pearl millet. | Kandarkar K et al. | β | 2024 | β |
| Accurate sequencing of DNA motifs able to form alternative (non-B) structures. | Weissensteiner MH et al. | β | 2023 | β |
| An international wheat diversity panel reveals novel sources of genetic resistance to tan spot in Australia. | Taylor J et al. | β | 2023 | β |
| Comparing feature selection and machine learning approaches for predicting <i>CYP2D6</i> methylation from genetic variation. | Fong WJ et al. | β | 2023 | β |
| FREQ-Seq2: a method for precise high-throughput combinatorial quantification of allele frequencies. | Zhao R et al. | β | 2023 | β |
| Genetic diversity and population structure of Vernonia amygdalina Del. in Uganda based on genome wide markers. | Nantongo JS et al. | β | 2023 | β |
| Genome-wide association mapping in a sweet cherry germplasm collection (<i>Prunus avium</i> L.) reveals candidate genes for fruit quality traits. | Donkpegan ASL et al. | β | 2023 | β |
| Genome-wide association mapping of resistance to the foliar diseases septoria nodorum blotch and tan spot in a global winter wheat collection. | Peters Haugrud AR et al. | β | 2023 | β |
| Role of 19 SNPs in 10 genes with type 2 diabetes in the Pakistani population. | Khan N et al. | β | 2023 | β |
| Selection of genetic instruments in Mendelian randomisation studies of sleep traits. | Paz V et al. | β | 2023 | β |
| The PD-1 single-nucleotide polymorphism rs11568821 and rs2227981 as a novel prognosis modelΒ in a triple-negative breast cancer patient. | Boguszewska-Byczkiewicz K et al. | β | 2023 | β |
| Traces of Human-Mediated Selection in the Gene Pool of Red Deer Populations. | MoravΔΓkovΓ‘ N et al. | β | 2023 | β |
| Associations between Gene-Gene Interaction and Overweight/Obesity of 12-Month-Old Chinese Infants. | Mei H et al. | β | 2022 | β |
| A tree-based gene-environment interaction analysis with rare features. | Liu M et al. | β | 2022 | β |
| Interaction Effects of DRD2 Genetic Polymorphism and Interpersonal Stress on Problematic Gaming in College Students. | Kim E et al. | β | 2022 | β |
| Landscape Genomics Provides Evidence of Ecotypic Adaptation and a Barrier to Gene Flow at Treeline for the Arctic Foundation Species <i>Eriophorum vaginatum</i>. | Stunz E et al. | β | 2022 | β |
| Population-specific, recent positive selection signatures in cultivated Cucumis sativus L. (cucumber). | Lin X et al. | β | 2022 | β |
| SilicoDArT and SNP markers for genetic diversity and population structure analysis of Trema orientalis; a fodder species. | Nantongo JS et al. | β | 2022 | β |
| The Association Between Genomic Heterozygosity and Carcass Merit in Cattle. | Kenny D et al. | β | 2022 | β |
| Transcription factor 7-like 2 single nucleotide polymorphisms rs290487 and rs290481 are associated with dyslipidemia in the Balinese population. | Limardi PC et al. | β | 2022 | β |
| Circulating Adiponectin and Its Association with Metabolic Traits and Type 2 Diabetes: Gene-Diet Interactions Focusing on Selected Gene Variants and at the Genome-Wide Level in High-Cardiovascular Risk Mediterranean Subjects. | Coltell O et al. | β | 2021 | β |
| Genetic Susceptibility to Acute Kidney Injury. | Ortega-Loubon C et al. | β | 2021 | β |
| Genetic Variation in <i>ABCC4</i> and <i>CFTR</i> and Acute Pancreatitis during Treatment of Pediatric Acute Lymphoblastic Leukemia. | Bartram T et al. | β | 2021 | β |
| GWAS significance thresholds for deep phenotyping studies can depend upon minor allele frequencies and sample size. | Asif H et al. | β | 2021 | β |
| Interaction of Sirtuin 1 (SIRT1) candidate longevity gene and particulate matter (PM2.5) on all-cause mortality: a longitudinal cohort study in China. | Yao Y et al. | β | 2021 | β |
| OsTPR boosts the superior grains through increase in upper secondary rachis branches without incurring a grain quality penalty. | Pasion EA et al. | β | 2021 | β |
| Polymorphism in the MAGI2 Gene Modifies the Effect of Amyloid Ξ² on Neurodegeneration. | Kim HR et al. | β | 2021 | β |
| Regarding the F-word: The effects of data filtering on inferred genotype-environment associations. | Ahrens CW et al. | β | 2021 | β |
| Characterization of genetic diversity and population structure in wheat using array based SNP markers. | Kumar D et al. | β | 2020 | β |
| Genomic selection for lentil breeding: Empirical evidence. | Haile TA et al. | β | 2020 | β |
| Identification of Stripe Rust Resistance Loci in U.S. Spring Wheat Cultivars and Breeding Lines Using Genome-Wide Association Mapping and <i>Yr</i> Gene Markers. | Liu L et al. | β | 2020 | β |
| Measuring the Microscopic Structures of Human Dental Enamel Can Predict Caries Experience. | Kelly AM et al. | β | 2020 | β |
| Pathogen Genetic Control of Transcriptome Variation in the <i>Arabidopsis thaliana</i> - <i>Botrytis cinerea</i> Pathosystem. | Soltis NE et al. | β | 2020 | β |
| Association Analysis of 14 Candidate Gene Polymorphism with Depression and Stress among Gestational Diabetes Mellitus. | Lee KW et al. | β | 2019 | β |
| Association of HTRA1 and ARMS2 gene polymorphisms with response to intravitreal ranibizumab among neovascular age-related macular degenerative subjects. | Mohamad NA et al. | β | 2019 | β |
| Generating High Density, Low Cost Genotype Data in Soybean [<i>Glycine max</i> (L.) Merr.]. | Happ MM et al. | β | 2019 | β |
| Genetic Diversity and Population Structure of a <i>Camelina sativa</i> Spring Panel. | Luo Z et al. | β | 2019 | β |
| Association between inflammatory-response gene polymorphisms and risk of acute kidney injury in children. | He J et al. | β | 2018 | β |
| Association mapping of quantitative resistance to charcoal root rot in mulberry germplasm. | Pinto MV et al. | β | 2018 | β |
| Efficiency of different strategies to mitigate ascertainment bias when using SNP panels in diversity studies. | Malomane DK et al. | β | 2018 | β |
| Genetic Loci Controlling Carotenoid Biosynthesis in Diverse Tropical Maize Lines. | Azmach G et al. | β | 2018 | β |
| Genome-Wide Association Mapping of Loci for Resistance to Stripe Rust in North American Elite Spring Wheat Germplasm. | Godoy JG et al. | β | 2018 | β |
| Mating Design and Genetic Structure of a Multi-Parent Advanced Generation Intercross (MAGIC) Population of Sorghum (<i>Sorghum bicolor</i> (L.) Moench). | Ongom PO et al. | β | 2018 | β |
| Phenotypic Data from Inbred Parents Can Improve Genomic Prediction in Pearl Millet Hybrids. | Liang Z et al. | β | 2018 | β |
| The search for loci under selection: trends, biases and progress. | Ahrens CW et al. | β | 2018 | β |
| Ultra-high-throughput DArTseq-based silicoDArT and SNP markers for genomic studies in macadamia. | Alam M et al. | β | 2018 | β |
| A Novel QTL for Powdery Mildew Resistance in Nordic Spring Barley (<i>Hordeum vulgare</i> L. ssp. <i>vulgare</i>) Revealed by Genome-Wide Association Study. | Bengtsson T et al. | β | 2017 | β |
| A Review of the Genetics of Hypertension with a Focus on Gene-Environment Interactions. | Waken RJ et al. | β | 2017 | β |
| Association of polymorphisms in the intron of <i>TCF4</i> gene to late-onset Fuchs endothelial corneal dystrophy: An Indian cohort study. | Rao BS et al. | β | 2017 | β |
| A user guide to the Brassica 60K Illumina Infiniumβ’ SNP genotyping array. | Mason AS et al. | β | 2017 | β |
| Complement Polymorphisms in Kidney Transplantation: Critical in Graft Rejection? | Michielsen LA et al. | β | 2017 | β |
| Cross-platform compatibility of de novo-aligned SNPs in a nonmodel butterfly genus. | Campbell EO et al. | β | 2017 | β |
| Deciphering the regulation of porcine genes influencing growth, fatness and yield-related traits through genetical genomics. | MartΓnez-Montes AM et al. | β | 2017 | β |
| Genetic variants in <i>KCNJ11</i>, <i>TCF7L2</i> and <i>HNF4A</i> are associated with type 2 diabetes, BMI and dyslipidemia in families of Northeastern Mexico: A pilot study. | Gallardo-Blanco HL et al. | β | 2017 | β |
| Genome-wide mapping and prediction suggests presence of local epistasis in a vast elite winter wheat populations adapted to Central Europe. | He S et al. | β | 2017 | β |
| Structural variants in genes associated with human Williams-Beuren syndrome underlie stereotypical hypersociability in domestic dogs. | vonHoldt BM et al. | β | 2017 | β |
| Using RNA-Seq SNP data to reveal potential causal mutations related to pig production traits and RNA editing. | MartΓnez-Montes AM et al. | β | 2017 | β |
| A General Framework for the Evaluation of Genetic Association Studies Using Multiple Marginal Models. | Kitsche A et al. | β | 2016 | β |
| Development and application of a novel genome-wide SNP array reveals domestication history in soybean. | Wang J et al. | β | 2016 | β |
| Impact of imputation methods on the amount of genetic variation captured by a single-nucleotide polymorphism panel in soybeans. | Xavier A et al. | β | 2016 | β |
| Regional heritability mapping method helps explain missing heritability of blood lipid traits in isolated populations. | Shirali M et al. | β | 2016 | β |
| Type I error rates of rare single nucleotide variants are inflated in tests of association with non-normally distributed traits using simple linear regression methods. | Schwantes-An TH et al. | β | 2016 | β |
| Walking through the statistical black boxes of plant breeding. | Xavier A et al. | β | 2016 | β |
| A cautionary note on the impact of protocol changes for genome-wide association SNPΒ ΓΒ SNP interaction studies: an example on ankylosing spondylitis. | Bessonov K et al. | β | 2015 | β |
| A genome-wide association study in large white and landrace pig populations for number piglets born alive. | Bergfelder-DrΓΌing S et al. | β | 2015 | β |
| Genome-wide association meta-analysis identifies novel variants associated with fasting plasma glucose in East Asians. | Hwang JY et al. | β | 2015 | β |
| Genome-wide association study of production and stability traits in barley cultivated under future climate scenarios | Ingvordsen CH et al. | β | 2015 | β |
| Starch phosphorylation in potato tubers is influenced by allelic variation in the genes encoding glucan water dikinase, starch branching enzymes I and II, and starch synthase III. | Carpenter MA et al. | β | 2015 | β |
| The effect of rare variants on inflation of the test statistics in case-control analyses. | Pirie A et al. | β | 2015 | β |
| A first step toward the development of a barley NAM population and its utilization to detect QTLs conferring leaf rust seedling resistance. | Schnaithmann F et al. | β | 2014 | β |
| Candidate gene approach for parasite resistance in sheep--variation in immune pathway genes and association with fecal egg count. | Periasamy K et al. | β | 2014 | β |
| Discovering pair-wise genetic interactions: an information theory-based approach. | Ignac TM et al. | β | 2014 | β |
| Dissection of additive genetic variability for quantitative traits in chickens using SNP markers. | Abdollahi-Arpanahi R et al. | β | 2014 | β |
| Genome-Wide Association in Tomato Reveals 44 Candidate Loci for Fruit Metabolic Traits. | Sauvage C et al. | β | 2014 | β |
| Genome-wide association study of opioid dependence: multiple associations mapped to calcium and potassium pathways. | Gelernter J et al. | β | 2014 | β |
| Combining association mapping and transcriptomics identify HD2B histone deacetylase as a genetic factor associated with seed dormancy in Arabidopsis thaliana. | Yano R et al. | β | 2013 | β |
| Gene variants within the COL1A1 gene are associated with reduced anterior cruciate ligament injury in professional soccer players. | Ficek K et al. | β | 2013 | β |
| Genome-wide analysis of blood pressure variability and ischemic stroke. | Yadav S et al. | β | 2013 | β |
| The MCIC collection: a shared repository of multi-modal, multi-site brain image data from a clinical investigation of schizophrenia. | Gollub RL et al. | β | 2013 | β |
| TLR-10 polymorphism and papillary thyroid cancer: one more SNP to consider? | Boufraqech M et al. | β | 2013 | β |
| ANK3 and CACNA1C--missing genetic link for bipolar disorder and major depressive disorder in two German case-control samples. | Kloiber S et al. | β | 2012 | β |
| Constructing endophenotypes of complex diseases using non-negative matrix factorization and adjusted rand index. | Wang HM et al. | β | 2012 | β |
| Reliable single chip genotyping with semi-parametric log-concave mixtures. | Rippe RC et al. | β | 2012 | β |
| The genetic association of DUSP6 with bipolar disorder and its effect on ERK activity. | Kim SH et al. | β | 2012 | β |
| Improving the signal-to-noise ratio in genome-wide association studies. | Martin LJ et al. | β | 2009 | β |