Simultaneous genotype calling and haplotype phasing improves genotype accuracy and reduces false-positive associations for genome-wide association studies.
- Authors
- Browning, Brian L; Yu, Zhaoxia
- Year
- 2009
- Journal
- American journal of human genetics
- PMID
- 19931040
- DOI
- 10.1016/j.ajhg.2009.11.004
- PMCID
- PMC2790566
We present a novel method for simultaneous genotype calling and haplotype-phase inference. Our method employs the computationally efficient BEAGLE haplotype-frequency model, which can be applied to large-scale studies with millions of markers and thousands of samples. We compare genotype calls made with our method to genotype calls made with the BIRDSEED, CHIAMO, GenCall, and ILLUMINUS genotype-calling methods, using genotype data from the Illumina 550K and Affymetrix 500K arrays. We show that our method has higher genotype-call accuracy and yields fewer uncalled genotypes than competing methods. We perform single-marker analysis of data from the Wellcome Trust Case Control Consortium bipolar disorder and type 2 diabetes studies. For bipolar disorder, the genotype calls in the original study yield 25 markers with apparent false-positive association with bipolar disorder at a p < 10(-7) significance level, whereas genotype calls made with our method yield no associated markers at this significance threshold. Conversely, for markers with replicated association with type 2 diabetes, there is good concordance between genotype calls used in the original study and calls made by our method. Results from single-marker and haplotypic analysis of our method's genotype calls for the bipolar disorder study indicate that our method is highly effective at eliminating genotyping artifacts that cause false-positive associations in genome-wide association studies. Our new genotype-calling methods are implemented in the BEAGLE and BEAGLECALL software packages.
No figures extracted from this document.
No chunks β full text not yet ingested.
No entities extracted from this document yet.
No uploaded files.
No citations found.
In this knowledge base
External
| Title | Authors | Journal | Year | Link |
|---|---|---|---|---|
| Enabling efficient analysis of biobank-scale data with genotype representation graphs. | DeHaas D et al. | β | 2025 | β |
| GDBIG: A Pioneering Birth Cohort Genomic Platform Facilitating Intergenerational Genetic Research. | Huang S et al. | β | 2025 | β |
| Integrative genome-wide association and haplotype-based analyses reveal genetic structure and local adaptation in Korean landrace soybeans. | Kim EG et al. | β | 2025 | β |
| Sex-Biased Admixture Followed by Isolation and Adaptive Evolution Shaped the Genomic and Blood Pressure Diversity of the LopNur People. | Wen J et al. | β | 2025 | β |
| GDBIG: the first birth cohort genomic database and platform facilitating intergenerational genetic research | Huang S et al. | β | 2024 | β |
| GWAS for identification of genomic regions and candidate genes in vegetable crops. | Nandi S et al. | β | 2024 | β |
| Introgression and disruption of migration routes have shaped the genetic integrity of wildebeest populations. | Liu X et al. | β | 2024 | β |
| Low-coverage whole genome sequencing for a highly selective cohort of severe COVID-19 patients. | Santos R et al. | β | 2024 | β |
| Segregation GWAS to linearize a non-additive locus with incomplete penetrance: an example of horn status in sheep. | Duijvesteijn N et al. | β | 2024 | β |
| The Born in Guangzhou Cohort Study enables generational genetic discoveries. | Huang S et al. | β | 2024 | β |
| Genomics of <i>ERBB2</i>-Positive Breast Cancer in Young Women Before and After Exposure to Chemotherapy Plus Trastuzumab. | Lipsyc-Sharf M et al. | β | 2023 | β |
| QTLs and Candidate Genes Associated with Semen Traits in Merino Sheep. | Hodge MJ et al. | β | 2023 | β |
| Sequenced-based GWAS for linear classification traits in Belgian Blue beef cattle reveals new coding variants in genes regulating body size in mammals. | GualdrΓ³n Duarte JL et al. | β | 2023 | β |
| A Fast and Robust Strategy to Remove Variant-Level Artifacts in Alzheimer Disease Sequencing Project Data. | Belloy ME et al. | β | 2022 | β |
| An empirical evaluation of genotype imputation of ancient DNA. | Ausmees K et al. | β | 2022 | β |
| Benchmarking phasing software with a whole-genome sequenced cattle pedigree. | Oget-Ebrad C et al. | β | 2022 | β |
| Editorial: Genetic validation and its role in crop improvement. | Sallam A et al. | β | 2022 | β |
| Genomic prediction in a numerically small breed population using prioritized genetic markers from whole-genome sequence data. | Moghaddar N et al. | β | 2022 | β |
| Genotyping-by-sequencing and genome-wide association study reveal genetic diversity and loci controlling agronomic traits in triticale. | Cao D et al. | β | 2022 | β |
| Genotyping, the Usefulness of Imputation to Increase SNP Density, and Imputation Methods and Tools. | Phocas F | β | 2022 | β |
| PolyHaplotyper: haplotyping in polyploids based on bi-allelic marker dosage data. | Voorrips RE et al. | β | 2022 | β |
| Population genomic monitoring provides insight into conservation status but no correlation with demographic estimates of extinction risk in a threatened trout. | Hemstrom W et al. | β | 2022 | β |
| The genome and diet of a 35,000-year-old <i>Canis lupus</i> specimen from the Paleolithic painted cave, Chauvet-Pont d'Arc, France. | Elalouf JM et al. | β | 2022 | β |
| Usefulness of temperate-adapted maize lines developed by doubled haploid and single-seed descent methods. | Santos IGD et al. | β | 2022 | β |
| A beginner's guide to low-coverage whole genome sequencing for population genomics. | Lou RN et al. | β | 2021 | β |
| Accuracy of genotype imputation based on reference population size and marker density in Hanwoo cattle. | Lee D et al. | β | 2021 | β |
| A Genome-Wide Association Study Pinpoints Quantitative Trait Genes for Plant Height, Heading Date, Grain Quality, and Yield in Rye (<i>Secale cereale</i> L.). | Siekmann D et al. | β | 2021 | β |
| A population-specific reference panel for improved genotype imputation in African Americans. | O'Connell J et al. | β | 2021 | β |
| Dissection of the Genetic Basis of Yield-Related Traits in the Chinese Peanut Mini-Core Collection Through Genome-Wide Association Studies. | Zhou X et al. | β | 2021 | β |
| Diversification, Introgression, and Rampant Cytonuclear Discordance in Rocky Mountains Chipmunks (Sciuridae: Tamias). | Sarver BAJ et al. | β | 2021 | β |
| Efficient phasing and imputation of low-coverage sequencing data using large reference panels. | Rubinacci S et al. | β | 2021 | β |
| Genetic Admixture in the Culturally Unique Peranakan Chinese Population in Southeast Asia. | Wu D et al. | β | 2021 | β |
| Genetic Origins and Sex-Biased Admixture of the Huis. | Ma X et al. | β | 2021 | β |
| Genome-wide association studies: assessing trait characteristics in model and crop plants. | Alseekh S et al. | β | 2021 | β |
| Trans-ethnic genome-wide association study of severe COVID-19. | Wu P et al. | β | 2021 | β |
| Using off-target data from whole-exome sequencing to improve genotyping accuracy, association analysis and polygenic risk prediction. | Dou J et al. | β | 2021 | β |
| Appraisal of wheat genomics for gene discovery and breeding applications: a special emphasis on advances in Asia. | Rasheed A et al. | β | 2020 | β |
| Combining sequence data from multiple studies: Impact of analysis strategies on rare variant calling and association results. | Chen Z et al. | β | 2020 | β |
| Genome-Wide Association for HbA1c in Malay Identified Deletion on SLC4A1 that Influences HbA1c Independent of Glycemia. | Chai JF et al. | β | 2020 | β |
| Oligonucleotide hybridization with magnetic separation assay for multiple SNP phasing. | Lee Yu HL et al. | β | 2020 | β |
| The paternal and maternal genetic history of Vietnamese populations. | Macholdt E et al. | β | 2020 | β |
| Crop genome-wide association study: a harvest of biological relevance. | Liu HJ et al. | β | 2019 | β |
| Discovery of common and rare genetic risk variants for colorectal cancer. | Huyghe JR et al. | β | 2019 | β |
| Genetic analyses of human fetal retinal pigment epithelium gene expression suggest ocular disease mechanisms. | Liu B et al. | β | 2019 | β |
| Genome-wide meta-analysis of SNP-by9-ACEI/ARB and SNP-by-thiazide diuretic and effect on serum potassium in cohorts of European and African ancestry. | Irvin MR et al. | β | 2019 | β |
| High-quality, genome-wide SNP genotypic data for pedigreed germplasm of the diploid outbreeding species apple, peach, and sweet cherry through a common workflow. | Vanderzande S et al. | β | 2019 | β |
| Large-Scale Whole-Genome Sequencing of Three Diverse Asian Populations in Singapore. | Wu D et al. | β | 2019 | β |
| Protective HLA alleles are associated with reduced LPS levels in acute HIV infection with implications for immune activation and pathogenesis. | Claiborne DT et al. | β | 2019 | β |
| RNA-Seq in 296 phased trios provides a high-resolution map of genomic imprinting. | Jadhav B et al. | β | 2019 | β |
| SNP genotype calling and quality control for multi-batch-based studies. | Seo S et al. | β | 2019 | β |
| Statistical methods for genome-wide association studies. | Wang MH et al. | β | 2019 | β |
| Genetic Regulatory Mechanisms of Smooth Muscle Cells Map to Coronary Artery Disease Risk Loci. | Liu B et al. | β | 2018 | β |
| Genomic prediction of the polled and horned phenotypes in Merino sheep. | Duijvesteijn N et al. | β | 2018 | β |
| Genotype Imputation from Large Reference Panels. | Das S et al. | β | 2018 | β |
| Haplotype phasing in single-cell DNA-sequencing data. | Satas G et al. | β | 2018 | β |
| Interplay of host genetics and gut microbiota underlying the onset and clinical presentation of inflammatory bowel disease. | Imhann F et al. | β | 2018 | β |
| Evidence for SNP-SNP interaction identified through targeted sequencing of cleft case-parent trios. | Xiao Y et al. | β | 2017 | β |
| Genomewide Association Study of Alcohol Dependence Identifies Risk Loci Altering Ethanol-Response Behaviors in Model Organisms. | Adkins AE et al. | β | 2017 | β |
| Genome-Wide Association Study to Find Modifiers for Tetralogy of Fallot in the 22q11.2 Deletion Syndrome Identifies Variants in the <i>GPR98</i> Locus on 5q14.3. | Guo T et al. | β | 2017 | β |
| Genomic landscape of high-grade meningiomas. | Bi WL et al. | β | 2017 | β |
| Germline and somatic BAP1 mutations in high-grade rhabdoid meningiomas. | Shankar GM et al. | β | 2017 | β |
| Identifying and mitigating batch effects in whole genome sequencing data. | Tom JA et al. | β | 2017 | β |
| Landscape of Genomic Alterations in Pituitary Adenomas. | Bi WL et al. | β | 2017 | β |
| PhredEM: a phred-score-informed genotype-calling approach for next-generation sequencing studies. | Liao P et al. | β | 2017 | β |
| Predicting Cognitive Executive Functioning with Polygenic Risk Scores for Psychiatric Disorders. | Benca CE et al. | β | 2017 | β |
| A new model calling procedure for Illumina BeadArray data. | Li G | β | 2016 | β |
| Coverage recommendation for genotyping analysis of highly heterologous species using next-generation sequencing technology. | Song K et al. | β | 2016 | β |
| Evaluating Imputation Algorithms for Low-Depth Genotyping-By-Sequencing (GBS) Data. | Chan AW et al. | β | 2016 | β |
| Genome-wide association study of 12 agronomic traits in peach. | Cao K et al. | β | 2016 | β |
| MAPK activation and HRAS mutation identified in pituitary spindle cell oncocytoma. | Miller MB et al. | β | 2016 | β |
| Phasing for medical sequencing using rare variants and large haplotype reference panels. | Sharp K et al. | β | 2016 | β |
| Accuracy of genotype imputation based on random and selected reference sets in purebred and crossbred sheep populations and its effect on accuracy of genomic prediction. | Moghaddar N et al. | β | 2015 | β |
| A genome-wide association study confirms PNPLA3 and identifies TM6SF2 and MBOAT7 as risk loci for alcohol-related cirrhosis. | Buch S et al. | β | 2015 | β |
| An efficient and scalable analysis framework for variant extraction and refinement from population-scale DNA sequence data. | Jun G et al. | β | 2015 | β |
| ARID1A and TERT promoter mutations in dedifferentiated meningioma. | Abedalthagafi MS et al. | β | 2015 | β |
| Comparing variant calling algorithms for target-exon sequencing in a large sample. | Lo Y et al. | β | 2015 | β |
| Correcting for Sample Contamination in Genotype Calling of DNA Sequence Data. | Flickinger M et al. | β | 2015 | β |
| DISSCO: direct imputation of summary statistics allowing covariates. | Xu Z et al. | β | 2015 | β |
| Excess of rare variants in genes that are key epigenetic regulators of spermatogenesis in the patients with non-obstructive azoospermia. | Li Z et al. | β | 2015 | β |
| Exploring the genetic signature of body size in Yucatan miniature pig. | Kim H et al. | β | 2015 | β |
| Functional Impact and Evolution of a Novel Human Polymorphic Inversion That Disrupts a Gene and Creates a Fusion Transcript. | Puig M et al. | β | 2015 | β |
| Genetic Affinity of the Bhil, Kol and Gond Mentioned in Epic Ramayana. | Chaubey G et al. | β | 2015 | β |
| Heritability, SNP- and Gene-Based Analyses of Cannabis Use Initiation and Age at Onset. | MinicΔ CC et al. | β | 2015 | β |
| Immunogenetic influences on acquisition of HIV-1 infection: consensus findings from two African cohorts point to an enhancer element in IL19 (1q32.2). | Li X et al. | β | 2015 | β |
| M(3)-S: a genotype calling method incorporating information from samples with known genotypes. | Li G et al. | β | 2015 | β |
| Whole-genome sequencing to understand the genetic architecture of common gene expression and biomarker phenotypes. | Wood AR et al. | β | 2015 | β |
| A 660-Kb deletion with antagonistic effects on fertility and milk production segregates at high frequency in Nordic Red cattle: additional evidence for the common occurrence of balancing selection in livestock. | Kadri NK et al. | β | 2014 | β |
| ANGSD: Analysis of Next Generation Sequencing Data. | Korneliussen TS et al. | β | 2014 | β |
| Another explanation for apparent epistasis. | Wood AR et al. | β | 2014 | β |
| Assessing single nucleotide variant detection and genotype calling on whole-genome sequenced individuals. | Cheng AY et al. | β | 2014 | β |
| Characterizing bias in population genetic inferences from low-coverage sequencing data. | Han E et al. | β | 2014 | β |
| Host genetics and immune control of HIV-1 infection: fine mapping for the extended human MHC region in an African cohort. | Prentice HA et al. | β | 2014 | β |
| iCall: a genotype-calling algorithm for rare, low-frequency and common variants on the Illumina exome array. | Zhou J et al. | β | 2014 | β |
| Identification of allelic imbalance with a statistical model for subtle genomic mosaicism. | Xia R et al. | β | 2014 | β |
| Investigation of de novo unique differentially expressed genes related to evolution in exercise response during domestication in Thoroughbred race horses. | Park W et al. | β | 2014 | β |
| KRLMM: an adaptive genotype calling method for common and low frequency variants. | Liu R et al. | β | 2014 | β |
| Molecular validation of the schizophrenia spectrum. | Bigdeli TB et al. | β | 2014 | β |
| PedBLIMP: extending linear predictors to impute genotypes in pedigrees. | Chen W et al. | β | 2014 | β |
| Quantitative genetics of CTCF binding reveal local sequence effects and different modes of X-chromosome association. | Ding Z et al. | β | 2014 | β |
| Whole-genome sequence variation, population structure and demographic history of the Dutch population. | Genome of the Netherlands Consortium | β | 2014 | β |
| Accurate local-ancestry inference in exome-sequenced admixed individuals via off-target sequence reads. | Hu Y et al. | β | 2013 | β |
| A comprehensive SNP and indel imputability database. | Duan Q et al. | β | 2013 | β |
| Analysis of immune-related loci identifies 48 new susceptibility variants for multiple sclerosis. | International Multiple Sclerosis Genetics Consortium (IMSGC) et al. | β | 2013 | β |
| A primer for disease gene prioritization using next-generation sequencing data. | Wang S et al. | β | 2013 | β |
| Building a genome analysis pipeline to predict disease risk and prevent disease. | Bromberg Y | β | 2013 | β |
| Converging Evidence for Epistasis between ANK3 and Potassium Channel Gene KCNQ2 in Bipolar Disorder. | Judy JT et al. | β | 2013 | β |
| Estimating individual admixture proportions from next generation sequencing data. | Skotte L et al. | β | 2013 | β |
| Genetic recombination is targeted towards gene promoter regions in dogs. | Auton A et al. | β | 2013 | β |
| Genome-wide association analysis in primary sclerosing cholangitis and ulcerative colitis identifies risk loci at GPR35 and TCF4. | Ellinghaus D et al. | β | 2013 | β |
| HapFABIA: identification of very short segments of identity by descent characterized by rare variants in large sequencing data. | Hochreiter S | β | 2013 | β |
| Imputation of coding variants in African Americans: better performance using data from the exome sequencing project. | Duan Q et al. | β | 2013 | β |
| Multi-population classical HLA type imputation. | Dilthey A et al. | β | 2013 | β |
| Next-generation sequencing in the clinic: promises and challenges. | Xuan J et al. | β | 2013 | β |
| Non-Hodgkin lymphoma risk and variants in genes controlling lymphocyte development. | Schuetz JM et al. | β | 2013 | β |
| Peeling back the evolutionary layers of molecular mechanisms responsive to exercise-stress in the skeletal muscle of the racing horse. | Kim H et al. | β | 2013 | β |
| Re-ranking sequencing variants in the post-GWAS era for accurate causal variant identification. | Faye LL et al. | β | 2013 | β |
| Single Nucleotide Polymorphism (SNP) Detection and Genotype Calling from Massively Parallel Sequencing (MPS) Data. | Li Y et al. | β | 2013 | β |
| Whole-exome sequencing of 2,000 Danish individuals and the role of rare coding variants in type 2 diabetes. | Lohmueller KE et al. | β | 2013 | β |
| A beginners guide to SNP calling from high-throughput DNA-sequencing data. | Altmann A et al. | β | 2012 | β |
| Absolute quantification of somatic DNA alterations in human cancer. | Carter SL et al. | β | 2012 | β |
| A fast and accurate SNP detection algorithm for next-generation sequencing data. | Xu F et al. | β | 2012 | β |
| Amino acid position 11 of HLA-DRΞ²1 is a major determinant of chromosome 6p association with ulcerative colitis. | Achkar JP et al. | β | 2012 | β |
| A pathway analysis method for genome-wide association studies. | Shahbaba B et al. | β | 2012 | β |
| A Ξ½-support vector regression based approach for predicting imputation quality. | Huang YH et al. | β | 2012 | β |
| Data mining approaches for genome-wide association of mood disorders. | Pirooznia M et al. | β | 2012 | β |
| Detecting and estimating contamination of human DNA samples in sequencing and array-based genotype data. | Jun G et al. | β | 2012 | β |
| Detecting rare variant associations by identity-by-descent mapping in case-control studies. | Browning SR et al. | β | 2012 | β |
| Efficiency and power as a function of sequence coverage, SNP array density, and imputation. | Flannick J et al. | β | 2012 | β |
| Exome sequencing and complex disease: practical aspects of rare variant association studies. | Do R et al. | β | 2012 | β |
| Extremely low-coverage sequencing and imputation increases power for genome-wide association studies. | Pasaniuc B et al. | β | 2012 | β |
| Family-based association tests using genotype data with uncertainty. | Yu Z | β | 2012 | β |
| Genotype calling from next-generation sequencing data using haplotype information of reads. | Zhi D et al. | β | 2012 | β |
| Haplotype inference. | Delaneau O et al. | β | 2012 | β |
| Identity by descent between distant relatives: detection and applications. | Browning SR et al. | β | 2012 | β |
| M(3): an improved SNP calling algorithm for Illumina BeadArray data. | Li G et al. | β | 2012 | β |
| The MTAP-CDKN2A locus confers susceptibility to a naturally occurring canine cancer. | Shearin AL et al. | β | 2012 | β |
| Accuracy and computational efficiency of a graphical modeling approach to linkage disequilibrium estimation. | Abel HJ et al. | β | 2011 | β |
| A fast, powerful method for detecting identity by descent. | Browning BL et al. | β | 2011 | β |
| A framework for variation discovery and genotyping using next-generation DNA sequencing data. | DePristo MA et al. | β | 2011 | β |
| Approximate and exact tests of Hardy-Weinberg equilibrium using uncertain genotypes. | Shriner D | β | 2011 | β |
| A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. | Li H | β | 2011 | β |
| Comparing genotyping algorithms for Illumina's Infinium whole-genome SNP BeadChips. | Ritchie ME et al. | β | 2011 | β |
| DASH: a method for identical-by-descent haplotype mapping uncovers association with recent variation. | Gusev A et al. | β | 2011 | β |
| Genome-wide association analysis of age at onset and psychotic symptoms in bipolar disorder. | Belmonte Mahon P et al. | β | 2011 | β |
| Genome-wide association study in bipolar patients stratified by co-morbidity. | Kerner B et al. | β | 2011 | β |
| Haplotype phasing: existing methods and new developments. | Browning SR et al. | β | 2011 | β |
| Increased power of mixed models facilitates association mapping of 10 loci for metabolic traits in an isolated population. | Kenny EE et al. | β | 2011 | β |
| Inferring haplotypes of copy number variations from high-throughput data with uncertainty. | Kato M et al. | β | 2011 | β |
| Linkage disequilibrium based genotype calling from low-coverage shotgun sequencing reads. | Duitama J et al. | β | 2011 | β |
| Low-coverage sequencing: implications for design of complex trait association studies. | Li Y et al. | β | 2011 | β |
| SNP detection and genotyping from low-coverage sequencing data on multiple diploid samples. | Le SQ et al. | β | 2011 | β |
| The influence of genetics on response to treatment with ranibizumab (Lucentis) for age-related macular degeneration: the Lucentis Genotype Study (an American Ophthalmological Society thesis). | Francis PJ | β | 2011 | β |
| Genotype imputation for genome-wide association studies. | Marchini J et al. | β | 2010 | β |
| High-resolution detection of identity by descent in unrelated individuals. | Browning SR et al. | β | 2010 | β |