A comprehensive evaluation of SNP genotype imputation.
- Authors
- Nothnagel, Michael; Ellinghaus, David; Schreiber, Stefan; Krawczak, Michael; Franke, Andre
- Year
- 2009
- Journal
- Human genetics
- PMID
- 19089453
- DOI
- 10.1007/s00439-008-0606-5
Genome-wide association studies have contributed significantly to the genetic dissection of complex diseases. In order to increase the power of existing marker sets even further, methods have been proposed to predict individual genotypes at un-typed loci from other marker sets by imputation, usually employing HapMap data as a reference. Although various imputation algorithms have been used in practice already, a comprehensive evaluation and comparison of these approaches, using genome-wide SNP data from one and the same population is still lacking. We therefore investigated four publicly available programs for genotype imputation (BEAGLE, IMPUTE, MACH, and PLINK) using data from 449 German individuals genotyped in our laboratory for three genome-wide SNP sets [Affymetrix 5.0 (500 k), Affymetrix 6.0 (1,000 k), and Illumina 550 k]. We observed that HapMap-based imputation in a northern European population is powerful and reliable, even in highly variable genomic regions such as the extended MHC on chromosome 6p21. However, while genotype predictions were found to be highly accurate with all four programs, the number of SNPs for which imputation was actually carried out ('imputation efficacy') varied substantially. BEAGLE, IMPUTE, and MACH yielded nearly identical trade-offs between imputation accuracy and efficacy whereas PLINK performed consistently poorer. We nevertheless recommend either MACH or BEAGLE for practical use because these two programs are more user-friendly and generally require less memory than IMPUTE.
No figures extracted from this document.
No chunks β full text not yet ingested.
No entities extracted from this document yet.
No uploaded files.
No citations found.
In this knowledge base
External
| Title | Authors | Journal | Year | Link |
|---|---|---|---|---|
| Assessing Genotype Imputation Methods for Low-Coverage Sequencing Data in Populations With Differing Relatedness and Inbreeding Levels. | Vi T et al. | β | 2025 | β |
| Genetic basis of phenotypic diversity in <i>C. stenophylla</i>: a stepping stone for climate-adapted coffee cultivar development. | Lahai PM et al. | β | 2025 | β |
| KBeagle: An Adaptive Strategy and Tool for Improving Imputation Accuracy and Computation Time. | Guo X et al. | β | 2025 | β |
| Association Between Polymorphisms in DNA Repair Genes and Glioma Susceptibility: A Meta-Analysis of Four Single Nucleotide Polymorphisms (rs3212986, rs13181, rs25487, and rs861539). | Fotakopoulos G et al. | β | 2024 | β |
| Genome-Wide Association Study and Phenotype Prediction of Reproductive Traits in Large White Pigs. | Zhang H et al. | β | 2024 | β |
| Protective function of sclerosing cholangitis on IBD. | Bedke T et al. | β | 2024 | β |
| A joint use of pooling and imputation for genotyping SNPs. | Clouard C et al. | β | 2022 | β |
| Association of IL1R1 gene (SNP rs2071374) with the risk of preeclampsia. | Sivaraj N et al. | β | 2022 | β |
| Genome-Wide Association Study Identifies Two Common Loci Associated with Pigment Dispersion Syndrome/Pigmentary Glaucoma and Implicates Myopia in its Development. | Simcoe MJ et al. | β | 2022 | β |
| Association mapping and genomic selection for sorghum adaptation to tropical soils of Brazil in a sorghum multiparental random mating population. | Bernardino KC et al. | β | 2021 | β |
| A survival of the fittest strategy for the selection of genotypes by which drug responders and non-responders can be predicted in small groups. | HΓΆhle D et al. | β | 2021 | β |
| False positive findings during genome-wide association studies with imputation: influence of allele frequency and imputation accuracy. | Zhang Z et al. | β | 2021 | β |
| An Automated Method To Predict Mouse Gene and Protein Sequences Using Variant Data. | Dornbos P et al. | β | 2020 | β |
| Overcoming the challenges of imputation of rare variants in a Taiwanese cohort. | Chattopadhyay A et al. | β | 2020 | β |
| The Global Durum Wheat Panel (GDP): An International Platform to Identify and Exchange Beneficial Alleles. | Mazzucotelli E et al. | β | 2020 | β |
| A combined linkage, microarray and exome analysis suggests MAP3K11 as a candidate gene for left ventricular hypertrophy. | Silva CT et al. | β | 2018 | β |
| Genotype imputation performance of three reference panels using African ancestry individuals. | Vergara C et al. | β | 2018 | β |
| Modeling coverage gaps in haplotype frequencies via Bayesian inference to improve stem cell donor selection. | Louzoun Y et al. | β | 2018 | β |
| Validation of genotype imputation in Southeast Asian populations and the effect of single nucleotide polymorphism annotation on imputation outcome. | Lert-Itthiporn W et al. | β | 2018 | β |
| Identifying tagging SNPs for African specific genetic variation from the African Diaspora Genome. | Johnston HR et al. | β | 2017 | β |
| IL-8 -251A/T and +781C/T polymorphisms were associated with risk of breast cancer in a Chinese population. | Zhang J et al. | β | 2017 | β |
| Polymorphisms in Renal Ammonia Metabolism Genes Correlate With 24-Hour Urine pH. | Canales BK et al. | β | 2017 | β |
| Securing the use of existing sample collections for future human genetic research. | Kanoungi G et al. | β | 2017 | β |
| Sequencing and de novo assembly of 150 genomes from Denmark as a population reference. | Maretty L et al. | β | 2017 | β |
| Bias Characterization in Probabilistic Genotype Data and Improved Signal Detection with Multiple Imputation. | Palmer C et al. | β | 2016 | β |
| FAPI: Fast and accurate P-value Imputation for genome-wide association study. | Kwan JS et al. | β | 2016 | β |
| Genetic variants in the mTOR pathway and breast cancer risk in African American women. | Cheng TD et al. | β | 2016 | β |
| Multi-generational imputation of single nucleotide polymorphism marker genotypes and accuracy of genomic selection. | Toghiani S et al. | β | 2016 | β |
| Pharmacometabolomics-aided Pharmacogenomics in Autoimmune Disease. | Katsila T et al. | β | 2016 | β |
| Drug-Gene Interactions of Antihypertensive Medications and Risk of Incident Cardiovascular Disease: A Pharmacogenomics Study from the CHARGE Consortium. | Bis JC et al. | β | 2015 | β |
| SNP imputation bias reduces effect size determination. | Khankhanian P et al. | β | 2015 | β |
| Accuracy of imputation to whole-genome sequence data in Holstein Friesian cattle. | van Binsbergen R et al. | β | 2014 | β |
| Drug-gene interactions and the search for missing heritability: a cross-sectional pharmacogenomics study of the QT interval. | Avery CL et al. | β | 2014 | β |
| Evaluation of genome based estimated breeding values for meat quality in a berkshire population using high density single nucleotide polymorphism chips. | Baby S et al. | β | 2014 | β |
| Impact of pre-imputation SNP-filtering on genotype imputation results. | Roshyara NR et al. | β | 2014 | β |
| Impact of reference population on accuracy of imputation from 6K to 50K single nucleotide polymorphism chips in purebred and crossbreed beef cattle. | Ventura RV et al. | β | 2014 | β |
| Imputation and quality control steps for combining multiple genome-wide datasets. | Verma SS et al. | β | 2014 | β |
| On the performance of multiple imputation based on chained equations in tackling missing data of the African Ξ±3.7 -globin deletion in a malaria association study. | SepΓΊlveda N et al. | β | 2014 | β |
| Schizophrenia miR-137 locus risk genotype is associated with dorsolateral prefrontal cortex hyperactivation. | van Erp TG et al. | β | 2014 | β |
| The utility of low-density genotyping for imputation in the Thoroughbred horse. | Corbin LJ et al. | β | 2014 | β |
| A pharmacokinetic/pharmacodynamic model of tumor lysis syndrome in chronic lymphocytic leukemia patients treated with flavopiridol. | Ji J et al. | β | 2013 | β |
| Assets of imputation to ultra-high density for productive and functional traits. | JimΓ©nez-Montero JA et al. | β | 2013 | β |
| Comparison of different imputation methods from low- to high-density panels using Chinese Holstein cattle. | Weng Z et al. | β | 2013 | β |
| Comparison of the performance of two commercial genome-wide association study genotyping platforms in Han Chinese samples. | Jiang L et al. | β | 2013 | β |
| Genome-wide investigation of gene-environment interactions in colorectal cancer. | Siegert S et al. | β | 2013 | β |
| Genotype imputation reference panel selection using maximal phylogenetic diversity. | Zhang P et al. | β | 2013 | β |
| Genotype imputation via matrix completion. | Chi EC et al. | β | 2013 | β |
| GStream: improving SNP and CNV coverage on genome-wide association studies. | Alonso A et al. | β | 2013 | β |
| Ischemic stroke is associated with the ABO locus: the EuroCLOT study. | Williams FM et al. | β | 2013 | β |
| Resequencing three candidate genes for major depressive disorder in a Dutch cohort. | Verbeek EC et al. | β | 2013 | β |
| Windfalls and pitfalls: Applications of population genetics to the search for disease genes. | Edge MD et al. | β | 2013 | β |
| 1000 Genomes-based imputation identifies novel and refined associations for the Wellcome Trust Case Control Consortium phase 1 Data. | Huang J et al. | β | 2012 | β |
| A fine-mapping study of 7 top scoring genes from a GWAS for major depressive disorder. | Verbeek EC et al. | β | 2012 | β |
| Assessment of genotype imputation performance using 1000 Genomes in African American studies. | Hancock DB et al. | β | 2012 | β |
| Association mapping and disease: evolutionary perspectives. | Besenbacher S et al. | β | 2012 | β |
| A Ξ½-support vector regression based approach for predicting imputation quality. | Huang YH et al. | β | 2012 | β |
| Comparison of requirements and capabilities of major multipurpose software packages. | Igo RP et al. | β | 2012 | β |
| Comprehensive evaluation of imputation performance in African Americans. | Chanda P et al. | β | 2012 | β |
| Effect of genome-wide genotyping and reference panels on rare variants imputation. | Zheng HF et al. | β | 2012 | β |
| Efficiency and power as a function of sequence coverage, SNP array density, and imputation. | Flannick J et al. | β | 2012 | β |
| Evaluation of the imputation performance of the program IMPUTE in an admixed sample from Mexico City using several model designs. | Krithika S et al. | β | 2012 | β |
| Fine-mapping of a region of chromosome 5p15.33 (TERT-CLPTM1L) suggests a novel locus in TERT and a CLPTM1L haplotype are associated with glioma susceptibility in a Chinese population. | Zhao Y et al. | β | 2012 | β |
| Folic acid supplementation, MTHFR and MTRR polymorphisms, and the risk of childhood leukemia: the ESCALE study (SFCE). | Amigou A et al. | β | 2012 | β |
| Functional variant in the autophagy-related 5 gene promotor is associated with childhood asthma. | Martin LJ et al. | β | 2012 | β |
| Genome-wide association study identifies novel loci associated with circulating phospho- and sphingolipid concentrations. | Demirkan A et al. | β | 2012 | β |
| Genome-wide search for gene-gene interactions in colorectal cancer. | Jiao S et al. | β | 2012 | β |
| Genotype imputation for African Americans using data from HapMap phase II versus 1000 genomes projects. | Sung YJ et al. | β | 2012 | β |
| Genotype Imputation for Latinos Using the HapMap and 1000 Genomes Project Reference Panels. | Gao X et al. | β | 2012 | β |
| Merging pharmacometabolomics with pharmacogenomics using '1000 Genomes' single-nucleotide polymorphism imputation: selective serotonin reuptake inhibitor response pharmacogenomics. | Abo R et al. | β | 2012 | β |
| Methods for meta-analyses of genome-wide association studies: critical assessment of empirical evidence. | GΓΆgele M et al. | β | 2012 | β |
| Performance of genotype imputations using data from the 1000 Genomes Project. | Sung YJ et al. | β | 2012 | β |
| Personal receptor repertoires: olfaction as a model. | Olender T et al. | β | 2012 | β |
| Population-based case-control association studies. | Hancock DB et al. | β | 2012 | β |
| TRM: a powerful two-stage machine learning approach for identifying SNP-SNP interactions. | Lin HY et al. | β | 2012 | β |
| An empirical evaluation of imputation accuracy for association statistics reveals increased type-I error rates in genome-wide associations. | Almeida MA et al. | β | 2011 | β |
| Efficient genomewide selection of PCA-correlated tSNPs for genotype imputation. | Javed A et al. | β | 2011 | β |
| Genetic risk profiles for depression and anxiety in adult and elderly cohorts. | Demirkan A et al. | β | 2011 | β |
| Genetics of infectious diseases: hidden etiologies and common pathways. | Orlova M et al. | β | 2011 | β |
| Genetics of vesicoureteral reflux. | Puri P et al. | β | 2011 | β |
| Genetic variants in LPL, OASL and TOMM40/APOE-C1-C2-C4 genes are associated with multiple cardiovascular-related traits. | Middelberg RP et al. | β | 2011 | β |
| Genome-wide association study for serum urate concentrations and gout among African Americans identifies genomic risk loci and a novel URAT1 loss-of-function allele. | Tin A et al. | β | 2011 | β |
| Identification of KIF3A as a novel candidate gene for childhood asthma using RNA expression and population allelic frequencies differences. | Kovacic MB et al. | β | 2011 | β |
| Imputation of genotypes from low- to high-density genotyping platforms and implications for genomic selection. | Berry DP et al. | β | 2011 | β |
| Imputation of low-frequency variants using the HapMap3 benefits from large, diverse reference sets. | Jostins L et al. | β | 2011 | β |
| Meta-analysis of genome-wide association for migraine in six population-based European cohorts. | Ligthart L et al. | β | 2011 | β |
| Molecular characterization of a long range haplotype affecting protein yield and mastitis susceptibility in Norwegian Red cattle. | Sodeland M et al. | β | 2011 | β |
| Novel loci for major depression identified by genome-wide association study of Sequenced Treatment Alternatives to Relieve Depression and meta-analysis of three studies. | Shyn SI et al. | β | 2011 | β |
| ParaHaplo 3.0: A program package for imputation and a haplotype-based whole-genome association study using hybrid parallel computing. | Misawa K et al. | β | 2011 | β |
| Pathway-based identification of SNPs predictive of survival. | Pang H et al. | β | 2011 | β |
| Practical Consideration of Genotype Imputation: Sample Size, Window Size, Reference Choice, and Untyped Rate. | Zhang B et al. | β | 2011 | β |
| Strategies for genetic model specification in the screening of genome-wide meta-analysis signals for further replication. | Pereira TV et al. | β | 2011 | β |
| The effect of genome-wide association scan quality control on imputation outcome for common variants. | Southam L et al. | β | 2011 | β |
| The effect of reference panels and software tools on genotype imputation. | Nho K et al. | β | 2011 | β |
| Using penalised logistic regression to fine map HLA variants for rheumatoid arthritis. | Vignal CM et al. | β | 2011 | β |
| A meta-analysis of genome-wide data from five European isolates reveals an association of COL22A1, SYT1, and GABRR2 with serum creatinine level. | Pattaro C et al. | β | 2010 | β |
| A new statistic to evaluate imputation reliability. | Lin P et al. | β | 2010 | β |
| APOE is not associated with Alzheimer disease: a cautionary tale of genotype imputation. | Beecham GW et al. | β | 2010 | β |
| Dealing with missing values in large-scale studies: microarray data imputation and beyond. | Aittokallio T | β | 2010 | β |
| Genome-wide approaches to schizophrenia. | Duan J et al. | β | 2010 | β |
| Genome-wide association study of PR interval. | Pfeufer A et al. | β | 2010 | β |
| Germline variation in apoptosis pathway genes and risk of non-Hodgkin's lymphoma. | Kelly JL et al. | β | 2010 | β |
| Identification of Diabetic Retinopathy Genes through a Genome-Wide Association Study among Mexican-Americans from Starr County, Texas. | Fu YP et al. | β | 2010 | β |
| Practical considerations for imputation of untyped markers in admixed populations. | Shriner D et al. | β | 2010 | β |
| Quality control for genome-wide association studies. | Weale ME | β | 2010 | β |
| The role of genetics in the etiology of schizophrenia. | Gejman PV et al. | β | 2010 | β |
| Utilizing genotype imputation for the augmentation of sequence data. | Fridley BL et al. | β | 2010 | β |
| Assessment of genotype imputation methods. | Biernacka JM et al. | β | 2009 | β |
| Current software for genotype imputation. | Ellinghaus D et al. | β | 2009 | β |
| Genome-wide association scan meta-analysis identifies three Loci influencing adiposity and fat distribution. | Lindgren CM et al. | β | 2009 | β |
| Genome-wide association studies--a summary for the clinical gastroenterologist. | Melum E et al. | β | 2009 | β |
| Genomic dissection of population substructure of Han Chinese and its implication in association studies. | Xu S et al. | β | 2009 | β |
| Replication in genome-wide association studies. | Kraft P et al. | β | 2009 | β |
| Single-nucleotide polymorphism bioinformatics: a comprehensive review of resources. | Johnson AD | β | 2009 | β |
| Single versus multiple imputation for genotypic data. | Fridley BL et al. | β | 2009 | β |
| The relationship between imputation error and statistical power in genetic association studies in diverse populations. | Huang L et al. | β | 2009 | β |
| Validating, augmenting and refining genome-wide association signals. | Ioannidis JP et al. | β | 2009 | β |