Case-control association testing in the presence of unknown relationships.
- Authors
- Choi, Yoonha; Wijsman, Ellen M; Weir, Bruce S
- Year
- 2009
- Journal
- Genetic epidemiology
- PMID
- 19333967
- DOI
- 10.1002/gepi.20418
- PMCID
- PMC2790016
Genome-wide association studies result in inflated false-positive results when unrecognized cryptic relatedness exists. A number of methods have been proposed for testing association between markers and disease with a correction for known pedigree-based relationships. However, in most case-control studies, relationships are generally unknown, yet the design is predicated on the assumption of at least ancestral relatedness among cases. Here, we focus on adjusting cryptic relatedness when the genealogy of the sample is unknown, particularly in the context of samples from isolated populations where cryptic relatedness may be problematic. We estimate cryptic relatedness using maximum-likelihood methods and use a corrected chi(2) test with estimated kinship coefficients for testing in the context of unknown cryptic relatedness. Estimated kinship coefficients characterize precisely the relatedness between truly related people, but are biased for unrelated pairs. The proposed test substantially reduces spurious positive results, producing a uniform null distribution of P-values. Especially with missing pedigree information, estimated kinship coefficients can still be used to correct non-independence among individuals. The corrected test was applied to real data sets from genetic isolates and created a distribution of P-value that was close to uniform. Thus, the proposed test corrects the non-uniform distribution of P-values obtained with the uncorrected test and illustrates the advantage of the approach on real data.
Box plots of estimated kinship coefficients. A: the simulated sample of Scenario I using 50 and 400 microsatellite markers. B: CEPH families using 500, 5,000 and 16,977 SNP markers.
Quantile-quantile plots of Ο2 test and corrected Ο2 tests for 50 markers (dashed line) and 400 markers (solid line) in the simulated sample of Scenario I. A: Classical Ο2 test. B: Corrected Ο2 test with actual kinships. C: Corrected Ο2 test using estimated kinships. D: Corrected Ο2 test using pedigree-based kinships.
Quantile-quantile plots of Ο2 test and corrected Ο2 tests in the simulated sample of Scenario II.
Comparison of kinship coefficients: actual kinships vs. estimated, pedigree-based and posterior kinships in the simulated sample of Scenario III.
Estimated k-coefficients of pairs in (A) Guam and (B) Kosrae samples. Cumulative distribution of estimated kinship coefficients of cases and controls in the (C) Guam and (D) Kosrae data sets. Quantile-quantile plot of p-values of Ο2 test and corrected Ο2 test in (E) Guam and (F) Kosrae samples. In the Kosrae study, Ο is the pedigree-based kinship coefficient and ΟΜ is the estimated kinship coefficient.
No entities extracted from this document yet.
No uploaded files.
In this knowledge base
External
| Title | Authors | Journal | Year | Link |
|---|---|---|---|---|
| Assessing the genetic diversity of Ethiopian indigenous goat ecotypes at the hemoglobin locus and its associations with morphometric traits. | Tilahun K et al. | β | 2025 | β |
| Genomics-Driven Monitoring of Fraxinus latifolia (Oregon Ash) to Inform Conservation and EAB-Resistance Breeding. | Melton AE et al. | β | 2025 | β |
| Germline predisposition in multiple myeloma. | Martins Rodrigues F et al. | β | 2025 | β |
| Precise estimation of in-depth relatedness in biobank-scale datasets using deepKin. | Zhang QX et al. | β | 2025 | β |
| Construction of relatedness matrices in autopolyploid populations using low-depth high-throughput sequencing data. | Bilton TP et al. | β | 2024 | β |
| Detecting inbreeding depression in structured populations. | Lavanchy E et al. | β | 2024 | β |
| A novel rare variants association test for binary traits in family-based designs via copulas. | Dossa HRG et al. | β | 2023 | β |
| Dense residential areas promote gene flow in dengue vector mosquito <i>Aedes albopictus</i>. | Yeo H et al. | β | 2023 | β |
| An unbiased kinship estimation method for genetic data analysis. | Jiang W et al. | β | 2022 | β |
| Privacy-aware estimation of relatedness in admixed populations. | Wang S et al. | β | 2022 | β |
| To Modify or Not to Modify: Allele-Specific Effects of 3'UTR-<i>KCNQ1</i> Single Nucleotide Polymorphisms on Clinical Phenotype in a Long QT 1 Founder Population Segregating a Dominant-Negative Mutation. | Winbo A et al. | β | 2022 | β |
| Correcting statistical bias in correlation-based kinship estimators | Jiang W et al. | β | 2021 | β |
| Estimating FST and kinship for arbitrary population structures. | Ochoa A et al. | β | 2021 | β |
| Fine-scale population structure and demographic history of British Pakistanis. | Arciero E et al. | β | 2021 | β |
| Accounting for Group-Specific Allele Effects and Admixture in Genomic Predictions: Theory and Experimental Evaluation in Maize. | Rio S et al. | β | 2020 | β |
| Beyond broad strokes: sociocultural insights from the study of ancient genomes. | Racimo F et al. | β | 2020 | β |
| Historic and modern genomes unveil a domestic introgression gradient in a wild red junglefowl population. | Wu MY et al. | β | 2020 | β |
| Improving predictive models for Alzheimer's disease using GWAS data by incorporating misclassified samples modeling. | Romero-Rosales BL et al. | β | 2020 | β |
| Association score testing for rare variants and binary traits in family data with shared controls. | Saad M et al. | β | 2019 | β |
| Principals about principal components in statistical genetics. | Abegaz F et al. | β | 2019 | β |
| Human activities and landscape features interact to closely define the distribution and dispersal of an urban commensal. | Tang Q et al. | β | 2018 | β |
| Urban landscape genomics identifies fine-scale gene flow patterns in an avian invasive. | Low GW et al. | β | 2018 | β |
| Design Considerations for Genetic Linkage and Association Studies. | Nsengimana J et al. | β | 2017 | β |
| Efficient Estimation of Realized Kinship from Single Nucleotide Polymorphism Genotypes. | Wang B et al. | β | 2017 | β |
| Estimation of kinship coefficient in structured and admixed populations using sparse sequencing data. | Dou J et al. | β | 2017 | β |
| Genetic Diversity and Population Structure of Ethiopian Sheep Populations Revealed by High-Density SNP Markers. | Edea Z et al. | β | 2017 | β |
| Identification of genetic outliers due to sub-structure and cryptic relationships. | Schlauch D et al. | β | 2017 | β |
| Quickly identifying identical and closely related subjects in large databases using genotype data. | Jin Y et al. | β | 2017 | β |
| Estimating relationships between phenotypes and subjects drawn from admixed families. | Blue EM et al. | β | 2016 | β |
| Inference of kinship using spatial distributions of SNPs for genome-wide association studies. | Lee H et al. | β | 2016 | β |
| Model-free Estimation of Recent Genetic Relatedness. | Conomos MP et al. | β | 2016 | β |
| Multipoint genome-wide linkage scan for nonword repetition in a multigenerational family further supports chromosome 13q as a locus for verbal trait disorders. | Truong DT et al. | β | 2016 | β |
| Observed and expected frequencies of structural hemoglobin variants in newborn screening surveys in Africa and the Middle East: deviations from Hardy-Weinberg equilibrium. | Piel FB et al. | β | 2016 | β |
| On the use of dense SNP marker data for the identification of distant relative pairs. | Sun M et al. | β | 2016 | β |
| NgsRelate: a software tool for estimating pairwise relatedness from next-generation sequencing data. | Korneliussen TS et al. | β | 2015 | β |
| Parente2: a fast and accurate method for detecting identity by descent. | Rodriguez JM et al. | β | 2015 | β |
| PBAP: a pipeline for file processing and quality control of pedigree data with dense genetic markers. | Nato AQ et al. | β | 2015 | β |
| A statistical framework to guide sequencing choices in pedigrees. | Cheung CY et al. | β | 2014 | β |
| A unified GMDR method for detecting gene-gene interactions in family and unrelated samples with application to nicotine dependence. | Chen GB et al. | β | 2014 | β |
| Improved maximum likelihood reconstruction of complex multi-generational pedigrees. | Sheehan NA et al. | β | 2014 | β |
| Power of family-based association designs to detect rare variants in large pedigrees using imputed genotypes. | Saad M et al. | β | 2014 | β |
| CrypticIBDcheck: an R package for checking cryptic relatedness in nominally unrelated individuals. | Nembot-Simo A et al. | β | 2013 | β |
| Genetic and neurophysiological correlates of the age of onset of alcohol use disorders in adolescents and young adults. | Chorlian DB et al. | β | 2013 | β |
| Identity by descent: variation in meiosis, across genomes, and in populations. | Thompson EA | β | 2013 | β |
| Maximum likelihood pedigree reconstruction using integer linear programming. | Cussens J et al. | β | 2013 | β |
| REFINING GENETICALLY INFERRED RELATIONSHIPS USING TREELET COVARIANCE SMOOTHING. | Crossett A et al. | β | 2013 | β |
| A high-performance computing toolset for relatedness and principal component analysis of SNP data. | Zheng X et al. | β | 2012 | β |
| Design considerations for genetic linkage and association studies. | Nsengimana J et al. | β | 2012 | β |
| Estimating kinship in admixed populations. | Thornton T et al. | β | 2012 | β |
| Family-based association studies for next-generation sequencing. | Zhu Y et al. | β | 2012 | β |
| Genome-wide association study of N370S homozygous Gaucher disease reveals the candidacy of CLN8 gene as a genetic modifier contributing to extreme phenotypic variation. | Zhang CK et al. | β | 2012 | β |
| Inferring coancestry in population samples in the presence of linkage disequilibrium. | Brown MD et al. | β | 2012 | β |
| XM: association testing on the X-chromosome in case-control samples with related individuals. | Thornton T et al. | β | 2012 | β |
| A fast, powerful method for detecting identity by descent. | Browning BL et al. | β | 2011 | β |
| Effective sample size: Quick estimation of the effect of related samples in genetic case-control association analyses. | Yang Y et al. | β | 2011 | β |
| Genome-wide association of familial late-onset Alzheimer's disease replicates BIN1 and CLU and nominates CUGBP2 in interaction with APOE. | Wijsman EM et al. | β | 2011 | β |
| Linkage analysis without defined pedigrees. | Day-Williams AG et al. | β | 2011 | β |
| Population-genetic comparison of the Sorbian isolate population in Germany with the German KORA population using genome-wide SNP arrays. | Gross A et al. | β | 2011 | β |
| Variation in actual relationship as a consequence of Mendelian sampling and linkage. | Hill WG et al. | β | 2011 | β |
| Characterizing allelic association in the genome era. | Weir BS et al. | β | 2010 | β |
| Exploring genetic susceptibility to cancer in diverse populations. | Haiman CA et al. | β | 2010 | β |
| ROADTRIPS: case-control association testing with partially or completely unknown population and pedigree structure. | Thornton T et al. | β | 2010 | β |
| Robust relationship inference in genome-wide association studies. | Manichaikul A et al. | β | 2010 | β |
| Statistical genetic issues for genome-wide association studies. | Weir BS | β | 2010 | β |
| The potential for enhancing the power of genetic association studies in African Americans through the reuse of existing genotype data. | Chen GK et al. | β | 2010 | β |
| Tuberculosis case-contact research in endemic tropical settings: design, conduct, and relevance to other infectious diseases. | Hill PC et al. | β | 2010 | β |
| Variance component model to account for sample structure in genome-wide association studies. | Kang HM et al. | β | 2010 | β |
| Combining information from linkage and association methods. | Marchani EE et al. | β | 2009 | β |
| Contrasting identity-by-descent estimators, association studies, and linkage analyses using the Framingham Heart Study data. | Marchani EE et al. | β | 2009 | β |
| Identification of novel susceptibility loci for Guam neurodegenerative disease: challenges of genome scans in genetic isolates. | Sieh W et al. | β | 2009 | β |