Construction of the model for the Genetic Analysis Workshop 14 simulated data: genotype-phenotype relationships, gene interaction, linkage, association, disequilibrium, and ascertainment effects for a complex phenotype.
- Authors
- Greenberg, David A; Zhang, Junying; Shmulewitz, Dvora; Strug, Lisa J; Zimmerman, Regina; Singh, Veena; Marathe, Sudhir
- Year
- 2005
- Journal
- BMC genetics
- PMID
- 16451639
- DOI
- 10.1186/1471-2156-6-S1-S3
- PMCID
- PMC1866756
The Genetic Analysis Workshop 14 simulated dataset was designed 1) To test the ability to find genes related to a complex disease (such as alcoholism). Such a disease may be given a variety of definitions by different investigators, have associated endophenotypes that are common in the general population, and is likely to be not one disease but a heterogeneous collection of clinically similar, but genetically distinct, entities. 2) To observe the effect on genetic analysis and gene discovery of a complex set of gene x gene interactions. 3) To allow comparison of microsatellite vs. large-scale single-nucleotide polymorphism (SNP) data. 4) To allow testing of association to identify the disease gene and the effect of moderate marker x marker linkage disequilibrium. 5) To observe the effect of different ascertainment/disease definition schemes on the analysis. Data was distributed in two forms. Data distributed to participants contained about 1,000 SNPs and 400 microsatellite markers. Internet-obtainable data consisted of a finer 10,000 SNP map, which also contained data on controls. While disease characteristics and parameters were constant, four "studies" used varying ascertainment schemes based on differing beliefs about disease characteristics. One of the studies contained multiplex two- and three-generation pedigrees with at least four affected members. The simulated disease was a psychiatric condition with many associated behaviors (endophenotypes), almost all of which were genetic in origin. The underlying disease model contained four major genes and two modifier genes. The four major genes interacted with each other to produce three different phenotypes, which were themselves heterogeneous. The population parameters were calibrated so that the major genes could be discovered by linkage analysis in most datasets. The association evidence was more difficult to calibrate but was designed to find statistically significant association in 50% of datasets. We also simulated some marker x marker linkage disequilibrium around some of the genes and also in areas without disease genes. We tried two different methods to simulate the linkage disequilibrium.
No figures extracted from this document.
| # | Section | Preview |
|---|---|---|
| 20 | Background — Details of data simulation — 2. Areas with LD/unique characteristics | There is LD between the SNPs of the detailed map (markers named beginning with the letter "B"… |
| 21 | Background — Details of data simulation — 2. Areas with LD/unique characteristics | To generate the LD, we generated 13- or 15-SNP haplotypes based on two-SNP haplotype probabilities… |
| 22 | Background — Details of data simulation — 2. Areas with LD/unique characteristics | 1. Haplotypes were sorted, treating the haplotypes as a character string. A group of haplotypes that… |
| 23 | Background — Details of data simulation — 2. Areas with LD/unique characteristics | 2. After generating 2,000 haplotypes, the haplotypes were sorted by frequency. Then, starting with… |
| 24 | Background — Details of data simulation — 2. Areas with LD/unique characteristics | Region 1 was created without LD, that is, the probabilities of all haplotypes were randomly… |
| 25 | Background — Details of data simulation — 2. Areas with LD/unique characteristics | The locations and method used to generate the haplotypes, and disease-carrying haplotype frequencies… |
| 26 | Background — Details of data simulation — 2. Areas with LD/unique characteristics | 1. Locus D1: SNP loci B01T0554-B01T0567. No LD. There are a total of 500 haplotypes, 30 of which… |
| 27 | Background — Details of data simulation — 2. Areas with LD/unique characteristics | 2. Locus D2: SNP loci B03T3056-B03T3068. There is LD in this region. The disease-carrying haplotypes… |
| 28 | Background — Details of data simulation — 2. Areas with LD/unique characteristics | 3. Locus D3: B05T4135-B05T4142: There is LD in this region. Disease-carrying haplotypes chosen by… |
| 29 | Background — Details of data simulation — 2. Areas with LD/unique characteristics | 4. Locus D4: B09T8331-B09T8342: LD is present. 18/200 haplotypes carry disease alleles. |
| 30 | Background — Details of data simulation — 2. Areas with LD/unique characteristics | In addition, the following non-disease-related regions were generated with LD: loci… |
| 31 | Background — Details of data simulation — 3. Phenotypes | There are three underlying latent traits designated P1, P2, and P3 (see Figure 1), the genetic… |
| 32 | Background — Details of data simulation — 3. Phenotypes | The following description of the simulated disease was distributed with the original data. |
| 33 | Background — Description of clinical characteristics of KPD and how the family data were collected and classified | KPD (DSM 301.98.6), first described by Matheson (1959) and sometimes called… |
| 34 | Background — Description of clinical characteristics of KPD and how the family data were collected and classified | The condition is thought to be genetic in origin, possibly exacerbated by prevailing social… |
| 35 | Background — Description of clinical characteristics of KPD and how the family data were collected and classified | The frequency of KPD is difficult to estimate, given the ambiguous nature of its symptoms. It has… |
| 36 | Background — Description of clinical characteristics of KPD and how the family data were collected and classified | Studies have demonstrated a number of characteristics (phenotypes) which, while apparently… |
| 37 | Background — Description of clinical characteristics of KPD and how the family data were collected and classified | 1. One hallmark of KPD patients is their constant concern with their emotional state and this… |
| 38 | Background — Description of clinical characteristics of KPD and how the family data were collected and classified | 2. Another manifestation of the "communal" pathology of KPD sufferers is that an unusually high… |
| 39 | Background — Description of clinical characteristics of KPD and how the family data were collected and classified | 3. Any humorous stories (jokes) induce strong psychological discomfort and feelings of being… |
No entities extracted from this document yet.
No uploaded files.
| Citation | PMID | DOI | Status |
|---|---|---|---|
| Matheson, R, Collected Stories, 1959, The creeping terror | — | — | — |
In this knowledge base
| Title | Year | PMID |
|---|---|---|
| Genetic Analysis Workshop 14: microsatellite and single-nucleotide polymorphism marker loci for genome-wide scans. | 2005 | 16451554 |
External
| Title | Authors | Journal | Year | Link |
|---|---|---|---|---|
| Accurate phenotyping: Reconciling approaches through Bayesian model averaging. | Chen CC et al. | — | 2017 | → |
| Computer simulation is an undervalued tool for genetic analysis: a historical view and presentation of SHIMSHON--a Web-based genetic simulation package. | Greenberg DA | — | 2011 | → |
| A novel analytical framework for dissecting the genetic architecture of behavioral symptoms in neuropsychiatric disorders. | Deo AJ et al. | — | 2010 | → |
| Computation of the posterior probability of linkage using 'high effect' genetic model priors. | Logue MW et al. | — | 2008 | → |
| A power study of bivariate LOD score analysis of a complex trait and fear/discomfort with strangers. | Ji F et al. | — | 2005 | → |
| Application of family-based association testing to assess the genotype-phenotype association involved in complex traits using single-nucleotide polymorphisms. | Wang MH et al. | — | 2005 | → |
| Assessment and implications of linkage disequilibrium in genome-wide single-nucleotide polymorphism and microsatellite panels. | Goode EL et al. | — | 2005 | → |
| Bias of allele-sharing linkage statistics in the presence of intermarker linkage disequilibrium. | Goode EL et al. | — | 2005 | → |
| Comparison of single-nucleotide polymorphisms and microsatellite markers for linkage analysis in the COGA and simulated data sets for Genetic Analysis Workshop 14: Presentation Groups 1, 2, and 3. | Wilcox MA et al. | — | 2005 | → |
| Dissection of heterogeneous phenotypes for quantitative trait mapping. | Bickeböller H et al. | — | 2005 | → |
| Genetic Analysis Workshop 14: microsatellite and single-nucleotide polymorphism marker loci for genome-wide scans. | Bailey-Wilson JE et al. | — | 2005 | → |
| Methods for detecting gene x gene interaction in multiplex extended pedigrees. | Brock GN et al. | — | 2005 | → |
| Summary of contributions to GAW Group 5: linkage mapping methods, Problem 2. | Cordell HJ | — | 2005 | → |