Algorithms for large-scale genotyping microarrays.
- Authors
- Liu, Wei-mn; Di, Xiaojun; Yang, Geoffrey; Matsuzaki, Hajime; Huang, Jing; Mei, Rui; Ryder, Thomas B; Webster, Teresa A; Dong, Shoulian; Liu, Guoying; Jones, Keith W; Kennedy, Giulia C; Kulp, David
- Year
- 2003
- Journal
- Bioinformatics (Oxford, England)
- PMID
- 14668223
- DOI
- 10.1093/bioinformatics/btg332
MOTIVATION: Analysis of many thousands of single nucleotide polymorphisms (SNPs) across whole genome is crucial to efficiently map disease genes and understanding susceptibility to diseases, drug efficacy and side effects for different populations and individuals. High density oligonucleotide microarrays provide the possibility for such analysis with reasonable cost. Such analysis requires accurate, reliable methods for feature extraction, classification, statistical modeling and filtering. RESULTS: We propose the modified partitioning around medoids as a classification method for relative allele signals. We use the average silhouette width, separation and other quantities as quality measures for genotyping classification. We form robust statistical models based on the classification results and use these models to make genotype calls and calculate quality measures of calls. We apply our algorithms to several different genotyping microarrays. We use reference types, informative Mendelian relationship in families, and leave-one-out cross validation to verify our results. The concordance rates with the single base extension reference types are 99.36% for the SNPs on autosomes and 99.64% for the SNPs on sex chromosomes. The concordance of the leave-one-out test is over 99.5% and is 99.9% higher for AA, AB and BB cells. We also provide a method to determine the gender of a sample based on the heterozygous call rate of SNPs on the X chromosome. See http://www.affymetrix.com for further information. The microarray data will also be available from the Affymetrix web site. AVAILABILITY: The algorithms will be available commercially in the Affymetrix software package.
No figures extracted from this document.
No chunks โ full text not yet ingested.
No entities extracted from this document yet.
No uploaded files.
No citations found.
In this knowledge base
| Title | Year | PMID |
|---|---|---|
| Description of the data from the Collaborative Study on the Genetics of Alcoholism (COGA) and single-nucleotide polymorphism genotyping for Genetic Analysis Workshop 14. | 2005 | 16451628 |
External
| Title | Authors | Journal | Year | Link |
|---|---|---|---|---|
| A Luminex-based single DNA fragment amplification assay as a practical tool for detecting and serotyping dengue virus. | Cabral-Castro MJ et al. | โ | 2016 | โ |
| Multicentric Genome-Wide Association Study for Primary Spontaneous Pneumothorax. | Sousa I et al. | โ | 2016 | โ |
| Detection of quantitative trait loci for mineral content of Nelore longissimus dorsi muscle. | Tizioto PC et al. | โ | 2015 | โ |
| A note on statistical method for genotype calling of high-throughput SNP arrays. | Yang J et al. | โ | 2013 | โ |
| Prohibitin (PHB) inhibits apoptosis in rat granulosa cells (GCs) through the extracellular signal-regulated kinase 1/2 (ERK1/2) and the Bcl family of proteins. | Chowdhury I et al. | โ | 2013 | โ |
| A mild form of Mucopolysaccharidosis IIIB diagnosed with targeted next-generation sequencing of linked genomic regions. | Selmer KK et al. | โ | 2012 | โ |
| A new genotype calling method for affymetrix SNP arrays. | Fu B et al. | โ | 2011 | โ |
| A review of software for microarray genotyping. | Lamy P et al. | โ | 2011 | โ |
| DNA microarray-based mutation discovery and genotyping. | Gresham D | โ | 2011 | โ |
| Identification of rare DNA variants in mitochondrial disorders with improved array-based sequencing. | Wang W et al. | โ | 2011 | โ |
| Investigation of parameters that affect the success rate of microarray-based allele-specific hybridization assays. | Poulsen L et al. | โ | 2011 | โ |
| ALCHEMY: a reliable method for automated SNP genotype calling for small batch sizes and highly homozygous populations. | Wright MH et al. | โ | 2010 | โ |
| Assessment of variability in GWAS with CRLMM genotyping algorithm on WTCCC coronary artery disease. | Zhang L et al. | โ | 2010 | โ |
| BRAF mutation is rare in advanced-stage low-grade ovarian serous carcinomas. | Wong KK et al. | โ | 2010 | โ |
| Integrated profiling reveals a global correlation between epigenetic and genetic alterations in mesothelioma. | Christensen BC et al. | โ | 2010 | โ |
| TumorBoost: normalization of allele-specific tumor copy numbers from a single pair of tumor-normal genotyping microarrays. | Bengtsson H et al. | โ | 2010 | โ |
| Upregulation of FOXM1 induces genomic instability in human epidermal keratinocytes. | Teh MT et al. | โ | 2010 | โ |
| Copy number variation has little impact on bead-array-based measures of DNA methylation. | Houseman EA et al. | โ | 2009 | โ |
| FOXM1 upregulation is an early event in human squamous cell carcinoma and it is enhanced by nicotine during malignant transformation. | Gemenetzidis E et al. | โ | 2009 | โ |
| Genome-wide association study to identify novel loci associated with therapy-related myeloid leukemia susceptibility. | Knight JA et al. | โ | 2009 | โ |
| Genome-wide linkage analysis with clustered SNP markers. | Selmer KK et al. | โ | 2009 | โ |
| Single nucleotide polymorphism arrays: a decade of biological, computational and technological advances. | LaFramboise T | โ | 2009 | โ |
| Uniparentalism in sporadic colorectal cancer is independent of imprint status, and coordinate for chromosomes 14 and 18. | Darbary HK et al. | โ | 2009 | โ |
| A dual-probe hybridization method for reducing variability in single nucleotide polymorphism analysis with oligonucleotide microarrays. | Yin BC et al. | โ | 2008 | โ |
| A new framework for the selection of tag SNPs by multimarker haplotypes. | Huang YT et al. | โ | 2008 | โ |
| Assessing batch effects of genotype calling algorithm BRLMM for the Affymetrix GeneChip Human Mapping 500 K array set using 270 HapMap samples. | Hong H et al. | โ | 2008 | โ |
| Bayesian Gaussian Mixture Models for High-Density Genotyping Arrays. | Sabatti C et al. | โ | 2008 | โ |
| Common statistical issues in genome-wide association studies: a review on power, data quality control, genotype calling and population structure. | Teo YY | โ | 2008 | โ |
| LINKGEN: a new algorithm to process data in genetic linkage studies. | Secolin R et al. | โ | 2008 | โ |
| Major copy proportion analysis of tumor samples using SNP arrays. | Li C et al. | โ | 2008 | โ |
| Molecular genetics of adult ADHD: converging evidence from genome-wide association and extended pedigree linkage studies. | Lesch KP et al. | โ | 2008 | โ |
| MPDA: microarray pooled DNA analyzer. | Yang HC et al. | โ | 2008 | โ |
| Reproducibility of Genotypes as Measured by the Affymetrix GeneChipยฎ 100K Human Mapping Array Set. | Fridley BL et al. | โ | 2008 | โ |
| Segmental uniparental disomy is a commonly acquired genetic event in relapsed acute myeloid leukemia. | Raghavan M et al. | โ | 2008 | โ |
| Smarter clustering methods for SNP genotype calling. | Lin Y et al. | โ | 2008 | โ |
| Validation and extension of an empirical Bayes method for SNP calling on Affymetrix microarrays. | Lin S et al. | โ | 2008 | โ |
| A multi-array multi-SNP genotyping algorithm for Affymetrix SNP microarrays. | Xiao Y et al. | โ | 2007 | โ |
| Association mapping using pooled DNA. | Yang HC et al. | โ | 2007 | โ |
| Automated SNP genotype clustering algorithm to improve data completeness in high-throughput SNP genotyping datasets from custom arrays. | Smith EM et al. | โ | 2007 | โ |
| Characteristics of gastrin controlled ECL cell specific gene expression. | Friis-Hansen L et al. | โ | 2007 | โ |
| Counting clusters using R-NN curves. | Guha R et al. | โ | 2007 | โ |
| Exploration, normalization, and genotype calls of high-density oligonucleotide SNP array data. | Carvalho B et al. | โ | 2007 | โ |
| Genome-wide, high-resolution detection of copy number, loss of heterozygosity, and genotypes from formalin-fixed, paraffin-embedded tumor tissue using microarrays. | Jacobs S et al. | โ | 2007 | โ |
| Genomic DNA pooling for whole-genome association scans in complex disease: empirical demonstration of efficacy in rheumatoid arthritis. | Steer S et al. | โ | 2007 | โ |
| Pharmacogenetics and pharmacogenomics of schizophrenia: a review of last decade of research. | Arranz MJ et al. | โ | 2007 | โ |
| Short oligonucleotide probes containing G-stacks display abnormal binding affinity on Affymetrix microarrays. | Wu C et al. | โ | 2007 | โ |
| SNiPer-HD: improved genotype calling accuracy by an expectation-maximization algorithm for high-density SNP arrays. | Hua J et al. | โ | 2007 | โ |
| AccuTyping: new algorithms for automated analysis of data from high-throughput genotyping with oligonucleotide microarrays. | Hu G et al. | โ | 2006 | โ |
| A genome-wide study of preferential amplification/hybridization in microarray-based pooled DNA experiments. | Yang HC et al. | โ | 2006 | โ |
| A genotype calling algorithm for affymetrix SNP arrays. | Rabbee N et al. | โ | 2006 | โ |
| Algorithm for automatic genotype calling of single nucleotide polymorphisms using the full course of TaqMan real-time data. | Callegaro A et al. | โ | 2006 | โ |
| Analysis and visualization of chromosomal abnormalities in SNP data with SNPscan. | Ting JC et al. | โ | 2006 | โ |
| A role for mitotic recombination in leukemogenesis. | Young BD et al. | โ | 2006 | โ |
| Genome-wide loss of heterozygosity and copy number alteration in esophageal squamous cell carcinoma using the Affymetrix GeneChip Mapping 10 K array. | Hu N et al. | โ | 2006 | โ |
| Genotyping pooled DNA using 100K SNP microarrays: a step towards genomewide association scans. | Meaburn E et al. | โ | 2006 | โ |
| Inferring loss-of-heterozygosity from unpaired tumors using high-density oligonucleotide SNP arrays. | Beroukhim R et al. | โ | 2006 | โ |
| MACGT: multi-dimensional automated clustering genotyping tool for analysis of microarray-based mini-sequencing data. | Walley DC et al. | โ | 2006 | โ |
| Pooled DNA genotyping on Affymetrix SNP genotyping arrays. | Kirov G et al. | โ | 2006 | โ |
| Analysis of single nucleotide polymorphisms in the promoter region of interleukin-10 by denaturing high-performance liquid chromatography. | Guzowski D et al. | โ | 2005 | โ |
| Applications of whole-genome high-density SNP genotyping. | Craig DW et al. | โ | 2005 | โ |
| Description of the data from the Collaborative Study on the Genetics of Alcoholism (COGA) and single-nucleotide polymorphism genotyping for Genetic Analysis Workshop 14. | Edenberg HJ et al. | โ | 2005 | โ |
| Dynamic model based algorithms for screening and genotyping over 100 K SNPs on oligonucleotide microarrays. | Di X et al. | โ | 2005 | โ |
| Genome-wide association study in esophageal cancer using GeneChip mapping 10K array. | Hu N et al. | โ | 2005 | โ |
| Genomic profiling maps loss of heterozygosity and defines the timing and stage dependence of epigenetic and genetic events in Wilms' tumors. | Yuan E et al. | โ | 2005 | โ |
| Genotyping DNA pools on microarrays: tackling the QTL problem of large samples and large numbers of SNPs. | Meaburn E et al. | โ | 2005 | โ |
| Silhouette scores for assessment of SNP genotype clusters. | Lovmar L et al. | โ | 2005 | โ |
| SNiPer: improved SNP genotype calling for Affymetrix 10K GeneChip microarray data. | Huentelman MJ et al. | โ | 2005 | โ |
| The Autism Genome Project: goals and strategies. | Hu-Lince D et al. | โ | 2005 | โ |
| Toward genome-wide SNP genotyping. | Syvรคnen AC | โ | 2005 | โ |
| Allelic imbalance analysis by high-density single-nucleotide polymorphic allele (SNP) array with whole genome amplified DNA. | Wong KK et al. | โ | 2004 | โ |
| Genotyping over 100,000 SNPs on a pair of oligonucleotide arrays. | Matsuzaki H et al. | โ | 2004 | โ |
| MARA: a novel approach for highly multiplexed locus-specific SNP genotyping using high-density DNA oligonucleotide arrays. | Shapero MH et al. | โ | 2004 | โ |
| Parallel genotyping of over 10,000 SNPs using a one-primer assay on a high-density oligonucleotide array. | Matsuzaki H et al. | โ | 2004 | โ |
| Whole genome DNA copy number changes identified by high density oligonucleotide arrays. | Huang J et al. | โ | 2004 | โ |
| Large-scale genotyping of complex DNA. | Kennedy GC et al. | โ | 2003 | โ |