On correcting the overestimation of the permutation-based false discovery rate estimator.
- Authors
- Jiao, Shuo; Zhang, Shunpu
- Year
- 2008
- Journal
- Bioinformatics (Oxford, England)
- PMID
- 18573796
- DOI
- 10.1093/bioinformatics/btn310
- PMCID
- PMC2638866
MOTIVATION: Recent attempts to account for multiple testing in the analysis of microarray data have focused on controlling the false discovery rate (FDR), which is defined as the expected percentage of the number of false positive genes among the claimed significant genes. As a consequence, the accuracy of the FDR estimators will be important for correctly controlling FDR. Xie et al. found that the standard permutation method of estimating FDR is biased and proposed to delete the predicted differentially expressed (DE) genes in the estimation of FDR for one-sample comparison. However, we notice that the formula of the FDR used in their paper is incorrect. This makes the comparison results reported in their paper unconvincing. Other problems with their method include the biased estimation of FDR caused by over- or under-deletion of DE genes in the estimation of FDR and by the implicit use of an unreasonable estimator of the true proportion of equivalently expressed (EE) genes. Due to the great importance of accurate FDR estimation in microarray data analysis, it is necessary to point out such problems and propose improved methods. RESULTS: Our results confirm that the standard permutation method overestimates the FDR. With the correct FDR formula, we show the method of Xie et al. always gives biased estimation of FDR: it overestimates when the number of claimed significant genes is small, and underestimates when the number of claimed significant genes is large. To overcome these problems, we propose two modifications. The simulation results show that our estimator gives more accurate estimation.
No figures extracted from this document.
No chunks β full text not yet ingested.
No entities extracted from this document yet.
No uploaded files.
In this knowledge base
| Title | Year | PMID |
|---|---|---|
| Genomic regions identified by overlapping clusters of nominally-positive SNPs from genome-wide studies of alcohol and illegal substance dependence. | 2011 | 21818250 |
External
| Title | Authors | Journal | Year | Link |
|---|---|---|---|---|
| Bioinformatics identification of lncRNA biomarkers associated with the progression of esophageal squamous cell carcinoma. | Yu J et al. | β | 2019 | β |
| MAP: model-based analysis of proteomic data to detect proteins with significant abundance changes. | Li M et al. | β | 2019 | β |
| Analysis of phosphoproteomics data. | Schaab C | β | 2011 | β |
| Bayesian hierarchical modeling and selection of differentially expressed genes for the EST data. | Yu F et al. | β | 2011 | β |
| Genomic regions identified by overlapping clusters of nominally-positive SNPs from genome-wide studies of alcohol and illegal substance dependence. | Johnson C et al. | β | 2011 | β |
| Comments on 'On correcting the overestimation of the permutation-based false discovery rate estimator'. | Xie Y | β | 2008 | β |