Classification and selection of biomarkers in genomic data using LASSO.
- Authors
- Ghosh, Debashis; Chinnaiyan, Arul M
- Year
- 2005
- Journal
- Journal of biomedicine & biotechnology
- PMID
- 16046820
- DOI
- 10.1155/JBB.2005.147
- PMCID
- PMC1184048
High-throughput gene expression technologies such as microarrays have been utilized in a variety of scientific applications. Most of the work has been done on assessing univariate associations between gene expression profiles with clinical outcome (variable selection) or on developing classification procedures with gene expression data (supervised learning). We consider a hybrid variable selection/classification approach that is based on linear combinations of the gene expression profiles that maximize an accuracy measure summarized using the receiver operating characteristic curve. Under a specific probability model, this leads to the consideration of linear discriminant functions. We incorporate an automated variable selection approach using LASSO. An equivalence between LASSO estimation with support vector machines allows for model fitting using standard software. We apply the proposed method to simulated data as well as data from a recently published prostate cancer study.
No figures extracted from this document.
No entities extracted from this document yet.
No uploaded files.
In this knowledge base
| Title | Year | PMID |
|---|---|---|
| Predicting alcohol use disorder remission: a longitudinal multimodal multi-featured machine learning approach. | 2021 | 33723218 |
External
| Title | Authors | Journal | Year | Link |
|---|---|---|---|---|
| A hybrid feature extraction framework combining PCA and mutual information for gene expression based lung cancer classification. | Shah SNA et al. | β | 2026 | β |
| Construction of epilepsy diagnosis model based on cell senescence-related genes and its potential mechanism. | Gong X et al. | β | 2025 | β |
| Identification of protein biomarkers to differentiate between gram-negative and gram-positive infections in adults suspected of sepsis. | Irani Shemirani M et al. | β | 2025 | β |
| Identifying potential biomarkers for type 2 diabetes in the adipose tissue of older adults via multiple machine learning algorithms. | Yu YS et al. | β | 2025 | β |
| Integrative multi-omics identifies S100A8/IGFBP5/CTSK/S100P as dual diagnostic biomarkers and therapeutic targets in Crohn's disease: from computational discovery to preclinical validation. | Wu J et al. | β | 2025 | β |
| miRNA panel from HER2+ and CD24+ plasma extracellular vesicle subpopulations as biomarkers of early-stage breast cancer. | Spychalski GB et al. | β | 2025 | β |
| $$ {\ell}_1 $$ -Penalized Multinomial Regression: Estimation, Inference, and Prediction, With an Application to Risk Factor Identification for Different Dementia Subtypes. | Tian Y et al. | β | 2024 | β |
| Development and evaluation of a chronic kidney disease risk prediction model using random forest. | Mendapara K | β | 2024 | β |
| Exploring combinations of dimensionality reduction, transfer learning, and regularization methods for predicting binary phenotypes with transcriptomic data. | Oshternian SR et al. | β | 2024 | β |
| Classification of COVID19 Patients Using Robust Logistic Regression. | Ghosh A et al. | β | 2022 | β |
| Development and Internal Validation of a Prognostic Model of the Probability of Death or Lung Transplantation Within 2 Years for Patients With Cystic Fibrosis and FEV<sub>1</sub>Β β€ 50%Β Predicted. | Ramos KJ et al. | β | 2022 | β |
| Identification of molecular subtyping system and four-gene prognostic signature with immune-related genes for uveal melanoma. | Xia F et al. | β | 2022 | β |
| Immunoarray Measurements of Parathyroid Hormone-Related Peptides Combined with Other Biomarkers to Diagnose Aggressive Prostate Cancer. | Dhanapala L et al. | β | 2022 | β |
| LncRNA Biomarkers of Inflammation and Cancer. | Reggiardo RE et al. | β | 2022 | β |
| Gene Correlation Guided Gene Selection for Microarray Data Classification. | Yang D et al. | β | 2021 | β |
| Multi-parametric MRI phenotype with trustworthy machine learning for differentiating CNS demyelinating diseases. | Huang J et al. | β | 2021 | β |
| Predicting alcohol use disorder remission: a longitudinal multimodal multi-featured machine learning approach. | Kinreich S et al. | β | 2021 | β |
| Predicting risk for Alcohol Use Disorder using longitudinal data with multimodal biomarkers and family history: a machine learning study. | Kinreich S et al. | β | 2021 | β |
| Sign-based Shrinkage Based on an Asymmetric LASSO Penalty. | Kawaguchi ES et al. | β | 2021 | β |
| Sparse Regression in Cancer Genomics: Comparing Variable Selection and Predictions in Real World Data. | O'Shea RJ et al. | β | 2021 | β |
| Hellinger distance-based stable sparse feature selection for high-dimensional class-imbalanced data. | Fu GH et al. | β | 2020 | β |
| Liquid Chromatography-Mass Spectrometry-Based Nontargeted Metabolomics Predicts Prognosis of Hepatocellular Carcinoma after Curative Resection. | Wang Q et al. | β | 2020 | β |
| Metabolic Fingerprinting on Synthetic Alloys for Medulloblastoma Diagnosis and Radiotherapy Evaluation. | Cao J et al. | β | 2020 | β |
| A biological function based biomarker panel optimization process. | Lee MY et al. | β | 2019 | β |
| Blind exploration of the unreferenced transcriptome reveals novel RNAs for prostate cancer diagnosis | Pinskaya M et al. | β | 2019 | β |
| Identification of a Five-CpG Signature with Diagnostic Value in Thyroid Cancer. | Jia X et al. | β | 2019 | β |
| Penalized integrative semiparametric interaction analysis for multiple genetic datasets. | Li Y et al. | β | 2019 | β |
| Predicting drug-target interactions using Lasso with random forest based on evolutionary information and chemical structure. | Shi H et al. | β | 2019 | β |
| Reference-free transcriptome exploration reveals novel RNAs for prostate cancer diagnosis. | Pinskaya M et al. | β | 2019 | β |
| Validation of miRNAs as Breast Cancer Biomarkers with a Machine Learning Approach. | Rehman O et al. | β | 2019 | β |
| Collective feature selection to identify crucial epistatic variants. | Verma SS et al. | β | 2018 | β |
| Gene selection for microarray data classification via subspace learning and manifold regularization. | Tang C et al. | β | 2018 | β |
| Improving stability of prediction models based on correlated omics data by using network approaches. | Tissier R et al. | β | 2018 | β |
| Systematic Analysis and Biomarker Study for Alzheimer's Disease. | Li X et al. | β | 2018 | β |
| Variable selection in omics data: A practical evaluation of small sample sizes. | Kirpich A et al. | β | 2018 | β |
| Diagnosis of major depressive disorder by combining multimodal information from heart rate dynamics and serum proteomics using machine-learning algorithm. | Kim EY et al. | β | 2017 | β |
| Interaction-Based Feature Selection for Uncovering Cancer Driver Genes Through Copy Number-Driven Expression Level. | Park H et al. | β | 2017 | β |
| Maxdenominator Reweighted Sparse Representation for Tumor Classification. | Li W et al. | β | 2017 | β |
| Tracing the breeding farm of domesticated pig using feature selection (Sus scrofa). | Kwon T et al. | β | 2017 | β |
| Folded concave penalized learning in identifying multimodal MRI marker for Parkinson's disease. | Liu H et al. | β | 2016 | β |
| Identification of an Immune-Neuroendocrine Biomarker Panel for Detection of Depression: A Joint Effects Statistical Approach. | Chan MK et al. | β | 2016 | β |
| A model for predicting clinical outcome in patients with human papillomavirus-positive tonsillar and base of tongue cancer. | Tertipis N et al. | β | 2015 | β |
| Constraint programming based biomarker optimization. | Zhou M et al. | β | 2015 | β |
| Development of a blood-based molecular biomarker test for identification of schizophrenia before disease onset. | Chan MK et al. | β | 2015 | β |
| Gene Selection Integrated with Biological Knowledge for Plant Stress Response Using Neighborhood System and Rough Set Theory. | Meng J et al. | β | 2015 | β |
| AucPR: an AUC-based approach using penalized regression for disease prediction with high-dimensional omics data. | Yu W et al. | β | 2014 | β |
| Classification techniques in analyzing surgical outcomes data. | Fabri PJ | β | 2014 | β |
| Identifying informative imaging biomarkers via tree structured sparse learning for AD diagnosis. | Liu M et al. | β | 2014 | β |
| Multiple instance learning for classification of dementia in brain MRI. | Tong T et al. | β | 2014 | β |
| Sparse representation for tumor classification based on feature extraction using latent low-rank representation. | Gan B et al. | β | 2014 | β |
| A mass spectrometry-based plasma protein panel targeting the tumor microenvironment in patients with breast cancer. | Cohen A et al. | β | 2013 | β |
| Complex biomarker discovery in neuroimaging data: Finding a needle in a haystack. | Atluri G et al. | β | 2013 | β |
| Biomarker identification and cancer classification based on microarray data using Laplace naive Bayes model with mean shrinkage. | Wu MY et al. | β | 2012 | β |
| Combining multiple approaches for gene microarray classification. | Nanni L et al. | β | 2012 | β |
| Predicting the lethal phenotype of the knockout mouse by integrating comprehensive genomic data. | Yuan Y et al. | β | 2012 | β |
| Tree-guided sparse coding for brain disease classification. | Liu M et al. | β | 2012 | β |
| Cancer classification from gene expression data by NPPC ensemble. | Ghorai S et al. | β | 2011 | β |
| Metasample-based sparse representation for tumor classification. | Zheng CH et al. | β | 2011 | β |
| Recipe for uncovering predictive genes using support vector machines based on model population analysis. | Li HD et al. | β | 2011 | β |
| Sparse logistic regression for diagnosis of liver fibrosis in rat by using SCAD-penalized likelihood. | Yan FR et al. | β | 2011 | β |
| Genome-wide discovery of human heart enhancers. | Narlikar L et al. | β | 2010 | β |
| A regularized method for selecting nested groups of relevant genes from microarray data. | De Mol C et al. | β | 2009 | β |
| Key wavelengths screening using competitive adaptive reweighted sampling method for multivariate calibration. | Li H et al. | β | 2009 | β |
| Regularized gene selection in cancer microarray meta-analysis. | Ma S et al. | β | 2009 | β |
| Early detection of ovarian cancer using group biomarkers. | Tchagang AB et al. | β | 2008 | β |
| Penalized feature selection and classification in bioinformatics. | Ma S et al. | β | 2008 | β |
| Sparse optimal scoring for multiclass cancer diagnosis and biomarker detection using microarray data. | Leng C | β | 2008 | β |
| A blocking strategy to improve gene selection for classification of gene expression data. | Bontempi G | β | 2007 | β |
| A review of feature selection techniques in bioinformatics. | Saeys Y et al. | β | 2007 | β |
| Empirical study of supervised gene screening. | Ma S | β | 2006 | β |
| Regularized binormal ROC method in disease classification using microarray data. | Ma S et al. | β | 2006 | β |
| Regularized ROC method for disease classification and biomarker selection with microarray data. | Ma S et al. | β | 2005 | β |