paperKB
coga / coga-kb
Help
Sign in

Chunk #7 — Transcriptome analysis: Bulk and single cell

Source
Comprehensive functional genomic resource and integrative model for the human brain.
Embedded
yes

Text

Second, we used an unsupervised analysis to identify the primary components of bulk expression variation. We decomposed the bulk gene expression matrix by using nonnegative matrix factorization (NMF) (B ≈ VH, where B, V, and H represent matrices) and determined whether the top components (NMF-TCs), capturing the majority of covariance (columns of V) (Fig. 2B), were consistently associated with the single-cell signatures (Fig. 2C) (21). A number of NMF-TCs were, in fact, highly correlated with cell types from matrix C for both TPM and UMI data—e.g., component NMF-17 is correlated with the Ex2 cell type (correlation coefficient r = 0.63) (Fig. 2C and fig. S9). This demonstrates that an unsupervised analysis derived solely from bulk data can roughly recapitulate the single-cell signatures, partially corroborating them.