paperKB
coga / coga-kb
Help
Sign in

Chunk #75 — STAR Methods — QUANTIFICATION AND STATISTICAL ANALYSIS — ICA based analysis and clustering

Source
Molecular Diversity and Specializations among the Cells of the Adult Mouse Brain.
Embedded
yes

Text

To identify finer substructure amongst these classes, classes with more than 200 cells were selected for subclustering. The largest 50% of the cells from each of these clusters was subjected to a variable gene selection, scaling, and independent component analysis. The independent component space is highly dependent on the number of components K that are selected for computation. To automatically nominate a value for K, we took advantage of the fact that the fastICA algorithm begins with a whitening step, in which a singular value decomposition is used to select the top K eigenvectors (i.e. principal components) for maximization of non-Gaussianity(Hyvärinen, 1999). We therefore calculated the number of statistically meaningful principal components using the Jackstraw method (Chung and Storey, 2015), to obtain a suitable value for K. In almost all instances of subclustering, this value was used, with a few exceptions where K was increased slightly. These values of K were then used to compute ICs for each subclustered class.