paperKB
coga / coga-kb
Help
Sign in

Chunk #67 — Materials and methods — Expanded design matrices

Source
Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2.
Embedded
yes

Text

For consistency with our software’s documentation, in the following text we will use the terminology of the R statistical language. In linear modeling, a categorical variable or factor can take on two or more values or levels. In standard design matrices, one of the values is chosen as a reference value or base level and absorbed into the intercept. In standard GLMs, the choice of base level does not influence the values of contrasts (LFCs). This, however, is no longer the case in our approach using ridge-regression-like shrinkage on the coefficients (described below), when factors with more than two levels are present in the design matrix, because the base level will not undergo shrinkage while the other levels do.