Chunk #83 — Online Methods — Cross-validation of model fit.

Source: Flexible statistical methods for estimating and testing effects in genomic studies with multiple conditions.
Embedded: yes

Text

To compare model fitting procedures, we randomly divide the rows of the data matrix B^ into two subsets—a “training set” B^train, and a “test set” B^test. We then apply mash to B^train, yielding estimates (π^,U^), and assess the “fit” of these estimates by computing the log-likelihood in the test data, logp(B^test | π^,U^,V), which is given by eq. 4. This cross-validation strategy can also be used to compare different approaches to estimating U^, and our current strategy was developed and refined using this framework.