test, Δχ2(72) = 307.918, p < 0.001, this form of comparative model testing can be sensitive to relatively trivial differences in model fit when the sample size is large. Thus, we re-scaled χ2 differences to an RMSEA metric (Hildebrandt, Wilhelm, & Robitzsch, 2009), whereby values greater than 0.05 on the resulting Index of Root Deterioration per Restriction (RDR) suggest that the change in model fit is significant. The resulting value (RDR = 0.038), indicated that we could constrain loadings to be equal by sex, and this model (the initial model) was used to derive all factor scores for enrichment testing.