All models controlled for program condition (i.e., intervention vs. comparison group). Residuals from the HGLM analysis indicated biased estimates, which may cause over or under estimation of the amount of variability in substance use outcomes. Thus, we used robust standard errors for hypotheses testing (Raudenbush & Bryk, 2002).