Finally, we cannot tell why we observed treatment differences for PDA and abstinence outcomes, but not for DDD and DrInC outcomes. It is a common finding that on average participants improve across multiple outcomes when they receive treatment (e.g., Donovan et al., 2005). Our failure to find treatment differences in DDD, however, may have been due to insufficient power to detect a smaller effect than that observed with PDA.