higher internal consistency, larger effect sizes with scales used for validation, and larger effect sizes with demographic and alcohol and other substance characteristics. It also performed slightly better when heritability was measured in a genetic framework (further details about this will be presented in a publication in preparation). Although none of these improvements reached statistical significance on its own, we consider the sum of these improvements supports the decision to improve item balance. More details about these results are omitted for the sake of brevity.