compared by conducting a likelihood ratio test calculated as the difference in the χ2 values and degrees of freedom between the two models. Such tests, however, are overpowered for large samples, especially with non-normal data. Therefore, we used differences in BIC values to compare the relative goodness of fit of competing models given its greater weight to model parsimony. A difference in BIC of 0-2 is considered weak evidence of support for the model with the lower BIC value, a difference of 2-6 is considered positive evidence, a difference of 6-10 is considered strong evidence, and a difference greater than 10 is considered very strong evidence31.