where z is a normalization constant called the partition function, F α (x) = U1(x) + αL(x) where U1(x) and L(x) are energy functions associated with p(y | x) and p(x) respectively, and α (a positive scalar) is a tuning or regularization parameter. Then