The initial G×E effect (3) did not have an overwhelmingly impressive p value, but it was robust, having been 1) discovered in an epidemiologically sound longitudinal cohort study; 2) tested in a straightforward and transparent analysis; 3) reproduced across two stressors, child maltreatment and adult stressful life events; and 4) reproduced across four depression phenotypes. How has this hypothesis fared in observational studies since it was initially tested?