each permutation was run through the TADA de novo algorithm with the same parameters used above for the observed data to assess the per gene q-values. We then recorded the number of probable genes (q < 0.3) and high confidence genes (q < 0.1) that were observed at each cohort size and plotted the smoothed trend line using local polynomial regression fitting (loess in R). The regression model also predicted the number of genes identified at a given number of trios.