We collated experimentally defined proteomic data sets corresponding to the structures listed in Table 2. The details of how those gene sets were collated are provided in Supplementary Section 10. We also examined sets based upon the Gene Ontology system (GO sets) in the gene2go file available from the NCBI (National Center for Biotechnology Information) on 28 July 2010 (Supplementary Section 11).