in the data set constructed. Table 1.The statistics of phosphorylation sites obtained from Phospho.ELM and Swiss-ProtData sourceNumber of phosphorylated proteinsNumber of phosphorylation sitesSerine (S)Threonine (T)Tyrosine (Y)Histidine (H)TotalPhospho.ELM3674991718901804113 612Swiss-Prot*314848461035901426832Combined (non-redundant)584211 888243321794316 551It notices that the sum of serine, threonine, tyrosine and histidine in Swiss-Prot is not equal to 6832, because there are several phosphorylation sites located on other kinds of residue. *The entries which contain residues annotated as ‘phosphorylation’ in the ‘MOD_RES’ are extracted and the entries annotated as ‘by similarity’, ‘potential’ and ‘probable’ are excluded.