in Spearmint (2) or manually curated rules (UniRule) (3–6) based on protein families. UniProtKB/TrEMBL contains the translations of all coding sequences (CDS) present in the EMBL/GenBank/DDBJ Nucleotide Sequence Databases (7) and sequences from TAIR Arabidopsis thaliana (8), SGD (9) and Ensembl Homo sapiens (10) with some defined exclusions. Records are selected for full manual annotation and integration into UniProtKB/Swiss-Prot according to defined annotation priorities.