paperKB
coga / coga-kb
Help
Sign in

Chunk #6 — EXPANDED CONTENT

Source
DGIdb 3.0: a redesign and expansion of the drug-gene interaction database.
Embedded
yes

Text

The enhancements to the online updaters have also been applied to Entrez Gene, from which 99% of all gene claims made by the DGIdb constituent sources were grouped (Supplementary Figure S2) (3). Another major change from 2.0 to 3.0 is that the canonical drug source for the DGIdb has switched from using PubChem compounds to ChEMBL molecules (14). This switch has added 1.7 million ChEMBL molecules to the database for potential matching to drug claims. Importantly, switching to ChEMBL has added 195 antibody drugs (e.g. trastuzumab, cetuximab), a drug class that is absent from the PubChem database and frequently requested by users. These antibody drugs matched to 539 distinct drug claims from the constituent sources of the DGIdb. With ChEMBL as the canonical drug source and the improvements to the grouping strategy below, 80.2% of all drug claims now group. Many of the resources we pull from strive to be as comprehensive as possible, and sometimes include broad classes of drug or therapy (e.g. ‘hormone therapy’, ‘mtor inhibitors’, ‘chemotherapy’, ‘radiation’, ‘antibiotics’, etc.), which account for a large percentage of the remaining drug claims.