paperKB
coga / coga-kb
Help
Sign in

Chunk #2 — Introduction

Source
JASPAR 2024: 20th anniversary of the open-access database of transcription factor binding profiles.
Embedded
yes

Text

Position frequency matrices (PFMs) remain the most common computational representation of TF-DNA interactions. PFMs are quantitative summaries of the DNA-binding preferences of a given TF, tallying the frequency of each nucleotide at each position of an aligned set of TFBSs and storing this information in a matrix form. These matrices can be converted into probabilistic matrices termed position weight matrices (PWMs) (8). The primary function of PWMs is to model the binding affinity or probability of interaction between a TF and a DNA sequence (8). As such, PWMs are used to predict TFBSs within any DNA sequence. To systematically explore the functional effects stemming from the binding process these models describe, it is necessary to have an extensive compilation that mirrors their diversity. To address this issue, multiple database solutions have been developed to collect and store PFMs, such as JASPAR (9), CIS-BP (10), and HOCOMOCO (11).