paperKB
coga / coga-kb
Help
Sign in

Chunk #32 — Methods — Generalized linear models

Source
Statistical modeling for sensitive detection of low-frequency single nucleotide variants.
Embedded
yes

Text

The Poisson GLM for erroneous sequencing read counts with log link function is expressed in eq. (1), where Ns,b,l is the observed number of erroneous reads for strand s (forward or reverse) with alternative base b (three possible values other than the reference) at location l, λs,b,l represents the expected mean for Ns,b,l, cs,b,l is the vector of genomic sequence context covariates, and β is the vector of fitted coefficients. The sequencing depth for strand s at location l is treated as the offset.1\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$ \log \left({\lambda}_{s,b,l}\right)= \log \left(E\left({N}_{s,b,l}\Big|{\boldsymbol{c}}_{s,b,l}\right)\right)= \log \left({d}_{s,l}\right)+{\boldsymbol{\beta}}^{\boldsymbol{\hbox{'}}}{\boldsymbol{c}}_{s,b,l} $$\end{document}logλs,b,l=logENs,b,l|cs,b,l=logds,l+β'cs,b,l