paperKB
coga / coga-kb
Help
Sign in

Chunk #15 — ONLINE METHODS — Imputation server architecture

Source
Next-generation genotype imputation service and methods.
Embedded
yes

Text

using minimac3 on the previously generated chunk. If the user has uploaded unphased genotypes, the data are prephased with one of the available phasing engines: Eagle 2, HAPI-UR34, or SHAPEIT17. A post-processing step generates a zipped and indexed VCF file (using bgzip and tabix35) for each imputed chromosome. To minimize the input/output load, the reference panel is distributed across available nodes in the cluster using the distributed cache feature of Hadoop. To ensure data security, imputation results are encrypted on the fly using a one-time password. All result files and reports can be viewed or downloaded via the web interface.