paperKB
coga / coga-kb
Help
Sign in

Chunk #71 — Findings — PLINK 2.0 design — Data compression

Source
Second-generation PLINK: rising to the challenge of larger and richer datasets.
Embedded
yes

Text

We note that LD-based compression of variant groups is also possible, and Sambo’s SNPack software [44] applies this to the PLINK 1 binary format. We do not plan to support this in PLINK 2.0 due to the additional software complexity required to handle probabilistic and multiallelic data, but we believe this is a promising avenue for development and look forward to integrating it in the future.