paperKB
coga / coga-kb
Help
Sign in

Chunk #36 — Online Methods — Virtual tumor benchmarking approach

Source
Sensitive detection of somatic point mutations in impure and heterogeneous cancer samples.
Embedded
yes

Text

First, we randomly divide the sequencing data into several partitions. We chose to create 6 partitions from each of the 3 libraries (18 partitions total), therefore creating data partitions with ~5x each. We accomplished this by sorting the BAM by name using SortSam from the Picard (http://picard.sourceforge.net) tools to effectively give the reads random ordering. We then randomly allocate each read to one of the partitions and write it to a partition-specific BAM file.