paperKB
coga / coga-kb
Help
Sign in

Chunk #36 — Methods — Gene expression arrays (‘gexarray’) — Re-mapping of array probes

Source
Common genetic variation drives molecular heterogeneity in human iPSCs.
Embedded
yes

Text

We first mapped the sequences allowing no mismatches (-n 0, seeding disabled) and kept uniquely mapping probes with a minimum mapping quality (MAPQ) of 10 (–q 10). These sequences were then mapped again, this time allowing one mismatch (-n 1). Again, only uniquely mapping probes with MAPQ > 10 were retained, resulting in a total of N = 37,740 probes. We further removed all probes that overlapped with any variant with a minor allele frequency greater than 0.05 in the main imputed dataset (Nlines = 858). Remaining probes were annotated with Gencode version 19 gene annotations 56 and only probes mapping uniquely to a single gene were kept (final probeset Nprobes = 25,604, representing 17,116 unique genes of which 14,569 are protein coding.