paperKB
coga / coga-kb
Help
Sign in

Chunk #15 — Methods — Eigenmaps

Source
Using ancestry matching to combine family-based and unrelated samples for genome-wide association studies.
Embedded
yes

Text

Rather than using traditional PCA, we utilize a variant of this approach that arises from spectral graph theory [12]. The basic idea is to represent the population as a weighted graph, where the weights reflect the degree of similarity between pairs of subjects. As with PCA, the graph is then embedded to a lower dimensional space using the top eigenvectors of a function of the weight matrix. Lee et al. [12] show that the spectral graph analysis (SGA) leads to more meaningful clusters than ancestry estimated via PCA. Eigenvectors calculated based upon PCA are strongly affected by uneven sampling of populations [32]. While somewhat susceptible to this bias, the SGA is more robust to cluster size [33]. Moreover, SGA also identifies eigenvectors that successfully separate the data into homogeneous clusters that frequently correspond to demographic labels [12].