Is it more correct to use r2 or D' for selecting SNPs in LD?
1
1
Entering edit mode
7.1 years ago
Milos Pjanic ▴ 30

If two SNPs are co-inherited 100% of the times, D' prime will always be 1, while r2 incorporates the information about allele frequencies and thus will be lower than 1 if allele frequencies deviate from 0.5 (for example, in case of a perfect disequilibrium, if major allele frequency is 50%, and minor allele frequency 1%, r2 will drop to 0.01). Doesn't this mean that r2 is useful only when searching either for SNPs in LD or for a proxy SNP, in order to prevent us from selecting a SNP that is not common in the population (possibly a rare variant)?

LD linkage disequilibrium • 3.9k views
ADD COMMENT
2
Entering edit mode
7.0 years ago
LauferVA 4.2k

It is not more or less correct except in relation to a given question or methodology.

For instance, there are a lot of methods that make use of r^2 to give information about other things of interest. The PAINTOR algorithm, for instance, uses an LD matrix consisting of r values, rather than D' values, to prioritize candidate causal variants.

This is a case when we might use r or r^2 but it has nothing to do with the rarity of the variant.

Thus, the question as phrased is too broad. It could be correct or incorrect to use either D' or r^2 in certain situations, and those may or may not have to do with rarity of variation.

ADD COMMENT

Login before adding your answer.

Traffic: 3161 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6