Forum:How to calculate Tajima's D and Fay & Wu's H for unphased data?
1
0
Entering edit mode
8.2 years ago
RoseString ▴ 10

Hi,

I have a small number of samples (~10) for my species of interest (non-model organism), so it's almost impossible to phase the data. I am interested in doing some site-frequency spectrum methods to detect positive selection in the genome, but they require the calculation of nucleotide diversity (pi). Is it possible to do so without phasing the data?

Thanks in advance!

Evolution Genomics Nucleotide-diversity Genetics • 4.2k views
ADD COMMENT
0
Entering edit mode
8.2 years ago
jsgounot ▴ 170

Maybe you could use VariScan. However, I don't know if it's the best way to do it for unphased data since you will have to produce 2 sequences for each individual, and therefore randomly assign each variant to one sequence.

ADD COMMENT
0
Entering edit mode

Thanks!

Do you know any literature doing the random assignment of variants if the data is unphased?

ADD REPLY
0
Entering edit mode

Just an update. I found a study using your method. They call this process 'haploidize data'.

ADD REPLY

Login before adding your answer.

Traffic: 2052 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6