Compare random and non-random mutation distribution
2
2
Entering edit mode
7.9 years ago
sacha ★ 2.4k

Hi, This is a statistics question! I hope you may help me ! On a DNA sequence region, there are several mutation hits. I want to know if the hits distribution follow or not a random distribution. In other words, imagine a target dart. How can get a p-value, to know if darts has been throw randomly or not .

Exemple

statistics pvalue • 1.8k views
ADD COMMENT
2
Entering edit mode
7.9 years ago
Michael 54k

Both distributions are random, with the number of variants in a region as a discrete random variable. The opposite of random is deterministic, which would mean that the number of events is fixed, which is obviously not what you mean. Your second example shows a uniform distribution of events, while the first case should look like a deviation from uniform distribution, such that you would look for a test for deviation from uniformity. See here: https://stats.stackexchange.com/questions/25827/how-does-one-measure-the-non-uniformity-of-a-distribution The simplest test proposed here is a Chi-square test.

ADD COMMENT
2
Entering edit mode

Chi-squared test is based on an approximation for large sample size. There could be need for creating large enough sub-bins of the DNA region so that several mutations are expected to occur. Also, something to be careful for is the mutational process that generated those mutations. There could be preferences in the base context of the mutations (e.g. CpG mutations) that could alter the expectation of a uniform distribution because of the particular sequence of the DNA region.

ADD REPLY
0
Entering edit mode
7.9 years ago
sacha ★ 2.4k

Thank you ! Your answer was clear and perfect !

ADD COMMENT

Login before adding your answer.

Traffic: 1545 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6