Upsampling BAMs, or downsampling
1
0
Entering edit mode
5.2 years ago
HI • 0

I have four ChIP-seq sample, which has about 10-20M reads , I want to do downsampling or upsampling to make them have the same genome coverage, for example, 15M reads, I know the software picard can do downsampling .My question is, is there any software to do up sampling, Thank you very much!

ChIP-Seq • 2.1k views
ADD COMMENT
0
Entering edit mode

Thank you very much! The reason that I do downsampling or upsampling is to normalize the coverage between materials, then I can compare them with the result from the deeptools(bamCoverage normalized with RPKM in a bin) (The materials were mapped to different reference sequence,so they could not be compared with existing software), is it right to do so? The reason that I want to do upsampling is there exists a sample with quite little reads.

ADD REPLY
0
Entering edit mode

The downsampling is a valid thing to do, and useful in many cases where normalision to read depth doesn't quite cut it. But upsampling is never valid, you are inventing data that doesn't really exist. Either downsample to the lowest sample, or discard the lowest sample.

Generally we pick a threshold, discard samples with fewer reads than that threshold and downsample the rest to it.

ADD REPLY
2
Entering edit mode
5.2 years ago

Up-sampling would just introduce noise, simply down-sample and call it done.

ADD COMMENT
1
Entering edit mode

Adding on this, naive upsampling would invent information which is not there, while subsampling reduces existing information. Therefore downsampling is imho preferred. Note that in ChIP-seq because of immunoprecipitation efficiency and/or antibody quality the total read count is not necessarily informative. What is your analysis goal?

ADD REPLY

Login before adding your answer.

Traffic: 1837 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6