Merging chip-seq samples sequenced in two different runs
1
0
Entering edit mode
3.2 years ago
srhic ▴ 60

Hello,

I have a chip-seq experiment in which the number of read is rather low (~10M per sample). However, most qc parameters show that the chip efficiency is not bad. So we have decided to resequence the same samples again. My question is what would be the best approach to combine the new data with the existing one? Would I just treat the new data as a technical replicate and merge the fastqs/bams together? Or would I treat it as a biological replicate and account for batch effect when using edgeR for differential analysis between my conditions?

(also a sort of unrelated question I have is that I have noticed that in most of my chip-seq experiments, the input sample always has more reads than the treatment samples and I was wondering if this is normal or just something random).

Thanks

ChIP-Seq RNA-Seq • 1.0k views
ADD COMMENT
3
Entering edit mode
3.2 years ago

I usually consider re-sequencing of the same sample (same library prep) as a technical replicate and merge the bam files once I have confirmed that the replicates are technically sound. For instance, for ChIP-seq, I would look at some control peak and assess if the replicates behave similarly. If they don't, this certainly raises a flag and I would not merge them unless there is a good reason for the difference.

Concerning having more reads in the input vs IP, I guess it totally depends on how you pooled the barcoded libraries before sequencing. If you aimed for an equimolar pool, and have more reads in your input, then this likely reflects library prep quantification issues or adapter contamination in your IP – which happen more frequently when the IP efficiency is low. That being said, nothing stops you from stepping away from an equimolar pool and mixing more material from the IP than from the input. After all, the IP reads are usually more informative than the input reads (PS : I'm not saying that the input control is not important here).

ADD COMMENT
1
Entering edit mode

Thanks. I was also planning to merge the bams.

My libraries are prepared by a core facility but I assume they aim for an equal ratio. My next question would have been if I can ask them to add more of the treatment sample but you already answered it. As you said the input is important but I would rather have a some extra reads in my treatment samples. I have noticed that the input almost always has ~20% more reads than my chip samples.

ADD REPLY

Login before adding your answer.

Traffic: 1632 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6