Adjusting Peak Calling For Broad Enrichment Of Histone Modifications.
3
8
Entering edit mode
13.3 years ago
Dave Gerrard ▴ 190

We are working on some ChIP-seq data targeting histone modifications which may be widely dispersed across the genome. We expect the enrichment to be much less peaky and more 'foothills vs. plains'. Does anyone have any tips or tricks for getting the most out of peak calling software in this situation or any alternative strategies? We are using Macs14 and QuEST but are open to trying other methods.

chip-seq peak-calling • 6.4k views
ADD COMMENT
7
Entering edit mode
13.3 years ago

We have looked at this type of data in the following papers:

  1. DOI: 10.1038/nature08924
  2. DOI: 10.1016/j.molcel.2010.01.030
  3. DOI: 10.1371/journal.pgen.1001134

We ended up writing our own set of perl scripts and changed variables based on the type of data examined (full details and rationales in the methods section of these papers)

Our 'peak finder' defines a peak as an area where values are above the height threshold, less than the specified gap apart and the whole area falls within the length thresholds. It outputs the peaks as a Bed file with statistics. I'll put our peak finding script here but please be warned that is was not intended for release and it is still largely undocumented. If there is a need, we can tidy it up and put it on the galaxy toolshed.

ADD COMMENT
1
Entering edit mode

Can you please update the link to your perl script!

ADD REPLY
0
Entering edit mode

I removed this script as there now exists a range of decent pack callers.. See this wikipedia page

ADD REPLY
2
Entering edit mode
13.3 years ago

Nice discussion topic -- I've also struggled with large histone modification peaks. One approach which we had some success with is using MACS 1.4 with the call-subpeaks option to subdivide the larger peaks using PeakSplitter:

http://www.ebi.ac.uk/bertone/software.html

We then overlapped these with nucleosome position calls from NPS:

http://liulab.dfci.harvard.edu/NPS/

This was helpful to get a more refined set of reference nucleosome regions that could then be used for comparisons between experiments. Here's the python code used to combine the NPS and MACS calls:

https://github.com/chapmanb/mgh_projects/blob/master/cy_histone_chipseq/merge_nps_macs.py

ADD COMMENT
2
Entering edit mode
13.3 years ago

I've used SICER and CCAT with reasonable success for this kind of problem. The former is specifically designed for diffuse enrichment regions (but not TFs) and the latter has a "peak mode" for TFs and similar cases and a "region mode" for cases where you expect a more spread-out enrichment.

ADD COMMENT

Login before adding your answer.

Traffic: 1514 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6