upscaling ChIP-seq data
1
0
Entering edit mode
9.3 years ago
R ▴ 10

Hi,

Could someone please tell me how to upscale a ChIP-seq data?

For down-scaling I use samtools view -bs but in the case of up-scaling I could not find anything

Thanks

ChIP-Seq • 2.1k views
ADD COMMENT
0
Entering edit mode

What is "upscale"? In other words, what do you want to do?

ADD REPLY
0
Entering edit mode

to scale unequal datasets, one way is to upscale/downscale the smaller/larger dataset. I want to upscale one of my dataset to increase the number of reads.

ADD REPLY
0
Entering edit mode

You cannot validly do that for any use case that I know of. You could subset the larger one, though.

ADD REPLY
0
Entering edit mode

Given the number of peak callers out there already, why are you rolling your own?

ADD REPLY
0
Entering edit mode

Also, have you read the source code for MACS or another peak caller? Many of them allow this and you could just see directly how they do it.

ADD REPLY
0
Entering edit mode

But I do not want to call peaks! I want to make the tracks with equal reads

ADD REPLY
1
Entering edit mode

Then subset the larger one.

ADD REPLY
2
Entering edit mode
9.3 years ago
bede.portz ▴ 540

Homer may do what you need. If by "track" you are referring to a file you can visualize on the UCSC browser, IGV, etc, than you can take your reads and use homer to make a tag directory, then use homer to make a bedgraph. Homer defaults to 10million reads for each file, or "track," using the makeUCSCfile command.You can tell homer what number of reads to normalize each dataset to, using the -norm <#> option.

Starting with you reads:

makeTagDirectory <directory> <reads.bed>
makeUCSCfile <directory> -o auto -norm <#>

The first command will create a directory of the reads you want to analyze, separated by chromosome, accelerating subsequent operations. The second command makes a bedgraph, normalized to the number of reads you specify, or 10 million by default. The bedgraph will be placed in the directory used to generate the file (as specified by the -o auto option)

If you want to normalize total read number among datasets for other analyses, such as composite plots or heatmaps, you can do so using homer and the annotatePeaks tool.

ADD COMMENT

Login before adding your answer.

Traffic: 2704 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6