Hi!
I was wondering if their is a way to normalise the number of reads in a region and the RPKM of the nearest gene to that region, so that a correlation could be computed.
Like the following data shows number of tags in first column and RPKM in second column
Tags RPKM
15 0.14619
11 0
203 0.2259
129 10.701
300 7.0772
122 2.3234
346 10.666
77 3.117
201 16.749
What is the most potent way to normalise/scale the data in two columns so that a correlation can be computed?
Thank you
Wouldn't that just be computing the RPKM for your region?
The aim is to correlate occupancy with RPKM