Question: Visualizing CNV data based from bed or bedgraph format
0
Entering edit mode

Hi,

I was wondering if there is a package or script which may visualize multi-panel CNV just IGV does but with more flexibility from a cohort of bedgraph files? preferably in R? Sushi R package does not have a point based visualization otherwise might have been perfect.

Best

Entering edit mode
1

Have you checked aCNViewer tool?

ADD REPLYlink 11 months ago
arup
♦ 1.3k
2
Entering edit mode

Hi sm.hasehmin,

You can use the Bioconductor package karyoploteR for this. You can plot the data as points using kpPoints as in the example below or you could change that to kpSegments to show the width of your data elements, for example. There's more information on the available functions and customization options at the karyoploteR tutorial.

To load the bedgraphs into R you can use toGRanges from regioneR. In the example I'm using bedgraph example data from Sushi which is not CNV data, but with only minor changes (setting ymin and ymax) you should be able to adapt the code to CNV values. To load your data, you can give toGRanges the file name of your bedgraphs and it should load them.

For the vertical positioning (r0 and r1) I'm using autotrack. This function is not yet in the tutorial but the inline documentation should suffice.

Note: since the example datasets have only data for a small region of chromosome 11 this is what we are plotting. Removing the zoom argument from plotKaryotype will plot the whole genome.

#Load 3 example bedgraphs from the datasets in the Sushi package
library(Sushi)
data(list=c("Sushi_DNaseI.bedgraph","Sushi_ChIPSeq_CTCF.bedgraph",
        "Sushi_ChIPExo_CTCF.bedgraph"), package="Sushi")

dd <- list(CTCF_Exo=Sushi_ChIPExo_CTCF.bedgraph,
           CTCF_Chip=Sushi_ChIPSeq_CTCF.bedgraph,
           DNase=Sushi_DNaseI.bedgraph)

png("bedgraphPlot.png", width = 1500, height = 1000)
  kp <- plotKaryotype(zoom="chr11:1643216-2359707", main = "BedGraphs", cex=3)
  kpAddBaseNumbers(kp, tick.dist = 10e4, add.units = TRUE, cex=1.8)
  for(i in seq_len(length(dd))) {
    names(dd[[i]]) <- c("chr", "start", "end", "value") 
    gr <- toGRanges(dd[[i]])
    at <- autotrack(i, length(dd), margin = 0.1)
    kpPoints(kp, data=gr, ymax=max(gr$value), r0=at$r0, r1=at$r1, col=rainbow(10)[i])
    kpAddLabels(kp, labels = names(dd)[i], r0=at$r0, r1=at$r1, cex=2)
  }
dev.off()

And you would get something like this

enter image description here

ADD COMMENTlink 10 months ago bernatgel ♦ 1.9k
Entering edit mode
0

Thanks alot. I removed the zoom argument and the representation was a bit two detailed for an overview of genomic changes. Is the reperesentaion like this possible ? https://us.v-cdn.net/5019796/uploads/FileUpload/58/15ca12b0618250b498391fe5562fb9.png

ADD REPLYlink 11 months ago
sm.hashemin
• 60
Entering edit mode
1

Sure, to have all chromosomes in one line use

kp <- plotKaryotype(plot.type=4)

and keep everything else the same. There are various plot types available to choose from. You can find them in the tutorial page.

The SNP-array data example might be useful to you. At the end your data is very similar to the LRR in SNP arrays.

enter image description here

ADD REPLYlink 11 months ago
bernatgel
♦ 1.9k
0
Entering edit mode

some tools I wrote:

http://lindenb.github.io/jvarkit/WesCnvSvg.html

http://lindenb.github.io/jvarkit/WesCnvTView.html

ADD COMMENTlink 11 months ago Pierre Lindenbaum 120k
0
Entering edit mode

Thanks a lot for the answers I am more searching for a representation like the following. As I have no normal samples I had to use ControlFREEC and reanalysis with another software is not an option.

enter image description here

ADD COMMENTlink 11 months ago sm.hashemin • 60

Login before adding your answer.

Powered by the version 1.8