Question

How to exclude noisy CNV from study?

0

Entering edit mode

7.1 years ago

rmateos.1 ▴ 10

Hi,

Nowadays I am working with Whole exome sequencing data. After using Varscan2 and DNAcopy, I identified some samples as quite noisy:

Example of noisy sample

My plan is to exclude these samples, but I want to have an statistical proof for doing it. Would it be possible to receive advice about this topic?

Thank you in advance

CNV copy number variation DNAcopy Varscan WES • 1.8k views

ADD COMMENT • link updated 7.0 years ago by ivivek_ngs ★ 5.2k • written 7.1 years ago by rmateos.1 ▴ 10

0

Entering edit mode

Axes/color description could help intepreting the figure.

ADD REPLY • link 7.1 years ago by Carlo Yague 8.7k

1

Entering edit mode

You are absolutely right.

The green/black colors define different chromosomes. The red fragments are the segments obtained after applying the DNAcopy package to the output from Varscan2

The Y axis is the log ratio between normal and tumor.

The X axis is just an index putting all the segments in order. It is not the real length of each fragment in the chromosome, but more like the number of outputs obtained by the package

ADD REPLY • link 7.1 years ago by rmateos.1 ▴ 10

0

Entering edit mode

Thx for clarifying. I'm not very familiar with varscan and DNAcopy, but perhaps using longer segments could help stabilize the log ratio.

ADD REPLY • link 7.1 years ago by Carlo Yague 8.7k

0

Entering edit mode

Thank you for your advice. Other patients get very clear segmentation, and I know that the data is indeed noisy... I am still wondering if forcing longer segmentation in noisy samples is worth it...

Thank you again

ADD REPLY • link 7.1 years ago by rmateos.1 ▴ 10

0

Entering edit mode

Just a suggestion : you could try to use CNV-seq that does more or less the same kind of analysis, but calculate automatically the 'optimal' segment size based on the read coverage. That make sense because higher read coverage gives more power to detect differences over smaller segments.

ADD REPLY • link 7.1 years ago by Carlo Yague 8.7k

score 0 · Answer 1 · 2017-04-11

Just to add what Carlo said. There are other plenty of tools that you can use. DNA copy that performs the CBS in VarScan2 might not be the correct one for your data. If the depth is too high then the segmentation will create such noises. So you can use the longer segments. Or alternatively use methods like ADTex, Control-FREEC, Excavator. See how the results vary.