Running 1.5M potentially different generalized linear models depending on distribution of read depth information to study CNV
1
0
Entering edit mode
7.9 years ago
LauferVA 4.2k

Background (don't need help on these sections, yet): I have read depth information on ~300 whole genomes. I am aware of many pitfalls of analyzing read depth as a proxy for CNV and have taken many steps to obtain quality-controlled read depth information that I am ready to analyze.

With this read depth data, I want to look for associations between this standardized, QCed, read depth information and my phenotype of interest in a covariate-controlled analysis.

However, I have been looking at the distributions of read depth information by window. Looking across windows, these windows have a distribution, but looking within window, there are (sometimes very) different distributions per window.

If the windows were all distributed the same, I could for instance run a poisson regression 1.5M times and be done. However, they are not. As such, the generalized linear model that I select should possibly be changed depending on the window to maximize power to analyze any given window.

Does anyone have experience automating the process of model fitting? Or is this inappropriate? Another method would of course be to use nonparametric analysis, but then I lose potentially very interesting information on the distribution of a given window.

CNV GLM read depth • 1.9k views
ADD COMMENT
4
Entering edit mode
7.9 years ago

If this is a learning experience ignore the following advice.

There are many CNV callers that will model the read depth: -GenomeSTRiP -CNVKIT -WaveCNV -cn.mops ...

I would suggest trying a published tool.

Otherwise, The associated publications should help you figure out what model you want to use.

ADD COMMENT
2
Entering edit mode
ADD REPLY
0
Entering edit mode

Thanks, Zev! It's both. I will read and mess around with these.

ADD REPLY

Login before adding your answer.

Traffic: 1548 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6