Question

association analysis with stratification

1

Entering edit mode

6.8 years ago

alessandrotestori7 ▴ 420

Hello! I want to run a classical genome-wide association analysis (cases vs controls) in order to understand whether particular polymorphisms are relevant in a disease. The problem I have encountered is that cases come from a different population than controls. Is it possible to run an association analysis anyway (with a correction) and get reliable statistics or should I discard a lot of samples? Are there any detailed pipelines or packages I can use? Please let me know. Thanks!

GWAS population stratification • 1.4k views

ADD COMMENT • link updated 5.8 years ago by Biostar 20 • written 6.8 years ago by alessandrotestori7 ▴ 420

score 1 · Answer 1 · 2017-06-18

1

Entering edit mode

6.8 years ago

theobroma22 ★ 1.2k

You can use the mantelhaen.test function in the stats R package, which conveniently is loaded in the library directory when you installed R. Alternatively, look at the epiR package, although I've never used it.

ADD COMMENT • link 6.8 years ago by theobroma22 ★ 1.2k

1

Entering edit mode

Thanks for your reply, though I'm not sure that is exactly what I need. However, please do let me know if I'm wrong about it. Indeed, I was rather looking for a pipeline or a package which can adjust p-values from an association study in case the controls and the cases have different allele distributions. I think several softwares/tools are available (e.g.: EIGENSTRAT, Price el al 2006, GenABEL?), but I was unable to follow a pipeline or find a recent package.

ADD REPLY • link 6.8 years ago by alessandrotestori7 ▴ 420

0

Entering edit mode

If you need to adjust p-values you can get the q-values and local FDR values, using the qvalue package available on Bioconductor.

ADD REPLY • link 6.8 years ago by theobroma22 ★ 1.2k

0

Entering edit mode

Now I see your study more clearly, and think the non-population controls were added to the data set as variation, or noise.

ADD REPLY • link 6.8 years ago by theobroma22 ★ 1.2k

0

Entering edit mode

Interesting, can you elaborate how it deals with population stratification? I considered studies with different control population vs patients dead upon arrival.

ADD REPLY • link 6.8 years ago by WouterDeCoster 47k

0

Entering edit mode

It would've been correct to consider populations with vastly different polymorphisms as dead upon arrival...but, in statistical modelling you could have also considered 1) having two controls, or 2) contrasting population cases with non-population controls, or 3) stratifying them separately in the contingency table which can help with the analysis like detecting any confounding variables after you obtain the point estimates and chi-squared values.

ADD REPLY • link 6.8 years ago by theobroma22 ★ 1.2k