logistic regression with PLINK
2
0
Entering edit mode
6.6 years ago

Guys, I'm running logistic regressions with plink. From the current output, I can tell the total number of individuals (case+control) from the NMISS column. My questions is, is there a way to get the number of cases and controls for each SNP as well? For example, with SNP123456, I currently know the sample size is 5,000. What's the easiest way to tell how many cases and how many controls within the 5,000 people? Greatly appreciate your help! Charlie

P.S. we dropped some genotype calls due to bad quality and we don't plan to impute the missing calls. Therefore, each SNP may have a different number of cases and controls.

PLINK logistic regression • 3.5k views
ADD COMMENT
0
Entering edit mode

Thanks. --freq gives nice information based on allelic information. It will be nice for me to get the numbers of individuals based on disease status/phenotype. For example, I need to know how many T2D cases and T2D controls for each SNP analyzed.

ADD REPLY
0
Entering edit mode

Please use ADD COMMENT/ADD REPLY when responding to existing posts to keep threads logically organized.

ADD REPLY
3
Entering edit mode
6.6 years ago

With plink 1.9, "--freq case-control" stratifies the allele frequency report by disease status.

With plink 2.0, you can embed case/control allele counts directly in the logistic regression report by adding "cols=+totallelecc" after --logistic/--glm. (Note that since these are allele counts, not sample counts, the numbers will generally be twice as large.)

ADD COMMENT
0
Entering edit mode

Thank you. I guess it's the best option I have for now. Greatly appreciate it!

ADD REPLY
0
Entering edit mode
6.6 years ago

Maybe the "--freq" command will give you this info.

ADD COMMENT

Login before adding your answer.

Traffic: 2605 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6