Merging a PLINK file with itself after duplicating sampleIDs and rsIDs
0
0
Entering edit mode
6.9 years ago
aritra90 ▴ 70

Hi,

I want to merge a PLINK file with itself, maintaining high genotyping rate. Let's say I have a file test.bed/bim/fam, what I am doing is the following:

  1. randomly selecting few SNPs from test.bim into a file snpstoflip.txt
  2. flipping those SNPs using

    ./plink --bfile test --flip snpstoflip.txt --make-bed --out flipped_test

  3. Duplicating the samples and SNP ids for the flipped_test dataset. (Adding something like "Dup_" infront of each sampleIDs and rsIDs)

  4. Merge them using

    ./plink --bfile test --bmerge flipped_test --make-bed --out MergedFiles

But, doing this, the genotyping rate decreases a lot, to 0.5, that means we have a lot of missing entries. Is there a better way of achieving this? The idea is to replicate the PED file many times to generate a scaled up large data set from a relatively small data set. Please let me know, your views on this or how to tackle this problem, without getting a lot of missing entries.

Thanks.

SNP PLINK genotype merging • 1.8k views
ADD COMMENT

Login before adding your answer.

Traffic: 1968 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6