Biostar Beta. Not for public use.
Best way of Merging Multiple Lanes generated by HiSeq 2000, index BAM or VCF?
0
Entering edit mode
17 months ago
venuraherath • 20

Hi Folks, I just want to make sure I am on the right track. I have paired reads coming from three Lanes per each sample. Following is what I did so far; I mapped reads to reference genome using BWA in SAM tools.

Now I have indexed BAM files (3 per each sample).

I am thinking of merging bam files using Sam tools and then go for Variant calling.

I read that alternatively, variants can be called for each indexed bam file and then merge vcf files.

What is the more logical approach? Looking forward to get your suggestions. Thank you! 🙂

ADD COMMENTlink
5
Entering edit mode
6 weeks ago
ATpoint 17k
Germany

Given that these are lane replicates and variation between lanes is typically low, I would have merged already at the fastq level. Anyway, now you could merge them at the BAM level and then perform variant calling. Calling variants on the three technical replicates separately is probably not too informative as each of the three would have lower power to detect events than the merged file.

ADD COMMENTlink
0
Entering edit mode

Thank You!

I am going to use merge in SAMtools for merging index bam files. Do I need to keep bai files (this case 3) in the same folder as well? After merging do i need to index it again?

ADD REPLYlink
1
Entering edit mode

Merging does not require an index. Yes, you'll need to index the new file again, but you can do this in one command like:

samtools merge - file1.bam file2.bam file3.bam | tee merged.bam | samtools index - merged.bam.bai
ADD REPLYlink
1
Entering edit mode

Awesome! Thank you so much! Have a Great Day!

Venura

ADD REPLYlink

Login before adding your answer.

Similar Posts
Loading Similar Posts
Powered by the version 2.1