Calculating how many SNPs different Sequence A is to Sequence B, C, D etc...
1
1
Entering edit mode
7.7 years ago
f2369583 ▴ 10

I have genomes from clinical bacterial infections. I'm trying to find out whether any of the isolates have infected multiple people.

As such I want to find out how many SNPs difference each genome has across the dataset.

There aren't many good quality reference genomes available so individual mapping is hard, relying on the draft assemblies produced within.

Any good tools/pipelines to achieve this? I've used MEGA's pairwise matrix, but I'm dubious about the results. I'm not sure if the quality of the assemblies will have a large impact on the % of distance detected (i.e. if genome A's assemblies is 2.541mb and genome B's assemlies is 2.543, will it score that extra .002 as distance even though it's likely an error)?

bacteria SNP • 1.4k views
ADD COMMENT
0
Entering edit mode

Do you have a VCF file? Look into vcftools.

ADD REPLY
0
Entering edit mode
7.7 years ago
Rob ▴ 150

I think the fastest way is to use a tool which don't need a reference genome for finding your Snps (like DiscoSnp ++ http://colibread.inria.fr/software/discosnp/) and then compares them to each other for each of your datasets.

ADD COMMENT

Login before adding your answer.

Traffic: 1571 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6