Comparing SPAdes assemblies to each other?
2
2
Entering edit mode
5.5 years ago

I would like to compare three WGS datasets to each other, without using a reference genome. I am planning on running each set of paired end fastqs through SPAdes independently and then comparing the spades.contig files to each other.

I don't know of an alignment tool that doesn't require a reference genome as an input. I essentially want each of these files to be a reference to each other. I am also hoping for some kind of output that states the % aligned, similar to bowtie2.

DeNovo SPAdes • 1.7k views
ADD COMMENT
0
Entering edit mode

I'm also looking for a tool that can do this, any luck finding it yet?

ADD REPLY
0
Entering edit mode

Use minimap2 or mummer to align the genomes in an all-vs-all pairwise manner. You can compute statistics on the alignments easily enough, but they will probably give you some already.

ADD REPLY
0
Entering edit mode

What is the expected size of the genome? Besides the tools mentioned by @Joe, you may need to look at LASTZ if these are eukaryotic genomes.

ADD REPLY
0
Entering edit mode

I think, Quast work for you. All you need to do is to check the parameters. It has got alignment parameter.

ADD REPLY
1
Entering edit mode
3.8 years ago

See if using the interactive dot plotter works for your case. I recently discovered this implementation of dot plotting and found extremely usefuyl.

It uses nucmer alignments hence it is quite performant (unlike most dot plotters), plus it is interactive and fast

https://dnanexus.github.io/dot/

ADD COMMENT
0
Entering edit mode
3.8 years ago
h.mon 35k

You don't say if you are working with prokaryotes or eukaryotes, nor the estimated genome size of your organism.

In case you have bacterial assemblies: for such a small number of genomes, Mauve is a great option for alignment and visualization. Better than aligning draft assemblies, it can align annotated draft assemblies in GenBank format (produced by e.g. Prokka, and you can visualize the correspondence of annotations, when you zoom in.

In case you have eukaryotic assemblies: D-Genies is a project similar to Dot suggested by Istvan Albert , but uses minimap2 for the genome to genome alignment, which should make it even faster than Dot.

ADD COMMENT

Login before adding your answer.

Traffic: 2044 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6