SAVAGE is taking too long to run the haplotype reconstruction

1

Entering edit mode

5.9 years ago

fernandalpcosta • 0

Hello all,

I'm running SAVAGE (https://bitbucket.org/jbaaijens/savage/src/master/) to get the haplotypes of a big virus (almost 3x the HIV virus' size) and it took over 72h hours to get to stage b of the SAVAGE pipeline.

The sequencing has a coverage of 20000x and the reference genome has 32KB of size.

I'm using 7 threads to run this analysis on a 8 vCPUs, 52 GB RAM and 10T disk machine.

I also tested HaploClique (https://github.com/cbg-ethz/haploclique) and PredictHaplo (http://bmda.cs.unibas.ch/software.html) softwares on this analysis, but both also took over 72h and never finished.

Is there another software that runs the whole analysis in less than 72h per virus in the conditions/specifications I mentioned above?

Thank you all in advance for any tips or help you may give me,

savage haplotype virus haploclique predicthaplo • 1.1k views

ADD COMMENT • link 5.9 years ago by fernandalpcosta • 0

0

Entering edit mode

Is the ultra high depth of sequencing causing this issue? Do you really need that much coverage.

ADD REPLY • link 5.9 years ago by GenoMax 141k

0

Entering edit mode

I'm not sure if that much coverage is necessary, but it's not unusual for virus's haplotype reconstruction. You thing that 10000x is enough?

ADD REPLY • link 5.9 years ago by fernandalpcosta • 0

0

Entering edit mode

Someone else will have to comment on the re-construction part. You could try different amounts (starting with 1000x) and see if it makes a big difference in results as you go up.