Entering edit mode
4.8 years ago
kihugaharrison
•
0
I have 29 plasmodium falciparum whole genome sequences that i need to analyse for SNPs. The problem is, each sample contains multiple sequences of upto 768,000 neucleotides....i need to convert this into a single sequence. How do i do this?
Can you please try to explain better the data that you have? Show examples, if possible.
Dear Kevin,
I have raw whole genome sequences 150 bases long that look like this....
That looks like raw illumina data converted to fasta (with a changed header) and interleaved (R1/R2 following each other).
Are these in one file or you have 29 files? It would be useful to have Q-scores for alignments/SNP calling.
If possible you should acquire original fastq format data and start there.