comparing two genomes
2
0
Entering edit mode
8.5 years ago
dr_mm_sh • 0

I am trying to find low similarity DNA sequences from two genomes of very closely related fungal species. Blast+ alignment of concatenated genomic sequences showed hits with identities around 70~80%. However, when I retrieved these DNA sequences of the both query and subject and made blastn search against genomes of each other, I was able to see hits with 100% identities.

What do you recommend to improve the accuracy of alignment results?

Command used to produce the alignment was:

$ blastn -db db_file -query input_file.txt -out output.txt -max_target_seqs 100 -outfmt 7

blast alignment • 2.0k views
ADD COMMENT
0
Entering edit mode
8.5 years ago
edrezen ▴ 730

Hello,

The blastn command actually executes the megablast algorithm by default. The megablast algorithm is very fast but finds fewer results than the blastn algorithm.

If you want to execute the blastn algorithm, you should try the following :

blastn -task blastn -db db_file -query input_file.txt -out output.txt -max_target_seqs 100 -outfmt 7

This post talks about this "-task" option.

ADD COMMENT
0
Entering edit mode
8.5 years ago
Sishuo Wang ▴ 230

discontinuous megablast might also be helpful or you can change '-word_size' (=11 be default).

ADD COMMENT

Login before adding your answer.

Traffic: 2553 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6