Why does Blast report a different alignment when using the -m8 option
2
2
Entering edit mode
9.7 years ago

I really need to know why when I do a blast with the option -m8, I have different results (different % identities, and gaps). example..

normal

>tr|Q3T560|Q3T560_9GAMM Arsenite transporter ATPase-like protein OS=Acidithiobacillus caldus
              GN=arsA1 PE=4 SV=1
          Length = 612
 Score =  612 bits (1577), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 329/546 (<strong>60%</strong>), Positives = 400/546 (73%), Gaps = 5/546 (0%)

Query: 372511 RVLLVSTDPASNVGQVFGTDIGNHITQIGQVSGLEALEIDPEAAAEDYRERIIGPIRSTL 372570
              ++LLVSTDPASNVGQVF  +IGN IT I  V+GL ALEIDP+ AA+ YRERI+GP+R  L
Sbjct: 38     KILLVSTDPASNVGQVFSQEIGNKITTISTVAGLSALEIDPQQAAQQYRERIVGPVRGAL 97

Query: 372571 PEKALNSIKEQLSGACTTEIAAFDEFTALLTDPELIERYDHIVFDTAPTGHTIRLLKLPG 372630

with -m8 option

# BLASTP 2.2.25 [Feb-01-2011]
# Query:
# Database: /home/biolinux/Desktop/fasta/ars/arsenic_db_3.fasta
# Fields: Query id, Subject id, % identity, alignment length, mismatches, gap openings, q. start, q. end, s. start, s. end, e-value, bit score
1_0    tr|Q3T560|Q3T560_9GAMM    63.55    546    194    4    372511    373052    38    582    0.0     612

In "normal" I have 60% of identities and 5 gaps, but when I put -m8 option I have 63.55% identities and 4 gaps..

Any reason?

blast • 3.9k views
ADD COMMENT
2
Entering edit mode
9.7 years ago
5heikki 11k

I couldn't find legacy blast 2.2.26 release notes but this is from 2.2.26+:

ChangeLog:
-----------------------------
* 2.2.26 release.
..
Bug fixes:
..
* The percent identity calculation was wrong if the best-hit algorithm was used.

So, the most likely reason is that 2.2.25 had a bug related to -m8 output and percent identity.

ADD COMMENT
0
Entering edit mode

Perfect way to spell out: please use the most recent version to report bugs.

ADD REPLY
0
Entering edit mode
9.7 years ago

how i can know what version i am using?

ADD COMMENT
0
Entering edit mode

probably blastp -version.

ADD REPLY
0
Entering edit mode
biolinux@biolinux-PC[biolinux] blastp -version                                                                                        [ 2:45PM]
blastp: 2.2.28+
Package: blast 2.2.28, build May  2 2013 18:07:25
biolinux@biolinux-PC[biolinux]

I don't know why, on the header of my blast appears BLASTP 2.2.25 [Feb-01-2011]

ADD REPLY
0
Entering edit mode

What was the command that you ran that generated the problem in your OP?

ADD REPLY
0
Entering edit mode
blastall -p blastp -d database.fasta -i sequence.fasta -m8 >out
ADD REPLY
0
Entering edit mode

That is not the same program as blastp, which explains the different versions. The blastall program is a part of the legacy blast tools. Try again with the blastp command and see if the results differ.

ADD REPLY
0
Entering edit mode

This command invokes legacy blast, but you also seem to have blast+ installed ...

ADD REPLY
0
Entering edit mode

I put

blastp -db database.fasta -query query.fasta -out out.fasta

and appears this...

Warning: (1431.1) CFastaReader: Ignoring invalid residue 8 at line 112004, position 4
Warning: (1431.1) CFastaReader: Ignoring invalid residue 6 at line 112004, position 5
Warning: (1431.1) CFastaReader: Ignoring invalid residue 1 at line 112004, position 6
Warning: (1431.1) CFastaReader: Ignoring invalid residue 4 at line 112005, position 0
Warning: (1431.1) CFastaReader: Ignoring invalid residue 2 at line 112005, position 1
Warning: (1431.1) CFastaReader: Ignoring invalid residue 4 at line 112005, position 2
Warning: (1431.1) CFastaReader: Ignoring invalid residue 6 at line 112005, position 3
Warning: (1431.1) CFastaReader: Ignoring invalid residue 9 at line 112005, position 4
Warning: (1431.1) CFastaReader: Ignoring invalid residue 2 at line 112005, position 5

and keep going.....

that it's normal?

aps, and I am using a gbk query, I mean.. query.gbk, that care?

ADD REPLY
0
Entering edit mode

Your fasta file is improperly formatted, check that there are no invalid characters present in the sequence entry under each fasta header.

ADD REPLY
0
Entering edit mode

Is it really necessary to be a FASTA file?, or can be other like a GBK file?

ADD REPLY
0
Entering edit mode

Fasta, Gi or NCBI accession.

ADD REPLY
0
Entering edit mode

I apply the formatdb command...

or it's the makeformatdb command?

ADD REPLY
0
Entering edit mode

You may want to double check the documentation:

http://www.ncbi.nlm.nih.gov/guide/howto/run-blast-local/

Use the BLAST+ makeblastdb program (or formatdb in the legacy BLAST package) to convert a FASTA-formatted local file into a BLAST database.

It seems that you're trying to blast one fasta file against another. Note that is deprecated, you want to use makeblastdb. If that doesn't work, check that the sequence entries in your query file do not contain any invalid characters.

ADD REPLY
0
Entering edit mode

although one does the database, the blast is always performed against fasta file database.

ADD REPLY

Login before adding your answer.

Traffic: 2084 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6