Biostar Beta. Not for public use.
where is "% Identity" column in blast-xml
3
Entering edit mode
4.6 years ago
helenhvalask • 30
United States

I am trying to parse blast-xml file from blastp search using searchIO in Biopython. However, I am not sure which one I should use for extracting "% identity".

In blast-tab file, I can use hsp.pident, does anyone know the equavilent attribute name for blast-xml. Or I should derive myself, "hsp.ident_num/hsp.aln_span*100". Thanks

ADD COMMENTlink
2
Entering edit mode
15 months ago
Peter 5.8k
Scotland, UK

The BLAST XML output format does not contain the percentage identify as an explicit field, so yes, you must calculate it from the number of identities and the alignment length.

See for example my BLAST XML to tabular conversion script: https://github.com/peterjc/galaxy_blast/blob/master/tools/ncbi_blast_plus/blastxml_to_tabular.py#L230

(Note if you are using Python 2, beware of integer division if you do the calculation as currently written!)

ADD COMMENTlink

Login before adding your answer.

Similar Posts
Loading Similar Posts
Powered by the version 2.1