BLAST taxid to complete linage using biopython on a local machine.
0
1
Entering edit mode
6.0 years ago

Hello all,

Using BLAST, I am trying to identify thousands of sequences. To make my job a bit easier, I would like to classify these sequences first by sorting them on phylum e.g. only keep BLAST hits from a certain pylum. To do this, I need to generate het complete linage of the hit using the NCBI taxonomic database. I know you can query the database with biopython. over the internet, but the machine I am working on has no internet access.

Could anyone give me some insight into this?

BLAST Biopython parsng taxonomy • 1.4k views
ADD COMMENT
0
Entering edit mode

How big/long are the sequences?

One very hacky solution might be to run your sequences through Kraken which identifies species, typically in short read metagenomic data, but you could massage your input data potentially. It may work on pre-assembled contigs, in which case you should be fine.

You'll still need to be able to download the software and download/make a Kraken database though, so it's going to be tricky if you have no internet access at all.

NB, this may also only work for microbial data, I'm not 100% sure what Kraken can handle.

ADD REPLY
0
Entering edit mode

I did check our kraken, but it doesn't really do everything I need.

ADD REPLY

Login before adding your answer.

Traffic: 2828 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6