How to download all the parasite protein data from NCBI?
2
0
Entering edit mode
7.3 years ago
najibveto ▴ 110

I want to do a local blast using all the bacterial protein data from NCBI instead of NR. Is there any way to download all the data from NCBI? Or to filter the NR database locally?

parasite protein ncbi • 2.6k views
ADD COMMENT
0
Entering edit mode

thanks a lot for your help.

ADD REPLY
2
Entering edit mode
7.3 years ago
j_susat ▴ 40

Hello,

if you just want to Download all the bacterial proteins from NCBI you could use Entrez Direct: Click Here Here is some small example which could do the job (restricted to refseq)

esearch -db protein -query "bacteria [ORGN] AND refseq [filter]" | efetch -format fasta > bacterial_proteins
ADD COMMENT
1
Entering edit mode
7.3 years ago
natasha.sernova ★ 4.0k

See this post and my answer inside:

where can I get environmental bacteria genome in fasta format (as many as possible)?

NCBI structure has been changed, so I am not sure about recent archives.

But to find at least something see ftp://ftp.ncbi.nlm.nih.gov/genomes/archive/old_refseq/Bacteria/

and all.faa.tar.gz file. I don't know where to find a current version. The old one is from 02.06.2015..

ADD COMMENT

Login before adding your answer.

Traffic: 1522 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6